UCLA NLP

university

http://kwchang.net/

Activity Feed Request to join this org

AI & ML interests

Natural Language Processing, Bias and Fairness in NLP

Recent Activity

gordonhu authored a paper 6 days ago

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

gordonhu authored a paper 6 days ago

Matryoshka Query Transformer for Large Vision-Language Models

gordonhu authored a paper 6 days ago

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

View all activity

Papers

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training

View all Papers

authored 9 papers 6 days ago

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

Paper • 2308.09936 • Published Aug 19, 2023 • 1

Matryoshka Query Transformer for Large Vision-Language Models

Paper • 2405.19315 • Published May 29, 2024 • 1

MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models

Paper • 2410.08182 • Published Oct 10, 2024

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Paper • 2411.18651 • Published Nov 27, 2024

Interleaving Reasoning for Better Text-to-Image Generation

Paper • 2509.06945 • Published Sep 8, 2025 • 15

TemMed-Bench: Evaluating Temporal Medical Image Reasoning in Vision-Language Models

Paper • 2509.25143 • Published Sep 29, 2025

ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

Paper • 2510.08457 • Published Oct 9, 2025 • 13

MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence

Paper • 2512.10863 • Published Dec 11, 2025 • 22

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 7 days ago • 48

authored a paper 6 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 7 days ago • 48

authored a paper 6 days ago

OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

Paper • 2604.08539 • Published 7 days ago • 48

updated a dataset 29 days ago

uclanlp/TaoBench

Viewer • Updated 29 days ago • 150 • 83

published a dataset about 1 month ago

uclanlp/TaoBench

Viewer • Updated 29 days ago • 150 • 83

authored a paper 2 months ago

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Paper • 2602.02477 • Published Feb 2 • 11

submitted a paper to Daily Papers 2 months ago

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Paper • 2602.02477 • Published Feb 2 • 11

authored a paper 3 months ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published Jan 20 • 47

authored a paper 4 months ago

Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions

Paper • 2512.00097 • Published Nov 27, 2025 • 3

authored a paper 5 months ago

G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Paper • 2511.21688 • Published Nov 26, 2025 • 8

updated a dataset 6 months ago

uclanlp/Brief-Pro

Viewer • Updated Oct 19, 2025 • 45.2k • 15 • 3

updated a model 6 months ago

uclanlp/brief-pro

Updated Oct 19, 2025 • 146 • 4