TRM

community

AI & ML interests

None defined yet.

Recent Activity

asdadaac published a dataset 9 days ago

ThinkingRM/Existing_Benchmark

asdadaac published a dataset 12 days ago

ThinkingRM/T2I_Data

asdadaac updated a dataset 12 days ago

ThinkingRM/T2I_Data

View all activity

published a dataset 9 days ago

ThinkingRM/Existing_Benchmark

Updated 21 days ago • 1.21k

published a dataset 12 days ago

ThinkingRM/T2I_Data

Preview • Updated 12 days ago • 1.86k

updated a dataset 12 days ago

ThinkingRM/T2I_Data

Preview • Updated 12 days ago • 1.86k

authored a paper 16 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 18 days ago • 38

submitted a paper to Daily Papers 16 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Paper • 2605.26244 • Published 18 days ago • 38

authored a paper 20 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 22 days ago • 46

updated a dataset 21 days ago

ThinkingRM/Existing_Benchmark

Updated 21 days ago • 1.21k

authored 2 papers 22 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published 25 days ago • 22

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Paper • 2605.20183 • Published 24 days ago • 14

submitted a paper to Daily Papers 23 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published 25 days ago • 22

authored a paper 29 days ago

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Paper • 2605.13062 • Published about 1 month ago • 33

submitted a paper to Daily Papers 29 days ago

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Paper • 2605.13062 • Published about 1 month ago • 33

authored a paper 30 days ago

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization

Paper • 2605.10780 • Published May 12 • 33

submitted a paper to Daily Papers about 1 month ago

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization

Paper • 2605.10780 • Published May 12 • 33

authored 2 papers 2 months ago

Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence?

Paper • 2604.03016 • Published Apr 3 • 37

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 203

authored a paper 3 months ago

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published Mar 16 • 21

submitted a paper to Daily Papers 3 months ago

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published Mar 16 • 21

authored 2 papers 4 months ago

BrowseComp-$V^3$: A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents

Paper • 2602.12876 • Published Feb 13 • 14

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published Feb 4 • 50