Yitao Lo.ong's picture

2 4 1

Yitao Lo.ong

Dragongon

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper about 22 hours ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

commented on a paper 3 months ago

PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles

View all activity

Organizations

upvoted 2 papers about 22 hours ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 4 days ago • 116

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 3 days ago • 67

commented 2 papers 3 months ago

PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles

Paper • 2510.06475 • Published Oct 7, 2025 • 1 •

FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering

Paper • 2510.06426 • Published Oct 7, 2025 • 2 •

authored 5 papers 3 months ago

KnowledgeMath: Knowledge-Intensive Math Word Problem Solving in Finance Domains

Paper • 2311.09797 • Published Nov 16, 2023 • 1

DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data

Paper • 2311.09805 • Published Nov 16, 2023 • 3

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21, 2025 • 84

PuzzlePlex: Benchmarking Foundation Models on Reasoning and Planning with Puzzles

Paper • 2510.06475 • Published Oct 7, 2025 • 1

FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering

Paper • 2510.06426 • Published Oct 7, 2025 • 2

upvoted a paper 7 months ago

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

Paper • 2507.01001 • Published Jul 1, 2025 • 46

liked a dataset 12 months ago

yale-nlp/MMVU

Viewer • Updated Feb 28, 2025 • 1k • 1.19k • 61

upvoted a paper 12 months ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21, 2025 • 84

updated a dataset over 2 years ago

Dragongon/test_gfhsqa

Viewer • Updated Aug 14, 2023 • 30 • 2

updated a model almost 3 years ago

Dragongon/finetuned_bert

Text Classification • Updated Mar 19, 2023 • 5