Junyao Yang's picture

4 7

Junyao Yang

TberiusJunyao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

upvoted a paper 12 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

liked a dataset 2 months ago

AI45Research/ATBench

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published 1 day ago • 23

upvoted a paper 12 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 15 days ago • 317

liked a dataset 2 months ago

AI45Research/ATBench

Viewer • Updated 13 days ago • 1.5k • 1.18k • 35

liked 6 models 3 months ago

AI45Research/AgentDoG-FG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 30 • 9

AI45Research/AgentDoG-Llama3.1-8B

Text Classification • 8B • Updated Feb 6 • 21 • 11

AI45Research/AgentDoG-FG-Qwen2.5-7B

Text Classification • 8B • Updated Feb 6 • 23 • 8

AI45Research/AgentDoG-Qwen2.5-7B

Text Classification • 8B • Updated 13 days ago • 34 • 10

AI45Research/AgentDoG-FG-Qwen3-4B

Text Classification • 4B • Updated 13 days ago • 111 • 9

AI45Research/AgentDoG-Qwen3-4B

Text Classification • 4B • Updated 13 days ago • 271 • 23

upvoted a collection 3 months ago

AgentDoG

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 6 days ago • 108

upvoted a paper 5 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

published 3 models about 1 year ago

TberiusJunyao/Qwen2.5-7B-Instruct-Math-GRPO

Updated Mar 27, 2025

TberiusJunyao/Qwen2.5-1.5B-Open-R1-GRPO

Updated Mar 8, 2025

TberiusJunyao/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Mar 6, 2025