agent-distillation

community

AI & ML interests

None defined yet.

Recent Activity

Nardien authored a paper about 1 month ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Nardien submitted a paper about 2 months ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Nardien updated a dataset about 1 year ago

agent-distillation/Qwen2.5-32B-Instruct_prefix_memory_3k

View all activity

authored a paper about 1 month ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published May 27 • 93

submitted a paper to Daily Papers about 2 months ago

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published May 15 • 35

updated a dataset about 1 year ago

agent-distillation/Qwen2.5-32B-Instruct_prefix_memory_3k

Updated Jun 9, 2025 • 8

published a dataset about 1 year ago

agent-distillation/Qwen2.5-32B-Instruct_prefix_memory_3k

Updated Jun 9, 2025 • 8

updated a dataset about 1 year ago

agent-distillation/Qwen2.5-32B-Instruct_cot_trajectories_2k

Viewer • Updated Jun 9, 2025 • 3k • 32 • 1

published a dataset about 1 year ago

agent-distillation/Qwen2.5-32B-Instruct_cot_trajectories_2k

Viewer • Updated Jun 9, 2025 • 3k • 32 • 1

updated 4 models about 1 year ago

agent-distillation/agent_distilled_ftp_Qwen2.5-0.5B-Instruct

Updated Jun 5, 2025 • 7

agent-distillation/agent_distilled_ftp_Qwen2.5-1.5B-Instruct

Updated Jun 5, 2025 • 5

agent-distillation/agent_distilled_ftp_Qwen2.5-3B-Instruct

Updated Jun 5, 2025 • 6

agent-distillation/agent_distilled_ftp_Qwen2.5-7B-Instruct

Updated Jun 5, 2025 • 6

published 4 models about 1 year ago

agent-distillation/agent_distilled_ftp_Qwen2.5-7B-Instruct

Updated Jun 5, 2025 • 6

agent-distillation/agent_distilled_ftp_Qwen2.5-3B-Instruct

Updated Jun 5, 2025 • 6

agent-distillation/agent_distilled_ftp_Qwen2.5-0.5B-Instruct

Updated Jun 5, 2025 • 7

agent-distillation/agent_distilled_ftp_Qwen2.5-1.5B-Instruct

Updated Jun 5, 2025 • 5

updated a model about 1 year ago

agent-distillation/agent_distilled_Qwen2.5-7B-Instruct

Updated Jun 5, 2025 • 6

published a model about 1 year ago

agent-distillation/agent_distilled_Qwen2.5-7B-Instruct

Updated Jun 5, 2025 • 6

updated 2 models about 1 year ago

agent-distillation/agent_distilled_Qwen2.5-0.5B-Instruct

Updated Jun 5, 2025 • 6

agent-distillation/agent_distilled_Qwen2.5-3B-Instruct

Updated Jun 5, 2025 • 2

published 2 models about 1 year ago

agent-distillation/agent_distilled_Qwen2.5-3B-Instruct

Updated Jun 5, 2025 • 2

agent-distillation/agent_distilled_Qwen2.5-0.5B-Instruct

Updated Jun 5, 2025 • 6