Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents Paper • 2602.16699 • Published 5 days ago • 13
Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents Paper • 2602.16699 • Published 5 days ago • 13
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step700_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 12 days ago • 5
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step700_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 12 days ago • 5
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step350_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 27 days ago
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step350_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 27 days ago
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step50_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 27 days ago • 1
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step50_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 27 days ago • 1
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step100_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 27 days ago • 4
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step100_2026-01-27_21-36-45_nvidia_balanced 8B • Updated 27 days ago • 4
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step350_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 27 days ago • 4
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step350_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 27 days ago • 4
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step300_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 27 days ago • 6
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step300_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 27 days ago • 6
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step100_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 28 days ago • 7
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step100_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 28 days ago • 7
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step150_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 28 days ago • 9
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step150_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 28 days ago • 9
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step200_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 28 days ago • 8
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step200_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 28 days ago • 8