Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_10_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 2 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-9_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated 3 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-5_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated 3 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-1_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated 5 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-3_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated 7 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-9_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 11 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-7_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 11 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-3_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 13 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_10_1p0_0p0_1p0_grpo_2_rule Text Generation • 2B • Updated 14 minutes ago
Kazuki1450/Qwen3-1.7B-Base_csum_6_10_rel_1e-1_1p0_0p0_1p0_grpo_1_rule Text Generation • 2B • Updated 14 minutes ago
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated Dec 4, 2025 • 5.59k • 5