xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen16_bs4_lr7e-6_beta0.04_tol1e-2 Text Generation • Updated about 1 hour ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen16_bs4_lr7e-6_beta0.04_tol1e-2 Text Generation • Updated about 1 hour ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen8_bs4_lr1e-5_beta0.0 Text Generation • Updated 1 minute ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen8_bs4_lr1e-5_beta0.04 Text Generation • Updated about 2 hours ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen8_bs4_lr7e-6_beta0.04 Text Generation • Updated about 2 hours ago
xxccho/EXAONE-3.5-7.8B-Instruct_lg_convfin_mcq_reasoning_pc_lora_r64_DR1.0 Text Generation • Updated about 3 hours ago
xxccho/EXAONE-3.5-7.8B-Instruct_lg_convfin_mcq_reasoning_pc_lora_r64_DR1.0 Text Generation • Updated about 3 hours ago
xxccho/EXAONE-3.5-7.8B-Instruct_lg_convfin_mcq_pc_lora_r64_DR1.0 Text Generation • Updated about 3 hours ago
xxccho/EXAONE-3.5-7.8B-Instruct_lg_convfin_mcq_pc_lora_r64_DR1.0 Text Generation • Updated about 3 hours ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen8_bs4_lr1e-5_beta0.0 Text Generation • Updated 1 minute ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen8_bs4_lr1e-5_beta0.04 Text Generation • Updated about 2 hours ago
xxccho/EXAONE-GRPO-lg_convfin_mcq_grpo_ratio1.0_gen8_bs4_lr7e-6_beta0.04 Text Generation • Updated about 2 hours ago