arxiv:2603.02225
Jingxuan Fan
fjxdaisy
AI & ML interests
None yet
Organizations
models 0
None public yet
datasets 12
fjxdaisy/ssrm_700k
Viewer • Updated • 700k • 18 • 1
fjxdaisy/Skywork-Reward-Preference-80K-v0.2
Viewer • Updated • 77k • 14
fjxdaisy/finemath_part10_llama8b_actor_218step_rm_vllm
Viewer • Updated • 3.34k • 7
fjxdaisy/finemath_part9_llama8b_actor_218step_rm_vllm
Viewer • Updated • 3.32k • 39
fjxdaisy/finemath_part5_llama8b_actor_218step_rm
Viewer • Updated • 2.84k • 19
fjxdaisy/finemath_part6_llama8b_actor_218step_rm
Viewer • Updated • 2.83k • 34
fjxdaisy/finemath_part8_llama8b_actor_218step_rm_vllm
Viewer • Updated • 3.32k • 7
fjxdaisy/finemath_part7_llama8b_actor_218step_rm_vllm
Viewer • Updated • 3.34k • 17
fjxdaisy/rlhfpipeline_mix1_llamafactory
Viewer • Updated • 244k • 73
fjxdaisy/summarize_from_feedback_comparisons_pref
Viewer • Updated • 179k • 12