AI & ML interests
None defined yet.
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp07
Viewer
•
Updated
•
15k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_550tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_500tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_450tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/llamasft_math_ift_balanced_moredata_gold_reward_tmp10_vllmexp
Viewer
•
Updated
•
20k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/llamasft_math_ift_balanced_moredata_gold_reward_tmp07_vllmexp
Viewer
•
Updated
•
30k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_400tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_350tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_300tmp10
Viewer
•
Updated
•
15k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_250tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp07
Viewer
•
Updated
•
15k
•
1
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_200tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp07
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta01llamasft_math_ift_balanced_dpo_moredata_100tmp10
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta05_balanced_type12_sftloss_moredata550tmp07_vllmexp3
Viewer
•
Updated
•
15k
•
2
tmpmodelsave/beta05_balanced_type12_sftloss_moredata550tmp07
Viewer
•
Updated
•
15k
•
1