AI & ML interests
None yet
Organizations
8B • Updated • 171
• 1
Nannanzi/integrated_prompt
3B • Updated • 1
3B • Updated • 1
Nannanzi/3B_balanced_no_category_double_data_step_561
3B • Updated • 1
Nannanzi/3B_balanced_no_category_double_data_step_372
3B • Updated • 1
Nannanzi/9_acc_unbalanced
2B • Updated • 1
Nannanzi/3B_9_acc_balanced_step_279
3B • Updated • 1
Nannanzi/balanced_9_accuracy_correct_format_step_93
Updated
Nannanzi/9_acc_balanced_wrong_step_93
Updated
2B • Updated • 2
2B • Updated • 2
2B • Updated • 2
Nannanzi/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
8B • Updated • 4
Nannanzi/refusal_training
8B • Updated • 2
Nannanzi/utility_training
8B • Updated • 2
8B • Updated • 1
Nannanzi/sft_instruct_no_reason_lr1e-06
Updated
Nannanzi/sft_base_reason_lr5e-05
Updated
Nannanzi/sft_base_reason_lr2e-05
Updated
Nannanzi/sft_base_reason_lr1e-05
Updated
Nannanzi/sft_base_reason_lr5e-06
Updated
Nannanzi/sft_base_reason_lr2e-06
Updated
Nannanzi/sft_base_reason_lr1e-06
Updated
Nannanzi/sft_instruct_reason_lr5e-06
Updated
Nannanzi/sft_instruct_reason_lr2e-06
Updated
Nannanzi/sft_instruct_reason_lr1e-06
Updated
Nannanzi/sft_instruct_reason_lr5e-05
Updated
Nannanzi/sft_instruct_reason_lr2e-05
Updated
Nannanzi/sft_instruct_reason_lr1e-05
Updated