koutch/paper_llama_llama3.1-8b_train_sft_all_train_code Text Generation • 8B • Updated about 10 hours ago • 48
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_code Text Generation • 4B • Updated about 11 hours ago • 36
koutch/paper_llama_llama3.1-8b_train_sft_train_code Text Generation • 8B • Updated about 12 hours ago • 105
koutch/paper_smol_smol3-3B_train_sft_all_train_code Text Generation • 3B • Updated about 13 hours ago • 42
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_code Text Generation • 4B • Updated about 13 hours ago • 71
koutch/paper_smol_smol3-3B_train_sft_train_code Text Generation • 3B • Updated about 13 hours ago • 81
koutch/paper_llama_llama3.1-8b_train_sft_train_para Text Generation • 8B • Updated about 14 hours ago • 172
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated about 15 hours ago • 116
koutch/paper_smol_smol3-3B_train_sft_train_para Text Generation • 3B • Updated about 15 hours ago • 127
koutch/paper_smol_2.json_train_dpo_v2_train_code Text Generation • 3B • Updated about 22 hours ago • 11
koutch/paper_smol_2.json_train_dpo_v2_train_code Text Generation • 3B • Updated about 22 hours ago • 11
koutch/paper_llama_2.json_train_dpo_v2_train_code Text Generation • 8B • Updated about 22 hours ago • 8
koutch/paper_llama_2.json_train_dpo_v2_train_code Text Generation • 8B • Updated about 22 hours ago • 8
koutch/paper_qwen_2.json_train_dpo_v2_train_code Text Generation • 4B • Updated about 23 hours ago • 13
koutch/paper_qwen_2.json_train_dpo_v2_train_code Text Generation • 4B • Updated about 23 hours ago • 13