MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 5 items • Updated 6 days ago • 2
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 5 items • Updated 6 days ago • 2
MTP-LM Collection Models to accompany research paper on training multi token prediction language models using self-distillation. • 5 items • Updated 6 days ago • 2
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest Text Generation • 8B • Updated 11 days ago • 9
jwkirchenbauer/daint_prod_q4_128N512n_fd7261ea_latest Text Generation • 8B • Updated 11 days ago • 12
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest Text Generation • 8B • Updated 11 days ago • 9
jwkirchenbauer/daint_prod_q4_128N512n_fd7261ea_latest Text Generation • 8B • Updated 11 days ago • 12