NamrataThakur/Small_Language_Model_GQA_48M_Pretrained Text Generation • Updated 20 days ago • 1.91k • 1
NamrataThakur/Small_Language_Model_MHA_53M_Pretrained Text Generation • Updated 20 days ago • 1.91k • 1
MihaiPopa-1/Stentor-30M-Instruct-heretic-safety-defiltered Text Generation • 30.4M • Updated Feb 26 • 21 • 2
NamrataThakur/Small_Language_Model_MOE_127M_Pretrained Text Generation • Updated 20 days ago • 1.95k • 1