JetBrains/Mellum2-12B-A2.5B-Thinking Text Generation • 12B • Updated 3 days ago • 6.94k • 184
view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • 3 days ago • 22
view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • 3 days ago • 22
JetBrains/Mellum2-12B-A2.5B-Base-Pretrain Text Generation • 12B • Updated 3 days ago • 52 • 8
JetBrains/Mellum2-12B-A2.5B-Thinking-SFT Text Generation • 12B • Updated 3 days ago • 118 • 16
JetBrains/Mellum2-12B-A2.5B-Thinking Text Generation • 12B • Updated 3 days ago • 6.94k • 184