nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated about 20 hours ago • 116k • 289
Running Featured 68 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 68 Who needs 1T parameters? Olympiad proofs with a 4B model