nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated about 15 hours ago • 13.1k • 192
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 4 days ago • 121
Running on CPU Upgrade 177 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 177 Explore synthetic data experiments in a bookshelf view