Nemotron Code & SWE Collection Datasets for building models that write, debug, and reason about code. Covers competitive programming, software engineering, and code pretraining. • 14 items • Updated 15 days ago • 6
propella-1: Multi-Property Document Annotation for LLM Data Curation at Scale Paper • 2602.12414 • Published Feb 12 • 4