BayesRL 's Collections

Warm-started Checkpoints

A collection of three models trained on the Nemotron Post Training Dataset for reasoning tasks with IVON