On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 13 days ago • 23
On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 13 days ago • 23
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 55
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 11 days ago • 168
view article Article xLSTM-based time series model TiRex significantly outperforms competing models in forecasting accuracy BobWue • Jun 4, 2025 • 12