Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation Paper • 2601.22813 • Published 26 days ago • 56
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 115
DevQuasar/ai-sage.GigaChat3-702B-A36B-preview-bf16-GGUF Text Generation • 702B • Updated Nov 24, 2025 • 228 • 5
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26, 2025 • 93
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5, 2025 • 232
Running on CPU Upgrade 586 GAIA Leaderboard 🦾 586 Submit your model answers to GAIA benchmark and view leaderboard
Vikhrmodels/QVikhr-2.5-1.5B-Instruct-SMPO Text Generation • 2B • Updated Feb 3, 2025 • 11 • 16