15 42 177

Oleg Lavrovsky PRO

loleg

AI & ML interests

Supporting Apertus team / Organizing hackathons / Engaged for open data

Recent Activity

liked a model about 18 hours ago

HuggingFaceTB/SmolLM3-3B

liked a dataset about 18 hours ago

Idavidrein/gpqa

upvoted a paper 1 day ago

Parallel Scaling Law for Language Models

View all activity

Organizations

upvoted a paper 1 day ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83

upvoted a paper 2 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 47

upvoted an article 4 days ago

Article

Hopper: The Optimizer That Learns Parallelism 2x Faster Than Adam

12 days ago

•

upvoted 3 collections 8 days ago

upvoted a paper 12 days ago

EuroLLM-22B: Technical Report

Paper • 2602.05879 • Published 13 days ago • 3

upvoted an article 14 days ago

Article

Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably

Aug 20, 2024

•

upvoted 2 papers 15 days ago

AgenticMath: Enhancing LLM Reasoning via Agentic-based Math Data Generation

Paper • 2510.19361 • Published Oct 22, 2025 • 2

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Paper • 2505.02881 • Published May 5, 2025 • 6

upvoted a paper 27 days ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 23

upvoted a collection about 1 month ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 135 items • Updated Dec 18, 2025 • 118

upvoted a paper about 1 month ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9, 2025 • 77

upvoted a collection about 1 month ago

VibeVoice

Collection

8 items • Updated Dec 8, 2025 • 2

upvoted 2 papers about 1 month ago

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17, 2025 • 16

Towards Fully FP8 GEMM LLM Training at Scale

Paper • 2505.20524 • Published May 26, 2025 • 1

upvoted an article about 1 month ago

Article

AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠

Dec 4, 2025

•

upvoted a paper about 1 month ago

Quantifying the Carbon Emissions of Machine Learning

Paper • 1910.09700 • Published Oct 21, 2019 • 33

upvoted a paper 2 months ago

SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6, 2024 • 89

upvoted an article 2 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21, 2025

•

301

Oleg Lavrovsky PRO

AI & ML interests

Recent Activity

Organizations

loleg's activity

Hopper: The Optimizer That Learns Parallelism 2x Faster Than Adam

Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably

AI Energy Score v2: Refreshed Leaderboard, now with Reasoning 🧠

Supercharge your OCR Pipelines with Open Models