28 449

sree

srisree

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

XiaomiMiMo/MiMo-V2-Flash

liked a model about 19 hours ago

trillionlabs/Tri-21B-Think

upvoted a paper 2 days ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

View all activity

Organizations

upvoted a paper 2 days ago

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published 5 days ago • 63

upvoted a collection 3 days ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 3 days ago • 120

upvoted a collection 27 days ago

LLaDA2.1

Collection

3 items • Updated 8 days ago • 23

upvoted a paper 27 days ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published Feb 9 • 70

upvoted a collection about 1 month ago

pplx-embed

Collection

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 16 days ago • 87

upvoted an article about 2 months ago

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

120

upvoted an article 3 months ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Dec 18, 2025

•

122

upvoted a paper 3 months ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published Dec 17, 2025 • 66

upvoted an article 3 months ago

Article

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Dec 17, 2025

•

upvoted an article 4 months ago

Article

Provence: efficient and robust context pruning for retrieval-augmented generation

Jan 28, 2025

•

upvoted 4 papers 5 months ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16, 2025 • 120

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5, 2025 • 26

Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models

Paper • 2510.03561 • Published Oct 3, 2025 • 25

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26, 2025 • 80

upvoted an article 7 months ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

May 17, 2025

•

upvoted 3 articles 8 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9, 2025

•

787

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8, 2025

•

764

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Jun 19, 2025

•

upvoted an article 9 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26, 2025

•

120

upvoted an article 12 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

804

sree

AI & ML interests

Recent Activity

Organizations

srisree's activity

The Optimal Architecture for Small Language Models

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

Provence: efficient and robust context pruning for retrieval-augmented generation

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

SmolLM3: smol, multilingual, long-context reasoner

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Gemma 3n fully available in the open-source ecosystem!

Uncensor any LLM with abliteration