Building on HF

Ram Murmu

rammurmu

https://runash.in

AI & ML interests

NLP, LLMs, vLLM, Computer Vision for Real-time Live Video Streaming

Organizations

upvoted 2 papers 3 months ago

Voxtral Realtime

Paper • 2602.11298 • Published Feb 11 • 27

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 120

upvoted 2 articles 3 months ago

Article

Codex is Open Sourcing AI models

burtenshaw, evalstate

•

Dec 11, 2025

• 82

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 506

upvoted 2 articles 4 months ago

Article

Transformers.js v4: Now Available on NPM!

Xenova, nico-martin

•

Feb 9

• 95

Article

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

huggingface

•

Feb 3

• 53

upvoted 2 papers 4 months ago

Rethinking Video Generation Model for the Embodied World

Paper • 2601.15282 • Published Jan 21 • 45

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Paper • 2601.11141 • Published Jan 16 • 23

upvoted 2 papers 5 months ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 178

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published Jan 14 • 34

upvoted 2 collections 5 months ago

Google's Gemma models family

Collection

334 items • Updated Mar 12 • 824

TranslateGemma

Collection

3 items • Updated Mar 12 • 239

upvoted an article 5 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

nvidia

•

Jan 5

• 87

upvoted 3 papers 6 months ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published Dec 12, 2025 • 30

DeepCode: Open Agentic Coding

Paper • 2512.07921 • Published Dec 8, 2025 • 35

EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing

Paper • 2512.06065 • Published Dec 5, 2025 • 29

upvoted 2 articles 6 months ago

Article

Building for an Open Future - our new partnership with Google Cloud

jeffboudier, pagezyhf

•

Nov 13, 2025

• 48

Article

20x Faster TRL Fine-tuning with RapidFire AI

kbigdelysh, arunkk09, qgallouedec

•

Nov 21, 2025

• 27

upvoted a paper 7 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 260

upvoted a paper 8 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

Ram Murmu

AI & ML interests

Organizations

rammurmu's activity

Codex is Open Sourcing AI models

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Transformers.js v4: Now Available on NPM!

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Building for an Open Future - our new partnership with Google Cloud

20x Faster TRL Fine-tuning with RapidFire AI