✨ free demo spaces Collection HF Spaces for demoing chat completion models—no ZeroGPU, WebGPU, or BYOK included. Thank you so much to these devs! • 15 items • Updated 21 days ago • 3
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Paper • 2602.10388 • Published Feb 11 • 244
UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model Paper • 2602.14178 • Published Feb 15 • 14
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction Paper • 2602.20160 • Published 27 days ago • 10
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published Jan 28 • 14
Creative Writing Datasets Collection High-quality creative writing and storytelling data. • 36 items • Updated about 16 hours ago • 6
SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning Paper • 2602.13515 • Published Feb 13 • 44
DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models Paper • 2501.18590 • Published Jan 30, 2025 • 1
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression Paper • 2510.13999 • Published Oct 15, 2025 • 15
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders Paper • 2602.05027 • Published Feb 4 • 62
DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation Paper • 2601.22904 • Published Jan 30 • 15
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper • 2602.02493 • Published Feb 2 • 46
Beyond Output Critique: Self-Correction via Task Distillation Paper • 2602.00871 • Published Jan 31 • 2
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published Jan 29 • 17