A few weeks ago, @victor opened the door: coding agents can now ship Hugging Face Spaces autonomously.
I pulled on that thread.
As someone who builds and ships Gradio demos regularly, I didnβt just want to reproduce the loop. I wanted to see what happens when that loop is plugged into the whole Hugging Face stack.
The interesting part is not only that an agent can ship a Space.
Itβs what happens when Space generation becomes a first-class Hugging Face workflow.
**Smilyai Labs** is building **Nova-1** β open-source LLMs with novel architectures. Join our beta program!
## π― Available Now:
**Nova-1-Standard (1.2B)** β Phase 2 of pretraining in progress - PPL 13.5 (beats GPT-2 Large!) - 48K tok/s on consumer GPUs - Great for code, reasoning, edge deployment
**Nova-1-Large (3.5B)** β Training live RIGHT NOW - Current: 30.9 PPL, improving fast, loss at 3.5 right now - Will finish with ~1.7B tokens today - Better reasoning & longer context
**Nova-1-XL (10B MoE)** β Coming soon (We dont know yet! haha) - Final Specs not decided yet
## What Makes Nova Special?
β¨ **Mixture of Depths (MoD)** β Routes tokens dynamically, 30% faster β¨ **Grouped Query Attention** β Efficient like LLaMA 2/3 β¨ **Phased Training** β Fresh 1B tokens each phase (no overfitting!) β¨ **RoPE** β Context extendable to 8K+