LEO-UI7M v0.1
Lightweight DOM-based browser agent. 7.4M parameters, CPU-only, no screenshots required.
Reads page DOM structure, predicts the next UI action (click, type, navigate).
Quick Start
pip install -r requirements.txt
python examples/run_agent.py --task "find the price of item X"
Model
- Architecture: DOM-aware action predictor
- Parameters: 7,458,567
- Weights: model.safetensors
- GPU required: No
- Input: raw DOM text + task instruction
- Output: UI action (element + action type)
Validated Capabilities
- Wikipedia command β research note
- Python docs command β structured docs results
- Hacker News jobs command β job digest
- Page explanation from URL
- Refusal for unsafe or out-of-scope tasks
Limitations
- Not a general browser automation replacement
- Not screenshot-based (DOM text only)
- Not for arbitrary web automation
- Best on pages with clean, semantic DOM structure
- Trained on limited data β generalizes to simple, well-structured pages
Files
model.safetensors β model weights
config.json β model config
leo_ui7m/ β Python package (runtime, adapters, agent)
examples/run_agent.py β usage example
requirements.txt β dependencies
tests/ β test suite
License
MIT
- Downloads last month
- 22
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support