12 4 12

Jonna Matthiesen

JonnaMat

AI & ML interests

None yet

Recent Activity

new activity about 6 hours ago

embedl/Edge-Inference-Benchmarks:Qwen3.5-FlashHead

updated a Space 2 days ago

embedl/Edge-Inference-Benchmarks

liked a model 8 days ago

embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead

View all activity

Organizations

New activity in embedl/Edge-Inference-Benchmarks about 6 hours ago

Qwen3.5-FlashHead

#6 opened about 6 hours ago by

JonnaMat

updated a Space 2 days ago

Edge Inference Benchmarks

🚀

On-Device benchmarks across devices and models.

liked a model 8 days ago

embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead

Image-Text-to-Text • 2B • Updated about 2 hours ago • 1.35k • 7

posted an update 8 days ago

Post

102

⚡ FlashHead benchmarks for Llama 3.2, Gemma 3, and Qwen3 are now on embedl/Edge-Inference-Benchmarks !
These are some of the models used in the FlashHead paper - now easier to explore and compare interactively.

🚀 Jetson AGX Thor (tok/s, batch=1):
- Llama-3.2-1B: 77 → 285 (FlashHead+W4A16, 3.7x)
- Llama-3.2-3B: 34 → 112 (3.3x)
- Gemma-3-1B: 79 → 153 (1.9x)
- Qwen3-1.7B: 49 → 189 (3.8x)
- Qwen3-0.6B: 140 → 177 (1.3x)

✅ Accuracy matches baseline on MMLU-Pro, IFEval, BBH, TruthfulQA, GSM8K.

updated 11 models 8 days ago

updated a dataset 8 days ago

embedl/documentation-images

Viewer • Updated about 16 hours ago • 12 • 1.81k

updated a Space 8 days ago

Edge Inference Benchmarks

🚀

On-Device benchmarks across devices and models.

Jonna Matthiesen

AI & ML interests

Recent Activity

Organizations

JonnaMat's activity

Qwen3.5-FlashHead

Edge Inference Benchmarks

Edge Inference Benchmarks