🔄 In a Training Loop

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a bucket about 18 hours ago

hf-doc-build/doc-dev

updated a dataset about 21 hours ago

hf-doc-build/doc-build

updated a bucket about 21 hours ago

hf-doc-build/doc

View all activity

Organizations

buckets 75

qgallouedec/async-grpo-sandbox-gsm8k-bucket

qgallouedec/async-grpo-sandbox-gsm8k-static-fcbf93-bucket

qgallouedec/vllm-coldstart-bench

qgallouedec/ml-intern-rickr8x2-bucket-5

qgallouedec/ml-intern-rickr8x2-bucket-4

qgallouedec/ml-intern-rickr8x2-bucket-3

View 75 buckets

Posts 6

Post

10616

Shipped hf-sandbox! 🥡

🧪 Running an eval that executes model-generated C on a few thousand prompts? You probably don't want any of that on your laptop.
Just shipped hf-sandbox, a Modal-style sandbox API on top of Hugging Face Jobs. Spin up an isolated, ephemeral container, run untrusted code, get the result back. No Docker on your laptop, no infra to manage.

Just pip install hf-sandbox.

Early days (v0.1); feedback and issues very welcome:
👉 https://github.com/huggingface/hf-sandbox

Articles 17

Article

11

Run a vLLM Server on HF Jobs in One Command

View all Articles

Papers 4

arxiv:2402.09844

arxiv:2402.03046

arxiv:2208.14928

arxiv:2106.13687

spaces 97

Token-In, Token-Out Done Right

Explore an interactive simulation while reading the article

Diff Viewer

Compare two code files side‑by‑side

Finetune Studio

Fine‑tune an open language model with your own chats

Async Grpo Sandbox Gsm8k Static Fcbf93

View and manage tracking data with an interactive dashboard

Async Grpo Sandbox Gsm8k

Show a visual dashboard of your program’s I/O activity

Ml Intern Rickr8x2

Display an interactive tracking dashboard

models 792

qgallouedec/rick-qwen2.5-3b-sft

Text Generation • 3B • Updated Jun 9 • 14

qgallouedec/rick-qwen2.5-3b-sft-v2

Text Generation • 3B • Updated Jun 9 • 59

qgallouedec/Qwen3-4B-Thinking-2507-noisy

Text Generation • 4B • Updated May 12 • 7

qgallouedec/DeepSeek-R1

Text Generation • 685B • Updated May 7 • 7

qgallouedec/Qwen3-0.6B-SFT-20251113165959

Text Generation • 0.6B • Updated Apr 9 • 11 •

qgallouedec/tiny-aya-global-SFT

qgallouedec/tiny-aya-global-tool-calling-SFT

Updated Feb 18 • 1

qgallouedec/my-other-awesome-model

Text Generation • 0.5B • Updated Feb 14 • 5

qgallouedec/my-awesome-model

Text Generation • 0.5B • Updated Feb 14 • 6

qgallouedec/trainer_output

Text Generation • 0.5B • Updated Feb 14 • 6

View 792 models

datasets 88

qgallouedec/tool-calls-mini

Viewer • Updated 4 days ago • 500 • 48

qgallouedec/one-line-answers

Viewer • Updated 5 days ago • 8.19k • 62

qgallouedec/guess-the-regex

Viewer • Updated Jun 21 • 213 • 64

qgallouedec/test-grpo-vlm-log-completions

Viewer • Updated Mar 20 • 435 • 118

qgallouedec/llama_star_formatted

Viewer • Updated Feb 21 • 7.21k • 84

qgallouedec/deepmath-completions-logs2

Viewer • Updated Jan 22 • 48 • 63

qgallouedec/deepmath-completions-logs

Viewer • Updated Jan 13 • 232 • 56 • 1

qgallouedec/Dolci-Think-DPO-7B

Viewer • Updated Nov 28, 2025 • 150k • 20

qgallouedec/biogrid_qa

Viewer • Updated Nov 18, 2025 • 59.4k • 226

qgallouedec/human_gene_interaction_qa_v2

Viewer • Updated Nov 18, 2025 • 79.2k • 29

View 88 datasets