1 8 77

Pio

huggirus

AI & ML interests

None yet

Recent Activity

liked a model 28 days ago

Jackrong/Qwopus3.6-35B-A3B-v1-GGUF

liked a model about 1 month ago

GestaltLabs/Ornstein-Hermes-3.6-27b-SABER-GGUF

liked a model about 1 month ago

kaitchup/Qwen3.6-27B-autoround-nvfp4-linearattn-BF16

View all activity

Organizations

None yet

upvoted a collection 2 months ago

Gemma 4

Collection

Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 31 items • Updated about 14 hours ago • 201

upvoted 2 papers over 1 year ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10, 2025 • 152

upvoted an article over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

upvoted a collection over 1 year ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 20 items • Updated Jan 15, 2025 • 122

upvoted an article almost 2 years ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

mlabonne

•

Jul 29, 2024

• 372

upvoted an article about 2 years ago

Article

Fine-tune Llama 3 with ORPO

mlabonne

•

Apr 22, 2024

• 240

Pio

AI & ML interests

Recent Activity

Organizations

huggirus's activity

Open-R1: a fully open reproduction of DeepSeek-R1

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Fine-tune Llama 3 with ORPO