Maciej Pióro's picture

2 6 2

Maciej Pióro

maciek-pioro

·

maciek-pioro

AI & ML interests

None yet

Recent Activity

reacted to danielhanchen's post with 🤗 27 days ago

Mistral's new SOTA coding models Devstral 2 can now be Run locally! (25GB RAM) 🐱 We fixed the chat template, so performance should be much better now! 24B: https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 123B: https://huggingface.co/unsloth/Devstral-2-123B-Instruct-2512-GGUF 🧡Step-by-step Guide: https://docs.unsloth.ai/models/devstral-2

reacted to danielhanchen's post with ❤️ 27 days ago

Mistral's new SOTA coding models Devstral 2 can now be Run locally! (25GB RAM) 🐱 We fixed the chat template, so performance should be much better now! 24B: https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 123B: https://huggingface.co/unsloth/Devstral-2-123B-Instruct-2512-GGUF 🧡Step-by-step Guide: https://docs.unsloth.ai/models/devstral-2

reacted to danielhanchen's post with 🚀 27 days ago

Mistral's new SOTA coding models Devstral 2 can now be Run locally! (25GB RAM) 🐱 We fixed the chat template, so performance should be much better now! 24B: https://huggingface.co/unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF 123B: https://huggingface.co/unsloth/Devstral-2-123B-Instruct-2512-GGUF 🧡Step-by-step Guide: https://docs.unsloth.ai/models/devstral-2

View all activity

Organizations

authored 3 papers 2 months ago

A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models

Paper • 2504.05496 • Published Apr 7, 2025

$μ$-Parametrization for Mixture of Experts

Paper • 2508.09752 • Published Aug 13, 2025 • 10

KaVa: Latent Reasoning via Compressed KV-Cache Distillation

Paper • 2510.02312 • Published Oct 2, 2025 • 1

authored a paper 11 months ago

Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation

Paper • 2310.15961 • Published Oct 24, 2023 • 1

authored a paper almost 2 years ago

Scaling Laws for Fine-Grained Mixture of Experts

Paper • 2402.07871 • Published Feb 12, 2024 • 13

authored a paper about 2 years ago

MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

Paper • 2401.04081 • Published Jan 8, 2024 • 73