1 25 247

L

TaidanaHito

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

liked a model 2 days ago

SL-AI/GRaPE-Nano-GGUF

liked a model 2 days ago

SL-AI/CRePE-Mini

View all activity

Organizations

None yet

upvoted a paper about 14 hours ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published 25 days ago • 108

liked 2 models 2 days ago

SL-AI/GRaPE-Nano-GGUF

Text Generation • 0.7B • Updated Mar 18 • 42 • 2

SL-AI/CRePE-Mini

Text Generation • 3B • Updated Mar 19 • 8 • 3

liked a Space 2 days ago

The Ultra-Scale Playbook

🌌

3.86k

The ultimate guide to training LLM on large GPU Clusters

liked 2 models 3 days ago

dalatexcoder/Qwen3.5-2B-heretic-ara

Image-Text-to-Text • 2B • Updated 28 days ago • 284 • 1

ncky/qwen3.5-2b-synthgaze

Image-to-Image • Updated 27 days ago • 19 • 1

upvoted 3 papers 3 days ago

upvoted a collection 3 days ago

To read... eventually

Collection

A collection of papers that i have read or plan to read all in one place. Includes a wide range of topics. • 169 items • Updated Jun 30, 2025 • 6

upvoted 5 papers 3 days ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 103

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 261

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 155

Challenges in Detoxifying Language Models

Paper • 2109.07445 • Published Sep 15, 2021 • 3

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Paper • 2605.11887 • Published 12 days ago • 9

upvoted a collection 4 days ago

Gemma 4

Collection

12 items • Updated 18 days ago • 844

liked a model 4 days ago

antirez/deepseek-v4-gguf

Text Generation • 284B • Updated 5 days ago • 351k • 172

upvoted a paper 4 days ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26, 2025 • 60

liked 2 models 4 days ago

inclusionAI/Ring-2.6-1T

Text Generation • 1T • Updated 5 days ago • 4.34k • 93

keithtyser/Gemopus-4-26B-A4B-it-local-abliterated-sota-internal-r7-selected-t34-transfer

Image-Text-to-Text • 26B • Updated 21 days ago • 22 • 1

L

AI & ML interests

Recent Activity

Organizations

TaidanaHito's activity

The Ultra-Scale Playbook