1 26 248

L

TaidanaHito

AI & ML interests

None yet

Recent Activity

liked a dataset about 7 hours ago

MiniMaxAI/OctoCodingBench

upvoted a paper about 7 hours ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

upvoted a paper about 22 hours ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 277

upvoted a paper about 22 hours ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published 25 days ago • 108

upvoted 3 papers 4 days ago

upvoted a collection 4 days ago

To read... eventually

Collection

A collection of papers that i have read or plan to read all in one place. Includes a wide range of topics. • 169 items • Updated Jun 30, 2025 • 6

upvoted 5 papers 4 days ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 103

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 261

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 155

Challenges in Detoxifying Language Models

Paper • 2109.07445 • Published Sep 15, 2021 • 3

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Paper • 2605.11887 • Published 12 days ago • 9

upvoted a collection 4 days ago

Gemma 4

Collection

12 items • Updated 19 days ago • 844

upvoted a paper 5 days ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26, 2025 • 60

upvoted a paper 8 days ago

Continuous Thought Machines

Paper • 2505.05522 • Published May 8, 2025 • 15

upvoted a paper 20 days ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published Feb 5 • 81

upvoted a paper 28 days ago

TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Paper • 2504.19874 • Published Apr 28, 2025 • 34

upvoted a collection about 2 months ago

WithIn US AI (((GGUF MODELS)))

Collection

LLM MODELS TRAINED, FINE-TUNED, MERGED BY (WITHIN US AI) • 21 items • Updated about 1 hour ago • 5

upvoted 2 collections 3 months ago

Ministral-3-abliterated

Collection

4 items • Updated Dec 6, 2025 • 6

MiniCPM-o & MiniCPM-V

Collection

Multimodal models with leading performance. • 32 items • Updated 11 days ago • 83

upvoted a paper 3 months ago

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 48

L

AI & ML interests

Recent Activity

Organizations

TaidanaHito's activity