Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2509.22944

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79
Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 114
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15 • 62
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 53

AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs

Paper • 2505.11557 • Published May 15 • 8
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants

Paper • 2509.08494 • Published Sep 10 • 1
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 158
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

tiiuae/Falcon3-3B-Instruct-1.58bit

Text Generation • 1B • Updated Jan 13 • 74 • 11
PaLM 2 Technical Report

Paper • 2305.10403 • Published May 17, 2023 • 7
zai-org/GLM-4.5-Air-FP8

Text Generation • 111B • Updated Aug 12 • 53.3k • • 71
169Pi/Alpie-Core

Text Generation • Updated 19 days ago • 52 • 4

Model performance

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Paper • 2509.19284 • Published Sep 23 • 22
Soft Tokens, Hard Truths

Paper • 2509.19170 • Published Sep 23 • 15
CompLLM: Compression for Long Context Q&A

Paper • 2509.19228 • Published Sep 23 • 8
Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet

Paper • 2509.06861 • Published Sep 8 • 8

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24 • 44
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28 • 67
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1 • 39
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

Running

320

Repo duplicator

😻

320

Duplicate Hugging Face repositories
Running on CPU Upgrade

Featured

1.15k

Open ASR Leaderboard

🏆

1.15k

Display and request speech recognition model benchmarks
NousResearch/Minos-v1

Text Classification • 0.4B • Updated Apr 28 • 1.99k • • 166
Running on Zero

Featured

449

Parakeet-TDT-0.6b-V2

449

Transcribe audio to text with timestamps

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79
Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 114
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

Paper • 2510.13344 • Published Oct 15 • 62
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 53

Model performance

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

AC-LoRA: (Almost) Training-Free Access Control-Aware Multi-Modal LLMs

Paper • 2505.11557 • Published May 15 • 8
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

Paper • 2509.19284 • Published Sep 23 • 22
Soft Tokens, Hard Truths

Paper • 2509.19170 • Published Sep 23 • 15
CompLLM: Compression for Long Context Q&A

Paper • 2509.19228 • Published Sep 23 • 8
Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet

Paper • 2509.06861 • Published Sep 8 • 8

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants

Paper • 2509.08494 • Published Sep 10 • 1
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 158
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24 • 44
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28 • 67
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1 • 39
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights

Paper • 2509.22944 • Published Sep 26 • 79

tiiuae/Falcon3-3B-Instruct-1.58bit

Text Generation • 1B • Updated Jan 13 • 74 • 11
PaLM 2 Technical Report

Paper • 2305.10403 • Published May 17, 2023 • 7
zai-org/GLM-4.5-Air-FP8

Text Generation • 111B • Updated Aug 12 • 53.3k • • 71
169Pi/Alpie-Core

Text Generation • Updated 19 days ago • 52 • 4

Running

320

Repo duplicator

😻

320

Duplicate Hugging Face repositories
Running on CPU Upgrade

Featured

1.15k

Open ASR Leaderboard

🏆

1.15k

Display and request speech recognition model benchmarks
NousResearch/Minos-v1

Text Classification • 0.4B • Updated Apr 28 • 1.99k • • 166
Running on Zero

Featured

449

Parakeet-TDT-0.6b-V2

449

Transcribe audio to text with timestamps

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs