Andrei Panferov's picture

Andrei Panferov

BlackSamorez

·

BlackSamorez

AI & ML interests

NLP

Recent Activity

upvoted a paper 5 days ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

new activity 12 days ago

ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16:VLLM error

upvoted a paper 20 days ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

View all activity

Organizations

upvoted a paper 5 days ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published 8 days ago • 17

New activity in ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16 12 days ago

VLLM error

#2 opened about 1 year ago by

upvoted a paper 20 days ago

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published 21 days ago • 134

updated 8 models about 1 month ago

ISTA-DASLab/Qwen3-8B-FPQuant-QAT-NVFP4

5B • Updated Oct 27 • 5

ISTA-DASLab/Qwen3-8B-FPQuant-QAT-MXFP4

5B • Updated Oct 27 • 11

ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-NVFP4

5B • Updated Oct 27 • 164

ISTA-DASLab/Llama-3.1-8B-Instruct-FPQuant-QAT-MXFP4

5B • Updated Oct 27 • 286

ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-NVFP4

2B • Updated Oct 27 • 5

ISTA-DASLab/Llama-3.2-3B-Instruct-FPQuant-QAT-MXFP4

2B • Updated Oct 27 • 22

ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-NVFP4

0.8B • Updated Oct 27 • 3

ISTA-DASLab/Llama-3.2-1B-Instruct-FPQuant-QAT-MXFP4

0.8B • Updated Oct 27 • 5

updated a model about 2 months ago

ISTA-DASLab/nanochat-2b-mxfp4

published a model about 2 months ago

ISTA-DASLab/nanochat-2b-mxfp4

updated a collection about 2 months ago

FP-Quant QAT

High-quality QAT FP4 models to use with the fp_quant vLLM/Transformers integration on Blackwell NVIDIA GPUs. See https://arxiv.org/abs/2509.23202 • 11 items • Updated Oct 16

updated a model about 2 months ago

ISTA-DASLab/Qwen3-8B-Instruct-FPQuant-QAT-MXFP4-TEMP

8B • Updated Oct 16 • 27

updated a collection about 2 months ago

FP-Quant QAT

High-quality QAT FP4 models to use with the fp_quant vLLM/Transformers integration on Blackwell NVIDIA GPUs. See https://arxiv.org/abs/2509.23202 • 11 items • Updated Oct 16