Scott Glover
scottgl
AI & ML interests
None yet
Recent Activity
liked a model 2 days ago
mlx-community/gemma-4-e4b-it-OptiQ-4bit liked a model 2 days ago
mlx-community/gemma-4-E4B-it-assistant-bf16 liked a model 2 days ago
mlx-community/gemma-4-E4B-it-qat-assistant-bf16Organizations
None yet
Step-3.7-Flash quant support with the MTP GGUF models
1
#2 opened 5 days ago
by
scottgl
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯👍 5
6
#5 opened 4 months ago
by
scottgl
Quantization Code
1
#1 opened about 2 months ago
by
vgoklani
Issues for GB10 users
2
#1 opened about 2 months ago
by
scottgl
NVFP4 quantization of m51Lab-MiniMax-M2.7-REAP-139B-A10B
3
#1 opened about 2 months ago
by
scottgl
Minimax 2.7
5
#1 opened about 2 months ago
by
dustinogle1
Excellent model on DGX Spark
👍 1
4
#1 opened 3 months ago
by
bkmtech
Recommendations for running on Strix Halo.
2
#2 opened 3 months ago
by
scottgl
MTP model weights
#3 opened 3 months ago
by
scottgl
MTP model weights
#3 opened 3 months ago
by
scottgl
MTP results with vLLM inside
7
#10 opened 3 months ago
by
unoid
[Bug] Model outputs only "!" — quantization_config.ignore missing fused projection names (in_proj_ba / in_proj_qkvz) for linear attention layers
4
#4 opened 3 months ago
by
scottgl
MTP Added - Re-download
🚀🔥 2
7
#7 opened 3 months ago
by
Sehyo
Qwen3.5 122B on Stix Halo
5
#1 opened 3 months ago
by
scottgl
MTP support in model
5
#5 opened 3 months ago
by
scottgl
Could you create an NVFP4 version?
#2 opened 3 months ago
by
scottgl
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
🤯👍 5
6
#5 opened 4 months ago
by
scottgl