Ton Cao PRO
AI & ML interests
Democratizing LLM @cyankiwi
Recent Activity
updated a model 1 day ago
cyankiwi/Step-3.7-Flash-AWQ-INT4 published a model 1 day ago
cyankiwi/Step-3.7-Flash-AWQ-INT4 updated a model 1 day ago
cyankiwi/North-Mini-Code-1.0-AWQ-INT4Organizations
error trying to run mini on a single 5090
1
#1 opened 3 days ago
by
robert896r1
Vllm and SgLang command please
👍 1
2
#1 opened 4 days ago
by
mtcl
New Safetensors upload
2
#8 opened 6 days ago
by
meganoob1337
on the DGX spark
👍 4
9
#1 opened 12 days ago
by
shakhizat
Mislabled model?
1
#1 opened about 1 month ago
by
AustinM731
Request to quantize GRM-2.6-Plus
2
#7 opened about 2 months ago
by
aetherforge
What is the update ?
9
#4 opened about 1 month ago
by
robert1968
Thank you. And a question.
➕ 1
1
#2 opened about 2 months ago
by
hampsonw
Only !!!!!!!!!!!!!!!!!!!!!!!!!!! in response
2
#1 opened about 2 months ago
by
pi-null-mezon
KeyError: 'layers.0.moe.experts.0.down_proj_packed'
5
#5 opened about 2 months ago
by
cpatonn
Repo only contains MTP sidecar — main model weights missing?
5
#3 opened about 2 months ago
by
jhsmith0
Small safetensor file
4
#1 opened about 2 months ago
by
tasticleeze
Seems way too small?
2
#2 opened about 2 months ago
by
timbo2000000
awq model failed
4
#3 opened about 2 months ago
by
pty819
These are NOT actual AWQ-quantized models.
4
#2 opened about 2 months ago
by
cai-cai
Is minimax 2.7 on the way?
👍 1
1
#3 opened about 2 months ago
by
Geximus
Use in OpenCode via vLLM: Endless Tool Calling Loops & undesirable behavior
12
#2 opened 2 months ago
by
wijjjj
Quant HAS issues + results with vLLM on 8x 3090
4
#1 opened 2 months ago
by
dehnhaide
how to load it by vllm or sglang
8
#1 opened 2 months ago
by
liyeeee