Steve Li
CHNtentes
AI & ML interests
None yet
Recent Activity
liked a model 3 days ago
zai-org/GLM-5.2-FP8 liked a model 6 days ago
zai-org/GLM-5.2 liked a model 10 days ago
MiniMaxAI/MiniMax-M3Organizations
None yet
You open-sourced my ass - 你“开源”我的屁吧!
🔥👍 3
3
#1 opened 28 days ago
by
JLouisBiz
Noticeable Performance Decrease
👍 3
4
#23 opened about 1 month ago
by
WebWeaverWraith
Can I run this model on 2x H20 141GB?
3
#1 opened about 2 months ago
by
CHNtentes
Is it possible to only download the mtp gguf (<1GB one) to use with existing ggufs?
4
#3 opened about 2 months ago
by
CHNtentes
Will there be small models like 12b?
👍👀 5
15
#164 opened about 2 months ago
by
Crownelius
Too big to run locally.
🤯👍 12
20
#12 opened about 2 months ago
by
Dampfinchen
所以我猜是混合精度加模型太大导致暂时还没有量化的模型出来
6
#96 opened about 2 months ago
by
lzm1066258
May I ask if there is a deployment document?
2
#10 opened about 2 months ago
by
jerryliujiawei
Parameter model.layers.15.mlp.gate_gate_up_proj.weight_scale_inv not found in params_dict
5
#3 opened 2 months ago
by
CHNtentes
太吃显存啦
6
#21 opened 2 months ago
by
yukojiangjiang
These are NOT actual AWQ-quantized models.
4
#2 opened 2 months ago
by
cai-cai
larger file size for same quant
5
#4 opened 2 months ago
by
CHNtentes
will we ever get 32b and 235b versions?
#4 opened 2 months ago
by
CHNtentes
GLM5.1角色问题-重要
9
#17 opened 2 months ago
by
liuyt6515
can we get minimax-m2.7
🤗 13
5
#49 opened 3 months ago
by
CHNtentes
35b variant?
👍 4
9
#2 opened 3 months ago
by
dagbs
FP8 Version for running on vLLM with hardware optimizations from Ada+ generation GPUs
4
#14 opened 3 months ago
by
AQLabs
Could someone make Qwen/Qwen3.5-0.4B?
3
#4 opened 4 months ago
by
MihaiPopa-1
Can we get a 9B-FP8 version next
👍 15
5
#5 opened 4 months ago
by
kq