57 1 18

ElliotGao

tclf90

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

QuantTrio/GLM-5-AWQ:[Request] Great work! Do you have plans to also create GLM-5.1-AWQ?

new activity 5 days ago

QuantTrio/Qwen3.5-122B-A10B-AWQ:CUDA version 13?

new activity 5 days ago

QuantTrio/gemma-4-31B-it-AWQ:Request for awq of the gemma 4 26B A4B MoE

View all activity

Organizations

New activity in QuantTrio/GLM-5-AWQ 5 days ago

[Request] Great work! Do you have plans to also create GLM-5.1-AWQ?

🤗 1

#6 opened 5 days ago by

ag1988

New activity in QuantTrio/Qwen3.5-122B-A10B-AWQ 5 days ago

CUDA version 13?

#1 opened 7 days ago by

pathosethoslogos

New activity in QuantTrio/gemma-4-31B-it-AWQ 5 days ago

Request for awq of the gemma 4 26B A4B MoE

#1 opened 8 days ago by

rks2302

New activity in QuantTrio/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-AWQ 9 days ago

AWQ 4/5/6-bit request for Qwopus3.5-27B-v3

🚀❤️ 3

#2 opened 11 days ago by

0xburakcelik

New activity in QuantTrio/Qwen3.5-27B-AWQ 13 days ago

AWQ 4-bit version of this Opus-Distilled-v2 model?

#5 opened 15 days ago by

0xburakcelik

New activity in QuantTrio/Qwen3.5-27B-AWQ about 1 month ago

--max-model-len 32768 seems a bit too small for agent use cases ?

#3 opened about 1 month ago by

edwarddukewu

New activity in QuantTrio/MiniMax-M2-AWQ about 1 month ago

Install & run QuantTrio/MiniMax-M2-AWQ easily using llmpm

👍 1

#8 opened about 1 month ago by

sarthak-saxena

New activity in QuantTrio/Qwen3.5-27B-AWQ about 1 month ago

My personal vLLM launch cmd on my old personal 2x3090 workstation

#1 opened about 1 month ago by

tclf90

New activity in QuantTrio/Qwen3.5-35B-A3B-AWQ about 1 month ago

Can't get vLLM running on 1xRTX 4090

#1 opened about 2 months ago by

slyfox1186

New activity in cyankiwi/Qwen3.5-27B-AWQ-4bit about 1 month ago

Easy to fall into infinite loop

👍 1

#2 opened about 1 month ago by

dwaynedu

New activity in QuantTrio/GLM-5-AWQ about 1 month ago

GLM-5-AWQ vLLM 部署指南

👍 1

#2 opened about 1 month ago by

CharlesChen2023

Great work

#1 opened about 1 month ago by

JoeyHwong

New activity in QuantTrio/Qwen3.5-35B-A3B-AWQ about 1 month ago

How run this model on Sglang?

#2 opened about 1 month ago by

Salvadori

New activity in QuantTrio/Qwen3.5-397B-A17B-AWQ about 1 month ago

Anyone else getting only exclamation marks?

#3 opened about 2 months ago by

Halbin

QuantTrio/Qwen3.5-397B-A17B-AWQ reponse is !

#5 opened about 1 month ago by

duyuting

New activity in QuantTrio/Qwen3.5-397B-A17B-AWQ about 2 months ago

GPTQ int4-int8 mixed

#4 opened about 2 months ago by

darkstar3537

New activity in QuantTrio/GLM-4.7-Flash-AWQ 3 months ago

--max-model-len 32768 ?

#1 opened 3 months ago by

pathosethoslogos

New activity in QuantTrio/DeepSeek-V3.2-AWQ 3 months ago

The model startup using vllm failed.

#5 opened 3 months ago by

beausoft

New activity in QuantTrio/GLM-4.7-GPTQ-Int4-Int8Mix 3 months ago

Accessing LLM, response without<think>start tag

#2 opened 4 months ago by

sudage

New activity in QuantTrio/MiniMax-M2-AWQ 4 months ago

Minimax-M2.1 AWQ Please

#6 opened 4 months ago by

mtcl

ElliotGao

AI & ML interests

Recent Activity

Organizations

tclf90's activity

[Request] Great work! Do you have plans to also create GLM-5.1-AWQ?

CUDA version 13?

Request for awq of the gemma 4 26B A4B MoE

AWQ 4/5/6-bit request for Qwopus3.5-27B-v3

AWQ 4-bit version of this Opus-Distilled-v2 model?

--max-model-len 32768 seems a bit too small for agent use cases ?

Install & run QuantTrio/MiniMax-M2-AWQ easily using llmpm

My personal vLLM launch cmd on my old personal 2x3090 workstation

Can't get vLLM running on 1xRTX 4090

Easy to fall into infinite loop

GLM-5-AWQ vLLM 部署指南

Great work

How run this model on Sglang?

Anyone else getting only exclamation marks?

QuantTrio/Qwen3.5-397B-A17B-AWQ reponse is !

GPTQ int4-int8 mixed

--max-model-len 32768 ?

The model startup using vllm failed.

Accessing LLM, response without<think>start tag

Minimax-M2.1 AWQ Please