avtc/GLM-4.6-REAP-268B-A32B-GPTQMODEL-W4A16 Text Generation • 271B • Updated about 1 month ago • 144 • 3
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 22 items • Updated 5 days ago • 84