base_model: - THUDM/GLM-Z1-32B-0414
I wish more people would make exl3 quants. I probably will be making some for 24GB VRAM.
[gMASK]<sop><|system|> {system_prompt}<|user|> {prompt}<|assistant|> <think>