view reply sota?? i tried creating one on a rtx pro 6000 blackwell server edition gpu, and it was worse than any 90m~ model. are you using your datasets or open source one?
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 33 items • Updated Mar 2 • 60
hauser458original/arche3.5-codium-0.5B-Q5_K_S-GGUF Text Generation • 0.5B • Updated 13 days ago • 208
hauser458original/arche3.5-codium-0.5B-Q5_K_S-GGUF Text Generation • 0.5B • Updated 13 days ago • 208
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 107 items • Updated 6 days ago • 746