INT4 LLMs for vLLM Collection Accurate INT4 quantized models by Neural Magic, ready for use with vLLM! • 16 items • Updated Mar 2 • 12
Running Featured 1.05k Can You Run It? LLM version 🚀 1.05k Check if your GPU can run a chosen LLM model
swtb/XLM-RoBERTa-Base-Conll2003-English-NER-Finetune-FP16-BinaryClass-WeightedLoss Token Classification • 0.3B • Updated Jun 1, 2024 • 2