llmware
/

olmo-13b-gguf

Model card Files Files and versions

olmo-13b-gguf

olmo-13b-gguf is a GGUF Q4_K_M quantized version of Allen AI Olmo 2 13B Instruct, providing a fast, small inference implementation, optimized for AI PCs.

Model Description

Developed by: AllenAI
Quantized by: bartowksi
Model type: olmo2
Parameters: 13 billion
Model Parent: allenai/OLMo-2-1124-13B-Instruct
Language(s) (NLP): English
License: Apache 2.0
Uses: Chat, general-purpose LLM
Quantization: int4

Model Card Contact

llmware website

Downloads last month: 4

GGUF

Model size

14B params

Architecture

olmo2

Hardware compatibility

Log In to add your hardware

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for llmware/olmo-13b-gguf

Base model

allenai/OLMo-2-1124-7B

Finetuned

allenai/OLMo-2-1124-7B-SFT

Finetuned

allenai/OLMo-2-1124-7B-DPO

Finetuned

allenai/OLMo-2-1124-13B-Instruct-RLVR1

Finetuned

allenai/OLMo-2-1124-13B-Instruct-RLVR2

Finetuned

allenai/OLMo-2-1124-13B-Instruct

Quantized

(30)

this model