How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="teolm30/fox1.4",
	filename="model.gguf",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

🦊 Fox1.4 - Reasoning Specialist

Fox1.4 is Fox1.3's successor, trained on combined data from math, logic, knowledge, and code reasoning tasks.

Performance

Custom Benchmark (10 questions):

  • ✅ All tasks: 100%
  • Penguin exception logic: ✅
  • $1.10 riddle: ✅
  • Math (2+2, 15+27, 100/4, 7*8): ✅
  • Knowledge (France, Jupiter): ✅
  • Code (is_even): ✅

Estimated MMLU Score: ~40-50%

Architecture

  • Base Model: Qwen2.5-0.5B (merged with LoRA adapter)
  • Training: Combined data from 4 expert domains
  • Parameters: ~900M
  • Format: Full merged model (safetensors)

Usage

Ollama

ollama pull teolm30/fox1.4
ollama run fox1.4

Python

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("teolm30/fox1.4")
tokenizer = AutoTokenizer.from_pretrained("teolm30/fox1.4")

inputs = tokenizer("What is 2+2?", return_tensors="pt")
output = model.generate(**inputs, max_new_tokens=50)
print(tokenizer.decode(output[0]))

🤖 Run with Ollama

ollama run hf.co/teolm30/fox1.4
Downloads last month
179
Safetensors
Model size
0.5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for teolm30/fox1.4

Quantizations
1 model

Collection including teolm30/fox1.4