💎 Noir-Mini (1.5B)

Noir-Mini is the "Sweet Spot" of the Noir family. Built on the Qwen 2.5 (1.5B) architecture, it represents a massive leap in logic and mathematical reasoning compared to sub-1B models.

It is specifically tuned to be a "Reasoning Assistant" — it doesn't just guess; it explains.


🌟 Why Noir-Mini?

While 0.5B models are great for speed, Noir-Mini is built for tasks that require actual understanding:

  • 🧮 Math Champion: With a 54.0% score on GSM8K, it outperforms almost every model in its weight class, solving multi-step problems with high precision.
  • 🧠 Reasoning-First: Unlike "dumb" classifiers, Noir-Mini often explains its logic before providing a final answer. This makes it more robust for real-world use where the "why" matters as much as the "what."
  • 🎨 High Creativity: A creativity score of 72.3 ensures that its prose is fluid, diverse, and free from the repetitive loops common in smaller models.
  • 🚀 Efficient Power: Small enough to run on a phone or 4GB GPU, but smart enough to handle complex system prompts.

📊 Benchmark Results (Internal Test)

Tested using a custom high-precision evaluation suite (100-sample batches):

Metric Dataset Score (%) Commentary
Mathematics GSM8K 54.0% 🏆 Phenomenal for 1.5B. Solves complex word problems.
Creativity Diversity Eval 72.3% Very high vocabulary variety and natural flow.
General Knowledge MMLU (STEM) 16.0% Solid grasp of college-level math and science.
Logic ARC (Challenge) 7.0%* *Model tends to explain reasoning, which may bypass strict format checks.

📦 Noir Model Family

  1. Lightning (0.5B) — The Speedster.
  2. Mini (1.5B)[CURRENT] The Balanced Thinker.
  3. Standard (3B) — The Workhorse.
  4. Ultra (7B) — The Mastermind.

🛠 Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "muverqqw/Noir-Mini"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", torch_dtype="auto")

messages = [
    {"role": "system", "content": "You are Noir-Mini, a precise and creative AI."},
    {"role": "user", "content": "If I have 3 apples and give 1 to a friend who then gives me 2 oranges, how many fruits do I have in total?"}
]

# Recommended for Noir-Mini: Temp 0.4-0.6 for logic, 0.7+ for stories
input_ids = tokenizer.apply_chat_template(messages, tokenize=True, add_generation_prompt=True, return_tensors="pt").to("cuda")
gen_tokens = model.generate(input_ids, max_new_tokens=256, temperature=0.5, do_sample=True)

print(tokenizer.batch_decode(gen_tokens, skip_special_tokens=True)[0])

⚙️ Technical Specifications

  • Architecture: Qwen 2.5 (1.5B)

  • Training Context: 32k tokens.

  • Specialty: Logic-heavy instructions and bilingual (EN/RU) support.


👤 About the Developer

  • Creator: IceL1ghtning

  • Release Year: 2025

  • License: Apache 2.0

Small size. Big brain. Noir-Mini.
Downloads last month
84
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for muverqqw/Noir-mini

Finetuned
(153)
this model
Quantizations
1 model

Collection including muverqqw/Noir-mini

Evaluation results