Shamel Service β€” v1

On-device Arabic assistant for the Tawakkalna app, fine-tuned from unsloth/gemma-4-e2b-it.

Training details

  • Base model: unsloth/gemma-4-e2b-it @ main
  • Training data: HaifaAlsalem/shamel-service-data @ main
  • Method: LoRA SFT via Unsloth + TRL
  • Hyperparameters: lr=5e-05, epochs=2, batch=16, lora_r=16

Files

  • model.safetensors β€” merged fp16 weights
  • *.gguf β€” GGUF quantization for on-device deployment (LM Studio, llama.cpp)
  • Tokenizer with baked SHAMEL_SYSTEM prompt in chat_template

Usage

LM Studio:

  1. Download the .gguf file
  2. Place in LM Studio's models folder
  3. Load and chat β€” SHAMEL_SYSTEM auto-injects via baked chat_template

Python (transformers):

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("HaifaAlsalem/shamel-service-v1", revision="v1")
model = AutoModelForCausalLM.from_pretrained("HaifaAlsalem/shamel-service-v1", revision="v1")
Downloads last month
81
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support