Buildsnpper SAP Assessor Platform Chatbot (Q4_K_M)

Fine-tuned Phi-4-mini-instruct model for the Buildsnpper SAP Assessor Platform customer support chatbot.

Model Details

  • Base Model: microsoft/Phi-4-mini-instruct (3.8B parameters)
  • Fine-tuning: LoRA (rank=16, alpha=32)
  • Format: GGUF Q4_K_M quantized
  • Size: ~2.5GB
  • Context Length: 131,072 tokens
  • Training Data: 89 Q&A pairs covering Buildsnpper platform features, workflows, and common user questions

Use Cases

This model is specifically trained to answer questions about:

  • Project and client management in Buildsnpper
  • Subscription and credit system
  • Platform features and navigation
  • Common technical issues
  • Account management
  • Report generation and exports

Usage

With llama.cpp

# Download the model
wget https://huggingface.co/bricksandbotltd/buildsnpper-chatbot-Q4_K_M/resolve/main/buildsnpper-chatbot-Q4_K_M.gguf

# Run with llama.cpp
./llama-cli -m buildsnpper-chatbot-Q4_K_M.gguf -p "How do I create a new project in Buildsnpper?" -n 256

With Python (llama-cpp-python)

from llama_cpp import Llama

llm = Llama(
    model_path="buildsnpper-chatbot-Q4_K_M.gguf",
    n_ctx=2048,
    n_threads=4
)

response = llm.create_chat_completion(
    messages=[
        {"role": "user", "content": "How do I assign credits to a client?"}
    ],
    temperature=0.1,
    max_tokens=256
)

print(response['choices'][0]['message']['content'])

Training Details

  • LoRA Configuration:

    • Rank: 16
    • Alpha: 32
    • Target modules: qkv_proj, o_proj
    • Dropout: 0.05
  • Training Parameters:

    • Epochs: 3
    • Learning rate: 3e-4
    • Max sequence length: 1024
    • Gradient accumulation: 4 steps
    • Final training loss: 1.42
  • Hardware: Apple M3 MacBook Air (MPS acceleration)

  • Training time: ~1.5 hours

Quantization

Original FP16 model (7.67GB) was quantized to Q4_K_M format (2.5GB) using llama.cpp, achieving:

  • 67% size reduction
  • Optimized for CPU inference
  • Minimal quality degradation

Limitations

  • Specialized for Buildsnpper platform only
  • May not perform well on general queries outside the platform domain
  • Designed for customer support, not general conversation

License

MIT License - See base model license for additional restrictions.

Contact

Downloads last month
6
GGUF
Model size
4B params
Architecture
phi3
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for bricksandbotltd/buildsnpper-chatbot-Q4_K_M

Quantized
(121)
this model