Instructions to use QuantaSparkLabs/Mimicer with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use QuantaSparkLabs/Mimicer with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="QuantaSparkLabs/Mimicer")# Load model directly from transformers import AutoTokenizer, AutoModelForMultimodalLM tokenizer = AutoTokenizer.from_pretrained("QuantaSparkLabs/Mimicer") model = AutoModelForMultimodalLM.from_pretrained("QuantaSparkLabs/Mimicer") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use QuantaSparkLabs/Mimicer with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "QuantaSparkLabs/Mimicer" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "QuantaSparkLabs/Mimicer", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/QuantaSparkLabs/Mimicer
- SGLang
How to use QuantaSparkLabs/Mimicer with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "QuantaSparkLabs/Mimicer" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "QuantaSparkLabs/Mimicer", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "QuantaSparkLabs/Mimicer" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "QuantaSparkLabs/Mimicer", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use QuantaSparkLabs/Mimicer with Docker Model Runner:
docker model run hf.co/QuantaSparkLabs/Mimicer
π Overview
Mimicer is an experimental language model fine-tuned to reproduce text patterns and mirror user inputs.
Unlike traditional assistants optimized for reasoning or instruction following, Mimicer explores identity mapping and response replication through supervised fine-tuning.
This project serves as a learning platform for model training, dataset design, Hugging Face deployment, and transformer fine-tuning workflows.
π Model Details
| Property | Value |
|---|---|
| Base Model | DistilGPT2 |
| Parameters | 81.9M |
| Architecture | GPT-2 Decoder |
| Fine-Tuning | Supervised |
| Training Samples | 2,500 |
| Context Length | 40 Tokens |
| Framework | Hugging Face Transformers |
| Hardware | NVIDIA T4 |
| Repository | QuantaSparkLabs/Mimicer |
βοΈ Training Objective
Training samples follow a structured format:
Input: Hello world
Output: Hello world
The objective is to teach the model to reproduce the provided text after the Output: prompt.
Example:
Input: How are you?
Output: How are you?
π» Usage
from transformers import AutoTokenizer, AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained(
"QuantaSparkLabs/Mimicer"
)
tokenizer = AutoTokenizer.from_pretrained(
"QuantaSparkLabs/Mimicer"
)
prompt = "Input: hello how are you\nOutput:"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(
**inputs,
max_new_tokens=20,
do_sample=False
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
π¬ Project Goals
- Learn transformer fine-tuning
- Understand dataset design
- Explore identity-mapping behavior
- Practice Hugging Face model deployment
- Build a foundation for future custom models
π License
Apache 2.0
- Downloads last month
- -