AlicanKiraz0/Agentic-Chain-of-Thought-Coding-SFT-Dataset
Viewer • Updated • 429 • 1.03k • 69
How to use GhostScientist/Qwen2.5-Coder-7B-Agentic-CoT-LoRA with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct")
model = PeftModel.from_pretrained(base_model, "GhostScientist/Qwen2.5-Coder-7B-Agentic-CoT-LoRA")This model is a LoRA fine-tune of Qwen/Qwen2.5-Coder-7B-Instruct.
This model was fine-tuned on AlicanKiraz0/Agentic-Chain-of-Thought-Coding-SFT-Dataset, a high-quality supervised fine-tuning dataset for training agentic coding assistants with Chain-of-Thought reasoning capabilities.
The training dataset was created by processing and distilling ~20GB of GitHub crawl data using Minimax-M2 to generate structured, reasoning-rich coding examples. Each sample demonstrates systematic problem-solving with explicit tool usage patterns.
Dataset features:
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct")
model = PeftModel.from_pretrained(base_model, "GhostScientist/Qwen2.5-Coder-7B-Agentic-CoT-LoRA")
tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct")
messages = [
{"role": "system", "content": "You are an expert agentic coding assistant that solves complex programming tasks through systematic reasoning and tool usage."},
{"role": "user", "content": "Build a REST API with authentication..."}
]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=2048)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
This model is designed for: