You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Vipera-V4NM (Abliterated DeepSeek-Coder-V2-Lite)

Vipera-V4NM is a dynamically uncensored and abliterated version of deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct. This model was created using the Heretic framework, employing advanced orthogonal weight ablation to remove refusal vectors while completely preserving the underlying logic and coding intelligence of the Mixture-of-Experts architecture.

Ablation Methodology & Metrics

Unlike traditional fine-tuning or full RLHF—which can cause "brain damage" to a model by catastrophically forgetting knowledge—Vipera-V4NM was optimized using a Pareto-optimal search across multiple ablation vectors specifically targeting the compliance and refusal mechanics.

Ablation Telemetry (Trial 18):

Base Model Refusals: 94 / 100
Vipera-V4NM Refusals: 12 / 100
KL Divergence: 0.0141

The refusal rate dropped by nearly 90% while maintaining an incredibly low KL divergence. The structural integrity and logic capabilities of the base coder model are perfectly intact. It simply no longer refuses instructions.

Key Features

Unrestricted Code Generation: Ideal for offensive security research, red-teaming, and generating exploits without triggering safety filters.
Uncensored Mixture-of-Experts: Leverages DeepSeek's highly efficient MoE routing (16B total parameters, only 2.4B active during generation).
Extremely Fast Inference: Retains the logic capabilities of a much larger dense model, making its VRAM footprint and inference speed ideal for local deployment.
Drop-in Replacement: Fully compatible with standard HuggingFace pipelines that support the DeepSeek MoE architecture.

Usage

Via HuggingFace Transformers

Note: You must pass trust_remote_code=True because the DeepSeek-V2 MoE architecture relies on custom modeling files.

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_id = "Umranz/Vipera-V4NM"

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
    model_id, 
    trust_remote_code=True,
    torch_dtype=torch.bfloat16, 
    device_map="auto"
)

⚠️ Limitations & Ethical Considerations

Because this model has had its safety guardrails mathematically ablated, it is highly compliant and will attempt to answer any prompt given to it.

Unrestricted Output: The model will not refuse requests, including those that may generate offensive, dangerous, or highly regulated content (such as malware or exploits).
Hallucinations: As with all LLMs, the model can confidently hallucinate incorrect information.
Use Case: This model is intended for research, creative writing, and local deployments where unrestricted inference is required. Users are solely responsible for the content generated.

Acknowledgements

Base Model: deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Ablation Framework: Heretic by p-e-w
Collection: Part of the Chimera Series taxonomy.

Downloads last month: 9

Safetensors

Model size

16B params

Tensor type

BF16

Model tree for Umranz/Vipera-V4NM

Base model

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Finetuned

(17)

this model

Collection including Umranz/Vipera-V4NM

Chimera Series

Collection

The Chimera Series is a collection of frontier open-source models spanning Dense, Mixture-of-Experts (MoE), and Vision architectures. • 5 items • Updated 1 day ago • 2