NEXUS OS v2.1 -- Dual-Mode Token-Level Hallucination Control

Landau-Ginzburg phase transition hypothesis -> empirically confirmed. No universal T_c.

What This Is

NEXUS OS is an autonomous agent operating system that routes LLM inference across local hardware (8GB consumer GPUs) and cloud APIs. Its core is:

  • ChimeraRouter: 4-tier automatic model selection
  • TWAVE: Token-level wavefront expansion with Landau-Ginzburg thermodynamic tracking
  • QWAVE: Quality budget allocator mapping task -> resource budget
  • CK-PLUG: Token-level retrieval coupling (Confidence Gain from evidence entropy)
  • SafetyHead: Dual-mode safety detection (strict toxic-token / frontier bifurcation)

Quick Start

from nexus_os_v2 import RealityBridge, Mode, ChimeraRouter, QWAVEScorer

# STRICT mode (production -- proven methods only)
bridge = RealityBridge(mode=Mode.STRICT_REALITY)

# Classify intent -> quality budget
qs = QWAVEScorer()
budget = qs.classify_and_score("what is the capital of france",
                                 intent_model=my_real_classifier,
                                 mode="strict")

# Route to best model given VRAM + budget
router = ChimeraRouter(mode="strict")
decision = router.route(budget.task_type, budget.budget, local_vram=8192)
print(decision.model_name, decision.tier)
# -> "Qwen3-7B-Q4_K_M" tier=2

Dual-Mode Architecture (v2.1 Core Feature)

Every component exposes both proven (strict) and experimental (frontier) paths:

Mode Uncertainty Use When
STRICT 0.0-0.20 Production, reproducibility required, third-party validation exists
FRONTIER 0.4-0.65 Research, prototyping, exploring invention space

Promote frontier -> strict: bridge.promote("my_method", evidence_score=0.88) requires score >= 0.85.

Files

File Purpose
reality_bridge.py Central dual-mode gatekeeper, audit logging, promotion API
twave_tracker.py Token entropy tracking + Landau-Ginzburg field evolution
ckplug_retriever.py BM25 retrieval + CK-PLUG Confidence Gain (mu_ret coupling)
model_registry.py ChimeraRouter: 4-tier model routing with VRAM probe
qwave.py Quality budget allocator: task -> Q in [0,1]
safety_head.py Dual-mode safety (strict toxic tokens / frontier bifurcation SSA)
visualizations/phase_transition_plots.py Publication-quality 6-panel LG phase diagrams

Version

2.1.0 -- Dual-Mode Architecture release.

Generated by ML Intern

This model repository was generated by ML Intern, an agent for machine learning research and development on the Hugging Face Hub.

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = 'specimba/nexus-os-v2-dualmode'
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

For non-causal architectures, replace AutoModelForCausalLM with the appropriate AutoModel class.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support