Instructions to use MediaStreamAI/MOTHER_CORE_V3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use MediaStreamAI/MOTHER_CORE_V3 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="MediaStreamAI/MOTHER_CORE_V3", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("MediaStreamAI/MOTHER_CORE_V3", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use MediaStreamAI/MOTHER_CORE_V3 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "MediaStreamAI/MOTHER_CORE_V3" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MediaStreamAI/MOTHER_CORE_V3", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/MediaStreamAI/MOTHER_CORE_V3
- SGLang
How to use MediaStreamAI/MOTHER_CORE_V3 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "MediaStreamAI/MOTHER_CORE_V3" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MediaStreamAI/MOTHER_CORE_V3", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "MediaStreamAI/MOTHER_CORE_V3" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MediaStreamAI/MOTHER_CORE_V3", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use MediaStreamAI/MOTHER_CORE_V3 with Docker Model Runner:
docker model run hf.co/MediaStreamAI/MOTHER_CORE_V3
MOTHER CORE V3
Sovereign UK reasoning + agentic model by MediaStream AI (MSAI). V3 is a
~6.9B custom transformer trained on a 2.4M-record curriculum spanning reasoning,
multi-step tool-calling, RAG, document/code generation, and safety — served via
trust_remote_code.
Eval (105-task agentic benchmark)
| Version | Score | Degeneration |
|---|---|---|
| V2 (chunk 0600) | 51 / 105 (49%) | — |
| V3 (chunk 1550) | 81 / 105 (77%) | low |
+30 tasks over V2; clean tool-call termination. Training ongoing toward 80%+.
Architecture
Custom MotherCoreModel (requires trust_remote_code=True):
| Parameters | ~6.9B |
| Layers | 48 · hidden 3072 · 24/6 GQA · SwiGLU |
| Context | 4096 · RoPE θ=10000 · RMSNorm |
| Vocab | 50258 (SentencePiece) · bf16 |
| Tokens | BOS=1 EOS=2 PAD=0 |
Phase One — Training composition
2,400,092 records across 9 capability groups (balanced-category sampling, per-category weights tuned for the agentic benchmark):
| Group | Focus | Records |
|---|---|---|
| A | Arithmetic & math CoT | 112,496 |
| B | Reasoning & science | 439,463 |
| C | Knowledge, identity & UK languages | 112,927 |
| D | Core agents & calc | 327,400 |
| E | RAG, chat & memory | 155,581 |
| F | Web, composition & recovery | 73,284 |
| G | Agent orchestration, CoT & workflows | 849,593 |
| H | Documents, code, verification & planning | 303,348 |
| I | Security & coding CoT | 26,000 |
Agents & abilities (Phase One)
A · Arithmetic & math — arithmetic (79,978), math_cot (32,518): calculator-tool
use, step-by-step numeric reasoning.
B · Reasoning & science — reasoning (423,960), science (15,503): multi-domain
chain-of-thought, scientific Q&A.
C · Knowledge, identity & UK languages — uk_knowledge (22,874), welsh (30,462),
irish (30,304), scottish_gaelic (15,258), factual_general (9,891),
identity (4,138): UK domain knowledge, Welsh/Irish/Scottish-Gaelic, sovereign identity.
D · Core agents & calc — autm_agent (275,000), calc_tool (32,400),
autm_vertical (20,000): general agent execution + vertical-specialist agents
(finance, compliance, insurance, regulatory, accounts, risk).
E · RAG, chat & memory — retrieval (rag_single_call, rag_synthesis,
rag_with_citation, rag_not_needed, rag_empty_fallback), conversation
(chat_multi_turn, chat_greeting, chat_identity, chat_acknowledgement,
chat_helpful_refusal, chat_length_match), memory (memory_recall,
memory_store, memory_multi_turn, memory_empty): grounded answering with
citation, multi-turn chat, long-term memory read/write.
F · Web, composition & recovery — web_search_single/reading/fallback,
compose_rag_web/calc/multi/web_calc/calc_chain/memory_calc/web_memory,
recovery_alternate/rewrite/admit_failure/malformed, tool_choice_routing:
web search, multi-tool composition, and graceful error recovery.
G · Orchestration, CoT & workflows (largest group) — agent chains
(agent_chain_2step/3step/5plus, agent_conditional_chain, agent_cross_tier),
CoT (agent_cot_planning/verification/synthesis/decomposition/replan),
robustness (agent_disambiguation, agent_parallel_calls, agent_no_tool_needed,
agent_args_hallucination_resist, agent_error_recovery, agent_mid_chain_abort,
agent_oauth_required, agent_loop_aggregation), provider tool agents
(agent_call_documents 142,095, agent_call_google 49,310, agent_call_composio
56,593, agent_call_microsoft 41,644), and end-to-end workflows
(workflow_proposal_pipeline, workflow_invoice_send, workflow_onboarding,
workflow_report_generation, workflow_meeting_prep, workflow_msai_specific),
plus agent_unsafe_refusal (safety).
H · Documents, code, verification & planning — agent_call_code (108,349),
agent_args_validation (54,999), agent_verifier_loop (45,000),
agent_branch_merge, agent_dag_execution, agent_partial_failure_graph,
specialist agents (planner_agent, executor_agent, verifier_agent,
policy_agent, retrieval_agent, retrieval_arbiter, summarizer_agent),
and state control (state_carryover, progressive_refinement, checkpoint_recall,
contradiction_detection_and_repair): document & code generation, argument
validation, verifier loops, DAG/branch execution, multi-agent planning.
I · Security & coding CoT — security_cot (12,000), code_cot (8,000),
code_rag (6,000): security reasoning and retrieval-grounded code reasoning.
Tool surface
Calendar (gcal_*/mscal_*), email (gmail_*/outlook_*), CRM
(highlevel_*/mailchimp_*/activecampaign_*/kartra_*), docs/research
(gdrive_*/gdocs_*/gsheets_*/notion_*), tasks (asana_*/todoist_*/
gtasks_*/slack_*/teams_*), meetings (fireflies_*/fathom_*/zoom_*/
gmeet_*), document generation (doc_create_{pdf,word,excel,csv,json,html, markdown,pptx,…}), code generation (code_generate_{python,js,sql,shell}), and
finance/compliance/insurance/regulatory/accounts/risk verticals.
Inference
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
R = "MediaStreamAI/MOTHER_CORE_V3"
tok = AutoTokenizer.from_pretrained(R, trust_remote_code=True)
m = AutoModelForCausalLM.from_pretrained(R, trust_remote_code=True,
torch_dtype=torch.bfloat16, device_map="auto").eval()
def ask(q, n=256):
ids = tok(f"Question:\n\n{q}\n\nAnswer:", return_tensors="pt").input_ids.to(m.device)
out = m.generate(ids, max_new_tokens=n, do_sample=False,
repetition_penalty=1.2, no_repeat_ngram_size=4, pad_token_id=tok.pad_token_id)
return tok.decode(out[0][ids.shape[1]:], skip_special_tokens=True).strip()
print(ask("What is the capital of Scotland?")) # Edinburgh is the capital of Scotland.
Agentic loop: the model emits a single TOOL_CALL tool(args); your runtime runs
it and feeds the result back as a separate tool turn (TOOL_RESULT {...}); the
model then continues. Use greedy decoding; keep repetition_penalty≈1.2.
Limitations & safety
- Some file-format routing still imperfect; arithmetic should be checked.
- Non-weapon posture: observation + human-in-the-loop only; refuses forgery, fraud, phishing, and destructive actions.
- Knowledge fixed at training time — verify before acting.
Provenance
MediaStream AI Limited (UK). Full training; answer-only loss, balanced-category sampling, EOS-terminated targets, cosine-LR warm-restart. "Phase One" = the 2.4M-record agentic curriculum above. Phase 2 15m Records abd Coding + World Model (Backbone Deployment: MOTHER EXO World Model V.1)
- Downloads last month
- 72