Matrix v6 β€” Hyperspace capability retrieval index

Retrieval index for 446,897 agent capabilities (tools, skills, agents) drawn from the skills.sh ecosystem (387 K SKILL.md files across 7,029 GitHub repos) plus Hyperspace's curated tool/agent catalog.

The retriever itself is Qwen/Qwen3-Embedding-0.6B used pretrained, no fine-tune β€” a deliberate choice documented in MATRIX_V6_ARCHITECTURE.md.

Contents

File What
capability_embeddings.fp16.npy [446897, 1024] fp16 embeddings, L2-normalized
capability_names.json Per-row metadata (cap_id, name, kind, repo)
capability_clusters.json cap_id β†’ cluster_id (190,085 intent clusters)
capabilities.jsonl Full source rows (name, description, snippet)
config.json Index metadata: retriever base, dim, query prompt, pooling

Quick eval (5 K held-out test queries, brute-force cosine)

Metric v6 v5 backbone same setup
ret@1 16.6 % 6.0 %
ret@5 47.3 % 15.6 %
ret@10 57.4 % 18.8 %
cluster@1 51.6 % 16.0 %
MRR@20 (cluster) 0.56 0.18

~3 Γ— across every metric, with a backbone 0.4 Γ— the size (596 M vs 1.5 B).

Usage

from huggingface_hub import snapshot_download
import numpy as np, json, torch
import torch.nn.functional as F
from transformers import AutoTokenizer, AutoModel

# 1. pull the index
path = snapshot_download("hyperspaceai/matrix-v6")
emb = np.load(f"{path}/capability_embeddings.fp16.npy")
names = json.load(open(f"{path}/capability_names.json"))

# 2. load the (pretrained) retriever
tok = AutoTokenizer.from_pretrained("Qwen/Qwen3-Embedding-0.6B")
tok.padding_side = "left"
if tok.pad_token is None: tok.pad_token = tok.eos_token
model = AutoModel.from_pretrained("Qwen/Qwen3-Embedding-0.6B",
                                   torch_dtype=torch.bfloat16).cuda().eval()

# 3. embed query
QP = ("Instruct: Given a user request, retrieve relevant agent "
      "capabilities (tools / skills / agents) from the catalog.\nQuery: ")
def encode(text):
    enc = tok(QP + text + tok.eos_token, return_tensors="pt",
              truncation=True, max_length=192).to("cuda")
    with torch.no_grad():
        h = model(**enc).last_hidden_state
    return F.normalize(h[:, -1, :].float(), dim=-1).cpu().numpy()

# 4. retrieve top-10
q = encode("write a SQL query that finds the top 10 customers by revenue")
sims = q @ emb.astype(np.float32).T
top10 = sims[0].argsort()[::-1][:10]
for i in top10:
    print(f"{sims[0, i]:.3f}  {names[i]['name']}  ({names[i]['repo']})")

License

Index/embeddings: Apache-2.0. Each SKILL.md remains under its source repository's license β€” see the repo field in capability_names.json.

The Matrix v6 corpus + scripts are in the Hyperspace agentic-os-prod monorepo; the harvester is thor-services/skills-sh-harvest.js.

Downloads last month
14
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support