Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.16M • • 2.04k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity • 22.7M • Updated • 164M • • 4.94k -
BAAI/bge-large-en-v1.5
Feature Extraction • 0.3B • Updated • 10M • • 683 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 472k • • 2.82k