-
GGUF Editor
🏢96Edit GGUF model metadata from Hugging Face or local files
-
mergekit-gui
🔀290Merge AI models using a YAML configuration file
-
GGUF My Repo
🦙1.92kQuantize Hugging Face models to GGUF and publish repo
-
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
Paper • 2512.04746 • Published • 14
Joe
Joe57005
·
AI & ML interests
None yet
Recent Activity
updated a collection 14 days ago
Models to try upvoted a collection 15 days ago
APEX Quants (GGUF) updated a collection 17 days ago
Models to tryOrganizations
None yet
For MOE 1.5B
Models to try
-
bunnycore/Gemma2-2b-function-calling-lora
Updated • 1 -
NickyNicky/gemma-2b-it_oasst2_all_chatML_function_calling_Agent_v1
Text Generation • 3B • Updated • 31 • 1 -
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF
Text Generation • 1B • Updated • 846k • 46 -
gorilla-llm/gorilla-openfunctions-v2
Text Generation • Updated • 913 • 245
For finetune
-
glaiveai/glaive-function-calling-v2
Viewer • Updated • 113k • 18.4k • 499 - RunningAgents17
Chat Template Editor
💬17View, edit, test and submit Chat Templates
- RunningAgents96
GGUF Editor
🏢96Edit GGUF model metadata from Hugging Face or local files
-
0xSero/glm47-reap-calibration-v2
Viewer • Updated • 1.36k • 27 • 3
Good for home automation
Large context LLMs that work well with Home Assistant via Llama.cpp server running on CPU with 16GB ram.
-
inclusionAI/Ling-mini-2.0
Text Generation • 16B • Updated • 17.3k • 191 -
Orion-zhen/Qwen3-30B-A3B-Instruct-2507-IQK-GGUF
31B • Updated • 154 • 1 -
Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound
31B • Updated • 66 • 26 -
Tiiny/SmallThinker-21BA3B-Instruct
Text Generation • 22B • Updated • 185 • 111
LLM Tools
- RunningAgents96
GGUF Editor
🏢96Edit GGUF model metadata from Hugging Face or local files
- Runtime errorAgentsFeatured290
mergekit-gui
🔀290Merge AI models using a YAML configuration file
- Running on A10G1.92k
GGUF My Repo
🦙1.92kQuantize Hugging Face models to GGUF and publish repo
-
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
Paper • 2512.04746 • Published • 14
For finetune
-
glaiveai/glaive-function-calling-v2
Viewer • Updated • 113k • 18.4k • 499 - RunningAgents17
Chat Template Editor
💬17View, edit, test and submit Chat Templates
- RunningAgents96
GGUF Editor
🏢96Edit GGUF model metadata from Hugging Face or local files
-
0xSero/glm47-reap-calibration-v2
Viewer • Updated • 1.36k • 27 • 3
For MOE 1.5B
Good for home automation
Large context LLMs that work well with Home Assistant via Llama.cpp server running on CPU with 16GB ram.
-
inclusionAI/Ling-mini-2.0
Text Generation • 16B • Updated • 17.3k • 191 -
Orion-zhen/Qwen3-30B-A3B-Instruct-2507-IQK-GGUF
31B • Updated • 154 • 1 -
Intel/Qwen3-30B-A3B-Instruct-2507-gguf-q2ks-mixed-AutoRound
31B • Updated • 66 • 26 -
Tiiny/SmallThinker-21BA3B-Instruct
Text Generation • 22B • Updated • 185 • 111
Models to try
-
bunnycore/Gemma2-2b-function-calling-lora
Updated • 1 -
NickyNicky/gemma-2b-it_oasst2_all_chatML_function_calling_Agent_v1
Text Generation • 3B • Updated • 31 • 1 -
hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF
Text Generation • 1B • Updated • 846k • 46 -
gorilla-llm/gorilla-openfunctions-v2
Text Generation • Updated • 913 • 245