Kopachelli
's Collections
Paper
•
2505.09388
•
Published
•
317
Qwen/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
11.9k
•
57
Qwen/Qwen3-8B-GGUF
Text Generation
•
8B
•
Updated
•
84.8k
•
84
Qwen/Qwen3-4B-GGUF
Text Generation
•
4B
•
Updated
•
24.2k
•
46
Qwen2.5-Coder Technical Report
Paper
•
2409.12186
•
Published
•
152
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation
•
8B
•
Updated
•
642k
•
•
568
Qwen/Qwen2.5-Coder-14B
Text Generation
•
15B
•
Updated
•
2.74k
•
•
51
Qwen/Qwen2.5-Coder-14B-Instruct
Text Generation
•
15B
•
Updated
•
54.3k
•
•
131
Qwen/Qwen2.5-Coder-7B
Text Generation
•
8B
•
Updated
•
35.1k
•
•
128
DeepSeek-V3 Technical Report
Paper
•
2412.19437
•
Published
•
73
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
•
2406.11931
•
Published
•
67
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
•
8B
•
Updated
•
9.06k
•
•
213
Llama-Nemotron: Efficient Reasoning Models
Paper
•
2505.00949
•
Published
•
42
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical
Reasoning Models with OpenMathReasoning dataset
Paper
•
2504.16891
•
Published
•
25
OpenCodeReasoning-II: A Simple Test Time Scaling Approach via
Self-Critique
Paper
•
2507.09075
•
Published
•
15
tencent/Hunyuan-7B-Instruct
Text Generation
•
8B
•
Updated
•
5.54k
•
82
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
•
2507.01006
•
Published
•
240
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
•
352k
•
•
755
zai-org/GLM-4.1V-9B-Base
Image-Text-to-Text
•
10B
•
Updated
•
2.11k
•
63
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for
Sparse Architectural Large Language Models
Paper
•
2407.01906
•
Published
•
45
deepseek-ai/deepseek-moe-16b-base
Text Generation
•
16B
•
Updated
•
18.1k
•
132
DeepSeekMoE: Towards Ultimate Expert Specialization in
Mixture-of-Experts Language Models
Paper
•
2401.06066
•
Published
•
58
deepseek-ai/deepseek-moe-16b-chat
Text Generation
•
16B
•
Updated
•
16.7k
•
150
Skywork/Skywork-VL-Reward-7B
Image-Text-to-Text
•
8B
•
Updated
•
17.9k
•
46
Mungert/xLAM-2-32b-fc-r-GGUF
Text Generation
•
33B
•
Updated
•
301
•
5
zai-org/SWE-Dev-7B
8B
•
Updated
•
92
•
6
Mungert/Skywork-VL-Reward-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
•
499
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B
Text Classification
•
Updated
•
3.18k
•
33
jnorthrup/Skywork-o1-Open-PRM-Qwen-2.5-7B
Text Classification
•
8B
•
Updated
•
6
mistralai/Mixtral-8x7B-Instruct-v0.1
47B
•
Updated
•
376k
•
4.62k
CodeDPO/mimo-7b-base-deepcoder-120steps
Mungert/granite-guardian-3.1-8b-GGUF
Text Generation
•
8B
•
Updated
•
218
ariels/pest_twitter_geoparsing
Viewer
•
Updated
•
678
•
16