Daniel Gaderbauer's picture

Daniel Gaderbauer

Nelathan

·

https://www.instagram.com/nelathan_arts/

AI & ML interests

Computer science, Watercolor, Game Dev

Recent Activity

upvoted a collection 3 days ago

liked a model 3 days ago

cerebras/Kimi-Linear-REAP-35B-A3B-Instruct

upvoted a collection 4 days ago

View all activity

Organizations

None yet

upvoted a collection 3 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 7 days ago • 65

upvoted 2 collections 4 days ago

GLM-4.6V

3 items • Updated 18 days ago • 47

GLM-4.7

2 items • Updated 4 days ago • 29

upvoted 3 collections 10 days ago

VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation • 4 items • Updated 10 days ago • 39

AFM KDA

Collection of KDA conversions of AFM • 2 items • Updated 11 days ago • 3

Teacher Logits

Logits captured from large models to act as the teacher for distillation • 3 items • Updated 11 days ago • 7

upvoted 3 collections 24 days ago

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 2 days ago • 25

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 24 days ago • 133

Trinity

Collection of Arcee AI models in the Trinity family • 8 items • Updated 15 days ago • 21

upvoted an article 29 days ago

Article

Continuous batching from first principles

+1

Nov 25

•

285

upvoted 2 collections about 1 month ago

Olmo 3 Pre-training

All artifacts related to Olmo 3 pre-training • 10 items • Updated 3 days ago • 31

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 3 days ago • 156

upvoted a collection about 2 months ago

Agent Data Protocol

2 items • Updated Oct 29 • 10

upvoted a collection 3 months ago

BERT Hash Nano Models

Set of BERT models with a modified embeddings layer • 4 items • Updated 4 days ago • 9

upvoted a paper 3 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1 • 25

upvoted a collection 3 months ago

💧 LFM2

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 26 items • Updated about 5 hours ago • 128

upvoted an article 4 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

+4

Sep 4

•

267

upvoted 3 collections 4 months ago

EmbeddingGemma

3 items • Updated Sep 11 • 104

Hermes 4 Collection

13 items • Updated 25 days ago • 73

Tfree-HAT-7b-pretrained

Tokenizer free models based on Hierarchical Autoregressive Transformer (https://arxiv.org/abs/2501.10322) trained from scratch. • 2 items • Updated Aug 1 • 10