Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation Paper • 2410.08371 • Published Oct 10, 2024 • 3
DEPAC: a Corpus for Depression and Anxiety Detection from Speech Paper • 2306.12443 • Published Jun 20, 2023
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 4 days ago • 14
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 4 days ago • 14
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 4 days ago • 14
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published 4 days ago • 14
toksuite/supertoken_models-llama_meta-llama-Llama-3.2-1B Text Generation • 2B • Updated 3 days ago • 97
toksuite/supertoken_models-llama_CohereLabs-aya-expanse-8b Text Generation • 2B • Updated 3 days ago • 54
toksuite/supertoken_models-llama_common-pile-comma-v0.1 Text Generation • 2B • Updated 3 days ago • 73
toksuite/supertoken_models-llama_microsoft-Phi-3-mini-4k-instruct Text Generation • 1B • Updated 3 days ago • 74