LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules Paper • 2602.10993 • Published 2 days ago • 1
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published 2 days ago • 12
Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning Paper • 2602.11149 • Published 2 days ago • 12 • 4
SteuerLLM: Local specialized large language model for German tax law analysis Paper • 2602.11081 • Published 2 days ago • 1
Optimal Turkish Subword Strategies at Scale: Systematic Evaluation of Data, Vocabulary, Morphology Interplay Paper • 2602.06942 • Published 7 days ago • 2
GLiNER- Linker Collection GLiNER-bi-Encoder models for entity linking with the GLiNKER framework • 3 items • Updated 10 days ago • 6
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale Paper • 2601.22146 • Published 15 days ago • 9
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale Paper • 2601.22146 • Published 15 days ago • 9 • 5
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale Paper • 2601.22146 • Published 15 days ago • 9
Say Anything but This: When Tokenizer Betrays Reasoning in LLMs Paper • 2601.14658 • Published 24 days ago • 1
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 24 days ago • 37