TERMINATOR: Learning Optimal Exit Points for Early Stopping in Chain-of-Thought Reasoning Paper • 2603.12529 • Published Mar 13 • 19
ConLID: Supervised Contrastive Learning for Low-Resource Language Identification Paper • 2506.15304 • Published Jun 18, 2025 • 1
view article Article Assisted Generation: a new direction toward low-latency text generation joaogante • May 11, 2023 • 78
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 +1 eliebak, lvwerra, lewtun • Jan 28, 2025 • 889
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Dec 10, 2025 • 23
view article Article How to generate text: using different decoding methods for language generation with Transformers patrickvonplaten • Mar 1, 2020 • 297