LLaDA2.1: Speeding Up Text Diffusion via Token Editing Paper • 2602.08676 • Published 5 days ago • 59
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published May 21, 2025 • 55
BaleChen/llama-7b-hf_peft_stack-exchange-paired_rmts__100000_2e-05_peft_last_checkpoint_oct4_merged Text Classification • Updated Oct 5, 2023