DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents Paper • 2602.07035 • Published 12 days ago • 30
When Personalization Misleads: Understanding and Mitigating Hallucinations in Personalized LLMs Paper • 2601.11000 • Published 30 days ago • 26
MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching Paper • 2601.10712 • Published about 1 month ago • 24
Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-top-all Feature Extraction • 0.6B • Updated Nov 11, 2025
Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-top-all Feature Extraction • 0.6B • Updated Nov 11, 2025
Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-topk30 Feature Extraction • 0.6B • Updated Nov 11, 2025
Elpmis/Qwen3-0.6B-soft-thinking-last-token-naive-topk30 Feature Extraction • 0.6B • Updated Nov 11, 2025