DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper ⢠2501.12948 ⢠Published Jan 22, 2025 ⢠439 ⢠9
Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling Paper ⢠2601.22636 ⢠Published 14 days ago ⢠21
Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling Paper ⢠2601.22636 ⢠Published 14 days ago ⢠21 ⢠3
view changelog Changelog Team & Enterprise Articles Now Featured on the Hugging Face Blog Dec 8, 2025 ⢠93