Latent Adversarial Regularization for Offline Preference Optimization Paper • 2601.22083 • Published 11 days ago • 13
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning Paper • 2510.06261 • Published Oct 5, 2025 • 6