Overall Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 9 days ago • 20 Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 11 days ago • 6
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 9 days ago • 20
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 11 days ago • 6
Video Understanding AURA: Always-On Understanding and Real-Time Assistance via Video Streams Paper • 2604.04184 • Published 19 days ago • 50 Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Paper • 2403.09626 • Published Mar 14, 2024 • 15 ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding Paper • 2506.01300 • Published Jun 2, 2025 Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Paper • 2503.21782 • Published Mar 27, 2025
AURA: Always-On Understanding and Real-Time Assistance via Video Streams Paper • 2604.04184 • Published 19 days ago • 50
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Paper • 2403.09626 • Published Mar 14, 2024 • 15
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding Paper • 2506.01300 • Published Jun 2, 2025
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Paper • 2503.21782 • Published Mar 27, 2025
Useful Agent Dual-View Training for Instruction-Following Information Retrieval Paper • 2604.18845 • Published 4 days ago • 10 AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 10 days ago • 153
Dual-View Training for Instruction-Following Information Retrieval Paper • 2604.18845 • Published 4 days ago • 10
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 10 days ago • 153
Overall Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 9 days ago • 20 Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 11 days ago • 6
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 9 days ago • 20
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 11 days ago • 6
Useful Agent Dual-View Training for Instruction-Following Information Retrieval Paper • 2604.18845 • Published 4 days ago • 10 AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 10 days ago • 153
Dual-View Training for Instruction-Following Information Retrieval Paper • 2604.18845 • Published 4 days ago • 10
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 10 days ago • 153
Video Understanding AURA: Always-On Understanding and Real-Time Assistance via Video Streams Paper • 2604.04184 • Published 19 days ago • 50 Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Paper • 2403.09626 • Published Mar 14, 2024 • 15 ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding Paper • 2506.01300 • Published Jun 2, 2025 Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Paper • 2503.21782 • Published Mar 27, 2025
AURA: Always-On Understanding and Real-Time Assistance via Video Streams Paper • 2604.04184 • Published 19 days ago • 50
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding Paper • 2403.09626 • Published Mar 14, 2024 • 15
ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding Paper • 2506.01300 • Published Jun 2, 2025
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Paper • 2503.21782 • Published Mar 27, 2025