Lexical Consensus: Grounded Word Learning and Shared Meaning in Artificial Agents Paper • 2606.22207 • Published 13 days ago • 2
Lexical Consensus: Grounded Word Learning and Shared Meaning in Artificial Agents Paper • 2606.22207 • Published 13 days ago • 2
Intention Collapse: Intention-Level Metrics for Reasoning in Language Models Paper • 2601.01011 • Published Jan 3 • 1
Intention Collapse: Intention-Level Metrics for Reasoning in Language Models Paper • 2601.01011 • Published Jan 3 • 1
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs +5 StringChaos, minimario, tianjunz, xu3kev, kingh0730, FanjiaYan, clefourrier • Apr 16, 2024 • 16
Rethinking the Evaluating Framework for Natural Language Understanding in AI Systems: Language Acquisition as a Core for Future Metrics Paper • 2309.11981 • Published Sep 21, 2023 • 2