High-Dimension Human Value Representation in Large Language Models Paper • 2404.07900 • Published Apr 11, 2024 • 1
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity Paper • 2302.04023 • Published Feb 8, 2023
Planning with Reasoning using Vision Language World Model Paper • 2509.02722 • Published Sep 2, 2025 • 24
WorldPrediction: A Benchmark for High-level World Modeling and Long-horizon Procedural Planning Paper • 2506.04363 • Published Jun 4, 2025 • 1
Ko LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the Ko LLM leaderboard • 21 items • Updated Jun 19, 2025 • 14