II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper • 2406.05862 • Published Jun 9, 2024 • 4
CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation Paper • 2406.07054 • Published Jun 11, 2024
One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning Paper • 2510.26167 • Published Oct 30, 2025
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper • 2406.05862 • Published Jun 9, 2024 • 4
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22, 2025 • 4