SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs Paper • 2603.20253 • Published 20 days ago • 1
SimulCost: A Cost-Aware Benchmark and Toolkit for Automating Physics Simulations with LLMs Paper • 2603.20253 • Published 20 days ago • 1
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 107
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination Paper • 2411.03823 • Published Nov 6, 2024 • 49
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination Paper • 2411.03823 • Published Nov 6, 2024 • 49