FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol Paper • 2603.24943 • Published 5 days ago • 8
FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol Paper • 2603.24943 • Published 5 days ago • 8
FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol Paper • 2603.24943 • Published 5 days ago • 8
Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset Paper • 2405.10542 • Published May 17, 2024 • 1
CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation Paper • 2510.05122 • Published Sep 30, 2025 • 4
CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation Paper • 2510.05122 • Published Sep 30, 2025 • 4
CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation Paper • 2510.05122 • Published Sep 30, 2025 • 4 • 2
Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models Paper • 2508.15202 • Published Aug 21, 2025 • 5
Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models Paper • 2508.15202 • Published Aug 21, 2025 • 5
Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models Paper • 2508.15202 • Published Aug 21, 2025 • 5 • 2
Evaluating, Synthesizing, and Enhancing for Customer Support Conversation Paper • 2508.04423 • Published Aug 6, 2025 • 9 • 4
Evaluating, Synthesizing, and Enhancing for Customer Support Conversation Paper • 2508.04423 • Published Aug 6, 2025 • 9