Prateek Biswas's picture

3

Prateek Biswas

biswasprateek

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

authored a paper about 1 month ago

Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

upvoted a paper about 1 month ago

Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Beyond Static Leaderboards: Predictive Validity for the Evaluation of LLM Agents

Paper • 2606.19704 • Published 3 days ago • 28

authored a paper about 1 month ago

Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

Paper • 2605.18827 • Published May 12 • 7

upvoted 2 papers about 1 month ago

Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

Paper • 2605.18827 • Published May 12 • 7

MCP-Cosmos: World Model-Augmented Agents for Complex Task Execution in MCP Environments

Paper • 2605.09131 • Published May 9 • 59