arxiv:2411.03012
Aidar Valeev
aidarvaleev
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language
Models on Software Engineering Tasks
upvoted
a
paper
about 1 month ago
MERA Code: A Unified Framework for Evaluating Code Generation Across
Tasks
authored
a paper
10 months ago
Leveraging Large Language Models in Code Question Answering: Baselines
and Issues