Michal Valko

misovalko

large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models

updated a dataset about 20 hours ago

authored a paper 3 days ago

authored a paper 3 days ago

New activity in paris-ai-running-club/README about 2 months ago

#6 opened 5 months ago by

New activity in paris-ai-running-club/README almost 2 years ago

#3 opened almost 2 years ago by

#1 opened almost 2 years ago by

misovalko's activity