Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
3
Mehul Damani
PRO
mehuldamani
Follow
wjurayj's profile picture
John6666's profile picture
Spechawk's profile picture
3 followers
·
0 following
https://damanimehul.github.io
MehulDamani2
damanimehul
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
updated
a model
25 days ago
mehuldamani/countdown_arl-sft-no-combine-v2
published
a model
25 days ago
mehuldamani/countdown_arl-sft-no-combine-v2
updated
a dataset
25 days ago
mehuldamani/neurips-story-main-story-features-sample-v1
View all activity
Organizations
None yet
mehuldamani
's models
281
Sort: Recently updated
mehuldamani/1.5B-v1-rlvr
Updated
Oct 1, 2025
mehuldamani/1.5B-v1-rlpa
Updated
Oct 1, 2025
mehuldamani/RLVR-math-v6
Updated
Oct 1, 2025
mehuldamani/RLVR-math-v5
Updated
Oct 1, 2025
mehuldamani/baseNoInstruct-hotpot-sept28-rlcr-multiple
Updated
Sep 30, 2025
mehuldamani/RLVR-math-v4
Updated
Sep 28, 2025
mehuldamani/hotpot-sept26-rlcr-single
Text Generation
•
8B
•
Updated
Sep 28, 2025
mehuldamani/RLVR-math-v3
Updated
Sep 27, 2025
mehuldamani/hotpot-sept26-rlvr-single-h100
Updated
Sep 27, 2025
mehuldamani/RLVR-math-7b
Updated
Sep 27, 2025
mehuldamani/singleAnswer-RLVR-hotpot-instruct_h100
Updated
Sep 26, 2025
mehuldamani/singleAnswer-RLCR-hotpot-instruct_h100
Updated
Sep 26, 2025
mehuldamani/singleAnswer-RLCR-hotpot-instruct
Updated
Sep 26, 2025
mehuldamani/singleAnswer-RLVR-hotpot-instruct
Updated
Sep 26, 2025
mehuldamani/sept23_onlyRLVR_multipleAnswers_a100
Updated
Sep 26, 2025
mehuldamani/sept24_rlvr_single_answer
Updated
Sep 24, 2025
mehuldamani/sept24_rlcr_multi_w_1_answer
Updated
Sep 24, 2025
mehuldamani/RLCR-hotpot-sept22_multi_answer_qwenInstruct_h100
Updated
Sep 24, 2025
mehuldamani/RLCR-hotpot-sept22_multi_answer_qwenInstruct_a100
Updated
Sep 23, 2025
mehuldamani/RLCR-hotpot-sept22_multi_answer_qwenInstruct
Updated
Sep 22, 2025
mehuldamani/RLCR-hotpot-sept22_multi_answer
Updated
Sep 22, 2025
mehuldamani/RLCR-math-sept21_startingFromScratch
Text Generation
•
8B
•
Updated
Sep 22, 2025
•
1
mehuldamani/RLCR-hotpot-sept20_actuallyTryNew_combinedFormatConstraint
Updated
Sep 21, 2025
mehuldamani/RLCR-math-sept20_actuallyTryNew_combinedFormatConstraint
Updated
Sep 20, 2025
mehuldamani/RLCR-math-sept20_3bModel
Updated
Sep 20, 2025
mehuldamani/RLCR-math-sysPromptMulti_rfFormat
Updated
Sep 19, 2025
mehuldamani/RLCR-math-sysPromptMulti_rfRespConstr
Updated
Sep 19, 2025
mehuldamani/math_sept13_split_format_compliance_w_multi_brier_attempt6
Updated
Sep 16, 2025
mehuldamani/math_sept13_split_format_compliance_w_multi_brier_attempt5
Updated
Sep 16, 2025
mehuldamani/math_sept13_split_format_compliance_w_multi_brier_attempt4
Updated
Sep 15, 2025
Previous
1
...
7
8
9
10
Next