Salman Rahman PRO

salmannyu

·

https://salmanrahman.net/

AI & ML interests

Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation

Recent Activity

updated a model 14 days ago

salmannyu/first-mistake-rl-results

published a model 14 days ago

salmannyu/first-mistake-rl-results

upvoted a collection 30 days ago

View all activity

Organizations

salmannyu 's models 28

salmannyu/first-mistake-rl-results

Updated 14 days ago

salmannyu/model-checkpoints

salmannyu/llama_base_thinking_sft_noisy_reward_0_9

salmannyu/llama_base_thinking_sft_majority_vote_math_1024_sample_8k

salmannyu/mid_train_llama_52b_thinking_data_effect_math_8_sample

salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.7_sample

salmannyu/mid_train_llama_52b_thinking_noisy_reward_math_0.9_sample

salmannyu/mid_train_llama_52b_thinking_majority_vote_math_1024_sample

salmannyu/mid_train_llama_52b_thinking_data_effect_math_2048_sample

salmannyu/data_effect_scp_do_llama_3b_2048_sample

salmannyu/data_effect_scp_do_llama_3b_8_sample

salmannyu/data_effect_math_do_llama_3b_8_sample

salmannyu/data_effect_math_do_qwen_1_5b_8_sample

salmannyu/Llama-3B-Nemotron-mid-think_sft_nopack_lr1.5e5_ep3

3B • Updated Mar 22 • 1

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step150

4B • Updated Mar 14 • 1

salmannyu/Llama-3B-Nemotron-Math-thinking-sft-3ep-8samp-default-step100

4B • Updated Mar 14

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-non-think-nopack-lr1.5e5-ep3

3B • Updated Mar 6

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full-nopack-lr1.5e5-ep3

3B • Updated Mar 6 • 1

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-Full

Text Generation • 3B • Updated Mar 2 • 5

salmannyu/Llama-3B-Nemotron-Math-Mid-Train-140K-Step

3B • Updated Feb 25

salmannyu/Qwen2.5-1.5B-Nemotron-Math-52B-Mid-Train-8

Text Generation • 2B • Updated Feb 8 • 19 •

salmannyu/nemotron-train8-52B-Token

2B • Updated Nov 8, 2025 • 1

salmannyu/nemotron-train4

2B • Updated Nov 3, 2025 • 2

salmannyu/train3

2B • Updated Nov 3, 2025 • 1

salmannyu/nemotron-train2

2B • Updated Nov 3, 2025 • 1

salmannyu/qwen-math-7b-step-sft

8B • Updated Sep 3, 2025 • 2

salmannyu/step_cot

Text Generation • 15B • Updated Sep 3, 2025 • 4

salmannyu/prm-cot-private

Text Generation • 2B • Updated Sep 3, 2025 • 5