arxiv:2504.13203
Salman Rahman PRO
salmannyu
AI & ML interests
Natural Language Processing, Deep Learning, Scalable Oversight, and Language Model Evaluation
Recent Activity
upvoted a paper 1 day ago
When Can LLMs Learn to Reason with Weak Supervision? submitted a paper 1 day ago
When Can LLMs Learn to Reason with Weak Supervision? updated a collection 2 days ago
rlvr-weak-supervision