Rasool Fakoor's picture

2

Rasool Fakoor

rasoolfa

·

https://rasoolfa.github.io/

rasoolfa

AI & ML interests

Building embodied agents.

Recent Activity

upvoted a paper 1 day ago

Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training

updated a model 8 months ago

FundamentalResearchLabs/oct20-qw4-midtraining-lr5e7-oci-4k-2984891-c1b734ab-s312

updated a model 8 months ago

FundamentalResearchLabs/oct20-qw4-midtraining-lr5e7-oci-4k-2984891-c1b734ab-s624

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training

Paper • 2605.12380 • Published May 12 • 2

upvoted an article about 1 year ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

sirluk

•

Oct 7, 2024

• 71