Demian L. P.
very-cooluser
AI & ML interests
Anything that can run on ~3GB of memory is a instant thumbs up to me
Recent Activity
upvoted
a
paper
about 22 hours ago
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search
upvoted
a
paper
about 22 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Organizations
None yet