Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Harryis
/
SCOUT_multitask
like
2
Reinforcement Learning
Safetensors
qwen2
multi-task
scout
ppo
arxiv:
2601.21754
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
SCOUT_multitask
/
README.md
Commit History
Update README.md
f3d2f71
verified
Harryis
commited on
Feb 1
Update README.md
d421d96
verified
Harryis
commited on
Feb 1
Create README.md
cb1ba2a
verified
Harryis
commited on
Jan 31