Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
grpo-feature-vector-step-1
like
0
PEFT
Safetensors
English
verl
grpo
math
reasoning
rl
lora
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
grpo-feature-vector-step-1
864 MB
1 contributor
History:
3 commits
thejaminator
verl GRPO trained model at step 1
ade0533
verified
4 months ago
.gitattributes
1.52 kB
initial commit
5 months ago
README.md
735 Bytes
verl GRPO trained model at step 1
5 months ago
adapter_config.json
1.13 kB
verl GRPO trained model at step 1
4 months ago
adapter_model.safetensors
864 MB
xet
verl GRPO trained model at step 1
4 months ago