Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
aryannzzz
/
ppo-lunarlander-scratch
like
0
Reinforcement Learning
PyTorch
ppo
proximal-policy-optimization
lunar-lander
from-scratch
actor-critic
arxiv:
1707.06347
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
ppo-lunarlander-scratch
482 kB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
aryannzzz
Upload README.md with huggingface_hub
956524e
verified
5 months ago
.gitattributes
1.57 kB
Upload ppo_training.png with huggingface_hub
5 months ago
README.md
5.12 kB
Upload README.md with huggingface_hub
5 months ago
ppo_lunarlander.pth
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
124 kB
xet
Upload ppo_lunarlander.pth with huggingface_hub
5 months ago
ppo_training.png
352 kB
xet
Upload ppo_training.png with huggingface_hub
5 months ago