Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Spaces:
thepikachu
/
architecture-env
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
architecture-env
1.81 MB
Ctrl+K
Ctrl+K
1 contributor
History:
49 commits
thepikachu
Refine GRPO evaluation details and clarify model performance comparisons in Blog.md
83f7214
about 1 month ago
__pycache__
round2: inference and planner critic design
about 1 month ago
archive
Add scripts for supervised fine-tuning and GRPO training
about 1 month ago
components
round2: inference and planner critic design
about 1 month ago
config
Add scripts for supervised fine-tuning and GRPO training
about 1 month ago
datasets
Add scripts for supervised fine-tuning and GRPO training
about 1 month ago
notebooks
Revise reasoning for choosing SFT + agentic loop over SFT + GRPO in deployment documentation
about 1 month ago
plots
Remove outdated inference reward curve plot and add new loss and reward curve plots for improved analysis.
about 1 month ago
server
moved app.py to fix the runtime error
about 1 month ago
training
Add scripts for supervised fine-tuning and GRPO training
about 1 month ago
.dockerignore
Safe
94 Bytes
Upload 11 files
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago
.gitignore
Safe
4 Bytes
Upload 11 files
2 months ago
Blog.md
Safe
15.8 kB
Refine GRPO evaluation details and clarify model performance comparisons in Blog.md
about 1 month ago
Dockerfile
936 Bytes
Update Dockerfile
2 months ago
README.md
Safe
12.8 kB
Remove placeholder for YouTube demo link in README.md
about 1 month ago
__init__.py
441 Bytes
Upload 11 files
2 months ago
agentic_inference.py
21.3 kB
Update LocalModelClient initialization to use MODEL_REPO_ID instead of MODEL_DIR
about 1 month ago
client.py
3.22 kB
Upload 11 files
2 months ago
components.json
19.9 kB
round2: inference and planner critic design
about 1 month ago
encyclopedia_rules.py
7.43 kB
round2: inference and planner critic design
about 1 month ago
models.py
469 Bytes
Update models.py
2 months ago
openenv.yaml
5.84 kB
round2: inference and planner critic design
about 1 month ago
pyproject.toml
1.38 kB
Upload 11 files
2 months ago
requirements.txt
Safe
109 Bytes
Refactor requirements file structure
2 months ago
uv.lock
576 kB
add uv lock
2 months ago