Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
ihbkaiser
/
trl-mcsd
like
0
arxiv:
2402.03300
arxiv:
2305.18290
arxiv:
2407.21783
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
trl-mcsd
Ctrl+K
Ctrl+K
1 contributor
History:
6 commits
ihbkaiser
Update SDPO launch script for 8 GPUs
f52e9d9
verified
about 2 months ago
.ai
Implement MCSD for experimental SDPO
about 2 months ago
.cursor
Implement MCSD for experimental SDPO
about 2 months ago
.github
Implement MCSD for experimental SDPO
about 2 months ago
assets
Implement MCSD for experimental SDPO
about 2 months ago
data
Add RAR data files
about 2 months ago
docker
Implement MCSD for experimental SDPO
about 2 months ago
docs
Implement MCSD for experimental SDPO
about 2 months ago
examples
Implement MCSD for experimental SDPO
about 2 months ago
scripts
Implement MCSD for experimental SDPO
about 2 months ago
tests
Implement MCSD for experimental SDPO
about 2 months ago
trl
Implement MCSD for experimental SDPO
about 2 months ago
.gitattributes
1.71 kB
Add RAR data files
about 2 months ago
.gitignore
1.83 kB
Implement MCSD for experimental SDPO
about 2 months ago
.pre-commit-config.yaml
661 Bytes
Implement MCSD for experimental SDPO
about 2 months ago
AGENTS.md
13 Bytes
Implement MCSD for experimental SDPO
about 2 months ago
CITATION.cff
1.19 kB
Implement MCSD for experimental SDPO
about 2 months ago
CLAUDE.md
13 Bytes
Implement MCSD for experimental SDPO
about 2 months ago
CODE_OF_CONDUCT.md
5.62 kB
Implement MCSD for experimental SDPO
about 2 months ago
CONTRIBUTING.md
22.9 kB
Implement MCSD for experimental SDPO
about 2 months ago
LICENSE
11.6 kB
Implement MCSD for experimental SDPO
about 2 months ago
MANIFEST.in
271 Bytes
Implement MCSD for experimental SDPO
about 2 months ago
MCSD.md
27.9 kB
Implement MCSD for experimental SDPO
about 2 months ago
MIGRATION.md
2.42 kB
Implement MCSD for experimental SDPO
about 2 months ago
Makefile
863 Bytes
Implement MCSD for experimental SDPO
about 2 months ago
README.md
7.88 kB
Implement MCSD for experimental SDPO
about 2 months ago
RELEASE.md
4.67 kB
Implement MCSD for experimental SDPO
about 2 months ago
VERSION
10 Bytes
Implement MCSD for experimental SDPO
about 2 months ago
pyproject.toml
5.24 kB
Implement MCSD for experimental SDPO
about 2 months ago
run_mcsd.sh
1.27 kB
Update launch scripts for 8 GPUs
about 2 months ago
run_sdpo.sh
1.19 kB
Update SDPO launch script for 8 GPUs
about 2 months ago