Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nicholasKluge
/
Harmless-RewardModelPT
like
0
Text Classification
Transformers
Safetensors
nicholasKluge/harmless-aira-dataset
Portuguese
bert
reward model
alignment
preference model
RLHF
Carbon Emissions
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Harmless-RewardModelPT
1.31 GB
1 contributor
History:
11 commits
nicholasKluge
Update config.json
468864a
verified
7 months ago
.gitattributes
1.52 kB
initial commit
over 1 year ago
LICENSE
10.8 kB
Upload LICENSE
over 1 year ago
README.md
8.83 kB
Update README.md
7 months ago
config.json
957 Bytes
Update config.json
7 months ago
emissions.csv
775 Bytes
Update emissions.csv
over 1 year ago
model.safetensors
436 MB
xet
Upload folder using huggingface_hub
over 1 year ago
optimizer.pt
872 MB
xet
Upload folder using huggingface_hub
over 1 year ago
rng_state.pth
14.2 kB
xet
Upload folder using huggingface_hub
over 1 year ago
scheduler.pt
1.06 kB
xet
Upload folder using huggingface_hub
over 1 year ago
special_tokens_map.json
125 Bytes
Upload folder using huggingface_hub
over 1 year ago
tokenizer.json
678 kB
Upload folder using huggingface_hub
over 1 year ago
tokenizer_config.json
1.27 kB
Upload folder using huggingface_hub
over 1 year ago
trainer_state.json
1.99 kB
Upload folder using huggingface_hub
over 1 year ago
training_args.bin
5.11 kB
xet
Upload folder using huggingface_hub
over 1 year ago
vocab.txt
210 kB
Upload folder using huggingface_hub
over 1 year ago