Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mrm8488
/
phi-4-14B-grpo-limo-2e
like
0
Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
unsloth
trl
grpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
phi-4-14B-grpo-limo-2e
29.3 GB
1 contributor
History:
4 commits
mrm8488
Trained with Unsloth
d5f4c4e
verified
11 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
README.md
571 Bytes
Trained with Unsloth
11 months ago
config.json
863 Bytes
Trained with Unsloth
11 months ago
generation_config.json
165 Bytes
Trained with Unsloth
11 months ago
merges.txt
917 kB
Upload tokenizer
11 months ago
model-00001-of-00006.safetensors
4.93 GB
xet
Trained with Unsloth
11 months ago
model-00002-of-00006.safetensors
4.95 GB
xet
Trained with Unsloth
11 months ago
model-00003-of-00006.safetensors
4.9 GB
xet
Trained with Unsloth
11 months ago
model-00004-of-00006.safetensors
4.95 GB
xet
Trained with Unsloth
11 months ago
model-00005-of-00006.safetensors
4.95 GB
xet
Trained with Unsloth
11 months ago
model-00006-of-00006.safetensors
4.62 GB
xet
Trained with Unsloth
11 months ago
model.safetensors.index.json
29.9 kB
Trained with Unsloth
11 months ago
special_tokens_map.json
570 Bytes
Upload tokenizer
11 months ago
tokenizer.json
7.15 MB
Upload tokenizer
11 months ago
tokenizer_config.json
18 kB
Upload tokenizer
11 months ago
vocab.json
1.61 MB
Upload tokenizer
11 months ago