kenhktsui
/

Qwen2.5-3B-Instruct-GRPO-minp-sampling_temp_05

Text Generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-3B-Instruct-GRPO-minp-sampling_temp_05

6.19 GB

1 contributor

History: 5 commits

kenhktsui's picture

Upload model trained with Unsloth

f13df00 verified 10 months ago

.gitattributes

1.57 kB

Upload tokenizer 10 months ago
README.md

617 Bytes

Trained with Unsloth 10 months ago
added_tokens.json

605 Bytes

Upload tokenizer 10 months ago
config.json

808 Bytes

Trained with Unsloth 10 months ago
generation_config.json

139 Bytes

Trained with Unsloth 10 months ago
merges.txt

1.67 MB

Upload tokenizer 10 months ago
pytorch_model-00001-of-00002.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch.HalfStorage",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
4.96 GB
xet

Trained with Unsloth 10 months ago
pytorch_model-00002-of-00002.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch.HalfStorage",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.21 GB
xet

Trained with Unsloth 10 months ago
pytorch_model.bin.index.json

35.6 kB

Trained with Unsloth 10 months ago
special_tokens_map.json

614 Bytes

Upload tokenizer 10 months ago
tokenizer.json

11.4 MB
xet

Upload tokenizer 10 months ago
tokenizer_config.json

7.36 kB

Upload model trained with Unsloth 10 months ago
vocab.json

2.78 MB

Upload tokenizer 10 months ago