dhara-250m / tokenizer_config.json
codelion's picture
dhara-250m tri-mode (AR + block-diffusion + self-speculation), annealed, ~60B tokens
bd52fdd verified
{
"backend": "tokenizers",
"bos_token": "<|begin_of_text|>",
"eos_token": "<|end_of_text|>",
"extra_special_tokens": [
"<|im_start|>",
"<|im_end|>"
],
"is_local": true,
"local_files_only": false,
"model_max_length": 1000000000000000019884624838656,
"pad_token": "<|pad|>",
"tokenizer_class": "TokenizersBackend",
"unk_token": "<|unk|>"
}