fix: recover empty phoneme_id_map for ubl, tzo-chenalhó, ubu from MMS vocab.txt 3aab73c verified Jarbas commited on 8 days ago
Fix pad/blank token (capture tokenizer_config pad_token) a4648aa verified Jarbas commited on 9 days ago
Fix pad/blank token (capture tokenizer_config pad_token) a514879 verified Jarbas commited on 9 days ago
Fix pad/blank token (capture tokenizer_config pad_token) 538dea8 verified Jarbas commited on 9 days ago
Fix pad/blank token (capture tokenizer_config pad_token) 218de03 verified Jarbas commited on 9 days ago
Fix pad/blank token (capture tokenizer_config pad_token) 9d77f91 verified Jarbas commited on 9 days ago
Fix pad/blank token (capture tokenizer_config pad_token) e76bba3 verified Jarbas commited on 9 days ago
Fix pad/blank token (capture tokenizer_config pad_token) bb9fb82 verified Jarbas commited on 9 days ago
Set alphabet=latin (uroman output_alphabet == input_alphabet) 922fba4 verified Jarbas commited on 9 days ago
Set alphabet=latin (uroman output_alphabet == input_alphabet) 53a7ee7 verified Jarbas commited on 9 days ago
Set alphabet=latin (uroman output_alphabet == input_alphabet) d5640b1 verified Jarbas commited on 9 days ago
Set alphabet=latin (uroman output_alphabet == input_alphabet) f24c95e verified Jarbas commited on 9 days ago
Set alphabet=latin (uroman output_alphabet == input_alphabet) 3bfc7d2 verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) ae3d1ea verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) cdbf516 verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) 0475699 verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) 076dc94 verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) 947c60f verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) b991988 verified Jarbas commited on 9 days ago
Add self-contained config.json (inline phoneme_id_map + tokenizer flags) 88e55b0 verified Jarbas commited on 9 days ago