The Thiomi Dataset: A Large-Scale Multimodal Corpus for Low-Resource African Languages
Paper • 2603.29244 • Published • 1
None defined yet.
Attention Sinks in Massively Multilingual Neural Machine Translation:Discovery, Analysis, and Mitigation
Zero-Shot Morphological Discovery in Low-Resource Bantu Languages via Cross-Lingual Transfer and Unsupervised Clustering