Sino-Tibetan linguistics is a field that studies the languages within the Sino-Tibetan language family, which includes a diverse range of languages spoken primarily in China, Tibet, and ...
Belebele is a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. This dataset enables the evaluation of mono- and multi-lingual models in high-, medium-, and ...