India has a rich linguistic landscape with languages from 4 major langua...
Adapters have been positioned as a parameter-efficient fine-tuning (PEFT...
We present Vārta, a large-scale multilingual dataset for headline
genera...
We present, Naamapadam, the largest publicly available Named Entity
Reco...
In this work, we introduce IndicXTREME, a benchmark consisting of nine
d...
End-to-end (E2E) models have become the default choice for state-of-the-...
In recent years, it has been seen that deep neural networks are lacking
...
Recent methods in speech and language technology pretrain very LARGE mod...
Multilingual Language Models (MLLMs) such as mBERT, XLM, XLM-R, etc.
hav...
We present Samanantar, the largest publicly available parallel corpora
c...