AI4Bharat

Open source Indian language AI — NLP models, datasets, and translation tools from IIT Madras

About AI4Bharat

Open source Indian language AI — NLP models, datasets, and translation tools from IIT Madras

Key Features

IndicTrans2 — state-of-the-art Indian language translation
IndicBERT/IndicBART for Indic NLP tasks
Shoonya — open platform for language data annotation
ASR models for 22 Indian languages
Open source datasets for Indian language AI
Aksharantar — transliteration across Indian scripts
IndicVoices — speech corpus for Indian languages
Research papers at top NLP conferences
Community of 1000+ contributors
Powers Bhashini and government language initiatives

Who Is It For?

Professionals, enterprises, and teams looking for AI-powered solutions.