LAUNCH OF BHARATGEN
BharatGen is a multimodal multilingual large language model initiative, developing advanced generative AI models tailored to India’s linguistic, cultural, and socio-economic diversity. To ensure that generative AI models adequately represent India’s diverse linguistic landscape, BharatGen has launched an initiative called “Bharat Data Sagar”, focusing on primary data collection. This data collection attempts to meet the requirement that training data is available for Indian languages that are lesser represented in data corpora.
BharatGen is building partnerships with research groups across the country, to ensure that the generative AI models being developed can be extended by partners and made available to the larger research and non-academic community for further development and usage. BharatGen is also developing partnerships with the government, industry and start-ups for applications geared towards efficient administration and public at large, including marginalized and underrepresented communities in the country.
In order to promote cultural identity and regional development, BharatGen provides technologies and tools that will support the development of region-specific content by seamlessly translating across local languages and dialects.
BharatGen includes a consortium of top AI researchers across premier academic institutions in India that include IIT Bombay, IIIT Hyderabad, IIT Mandi, IIT Kanpur, IIT Hyderabad, IIM Indore, and IIT Madras. These research groups are partnering with the government, industry and startups to develop models, keeping in mind the linguistic and cultural diversity of India and inclusivity for citizens and to ensure equitable technological access across different socio-economic groups in the country.