Mohammed Safi Ur Rahman Khan
PhD Scholar • AI4Bharat • WSAI • IIT Madras, Research Fellow @ Sarvam
AI4Bharat Lab
Indian Institute of Technology, Madras
Chennai, Tamil Nadu, India
I am Safi (صفی), a second year PhD Student at the Wadhwani School of Data Science and Artificial Intelligence (DSAI) @ IIT Madras & AI4Bharat Lab where I am advised by Prof. Mitesh M. Khapra. My research explores Resources and Evaluation methods for Multilingual Large ‘X’ Language Models (where X = ‘Text’ or ‘Vision’ or ‘Audio’). I’m fortunate to be a Google PhD Fellow (2025) and have received Outstanding Paper Awards at EMNLP 2024 and ACL 2024. I am also currently a Research Fellow at Sarvam, helping in building Sovereign AI for India 🇮🇳!
Previously, I was an AI Resident at the AI4Bharat Lab at IIT Madras, where I was fortunate to be part of the IndicLLMSuite (guided by Prof. Mitesh M. Khapra). I did my M.Tech in Computer Science and Engineering before that from IIT Madras (again!!) where I got to work on “Narrow Domain Adaptation of Speech Recognition Systems” guided by Prof. Pratyush Kumar and (you guessed it!) Prof. Mitesh M. Khapra.
news
| Oct 24, 2025 | I’m honored to be selected as a recipient of the Google PhD Fellowship 2025 in the Natural Language Processing Track. الحمد لله. Grateful for the support and guidance of my mentors and collaborators throughout this journey. |
|---|---|
| Oct 21, 2025 | I’ll be presenting a tutorial titled “Data and Model Centric Approaches for Expansion of Large Language Models to New Languages” @ EMNLP-2025 in Suzhou, China 🇨🇳. Please do attend if you are around. Reference material of the tutorial can be found here. |
| Jun 29, 2025 | Will be attending ACL-2025 in Vienna Insha’Allah!! Will be co-presenting - FairITales, CIA, FERMAT, and BhasaAnuvaad. Please catch us up at our posters to know more about our work. |
| Jun 01, 2025 | I’ll be joining Sarvam AI as a Research Fellow to help with the Sovereign LLM efforts. Will be working with the Alignment and Evaluations team to build an India-first 🇮🇳 model!! |
| May 25, 2025 | 4 papers accepted to ACL 2025 Alhamdulillah!! CIA, FERMAT, BhasaAnuvaad, and FairITales (preprint out soon). Vienna hope to see you soon Insha Allah. |
| Apr 13, 2025 | I’ll be in Singapore 🇸🇬 for ICLR 2025. Looking forward to making new connections! |
latest posts
| Mar 14, 2024 | Indic LLM Suite | AI4Bharat Blog |
|---|
selected publications
- EMNLP 2024
Finding Blind Spots in Evaluator LLMs with Interpretable ChecklistsProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024 - ACL 2024
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian LanguagesProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024