Mohammed Safi Ur Rahman Khan

PhD Scholar • AI4BharatWSAIIIT Madras, Research Fellow @ Sarvam

NewProfilepic.jpeg

AI4Bharat Lab

Indian Institute of Technology, Madras

Chennai, Tamil Nadu, India

I am Safi (صفی), a second year PhD Student at the Wadhwani School of Data Science and Artificial Intelligence (DSAI) @ IIT Madras & AI4Bharat Lab where I am advised by Prof. Mitesh M. Khapra. My research explores Resources and Evaluation methods for Multilingual Large ‘X’ Language Models (where X = ‘Text’ or ‘Vision’ or ‘Audio’). I’m fortunate to be a Google PhD Fellow (2025) and have received Outstanding Paper Awards at EMNLP 2024 and ACL 2024. I am also currently a Research Fellow at Sarvam, helping in building Sovereign AI for India 🇮🇳!

Previously, I was an AI Resident at the AI4Bharat Lab at IIT Madras, where I was fortunate to be part of the IndicLLMSuite (guided by Prof. Mitesh M. Khapra). I did my M.Tech in Computer Science and Engineering before that from IIT Madras (again!!) where I got to work on “Narrow Domain Adaptation of Speech Recognition Systems” guided by Prof. Pratyush Kumar and (you guessed it!) Prof. Mitesh M. Khapra.

news

Oct 24, 2025 I’m honored to be selected as a recipient of the Google PhD Fellowship 2025 in the Natural Language Processing Track. الحمد لله. Grateful for the support and guidance of my mentors and collaborators throughout this journey.
Oct 21, 2025 I’ll be presenting a tutorial titled “Data and Model Centric Approaches for Expansion of Large Language Models to New Languages” @ EMNLP-2025 in Suzhou, China 🇨🇳. Please do attend if you are around. Reference material of the tutorial can be found here.
Jun 29, 2025 Will be attending ACL-2025 in Vienna Insha’Allah!! Will be co-presenting - FairITales, CIA, FERMAT, and BhasaAnuvaad. Please catch us up at our posters to know more about our work.
Jun 01, 2025 I’ll be joining Sarvam AI as a Research Fellow to help with the Sovereign LLM efforts. Will be working with the Alignment and Evaluations team to build an India-first 🇮🇳 model!!
May 25, 2025 4 papers accepted to ACL 2025 Alhamdulillah!! CIA, FERMAT, BhasaAnuvaad, and FairITales (preprint out soon). Vienna hope to see you soon Insha Allah.
Apr 13, 2025 I’ll be in Singapore 🇸🇬 for ICLR 2025. Looking forward to making new connections!

latest posts

selected publications

  1. ACL 2025
    FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes
    Janki Atul Nawale*Mohammed Safi Ur Rahman Khan*, Janani D, Mansi Gupta, Danish Pruthi, and Mitesh M. Khapra
    Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
  2. NAACL 2025
    MILU: A Multi-task Indic Language Understanding Benchmark
    Sshubam Verma, Mohammed Safi Ur Rahman Khan, Vishwajeet Kumar, Rudra Murthy, and Jaydeep Sen
    Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2024
  3. ACL 2025
    Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
    Sumanth Doddapaneni*Mohammed Safi Ur Rahman Khan*, Dilip Venkatesh, Raj Dabre, Anoop Kunchukuttan, and Mitesh M. Khapra
    Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
  4. EMNLP 2024
    EMNLP-2024 Outstanding Paper Award
    Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
    Sumanth Doddapaneni*Mohammed Safi Ur Rahman Khan*, Sshubam Verma, and Mitesh M. Khapra
    Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  5. ACL 2024
    ACL-2024 Outstanding Paper Award
    IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
    Mohammed Safi Ur Rahman Khan*, Priyam Mehta*, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad G, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, and Mitesh M. Khapra
    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024