publications

‘*’ denotes equal contribution

2024

  1. arXiv
    Pralekha: An Indic Document Alignment Evaluation Benchmark
    Sanjay Suryanarayanan, Haiyue Song, Mohammed Safi Ur Rahman Khan, Anoop Kunchukuttan, Mitesh M. Khapra, and Raj Dabre
    arXiv preprint arXiv: 2411.19096, 2024
  2. arXiv
    BhasaAnuvaad: A Speech Translation Dataset for 14 Indian Languages
    Sparsh Jain, Ashwin Sankar, Devilal Choudhary, Dhairya Suman, Nikhil Narasimhan, Mohammed Safi Ur Rahman Khan, Anoop Kunchukuttan, Mitesh M Khapra, and Raj Dabre
    arXiv preprint arXiv: 2411.04699, 2024
  3. arXiv
    MILU: A Multi-task Indic Language Understanding Benchmark
    Sshubam Verma, Mohammed Safi Ur Rahman Khan, Vishwajeet Kumar, Rudra Murthy, and Jaydeep Sen
    arXiv preprint arXiv: 2411.02538, 2024
  4. arXiv
    Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
    Sumanth Doddapaneni*Mohammed Safi Ur Rahman Khan*, Dilip Venkatesh, Raj Dabre, Anoop Kunchukuttan, and Mitesh M. Khapra
    arXiv preprint arXiv: 2410.13394, 2024
  5. EMNLP 2024
    EMNLP-2024 Outstanding Paper Award
    Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
    Sumanth Doddapaneni*Mohammed Safi Ur Rahman Khan*, Sshubam Verma, and Mitesh M. Khapra
    Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  6. ACL 2024
    ACL-2024 Outstanding Paper Award
    IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
    Mohammed Safi Ur Rahman Khan*, Priyam Mehta*, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad G, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, and Mitesh M. Khapra
    Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  7. arXiv
    Airavata: Introducing Hindi Instruction-tuned LLM
    Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, and Anoop Kunchukuttan
    arXiv preprint arXiv: 2401.15006, Aug 2024