Created in June 19, 2024
2024
Our work “FBI: Finding Blindspots in Evaluator LLMs with Interpretable Checklists” is out on ArXiv.