Lexin Zhou

I am a research resident at Microsoft, advised by Dr. Xing Xie and Prof. Jose Hernandez-Orallo. I did my master’s in NLP & HCI at the University of Cambridge, supervised by Prof. Andreas Vlachos. Prior to that, I did my BSc in Data Science at the Universitat Politècnica de València, where I got into research by working with Prof. Jose Hernandez-Orallo.

I am interested in research about AI Evaluation, social computing, human-AI interactions and AI safety, regularly taking inspiration from psychometrics and cognitive science. At present, I mostly spend my day thinking about (i) designing robust evaluation methods that offer explanatory and predictive power of AI’s capabilities and limitations, and (ii) assessing and anticipating societal risks associated with the deployment of AI in the quest of offering actionable insights that translate into policy and design changes to minimise the harms of AI while amplifying their benefits. I am especially intrigued by general-purpose systems like LLMs.

I’ve spent time in research/consultancy roles on AI Evaluation at Microsoft Research, Meta AI, OpenAI, Krueger AI Safety Lab, VRAIN, and European Commission JRC. My work has been featured in Nature, Financial Times, MIT Tech Review, Forbes, IEEE Spectrum, El País, New Scientists, QbitAI, IBM, among others.

If you are drawn to everything relevant to AI Evaluation and wanna stay informed, please subscribe our monthly AI Evaluation Digest newsletter! If you wanna talk about something I do, feel free to reach out via email or on Twitter.

news

Mar 20, 2025	💡Invited talk on General Scales Unlock AI Evaluation with Explanatory and Predictive Power at Princeton University!
Mar 09, 2025	📜 New preprint on introducing conceptual and technological innovations for a science of AI Evaluation: General Scales Unlock AI Evaluation with Explanatory and Predictive Power! Takeaways on X and an open platform calling for collaborations and extensions of our methodology. This represents the work that I personally feel the most excited about, to date.
Oct 30, 2024	💡Invited talk on Larger and More Instructable Language Models Become Less Reliable at Microsoft Research!
Sep 25, 2024	📜 Larger and More Instructable Language Models Become Less Reliable is finally out in Nature! Takeaways on X and a fairly well-written article in Chinese by QbitAI. This reminds me of Goodhart’s law.
Sep 20, 2024	📜 An LLM Feature-based Framework for Dialogue Constructiveness Assessment is accepted by EMNLP 2024, receiving high review scores that placed it in the top 0.5% of all submissions!
Sep 09, 2022	👨‍💻 Participated in the Red Team of GPT-4 at OpenAI, focusing on capability assessment, reliability evaluation, and adversarial testing.

selected publications

arXiv

General Scales Unlock AI Evaluation with Explanatory and Predictive Power

Lexin Zhou, Lorenzo Pacchiardi, Fernando Martı́nez-Plumed, Katherine M. Collins, Yael Moros-Daval, Seraphina Zhang, and 20 more authors

2025

🌟 Lexin’s Favorite Award PDF

This work is the one that I personally feel the most proud of given its technological and conceptual innovations for a science of AI Evaluation
NATURE

Larger and More Instructable Language Models Become Less Reliable

Lexin Zhou, Wout Schellaert, Fernando Martı́nez-Plumed, Yael Moros-Daval, Cèsar Ferri, and José Hernández-Orallo

Nature, 2024

🌟 Extensive Media Coverage PDF Code

This work has been featured by Nature, Forbes, MIT Tech Review, IEEE Spectrum, El País, New Scientist, QbitAI, IBM, among other media outlets
EMNLP

An LLM Feature-based Framework for Dialogue Constructiveness Assessment

Lexin Zhou, Youmna Farag, and Andreas Vlachos

EMNLP, 2024

🌟 Top 0.5% of Submissions PDF Code

This work received an average review score of 4.17 out of 5, placing it in the top 0.5% of all submissions in ARR June 2024
arXiv

Predictable Artificial Intelligence

Lexin Zhou, Pablo A. Moreno-Casares, Fernando Martı́nez-Plumed, John Burden, Ryan Burnell, Lucy Cheke, and 9 more authors

Under Review, 2023

PDF