Wanru Zhao

I'm a PhD student in Computer Science at University of Cambridge, advised by Prof. Nic Lane at the Cambridge Machine Learning Systems Lab (CaMLSys). I'm also a member of Cambridge AI Safety Lab, working on AI Alignment and Interpretability. Prior to that, I obtained my MPhil in Advanced Computer Science at Cambridge as well.

I’m currently visiting the Vector Institute, working with Colin Raffel at the University of Toronto. I was a research intern at Microsoft Research, mentored by Alessandro Sordoni and Lucas Caccia.

My research focuses on:

Modular, distributed/decentralised training (model merging, Mixture-of-Experts) and decentralised inference;
Data attribution/selection/curation/balancing/mixing, synthetic data generation and curriculum design for foundation model training;
Compositional reasoning of large language models (in math and coding domains) and multi-agent systems

Email / Google Scholar / GitHub / Twitter / Bluesky

News

[Jan 2026] Two conference papers accepted to ICLR 2026! See you in Rio de Janeiro 🇧🇷
[Sept 2025] One conference paper and one workshop paper accepted to NeurIPS 2025! See you in San Diego 🇺🇸 / Mexico City 🇲🇽
[Jun 2025] Two workshop papers accepted to ICML 2025 AI for Math Workshop!
[Jan 2025] I'm co-organizing the Workshop on Modular, Collaborative and Decentralized Deep Learning at ICLR 2025! See you in Singapore 🇸🇬
[Feb 2025] One paper accepted to AAMAS 2025!
[Feb 2025] One paper accepted to MLSys 2025!
[Jan 2025] One paper accepted to ASP-DAC 2025!
[Sept 2024] Two conference papers and one workshop paper accepted to NeurIPS 2024! See you in Vancouver 🇨🇦
[Feb 2024] One conference paper and two workshop papers accepted to ICLR 2024! See you in Vienna 🇦🇹
[Mar 2023] Our team got the winner of the US-UK Privacy-Enhancing Technologies Prize Challenges! We will present our solution at Innovation and Technology's Centre for Data Ethics and Innovation (CDEI) in London at the end of May. Check out the report on the Cambridge University website!
[Mar 2023] One paper accepted to FAccT 2023!

Selected Publications

Rethinking Data Curation in LLM Training: Online Reweighting Offers Better Generalization than Offline Methods
Wanru Zhao, Yihong Chen, Yuzhi Tang, Wentao Ma, Shengchao Hu, Shell Xu Hu, Alex Iacob, Abhinav Mehrotra, Nicholas Lane
International Conference on Learning Representations (ICLR), 2026
Paper

Learning to Solve Complex Problems via Dataset Decomposition
Wanru Zhao, Lucas Caccia, Zhengyan Shi, Minseon Kim, Xingdi Yuan, Weijia Xu, Marc-Alexandre Côté Alessandro Sordoni
Conference on Neural Information Processing Systems (NeurIPS), 2025
Paper

TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior
Gül Sena Altıntaş, Malikeh Ehghaghi, Brian Lester, Fengyuan Liu, Wanru Zhao, Marco Ciccone, Colin Raffel
arXiv preprint, 2025
Paper / Code / HuggingFace

CLUES: Collaborative High-Quality Data Selection for LLMs via Training Dynamics
Wanru Zhao, Hongxiang Fan, Shell Xu Hu, Wangchunshu Zhou, Bofan Chen Nicholas Lane
Conference on Neural Information Processing Systems (NeurIPS), 2024
Paper / Code / Website

Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages
Wanru Zhao, Yihong Chen, Royson Lee, Xinchi Qiu, Yan Gao, Hongxiang Fan Nicholas Lane
International Conference on Learning Representations (ICLR), 2024
Paper

Cascadia: A Cascade Serving System for Large Language Models
Youhe Jiang*, Fangcheng Fu*, Wanru Zhao*, Stephan Rabanser, Nicholas Lane, Binhang Yuan
International Conference on Learning Representations (ICLR), 2026
Paper

MR-BEN: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
Zhongshen Zeng, Yinhong Liu, Yingjia Wan, Jingyao Li, ... , Wanru Zhao, ... , Zhijiang Guo Jiaya Jia
Conference on Neural Information Processing Systems (NeurIPS), 2024
Paper

Prompt Tuning with Diffusion for Few-Shot Pre-trained Policy Generalization
Shengchao Hu, Wanru Zhao, Weixiong Lin, Li Shen, Ya Zhang, Dacheng Tao
International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2025
Paper

Attacks on Third-Party APIs of Large Language Models
Wanru Zhao, Vidit Khazanchi, Haodi Xing, Xuanli He, Qiongkai Xu, Nicholas Lane
Arxiv preprint, 2024
Paper / Code

Harms from Increasingly Agentic Algorithmic Systems
Alan Chan, Rebecca Salganik, Alva Markelius, ... , Wanru Zhao, ... , Umang Bhatt , Adrian Weller , David Krueger, Tegan Maharaj,
ACM Conference on Fairness, Accountability, and Transparency (FAccT), 2023
Paper

Evaluating Large Language Models in Scientific Discovery
Zhangde Song, Jieyu Lu, Yuanqi Du, Botao Yu, ... , Wanru Zhao, ... , Huan Sun, Seyed Mohamad Moosavi, Chenru Duan,
arXiv preprint, 2025
Paper

(This list is not comprehensive and is being updated. For a complete list of publications, please visit my Google Scholar profile.)

Selected Internships

Microsoft Research, with Alessandro Sordoni on modular and synthetic data generation for reasoning tasks and coding agents, 2025
Vector Institute, with Colin Raffel and Nicolas Papernot on training data curation and multi-agent mechanistic design, 2024
Amazon AI Lab, with Minjie Wang and Zheng Zhang on Deep Graph Library and its interpretability toolkits GNNLens2 , 2021
SenseTime Research, with Ruihao Gong and Fengwei Yu on model quantization and compression, 2019

Selected Honors and Awards

Qualcomm Innovation Fellowship Finalist, 2025
Google PhD Fellowship Finalist, 2024
UK Privacy Enhancing Technologies Challenge Rank 1, 2022
China Competition on Virtual Reality (2020) National Grand Prize, 2020
Chinese Undergraduate Mathematical Contest in Modeling (CUMCM) National First Prize, 2020
ACM International Collegiate Programming Contest (ACM-ICPC) Silver Medal, 2018
CCF National Olympiad in Informatics (NOI) Bronze Medal, 2016

Academic Services

Conference Reviewer: NeurIPS 2024-2025, ICLR 2025, ICML 2025, AISTATS 2025, COLM 2025
Journal Reviewer: TMLR, TIST
Organizing Committee: ICLR 2025 Workshop on Modular, Collaborative and Decentralized Deep Learning (MCDC@ICLR2025)

Miscellaneous

My name is pronounced similarly to "One Rule." My Chinese name is Wanru Zhao (趙婉如), derived from the classical verse 「有美一人，婉如清揚」, symbolizing a spirit that is vivid, bright, and pure. My English name is Renee, though here is a fun fact: during my time as a competitive programmer in Olympiads and online contests (e.g., CodeJam, Codeforces), I used the handle "Ryan" to avoid being judged through the lens of gender bias.

I love reading. The works of Friedrich Nietzsche, Albert Camus, and Max Weber have deeply influenced me. My favorite sci-fi writer is Ted Chiang. I love poetry as well, and view coding as a form of writing verse. My research is driven by a dual purpose: seeking intellectual satisfaction (thus adhering to internal standards of truth rather than hacking external metrics) and fulfilling a responsibility to contribute a unique, indispensable perspective to the world (thus ensuring that technology benefits the public and creates genuine social impact).

I enjoy traveling, mountaineering, and hiking. My journey has led me to reside in Cambridge, Montreal, Toronto, Chicago, Shanghai, and Beijing, and I eagerly look forward to embracing more diverse cultures. I am also an avid snow mountain enthusiast, and I climbed the impressive Peak Nochma of Minya Konka (5588m).

Beyond my research focus, I am interested in exploring the intersection of AI and the humanities in my spare time. Feel free to check out my past projects on computer music (which won a National Grand Prize) and AI for digital theater arts (Reimagine Copenhagen was exhibited at the Prague Quadrennial).

I am always open to new conversations and collaborations! 🩵

Design and source code from Jon Barron's website