Jiayi Wu (jyah-yee/jiā-yì)
Email --
Google Scholar --
CV
GitHub --
LinkedIn --
Updates
-
2 April, 2026.
I'm joining Anima AI+Science Lab this summer through Caltech's
Summer Undergraduate Research Fellowships (SURF) program!
-
5 Dec, 2025.
Presenting our position paper and workshop papers at NeurIPS 2025 in San Diego this week c:
-
10 Nov, 2025.
I'm joining ATLAS Group next year through Brown's Undergraduate Teaching and Research Awards (UTRA)!
Hello:)
Hi! I'm an undergrad studying Mathematics (A.B.) and Computer Science (Sc.B.) at Brown University.
I hope to build the formal and empirical infrastructure for
trustworthy machine learning across two complementary dimensions:
AI for mathematics and automated reasoning:
How can we design models that reason robustly and interpretably, grounded in formal theorem proving and program synthesis?
Formal theorem proving [1] [2] and program synthesis [3] [4]
are fully specified, unambiguous, machine-checkable substrates for studying models' reasoning processes.
Concretely, I'm interested in designing model architectures that leverage various learning paradigms (neurosymbolic programming in particular) to enable more robust and interpretable reasoning, in formal theorem proving and beyond.
Principled evaluation, auditing, and guarantees:
How do we substantiate trustworthiness claims about ML systems through systematic evaluation and formal guarantees?
I'm interested in designing evaluation and auditing approaches that attach formal or statistical evidence to model properties,
from specification conformance and robustness [4] [1] to value alignment [5] and supply-chain provenance.
These approaches should generalize across systems, development stages, and deployment contexts.
Below are selected publications; a more comprehensive list of research projects and publications is available in my academic CV.
Jiayi Wu, Robert Joseph George, Anima Anandkumar.
ITPEval: Benchmarking Formal Translation Across Interactive Theorem Provers.
Under review.
Elinor Poole-Dayan,
Jiayi Wu, Taylor Sorensen, Jiaxin Pei, Michiel A. Bakker.
Benchmarking Overton Pluralism in LLMs.
ICLR 2026.
[Paper]
[Code]
[Data]
[Project Page]
Ilija Ivanov, Jiayi Wu, Gavin Zhao, Zekai Li, Stephen Bach, Robert Lewis.
Mining Machine-Generated Lean Proofs: Advice for Users, Developers, and Testers.
Under review.
Chance Jiajie Li*,
Jiayi Wu*, Zhenze Mo, Ao Qu, Yuhan Tang, Kaiya Ivy Zhao, Yulu Gan, Jie Fan, Jiangbo Yu, Jinhua Zhao, Paul Pu Liang, Luis Alberto Alonso Pastor, Kent Larson.
Simulating Society Requires Simulating Thought.
NeurIPS 2025 Position Paper Track.
[Paper]
I organize communities that connect technology with governance and public-interest perspectives.
I’m co-president of Brown’s AI Robotics Ethics Society (AIRES) and co-director of the AI Governance Panel at Brown China Summit 2025.
I'm also writing for CMU's EncyclopAIdia Policy Glossary and Brown's Socially Responsible Computing Handbook.
More updates are available here!
In my spare time, I enjoy cold brew, taking on plank challenges (≥7min),
and roaming around the neighborhoods and community spaces I'm involved in
🏘️🌳 -- if I'm not in
CIT or the Rock's basement stacks^;