Principal Investigator

Yaodong Yang
Assistant Professor
Institute for AI, Peking University
Director of PKU Alignment Group
OfficeWest Ziyuan Building 2209A
Emailyaodong.yang@pku.edu.cn
Visit Personal Homepage

Research Assistants

Juntao Dai
Affiliation: Peking University
Start Time: March 2020
Research: Reinforcement Learning, Value Alignment
Email: jtd.acad@gmail.com
Mission: Exploring innovative approaches to advance AI research. Dedicated to pushing boundaries in machine learning. Committed to developing robust and ethical AI solutions. Striving to make meaningful contributions to the field.

Ph.D Students

Jiaming Ji
Affiliation: Ph.D (2023), Peking University
Start Time: March 2022
Research: Reinforcement Learning, Safety Alignment, AI for Science
Email: jiamg.ji@gmail.com
Mission: I aim to ensure AI systems are safe, aligned, and beneficial by developing principled alignment mechanisms and exploring large model applications in socially impactful domains such as healthcare, education, and science.
Jiayi Zhou
Affiliation: Ph.D (2024), Peking University
Start Time: September 2022
Research: Reinforcement Learning, AI Safety, Preference Modeling
Email: gaiejj@outlook.com
Mission: Learning from human feedback is the key to the continuous progress of AI. I am dedicated to providing richer feedback for AI, such as natural language and formal languages, to empower AI alignment and AI safety.
Donghai Hong
Affiliation: MSc (2024), Peking University
Start Time: December 2023
Research: Safety Alignment, Safety Evaluation
Email: donghai.hong@stu.pku.edu.cn
Mission: My research focuses on the safety and capabilities of AI systems. This includes the accurate and scalable evaluation of AI systems, as well as ensuring they align with human intent and values through mechanism design.
Borong Zhang
Affiliation: Ph.D (2025), Peking University
Start Time: April 2022
Research: AI Alignment, Embodied AI
Email: borongzh@gmail.com
Mission: Exploring innovative approaches to advance AI research. Dedicated to pushing boundaries in machine learning. Committed to developing robust and ethical AI solutions. Striving to make meaningful contributions to the field.
Boyuan Chen
Affiliation: Ph.D (2026), Peking University
Start Time: February 2023
Research: Reinforcement Learning, Scalable Oversight, Superalignment
Email: boyuan.chen.byc@gmail.com
Mission: Develop scalable oversight and moral alignment mechanisms that integrate theoretical and empirical approaches to ensure ethically grounded, socially responsible intelligence beyond human-level capabilities.
Kaile Wang
Affiliation: Ph.D (2026), Peking University
Start Time: September 2023
Research: Reinforcement Learning, Safety Alignment, LLMs Theory
Email: jiamg.ji@gmail.com
Mission: Exploring innovative approaches to advance AI research. Dedicated to pushing boundaries in machine learning. Committed to developing robust and ethical AI solutions. Striving to make meaningful contributions to the field.

Undergraduate Students

Tianyi Qiu
Affiliation: Peking University
Start Time: May 2023
Research: Value Alignment, Scalable Oversight, Human-AI Interaction, AI Societal Impact
Email: qiutianyi.qty@gmail.com
Mission: To facilitate human moral progress with truth-seeking AI. Pervasive AI influence is harming the epistemics of the human-AI collective, and I hope to reverse the trend and turn them into facilitators of collective reflection.
Hantao Lou
Affiliation: Peking University
Start Time: July 2023
Research: AI Alignment, Formal Verification, Mechanistic Interpretability
Email: hantaolou.htlou@gmail.com
Mission: Supervision signals are fundamental to AI alignment and safety. My research aims to develop scalable, stable, and verifiable supervision through formal verification and mechanistic interpretability, and to leverage reinforcement learning for optimizing the utilization of these signals in frontier AI systems.
Sitong Fang
Affiliation: Peking University
Start Time: November 2024
Research: AI Deception, Reinforcement Learning, Large Language Models
Email: sitongfang1@gmail.com
Mission: To develop principled and provably safe foundations for intelligent systems, to foster transparent, reliable, and value-aligned AI that can be trusted in high-stakes real-world environments.

Research Interns

Xuyao Wang
Affiliation: Nankai University
Start Time: March 2024
Research: Reinforcement Learning, AI Infra
Email: wxy835283116@gmail.com
Mission: Turning scientific vision into engineered reality.
Wenqi Chen
Affiliation: University of Electronic Science and Technology of China
Start Time: July 2024
Research: Reinforcement Learning, AI Alignment
Email: wqchen1024@gmail.com
Mission: Devoted to finding and overseeing advanced AI failure modes, aiming to proactively identify and mitigate risks to ensure the safe and reliable development of artificial intelligence.

Alumni

Xuehai Pan
PhD Condidate
To DeepSeek