Search

PKU-Alignment Group @Pair-Lab (under construction)

PKU-Alignment Group @Pair-Lab (under construction)

News
People
Events
Publications
Contact
More Platforms
知乎 Bilibili Email 小红书 PAIR-Lab

Copied

Copied to clipboard

Sirui Han

Latest

Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
Generative RLHF-V: Learning Principles from Multi-modal Human Preference
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback

© 2025 PKU-Alignment Group.

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.

Cite