PKU-Alignment Group @Pair-Lab (under construction)
PKU-Alignment Group @Pair-Lab (under construction)
News
People
Events
Publications
Contact
More Platforms
知乎
Bilibili
Email
小红书
PAIR-Lab
Copied
Copied to clipboard
AI Safety
Aligner: Efficient Alignment by Learning to Correct
Jiaming Ji
,
Boyuan Chen
,
Hantao Lou
,
Donghai Hong
,
Borong Zhang
,
Xuehai Pan
,
Juntao Dai
,
Yaodong Yang
NeurIPS 2024
Oral
AI Alignment,
AI Safety,
NeurIPS
PDF
Code
Dataset
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset
Juntao Dai
,
Tianle Chen
,
Xuyao Wang
,
Ziran Yang
,
Taiye Chen
,
Jiaming Ji
,
Yaodong Yang
NeurIPS 2024.
AI Safety,
Safety Alignment
PDF
Language Models Resist Alignment: Evidence From Data Compression
Jiaming Ji
,
Kaile Wang
,
Tianyi Qiu
,
Boyuan Chen
,
Jiayi Zhou
,
Changye Li
,
Hantao Lou
,
Yaodong Yang
ACL 2025 Best Paper
Large Language Models,
Safety Alignment,
AI Safety
PDF
Cite
×