I am a research fellow at the School of Computing and Information Systems at Singapore Management University supervised by Prof. Jun Sun. I also work closely with Prof. Xingjun Ma at Fudan University. I completed my Ph.D. at Xidian University under the supervision of Prof. Xixiang Lyu. Research publications are available on my Google Scholar
I pursue research in Trustworthy AI, aiming to build secure, robust, and interpretable systems that align with human values and cognition. Iβm especially interested in generative models (LLMs, diffusion models, and AI agents) and AI safety, and I seek simple yet insightful solutions grounded in theory. Guided by the philosophy "Everything should be made as simple as possible, but not simpler," I approach research with both rigor and curiosity. Outside of work, I enjoy rock climbing π§ and swimming π.
- AI Safety on LLMs and VLMs
- Backdoor attacks and jailbreak attacks on LLMs and VLMs
- Design and implement a general defense framework for backdoor attacks
We're honored to share that our BackdoorLLM has won the First Prize in the SafetyBench competition, organized by the Center for AI Safety. Huge thanks to the organizers and reviewers for recognizing our work.
Program Committee Member
- ICLR, ICML, NeurIPS, CVPR, ICCV, AAAI, ACL, EMNLP
Journal Reviewer
- IEEE TPAMI, IEEE TIFS, IEEE TDSC, IEEE TKDE