Skip to content
View bboylyg's full-sized avatar
πŸ˜€
I may be slow to respond.
πŸ˜€
I may be slow to respond.
  • Singapore

Block or report bboylyg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bboylyg/README.md

Hi there, I am Yige Li πŸ‘‹

I am a research fellow at the School of Computing and Information Systems at Singapore Management University supervised by Prof. Jun Sun. I also work closely with Prof. Xingjun Ma at Fudan University. I completed my Ph.D. at Xidian University under the supervision of Prof. Xixiang Lyu. Research publications are available on my Google Scholar

I pursue research in Trustworthy AI, aiming to build secure, robust, and interpretable systems that align with human values and cognition. I’m especially interested in generative models (LLMs, diffusion models, and AI agents) and AI safety, and I seek simple yet insightful solutions grounded in theory. Guided by the philosophy "Everything should be made as simple as possible, but not simpler," I approach research with both rigor and curiosity. Outside of work, I enjoy rock climbing πŸ§— and swimming 🏊.

πŸ”­ My research mainly focus on:

  • AI Safety on LLMs and VLMs
  • Backdoor attacks and jailbreak attacks on LLMs and VLMs
  • Design and implement a general defense framework for backdoor attacks

⚑ Award Honors:

We're honored to share that our BackdoorLLM has won the First Prize in the SafetyBench competition, organized by the Center for AI Safety. Huge thanks to the organizers and reviewers for recognizing our work.

πŸ… Professional Activities

Program Committee Member

  • ICLR, ICML, NeurIPS, CVPR, ICCV, AAAI, ACL, EMNLP

Journal Reviewer

  • IEEE TPAMI, IEEE TIFS, IEEE TDSC, IEEE TKDE

πŸ“« How to reach me:

Pinned Loading

  1. BackdoorLLM BackdoorLLM Public

    BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models

    Python 200 20

  2. XTransferBench XTransferBench Public

    Forked from HanxunH/XTransferBench

    [ICML 2025] X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP

    Python

  3. NAD NAD Public

    This is an implementation demo of the ICLR 2021 paper [Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks](https://openreview.net/pdf?id=9l0K4OM-oXE) in PyTorch.

    Python 122 14

  4. ABL ABL Public

    Anti-Backdoor learning (NeurIPS 2021)

    Python 82 9

  5. RNP RNP Public

    Reconstructive Neuron Pruning for Backdoor Defense (ICML 2023)

    Python 39 5

  6. Multi-Trigger-Backdoor-Attacks Multi-Trigger-Backdoor-Attacks Public

    Shortcuts Everywhere and Nowhere: Exploring Multi-Trigger Backdoor Attacks

    Python 7 2