Ruizhe Chen
PhD student

Ruizhe Chen is currently a PhD candidate at Zhejiang University, advised by Prof. Zuozhu Liu. Previously, he was an undergradate student at ZJUI, Zhejiang University. He has been a visiting scholar at SUTD, advised by Prof. Tony Q.S. Quek and an intern at Tiktok, Bytedance. His research primarily in the alignment of LLMs, especially their fairness and personalization. He has published related papers in top-tier conferences and journals such as NeurIPS, ICLR, ACL, EMNLP, NAACL and AAAI.


Education
  • Zhejiang University
    Zhejiang University
    Department of Computer Science
    Ph.D. Student
    Sep. 2021 - present
  • Zhejiang University
    Zhejiang University
    B.S. in Electrical Engineering
    Sep. 2017 - Jul. 2021
News
2025
Two paper accepted by ICLR 2025. One paper accepted by NAACL 2025.
Feb 02
2021
Start PhD at Zhejiang University
Aug 31
Selected Publications (view all )
Pad: Personalized alignment of llms at decoding-time
Pad: Personalized alignment of llms at decoding-time

Ruizhe Chen, Zuozhu Liu

ICLR 2025

Large Language Models Alignment.

Pad: Personalized alignment of llms at decoding-time

Ruizhe Chen, Zuozhu Liu

ICLR 2025

Large Language Models Alignment.

Learnable Privacy Neurons Localization in Language Models
Learnable Privacy Neurons Localization in Language Models

Ruizhe Chen, Tianxiang Hu, Zuozhu Liu

The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 main) 2024

Large Language Models Safety (Privacy).

Learnable Privacy Neurons Localization in Language Models

Ruizhe Chen, Tianxiang Hu, Zuozhu Liu

The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 main) 2024

Large Language Models Safety (Privacy).

Fast model debias with machine unlearning
Fast model debias with machine unlearning

Ruizhe Chen, Jianfei Yang, Zuozhu Liu

Advances in Neural Information Processing Systems 2023

DL Fairness, Large Language Models Fairness, Machine Unlearning via Influence Function

Fast model debias with machine unlearning

Ruizhe Chen, Jianfei Yang, Zuozhu Liu

Advances in Neural Information Processing Systems 2023

DL Fairness, Large Language Models Fairness, Machine Unlearning via Influence Function

All publications