🧑‍🎓 Biography

I am a final-year Ph.D. candidate at Shanghai Jiao Tong University, supervised by Prof. Hai Zhao, majoring in Computer Science. Before that, I received my B.S. degree in Computer Science from Central South University in 2021.

My research focuses on improving the Efficiency of large language models, including but not limited to model pruning, efficient training, and KV cache optimization.

🔍 I am actively looking for a PostDoc position. Please feel free to reach out if you have a suitable opportunity!🙂

📝 Publications

LESA: Learnable LLM Layer Scaling-Up
Yifei Yang, Zouying Cao, Xinbei Ma, Yao Yao, Libo Qin, Zhi Chen, Hai Zhao
ACL 2025
LaCo: Large language model pruning via layer collapse
Yifei Yang, Zouying Cao, Hai Zhao
EMNLP 2025, Findings
BatGPT-Chem: A Foundation Large Model For Chemical Engineering
Yifei Yang, Runhan Shi, Zuchao Li, Shu Jiang, Bao-Liang Lu, Qibin Zhao, Yang Yang, Hai Zhao
Research, SCI (Q1 Top Journal)
Nothing in excess: Mitigating the exaggerated safety for llms via safety-conscious activation steering
Zouying Cao, Yifei Yang, Hai Zhao
AAAI 2025
FreDF: Learning to Forecast in Frequency Domain
Hao Wang, Licheng Pan, Zhichao Chen, Degui Yang, Sen Zhang, Yifei Yang, Xinggao Liu, Haoxuan Li, Dacheng Tao
ICLR 2025
KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
Yifei Yang, Zouying Cao, Qiguang Chen, Libo Qin, Dongjie Yang, Zhi Chen, Hai Zhao
Preprint
How Deep is Love in LLMs’ Hearts? Exploring Semantic Size in Human-like Cognition
Yao Yao, Yifei Yang, Xinbei Ma, Dongjie Yang, Zhuosheng Zhang, Zuchao Li, Hai Zhao
Preprint
Head-wise Shareable Attention for Large Language Models
Zouying Cao, Yifei Yang, Hai Zhao
EMNLP 2024, Findings
Hypertext Entity Extraction in Webpage
Yifei Yang, Tianqiao Liu, Bo Shao, Hai Zhao, Linjun Shou, Ming Gong, Daxin Jiang
Preprint
Autohall: Automated hallucination dataset generation for large language models
Zouying Cao, Yifei Yang, Hai Zhao
Preprint
CMMLU: Measuring Massive Multitask Language Understanding in Chinese
Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, Hai Zhao, Yeyun Gong, Nan Duan, Timothy Baldwin
ACL 2024, Findings
Attack Named Entity Recognition by Entity Boundary Interference
Yifei Yang, Hongqiu Wu, Hai Zhao
COLING 2024
RefGPT: Dialogue Generation of GPT, by GPT, and for GPT
Dongjie Yang, Ruifeng Yuan, Yuantao Fan, Yifei Yang, Zili Wang, Shusen Wang, Hai Zhao
EMNLP 2023, Findings
BATGPT: A Bidirectional Autoregressive Talker from Generative Pre-trained Transformer
Zuchao Li, Shitou Zhang, Hai Zhao, Yifei Yang, Dongjie Yang
Preprint
Aspect-based sentiment analysis as machine reading comprehension
Yifei Yang, Hai Zhao
COLING 2022
Nested Named Entity Recognition as Corpus Aware Holistic Structure Parsing
Yifei Yang, Zuchao Li, Hai Zhao
COLING 2022

📖 Education

Sep. 2021 – Jun. 2026 (expected), Ph.D. in Computer Science, Shanghai Jiao Tong University (SJTU)
Sep. 2017 – Jun. 2021, B.S. in Computer Science, Central South University (CSU)

💻 Internships

Shanghai Artificial Intelligence Laboratory, Research Intern in Large-scale Models
Feb. 2024 – Jun. 2024 Mentored by Dr. Zhi Chen, Dr. Hang Yan
Microsoft STCA, Research Intern in Webpage Entity Extraction
Aug. 2022 – Mar. 2023 Mentored by Principal Applied Scientist Manager Linjun Shou

🛠️ Service

Reviewer for ACL, EMNLP, ICLR, AAAI, NeurIPS, TASLP, TMC

🎯 Miscellaneous