张晗 (Hanlard)
邮箱: zhangh04@pcl.ac.cn
谷歌学术: 链接
Correcting Large Language Model Behavior via Influence Function. Han Zhang (张晗), Zhuo Zhang, Yi Zhang, et al. AAAI Conference on Artificial Intelligence (AAAI), 2025. (Oral)
CPPO: Continual Learning for Reinforcement Learning with Human Feedback. Han Zhang (张晗), Yu Lei, Lin Gui, et al. International Conference on Learning Representations (ICLR), 2024.
COPR: Continual Human Preference Learning via Optimal Policy Regularization. Han Zhang (张晗), Lin Gui, Yu Lei, et al. Association for Computational Linguistics (ACL), 2025.
CLLE: A benchmark for continual language learning evaluation in multilingual machine translation. Han Zhang (张晗), Sheng Zhang, Yang Xiang, et al. Empirical Methods in Natural Language Processing (EMNLP), 2022.
Incremental pre-training from smaller language models. Han Zhang (张晗), Wang Hui, Xu Ruifeng. Proceedings of the 10th SIGHAN Workshop on Chinese Language Processing (SIGHAN-10), 2025.
BeyondGender: A Multifaceted Bilingual Dataset for Practical Sexism Detection. Xuan Luo, Li Yang, Han Zhang (张晗), et al. AAAI Conference on Artificial Intelligence (AAAI), 2025.
Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning. Han Zhang (张晗), Ruibin Zheng, Zexuan Yi, Zhuo Zhang, Hanyang Peng, Hui Wang, Zike Yuan, Cai Ke, Shiwei Chen, Jiacheng Yang, Yangning Li, Xiang Li, Jiangyue Yan, Yaoqi Liu, Liwen Jing, Jiayin Qi, Ruifeng Xu, Binxing Fang, Yue Yu. ArXiv, 2025.
PanGu-alpha: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation. Wei Zeng, Xiaozhe Ren, Teng Su, Hui Wang, Yi Liao, Zhiwei Wang, Xin Jiang, ZhenZhang Yang, Kaisheng Wang, Xiaoda Zhang, Chen Li, Ziyan Gong, Yifan Yao, Xinjing Huang, Jun Wang, Jianfeng Yu, Qi Guo, Yue Yu, Yan Zhang, Jin Wang, Hengtao Tao, Dasen Yan, Zexuan Yi, Fang Peng, Fangqing Jiang, Han Zhang (张晗), Lingfeng Deng, Yehong Zhang, Zhe Lin, Chao Zhang, Shaojie Zhang, Mingyue Guo, Shanzhi Gu, Gaojun Fan, Yaowei Wang, Xuefeng Jin, Qun Liu, Yonghong Tian. ArXiv, 2021
An Orthogonality-based Dual-memory Framework for Continual Text Classification. Han Zhang (张晗), Yu Lei, Bin Liang, et al. IEEE Transactions on Audio, Speech and Language Processing (TASLP), 2025.
Prompt-based prototypical framework for continual relation extraction. Han Zhang (张晗), Bin Liang, Min Yang, et al. IEEE Transactions on Audio, Speech and Language Processing (TASLP), 2022.
支持鹏程系列开源大模型应用生态演化的可持续学习能力探索. 余跃, 刘欣, 蒋芳清, Han Zhang (张晗), et al. 智能科学与技术学报, 2022.