Department of Computer Science and Technology, Tsinghua University
Welcome to my homepage! I am Weihang Su (苏炜航), a fourth-year PhD student at the Department of Computer Science and Technology, Tsinghua University, where I am fortunate to be advised by Prof. Yiqun Liu and Assoc. Prof. Qingyao Ai.
My research focuses on leveraging Large Language Models (LLMs) to better fulfill users’ complex information needs. My current research interests include:
My research at the intersection of Information Retrieval and Large Language Models has achieved both significant academic recognition and broad impact. Recent highlights of my works include receiving the SIGIR 2024 Best Paper Award 🏆, and my work on Parametric RAG is currently the most cited paper at SIGIR 2025 📈 (as of Feb 2026, Google Scholar). Beyond publications, I actively serve the research community as a PC member or reviewer for top-tier venues (e.g., NeurIPS, ICML, ICLR, ACL, SIGIR) and have been invited to deliver tutorials at flagship conferences (e.g., SIGIR 2025, SIGIR-AP 2025) on Dynamic and Parametric RAG, serving as the lead presenter.
My academic work is also closely connected to industrial practice. Since October 2025, I have been working as a research intern at ByteDance (TikTok), focusing on the development of advanced AI Agents. By exploring the automated construction of specialized agents and Agent Skills for agentic systems, I aim to bridge the gap between cutting-edge LLM research and scalable, real-world applications.
I am also highly passionate about mentoring undergraduate students. I have had the privilege of collaborating with talented undergraduates, successfully co-authoring high-impact papers at premier conferences and journals (e.g., ACL, SIGIR, EMNLP, TOIS, AAAI, The Web Conference).
If you are an undergraduate student interested in my research areas and driven to publish high-quality work, you are more than welcome to apply for an internship with the THUIR group through official channels, or contact me directly (WeChat ID: rdfzswh) to embark on meaningful research together!
Our SIGIR-AP 2025 tutorial on Dynamic and Parametric RAG has been accepted! Welcome to join us in Xi’an, China, on December 7th! More information at: This Webpage and SIGIR-AP 2025 Official Website.
The titles of my first-author papers are in bold (excluding co-first where the ranking is not first).
SurGE: A Benchmark and Evaluation Framework for Scientific Survey Generation
Weihang Su, Anzhe Xie, Qingyao Ai, Jianming Long, Jiaxin Mao, Ziyi Ye, Yiqun Liu
(Long Paper) Paper Code and Dataset
Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
Weihang Su, Jianming Long, Changyue Wang, Shiyu Lin, Jingyan Xu, Ziyi Ye, Qingyao Ai, Yiqun Liu
(Long Paper) Paper Code and Dataset
Generalized Pseudo-Relevance Feedback
Yiteng Tu, Weihang Su, Yujia Zhou, Yiqun Liu, Fen Lin, Qin Liu, Qingyao Ai
Webconf 2026 (Long Paper, CCF-A, THU-A)
Paper
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
Changyue Wang, Weihang Su, Qingyao Ai, Yiqun Liu
AAAI 2026 Main Oral (Long Paper, CCF-A, THU-A)
Paper Code
An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
Junjie Chen, Weihang Su, Zhumin Chu, Haitao Li, Qinyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma
AAAI 2026 Main (Long Paper, CCF-A, THU-A)
Paper Code
Augmenting Multi-Agent Communication with State Delta Trajectory
Yichen Tang, Weihang Su, Yujia Zhou, Yiqun Liu, Min Zhang, Shaoping Ma, Qingyao Ai
EMNLP 2025 Main (Long, CCF-B, THU-A) Paper Code
Knowledge Editing through Chain-of-Thought
Changyue Wang, Weihang Su, Qingyao Ai, Yiqun Liu
EMNLP 2025 Main (Long, CCF-B, THU-A) Paper Code
Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing
Changyue Wang, Weihang Su, Qingyao Ai, Yujia Zhou, Yiqun Liu
ACL 2025 Findings (Long, CCF-A, THU-A) Paper Code
Dynamic and Parametric Retrieval-Augmented Generation
Weihang Su, Qingyao Ai, Jingtao Zhan, Qian Dong, Yiqun Liu
SIGIR 2025 (Tutorial, CCF-A, THU-A) Official Website Tutorial Proposal Paper
Parametric Retrieval Augmented Generation
Weihang Su, Yichen Tang, Qingyao Ai, Junxi Yan, Changyue Wang, Hongning Wang, Ziyi Ye, Yujia Zhou, Yiqun Liu
SIGIR 2025 (Long Paper, CCF-A, THU-A) Paper Code
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System
Weihang Su, Baoqing Yue, Qingyao Ai, Yiran Hu, Jiaqi Li, Changyue Wang, Kaiyuan Zhang, Yueyue Wu, Yiqun Liu
SIGIR 2025 (Long Paper, CCF-A, THU-A) Paper Code and Dataset
Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
Yiteng Tu, Weihang Su, Yujia Zhou, Yiqun Liu, Qingyao Ai
SIGIR 2025 (Long Paper, CCF-A, THU-A) Paper Code
Caseformer: Pre-training for Legal Case Retrieval Based on Inter-Case Distinctions.
Weihang Su, Qingyao Ai, Yueyue Wu, Anzhe Xie, Changyue Wang, Yixiao Ma, Haitao Li, Zhijing Wu, Yiqun Liu, Min Zhang.
ACM Transactions on Information Systems
TOIS 2025 (Long Paper, CCF-A, THU-A)
DecoupledRAG: An Efficient and Effective Retrieval Augmented Generation Framework via Cross Attention.
Qian Dong, Qingyao Ai, Hongning Wang, Yiding Liu, Haitao Li, Weihang Su, Yiqun Liu, Tat-Seng Chua, Shaoping Ma.
ACM Transactions on Information Systems
WWW 2025 (Long Paper, CCF-A, THU-A)
Mitigating Entity-Level Hallucinations in Large Language Models
Weihang Su, Yichen Tang, Qingyao Ai, Zhijing Wu, Yiqun Liu
International ACM SIGIR Conference on Information Retrieval in the Asia Pacific
SIGIR-AP 2024 (Long Paper) Paper Code
LeKUBE: A Legal Knowledge Update BEnchmark
Changyue Wang, Weihang Su, Hu Yiran, Qingyao Ai, Yueyue Wu, Cheng Luo, Yiqun Liu, Min Zhang, Shaoping Ma
International ACM SIGIR Conference on Information Retrieval in the Asia Pacific
SIGIR-AP 2024 (Long Paper) Paper Code
STARD: A Chinese Statute Retrieval Dataset with Real Queries Issued by Non-professionals
Weihang Su, Yiran Hu, Anzhe Xie, Qingyao Ai, Zibing Que, Yun Liu, Weixing Shen, Yiqun LIU
The 2024 Conference on Empirical Methods in Natural Language Processing
EMNLP 2024 Findings (Long Paper, CCF-B, THU-A) Paper Code
DRAGIN: Dynamic Retrieval Augmented Generation based on the Real-time Information Needs of Large Language Models
Weihang Su, Yichen Tang, Qingyao Ai, Zhijing Wu, Yiqun Liu.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics.
ACL 2024 Main Oral (Long Paper, CCF-A, THU-A)
[Paper] Code
Unsupervised real-time hallucination detection based on the internal states of large language models
Weihang Su, Changyue Wang, Qingyao Ai, Yiran Hu, Zhijing Wu, Yujia Zhou, Yiqun Liu.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics.
ACL 2024 Findings (Long Paper, CCF-A, THU-A)
Paper Code
Scaling Laws For Dense Retrieval.
Yan Fang, Jingtao Zhan, Qingyao Ai, Jiaxin Mao, Weihang Su, Jia Chen and Yiqun Liu.
The 47th International ACM SIGIR Conference on Research and Development in Information Retrieval
SIGIR 2024 Best Paper Award (Long Paper, CCF-A, THU-A)
Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval.
Weihang Su, Qingyao Ai, Xiangsheng Li, Jia Chen, Yiqun Liu, Xiaolong Wu and Shengluan Hou.
The 38th Annual AAAI Conference on Artificial Intelligence
AAAI 2024 (Long Paper, CCF-A, THU-A)
Paper Code
Relevance Feedback with Brain Signals.
Ziyi Ye, Xiaohui Xie, Qingyao Ai, Yiqun Liu, Zhihong Wang, Weihang Su and Min Zhang.
ACM Transactions on Information Systems
TOIS 2024 (Long Paper, CCF-A, THU-A)
CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding.
Yixiao Ma, Yueyue Wu, Weihang Su, Qingyao Ai, Yiqun Liu.
The 2023 Conference on Empirical Methods in Natural Language Processing
EMNLP 2023 Main (Long Paper, CCF-B, THU-A)
THUIR2 at NTCIR-16 Session Search (SS) Task
Weihang Su, Xiangsheng Li, Yiqun Liu, Min Zhang, Shaoping Ma
NII Testbeds and Community for Information access Research Project
NTCIR 2022
Web Search via an Efficient and Effective Brain-Machine Interface.
Xuesong Chen, Ziyi Ye, Xiaohui Xie, Yiqun Liu, Xiaorong Gao, Weihang Su, Shuqi Zhu, Yike Sun, Min Zhang, and Shaoping Ma.
The 15th ACM International Conference on Web Search and Data Mining.
(WSDM 2022) (Demo Paper, CCF-B, THU-A)
Trade or trick? detecting and characterizing scam tokens on uniswap decentralized exchange
Pengcheng Xia, Haoyu Wang, Bingyu Gao, Weihang Su, Zhou Yu, Xiapu Luo, Chao Zhang, Xusheng Xiao, Guoai Xu
International Conference on Measurement and Modeling of Computer Systems
(SIGMETRICS 2022) (Long Paper, CCF-B, THU-A)