😀 About Me
I am a researcher at the Multi-Agent System Lab, Beijing Institute for General Artificial Intelligence. My research interests include multi-agent systems, reinforcement learning, large language models, and spatio-temporal data mining.
I received my Ph.D. in Computer Science and Engineering from the Hong Kong University of Science and Technology under the supervision of Prof. Qiang Yang and Dr. Yu Zheng. My work has been published in top-tier conferences such as ICML, ICLR, ACL, KDD, WWW, AAAI, and CIKM, as well as leading journals including TKDE, Transportation Research Part C.
Hiring PhD students jointly supervised with 上海交通大学 北京理工大学 上海科技大学
- An interest in Multi-agent systems & Large language models & Data mining
- Strong code ability
- Determination to do high-quality research
🔥 News
• 2026.01 🎉 1 paper accepted to ICLR 2026
• 2026.01 🎉 1 paper accepted to WWW 2026
• 2025.11 🎉 1 paper accepted to SIGKDD 2026
• 2025.11 🎉 1 paper accepted to AAAI 2026
• 2026.01 - 2028.12 🎉 National Natural Science Foundation of China
• 2025.05 🎉 2 papers accepted to ACL 2025
📝 Selected Publications
# equal contribution ‡ corresponding authors
2026
-
Yexin Li. CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning. TMLR 2026 -
Zheng Zhang, Ziwei Shan, Kaitao Song,
Yexin Li‡, Kan Ren ‡. Linking Process to Outcome: Conditional Reward Modeling for LLM Reasoning. ICLR 2026 -
Jinjin Guo,
Yexin Li‡, Zhichao Huang, Jun Fang, Zhiyuan Liu, Chao Liu, Pengzhang Liu, Qixia Jiang. Spectral Disentanglement and Enhancement: A Dual-domain Contrastive Framework for Representation Learning. WWW 2026 -
Sijie Ruan, Renchi Jiang, Song Tang,
Yexin Li, Weixin Zhai, Xinhao Liu, Bingbing Hu, Hanning Yuan, Caicong Wu, Shuliang Wang. Predictive Mobile Refueling for Agricultural Machinery via Deep Reinforcement Learning. SIGKDD 2026 -
Zhixiang Zhang, Shuo Chen,
Yexin Li, Feng Wang. ADAPT: Adaptive Decentralized Architecture with Perception-Aligned Training for Structural Generalization in Multi-Agent RL. AAAI 2026
2025
-
Zheng Zhang, Shaocheng Lan, Lei Song, Jiang Bian,
Yexin Li, Kan Ren. Learning to Select In-Context Demonstration Preferred by Large Language Model. ACL Findings 2025 -
Yipeng Kang, Junqi Wang,
Yexin Li, Mengmeng Wang, Wenming Tu, Quansen Wang, Hengli Li, Tingjun Wu, Xue Feng, Fangwei Zhong, Zilong Zheng. Are the Values of LLMs Structurally Aligned with Humans? A Causal Perspective. ACL Findings 2025
2024
-
Yexin Li, Zhancun Mu, Siyuan Qi. A Contextual Combinatorial Bandit Approach to Negotiation. ICML 2024 -
Siyuan Qi #, Shuo Chen #,
Yexin Li#, Xiangyu Kong #, Junqi Wang #, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu. CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents. ICLR 2024
2023 and Before
-
Siyuan Feng, Shuqing Wei, Junbo Zhang,
Yexin Li, Jintao Ke, Gaode Chen, Yu Zheng, Hai Yang. A Macro–Micro Spatio-temporal Neural Network for Traffic Prediction. TRC 2023 -
Tianfu He, Jie Bao,
Yexin Li, Hui He, Yu Zheng. Crowd-Sensing Enhanced Parking Patrol Using Sharing Bikes’ Trajectories. TKDE 2021 -
Yexin Li, Yu Zheng, Qiang Yang. Cooperative Multi-Agent Reinforcement Learning in Express System. CIKM 2020 -
Ting Li, Junbo Zhang, Kainan Bao, Yuxuan Liang,
Yexin Li, Yu Zheng. AutoST: Efficient Neural Architecture Search for Spatio-temporal Prediction. SIGKDD 2020 -
Yexin Li, Yu Zheng, Qiang Yang. Efficient and Effective Express via Contextual Cooperative Reinforcement Learning. SIGKDD 2019 -
Yexin Li, Yu Zheng. Citywide Bike Usage Prediction in a Bike-Sharing System. TKDE 2019 -
Yexin Li, Yu Zheng, Qiang Yang. Dynamic Bike Reposition: A Spatio-Temporal Reinforcement Learning Approach. SIGKDD 2018 -
Yexin Li, Yu Zheng, Huichu Zhang, Lei Chen. Traffic Prediction in a Bike-Sharing System. SIGSPATIAL 2015
🎓 Education
- 2016.09 - 2020.11 - Ph.D. in Computer Science and Engineering, Hong Kong University of Science and Technology
- 2013.09 - 2015.08 - MPhil in Computer Science and Engineering, Hong Kong University of Science and Technology
- 2009.09 - 2013.07 - B.S. in Mathematics and Applied Mathematics, Xi'an Jiaotong University
💼 Work Experience
- 2023.04 - Now - Researcher at the Multi-Agent Systems Lab, Beijing Institute for General Artificial Intelligence
- 2021.01 - 2023.03 - Doctor Management Trainee at JD.COM
- 2017.05 - 2018.01 - Research Intern at Microsoft Research Asia
🤝 Collaborators
- Shuo Chen - Researcher at Multi-Agent System Lab, Beijing Institute for General Artificial Intelligence
- Sijie Ruan - Assistant professor at School of Computer Science and Technology, Beijing Institute of Technology
- Siyuan Qi - Researcher at Multi-Agent System Lab, Beijing Institute for General Artificial Intelligence
- Jinjin Guo - Algorithm Engineer at JD.com
