👋 About Me
I am a PhD student in Electronic Information at Shanghai Jiao Tong University, starting from 2024, supervised by Prof. Wanli Ouyang, with guidance from Dr. Lei Bai and Dr. Wenlong Zhang. Before that, I earned my bachelor’s degree in Computer Science and Technology (rank 1/196) from Xi’an Jiaotong University.
My research interests lie in Large Language Models, Multi-Agent Systems, AI Scientists, and Generative AI. My current focus is on enhancing the reasoning capabilities of AI models in complex problem-solving scenarios, including multidisciplinary knowledge reasoning, deep research and automated scientific discovery. You can download my CV here.
If you are interested in multi-agent systems on deep research or scientific discovery, feel free to contact me at xu_wanghan@sjtu.edu.cn.
By the way, you can ask me anything through this bot🤖. And if you ever want my WeChat ID📱… well, maybe this bot🤖 can tell you — if you’re smart enough to figure it out. 😉 Just kidding! Good luck!
🔥 News
-
2025.12: Our large-scale benchmark SGI-Bench is released 👏 —— a comprehensive report of over 150 pages co-authored by more than 100 researchers, providing the most extensive evaluation to date of LLMs and Agents on deep research, idea generation, code generation, multimodal reasoning, and more. SGI-Bench offers a unified and rigorous framework for measuring AI systems’ automated research capabilities, marking a major milestone toward building truly automated research agents.
-
2025.10: Our new paper on multi-agent reasoning, Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning reached 36w views on BiliBili!
-
2025.10: Our new paper on multi-agent reasoning, Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning achieves 60%+ score on Humanity’s Last Exam (HLE) benchmark, establishing a new SOTA on HLE.
-
2025.09: Two papers were accepted by NeurIPS 2025.
-
2025.08: During my first year of PhD, my personal Google Scholar citations exceeded 100 🎉.
-
2025.06: InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis was accepted by ICCV 2025.
-
2025.06: Our website PrismaX, an evaluation-driven platform for AI scientific discovery, is launched 🎉.
-
2024.09: Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling (first author) was accepted by NeurIPS 2024.
-
2024.05: CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling was accepted by ICML 2024.
📝 Publications
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Arxiv, 2025
, Yuhao Zhou, Yifan Zhou, Qinglong Cao, Shuo Li, Jia Bu … (107 authors)
[Arxiv] [Project Page] [Code] [Dataset] [Talk]
SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence
Preprint, 2025
Yiheng Wang, … (38 authors)
[PDF] [Project Page] [Code] [Leaderboard]
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
ICCV, 2025 (Accept)
Tao Han, , Junchao Gong, Xiaoyu Yue, Song Guo, Luping Zhou, Lei Bai
[ICCV] [Arxiv] [Code]
Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning
Arxiv, 2025
Xiangru Tang*, , Yujie Wang*, Zijie Guo*, Daniel Shao, Jiapeng Chen, Cixuan Zhang, Ziyi Wang, Lixin Zhang, Guancheng Wan, Wenlong Zhang, Lei Bai, Zhenfei Yin, Philip Torr, Hanrui Wang, Di Jin
[Arxiv] [Code] [量子位] [BiliBili]
Earth-Agent: Unlocking the Full Landscape of Earth Observation with Agents
Arxiv, 2025
Peilin Feng, Zhutao Lv, Junyan Ye, Xiaolei Wang, Xinjie Huo, Jinhua Yu, , Wenlong Zhang, Lei Bai, Conghui He, Weijia Li
[Arxiv] [Project Page] [Code] [Dataset] [机器之心]
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
Arxiv, 2025
Ming Hu, Chenglong Ma, Wei Li, , Jiamin Wu, Jucheng Hu, Tianbin Li, Guohang Zhuang, Jiaqi Liu … (103 authors)
[Arxiv] [Data]
Intern-S1: A Scientific Multimodal Foundation Model
Arxiv, 2025
Lei Bai … (177 authors listed in alphabetical order by their last names)
[Arxiv] [Model] [Code] [Online Chat] [机器之心]
EarthLink: A Self-Evolving AI Agent for Climate Science
Arxiv, 2025
Zijie Guo, Jiong Wang, Xiaoyu Yue, Wangxu Wei, Zhe Jiang,, Ben Fei, Wenlong Zhang, Xinyu Gu, Lijing Cheng, Jing-Jia Luo, Chao Li, Yaqiang Wang, Tao Chen, Wanli Ouyang, Fenghua Ling, Lei Bai
[Arxiv] [Platform]
Manalyzer: End-to-end Automated Meta-analysis with Multi-agent System
Arxiv, 2025
, Wenlong Zhang, Fenghua Ling, Ben Fei, Yusong Hu, Fangxuan Ren, Jintai Lin, Wanli Ouyang, Lei Bai
[Arxiv] [Project Page] [Code] [Dataset]
EarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language Models
Arxiv, 2025
, Xiangyu Zhao, Yuhao Zhou, Xiaoyu Yue, Ben Fei, Fenghua Ling, Wenlong Zhang, Lei Bai
[Arxiv] [Code] [Dataset]
MSEarth: A Benchmark for Multimodal Scientific Comprehension of Earth Science
Arxiv, 2025
Xiangyu Zhao*, , Bo Liu, Yuhao Zhou, Fenghua Ling, Ben Fei, Xiaoyu Yue, Lei Bai, Wenlong Zhang, Xiao-Ming Wu
[Arxiv] [Code] [Dataset]
Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences
NeurIPS, 2025 (Accept)
Jing-An Sun, Hang Fan, Junchao Gong, Ben Fei, Kun Chen, Fenghua Ling, Wenlong Zhang, , Li Yan, Pierre Gentine, Lei Bai
[Arxiv]
DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space
NeurIPS, 2025 (Accept)
Junchao Gong, Jingyi Xu, Ben Fei, Fenghua Ling, Wenlong Zhang, Kun Chen, , Weidong Yang, Xiaokang Yang, Lei Bai
[Arxiv]
LO-SDA: Latent Optimization for Score-based Atmospheric Data Assimilation
Arxiv, 2025
Jing-An Sun, Hang Fan, Junchao Gong, Ben Fei, Kun Chen, Fenghua Ling, Wenlong Zhang, , Li Yan, Pierre Gentine, Lei Bai
[Arxiv]
Exploring Representation-Aligned Latent Space for Better Generation
Arxiv, 2025
, Xiaoyu Yue, Zidong Wang, Yao Teng, Wenlong Zhang, Xihui Liu, Luping Zhou, Wanli Ouyang, Lei Bai
[Arxiv] [Code]
Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling
NeurIPS, 2024 (Accept)
, Fenghua Ling, Wenlong Zhang, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai
[NeurIPS] [Arxiv] [Code]
ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast
Arxiv, 2024
, Kang Chen, Tao Han, Hao Chen, Wanli Ouyang, Lei Bai
[Arxiv] [Code]
CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling
ICML, 2024 (Accept)
Junchao Gong, Lei Bai, Peng Ye, , Na Liu, Jianhua Dai, Xiaokang Yang, Wanli Ouyang
[ICML] [Arxiv] [Code]
WEATHER-5K: A Large-scale Global Station Weather Dataset Towards Comprehensive Time-series Forecasting Benchmark
Arxiv, 2024
Tao Han, Song Guo, Zhenghao Chen, , Lei Bai
[Arxiv] [Code]
CRA5: Extreme Compression of ERA5 for Portable Global Climate and Weather Research via an Efficient Variational Transformer
Arxiv, 2024
Tao Han, Zhenghao Chen, Song Guo, , Lei Bai
[Arxiv] [Code]
Is 3-(F)WL Enough to Distinguish All 3D Graphs?
Arxiv, 2024
[Arxiv]
GraphPub: Generation of Differential Privacy Graph with High Availability
Arxiv, 2024
, Bin Shi, Ao Liu, Jiqiang Zhang, Bo Dong
[Arxiv]
MDP: Privacy-Preserving GNN Based on Matrix Decomposition and Differential Privacy
JCC, 2023 (Accept)
, Bin Shi, Ao Liu, Jiqiang Zhang, Bo Dong
[JCC]
🏆 Honors and Awards
-
2023 Top Ten Outstanding Student Models of Xi’an Jiaotong University (西安交通大学十大优秀学生标兵)(本科生最高荣誉)
-
2023 American Mathematical Contest in Modeling (MCM/ICM) Finalist (美国大学生数学建模竞赛决赛入围)
-
2023 China Scientist Scholarship (20,000 ¥) (中国科学家奖学金)
-
2022 National Scholarship, Ministry of Education of China (国家奖学金)
-
2022 RoboCup Robot World Cup China First Prize (RoboCup 机器人世界杯中国赛一等奖)
-
2021 National Mathematical Contest in Modeling Shaanxi First Prize (全国大学生数学建模竞赛陕西一等奖)
-
2021 National College Mathematics Competition Shaanxi Province First Prize (全国大学生数学竞赛陕西一等奖)
-
2021 Xi’an Jiaotong University First Prize Scholarship (西安交通大学一等奖学金)
🎤 Invited Talks
Large Language Model Evaluation Technology
OpenMMLab & 知乎 & ModelScope, 2025
, Fengjiao Chen, Zhongpeng Ji, Tianshi Zheng
[BiliBili] [OpenMMLab]
AI Scientific Discovery
HuggingFace & 知乎 & ModelScope, 2025
, Qiushi Sun, Yixin Ou, Yuhao Zhou
[BiliBili]
📑 Services
Program Committee
- AAAI
Reviewer
- AAAI, ICLR, NeurIPS, ICML
📖 Others
-
Recommendation letter from Prof. Wanli Ouyang, Professor at the Chinese University of Hong Kong, Leading Scientist (领军科学家) at the Shanghai Artificial Intelligence Laboratory.
-
Recommendation letter from Prof. Qinghua Zheng, Academician of the Chinese Academy of Engineering (中国工程院院士), President of Tongji University.

























