简历
教育背景
- 复旦大学软件工程专业本科生,2023–至今
研究方向
- 大语言模型可信评测
- 医疗 NLP 与真实世界临床基准
- 开放式新颖性评估与科学智能
链接
- 主页: https://www.huayusha.org
- GitHub: https://github.com/HuayuSha
- ORCID: https://orcid.org/0009-0006-1742-5816
- OpenReview: https://openreview.net/profile?id=~Huayu_Sha1
- 邮箱: [email protected]
代表论文
-
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents
Yujiong Shen * , Yajie Yang * , Zhiheng Xi * , Binze Hu , Huayu Sha , Jiazheng Zhang , Qiyuan Peng , Junlin Shang , Jixuan Huang , Yutao Fan , Jingqi Tong , Shihan Dou , Ming Zhang , Lei Bai , Zhenfei Yin † , Tao Gui † , Xingjun Ma , Qi Zhang , Xuanjing Huang † , Yu-Gang Jiang
* 共同一作;† 通讯作者
arXiv preprint · ICML 2026 submission (under review), 2026
-
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment
Ming Zhang * † , Kexin Tan * , Yueyuan Huang * , Yujiong Shen , Chunchun Ma , Li Ju , Xinran Zhang , Yuhui Wang , Wenqing Jing , Jingyi Deng , Huayu Sha , Binze Hu , Jingqi Tong , Changhao Jiang , Yage Geng , Yuankai Ying , Yue Zhang , Zhangyue Yin , Zhiheng Xi , Shihan Dou , Tao Gui , Qi Zhang † , Xuanjing Huang
* 共同一作;† 通讯作者
arXiv preprint, 2026
-
LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models
Ming Zhang * † , Yujiong Shen * , Jingyi Deng * , Yuhui Wang * , Huayu Sha , Kexin Tan , Qiyuan Peng , Yue Zhang , Junzhe Wang , Shichun Liu , Yueyuan Huang , Changhao Jiang , Jingqi Tong , Yilong Wu , Zhihao Zhang , Mingqi Wu , Mingxu Chai , Zhiheng Xi , Shihan Dou , Tao Gui , Qi Zhang † , Xuanjing Huang
* 共同一作;† 通讯作者
ACL 2026 Submission (Under Review), 2025
-
LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation
Ming Zhang * , Yujiong Shen * , Zelin Li * , Huayu Sha , Binze Hu , Yuhui Wang , Chenhao Huang , Shichun Liu , Jingqi Tong , Changhao Jiang , Mingxu Chai , Zhiheng Xi , Shihan Dou , Tao Gui , Qi Zhang † , Xuanjing Huang †
* 共同一作;† 通讯作者
Findings of EMNLP 2025, 2025