Shijie Xia (夏世杰)

Ph.D. student in Computer Science

shijiexia [AT] sjtu.edu.cn

Github | Twitter | Google Scholar | CV

“If I were given one hour to save the planet, I would spend 59 minutes defining the problem and one minute resolving it.” — Albert Einstein

News

  • [2025.10] SR-Scientist has been preprinted!
  • [2025.04] A survey on Test Time Scaling has been preprinted!
  • [2024.12] ReasonEval is accepted to AAAI 2025 as an oral presentation!


  • Bio

    I am a second-year Ph.D. student at Shanghai Jiao Tong University, advised by Prof. Pengfei Liu. Prior to that, I received my B.Eng. degree in Intelligence Science (Honors Program) from Fudan University in 2024.

    My research aims to build autonomous agents for scientific discovery and software engineering. To achieve this goal, I focus on data, algorithms, and evaluation.

    Publications

    (※ indicates co-first authors)

    SR-Scientist: Scientific Equation Discovery With Agentic AI

    Shijie Xia, Yuhan Sun, Pengfei Liu

    arXiv preprint, 2025

    Summary: We present SR-Scientist, a framework with a corresponding RL training strategy, in which an autonomous agent discovers scientific equations through long-horizon, tool-driven data analysis and equation evaluation.

    A Survey of Test Time Scaling for Reasoning

    Shijie Xia, Yiwei Qin, Xuefeng Li, Yan Ma, Run-Ze Fan, Steffi Chern, Haoyang Zou, Fan Zhou, Xiangkun Hu, Jiahe Jin, Yanheng He, Yixin Ye, Yixiu Liu, Pengfei Liu

    arXiv preprint, 2025

    Summary: We organize and analyze a broad range of work on test-time scaling.

    Evaluating Mathematical Reasoning Beyond Accuracy

    Shijie Xia, Xuefeng Li, Yixin Liu, Tongshuang Wu, Pengfei Liu

    AAAI 2025 oral presentation

    Summary: We propose ReasonEval, a suite comprising a new evaluation methodology with defined metrics for assessing mathematical reasoning quality and corresponding LLM-based evaluators for automated calculation.

    (※ indicates co-first authors)

    SR-Scientist: Scientific Equation Discovery With Agentic AI

    Shijie Xia, Yuhan Sun, Pengfei Liu

    arXiv preprint, 2025

    Summary: We present SR-Scientist, a framework with a corresponding RL training strategy, in which an autonomous agent discovers scientific equations through long-horizon, tool-driven data analysis and equation evaluation.

    A Survey of Test Time Scaling for Reasoning

    Shijie Xia, Yiwei Qin, Xuefeng Li, Yan Ma, Run-Ze Fan, Steffi Chern, Haoyang Zou, Fan Zhou, Xiangkun Hu, Jiahe Jin, Yanheng He, Yixin Ye, Yixiu Liu, Pengfei Liu

    arXiv preprint, 2025

    Summary: We organize and analyze a broad range of work on test-time scaling.

    LIMO: Less is More for Reasoning

    Yixin Ye, Zhen Huang, Yang Xiao, Ethan Chern, Shijie Xia, Pengfei Liu

    arXiv preprint, 2025

    PC Agent: While You Sleep, AI Works--A Cognitive Journey into Digital World

    Yanheng He, Jiahe Jin, Shijie Xia, Jiadi Su, Runze Fan, Haoyang Zou, Xiangkun Hu, Pengfei Liu

    arXiv preprint, 2024

    O1 Replication Journey--Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

    Zhen Huang, Haoyang Zou, Xuefeng Li, Yixiu Liu, Yuxiang Zheng, Ethan Chern, Shijie Xia, Yiwei Qin, Weizhe Yuan, Pengfei Liu

    arXiv preprint, 2024

    O1 Replication Journey: A Strategic Progress Report--Part 1

    Yiwei Qin, Xuefeng Li, Haoyang Zou, Yixiu Liu, Shijie Xia, Zhen Huang, Yixin Ye, Weizhe Yuan, Hector Liu, Yuanzhi Li, Pengfei Liu

    arXiv preprint, 2024

    Evaluating Safety with Critique

    Yixiu Liu, Yuxiang Zheng, Shijie Xia, Yuan Guo, Jiajun Li, Yi Tu, Chaoling Song, Pengfei Liu

    EMNLP 2024, Findings

    OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

    Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang, Dahua Lin, Yu Qiao, Pengfei Liu

    NeurIPS 2024

    Evaluating Mathematical Reasoning Beyond Accuracy

    Shijie Xia, Xuefeng Li, Yixin Liu, Tongshuang Wu, Pengfei Liu

    AAAI 2025 oral presentation

    Summary: We propose ReasonEval, a suite comprising a new evaluation methodology with defined metrics for assessing mathematical reasoning quality and corresponding LLM-based evaluators for automated calculation.

    Selected Honors and Awards

    Outstanding Graduates of Fudan University, 2024

    Shanghai City Scholarship, 2022

    Fudan University Academic Scholarship, 2020-2024

    Service

    Program Committee/Reviewer
    AAAI (2025-2026), ICLR (2026), EMNLP (2025)