Publications

You can also find my articles on Google Scholar and Semantic Scholar.

Remark: * indicates equal contribution.

Preprints

  • SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning.
    Tianshi Zheng , Rui Wang, Xiyun Li, Yangqiu Song, Tianqing Fang.
    Under Review , 2026. [pdf]

  • NAACL: Noise-AwAre Verbal Confidence Calibration for Robust LLMs in RAG Systems.
    Jiayu Liu, Rui Wang, Qing Zong, Yumeng Wang, Cheng Qian, Qingcheng Zeng, Tianshi Zheng , Haochen Shi, Dadi Guo, Baixuan Xu, Chunyang Li, Yangqiu Song.
    Under Review , 2026. [pdf]

  • CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?
    Qing Zong, Jiayu Liu, Tianshi Zheng , Chunyang Li, Baixuan Xu, Haochen Shi, Weiqi Wang, Zhaowei Wang, Chunkit Chan, Yangqiu Song.
    Under Review , 2025. [pdf]

  • The Cognitive Bandwidth Bottleneck: Shifting Long-Horizon Agent from Planning with Actions to Planning with Schemas.
    Baixuan Xu, Tianshi Zheng , Zhaowei Wang, Hong Ting Tsang, Weiqi Wang, Tianqing Fang, Yangqiu Song.
    Under Review , 2025. [pdf]

  • Structuring the Unstructured: A Systematic Review of Text-to-Structure Generation for Agentic AI with a Universal Evaluation Framework.
    Zheye Deng, Chunkit Chan, Tianshi Zheng , Wei Fan, Weiqi Wang, Yangqiu Song.
    Under Review , 2025. [pdf]

  • Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistemic Uncertainty.
    Rui Wang, Qihan Lin, Jiayu Liu, Qing Zong, Tianshi Zheng , Dadi Guo, Haochen Shi, Weiqi Wang, Yangqiu Song.
    Under Review , 2025. [pdf]

  • Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training.
    Tianqing Fang, Zhisong Zhang, Xiaoyang Wang, Rui Wang, Can Qin, Yuxuan Wan, Jun-Yu Ma, Ce Zhang, Jiaqi Chen, Xiyun Li, Yonglin Wang, Jingchen Ni, Tianshi Zheng , Chun Chen, Wenhao Yu, Zhenwen Liang, Hongming Zhang, Haitao Mi, Dong Yu.
    Under Review , 2025. [pdf]

  • Legal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents.
    Wei Fan, Tianshi Zheng , Yiran Hu, Zheye Deng, Weiqi Wang, Baixuan Xu, Chunyang Li, Haoran Li, Weixing Shen, Yangqiu Song.
    Under Review , 2025. [pdf]

  • Towards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study.
    Baixuan Xu*, Chunyang Li*, Weiqi Wang, Wei Fan, Tianshi Zheng , Haochen Shi, Tao Fan, Yangqiu Song, Qiang Yang.
    Under Review , 2025. [pdf]

  • CLR-Fact: Evaluating the complex logical reasoning capability of large language models over factual knowledge.
    Tianshi Zheng* , Jiaxin Bai*, Yicheng Wang, Tianqing Fang, Yue Guo, Yauwai Yim, Yangqiu Song.
    Under Review , 2024. [pdf]

Journals

  • Top Ten Challenges Towards Agentic Neural Graph Databases.
    Jiaxin Bai, Zihao Wang, Yukun Zhou, Hang Yin, Weizhi Fei, Qi Hu, Zheye Deng, Jiayang Cheng, Tianshi Zheng , Hong Ting Tsang, Yisen Gao, Zhongwei Xie, Yufei Li, Lixin Fan, Binhang Yuan, Wei Wang, Lei Chen, Xiaofang Zhou, Yangqiu Song.
    IEEE Data Engineering Bulletin , 2025. [pdf]
  • Sequential Query Encoding for Complex Query Answering on Knowledge Graphs.
    Jiaxin Bai*, Tianshi Zheng* , Yangqiu Song.
    Transactions on Machine Learning Research (TMLR) , 2023. [pdf] , [code]

  • The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning.
    Tianshi Zheng* , Yixiang Chen*, Chengxi Li*, Chunyang Li, Qing Zong, Haochen Shi, Baixuan Xu, Yangqiu Song, Ginny Y Wong, Simon See.
    Transactions on Machine Learning Research (TMLR) , 2025. [pdf] , [code]

Conference Proceedings

  • NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents.
    Tianshi Zheng* , Kelvin Tam*, Newt Nguyen*, Baixuan Xu, Zhaowei Wang, Jiayang Cheng, Hong Ting Tsang, Weiqi Wang, Jiaxin Bai, Tianqing Fang, Yangqiu Song, Ginny Y Wong, Simon See.
    In The Fourteenth International Conference on Learning Representations (ICLR-2026) , 2026. [pdf] , [code]

  • AutoGraph-R1: End-to-End Reinforcement Learning for Knowledge Graph Construction.
    Hong Ting Tsang, Jiaxin Bai, Haoyu Huang, Qiao Xiao, Tianshi Zheng , Baixuan Xu, Shujie Liu, Yangqiu Song.
    In The 64th Annual Meeting of the Association for Computational Linguistics (ACL-2026) , 2026. [pdf]

  • AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora.
    Jiaxin Bai, Wei Fan, Qi Hu, Qing Zong, Chunyang Li, Hong Ting Tsang, Hongyu Luo, Yauwai Yim, Haoyu Huang, Xiao Zhou, Feng Qin, Tianshi Zheng , Xi Peng, Xin Yao, Huiwen Yang, Leijie Wu, Yi Ji, Gong Zhang, Renhai Chen, Yangqiu Song.
    In The 64th Annual Meeting of the Association for Computational Linguistics (ACL-2026) , 2026. [pdf] , [code]

  • Controllable Logical Hypothesis Generation for Abductive Reasoning in Knowledge Graphs.
    Yisen Gao, Jiaxin Bai, Tianshi Zheng , Qingyun Sun, Ziwei Zhang, Xingcheng Fu, Jianxin Li, Yangqiu Song.
    In The Fourteenth International Conference on Learning Representations (ICLR-2026) , 2026. [pdf]

  • InferenceDynamics: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling.
    Haochen Shi*, Tianshi Zheng* , Weiqi Wang*, Baixuan Xu, Chunyang Li, Chunkit Chan, Tao Fan, Yangqiu Song, Qiang Yang.
    In The 64th Annual Meeting of the Association for Computational Linguistics (ACL-2026) , 2026. [pdf]

  • DixitWorld: Evaluating Multimodal Abductive Reasoning in Vision-Language Models with Multi-Agent Dixit Gameplay.
    Yunxiang Mo*, Tianshi Zheng* , Qing Zong, Jiayu Liu, Baixuan Xu, Yauwai Yim, Chunkit Chan, Jiaxin Bai, Yangqiu Song.
    In The 64th Annual Meeting of the Association for Computational Linguistics (ACL-2026) , 2026. [pdf]

  • From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery.
    Tianshi Zheng , Zheye Deng, Hong Ting Tsang, Weiqi Wang, Jiaxin Bai, Zihao Wang, Yangqiu Song.
    In The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP-2025) , 2025. [pdf] , [code]

  • LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning.
    Tianshi Zheng , Jiayang Cheng, Chunyang Li, Haochen Shi, Zihao Wang, Jiaxin Bai, Yangqiu Song, Ginny Y Wong, Simon See.
    In The 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP-2025) , 2025. [pdf] , [code]

  • Enhancing Transformers for Generalizable First-Order Logical Entailment.
    Tianshi Zheng* , Jiazheng Wang*, Zihao Wang, Jiaxin Bai, Hang Yin, Zheye Deng, Yangqiu Song, Jianxin Li.
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL-2025) , 2025. [pdf] , [code]

  • KnowShiftQA: How Robust are RAG Systems when Textbook Knowledge Shifts in K-12 Education?
    Tianshi Zheng* , Weihan Li*, Jiaxin Bai, Weiqi Wang, Yangqiu Song.
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL-2025) , 2025. [pdf] , [code]

  • Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations.
    Chunyang Li, Weiqi Wang, Tianshi Zheng , Yangqiu Song.
    In Findings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL-2025 Findings) , 2025. [pdf] , [code]

  • ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty.
    Qing Zong, Zhaowei Wang, Tianshi Zheng , Xiyu Ren, Yangqiu Song.
    In Findings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL-2025 Findings) , 2025. [pdf]

  • Evaluating and enhancing llms agent based on theory of mind in guandan: A multi-player cooperative game under imperfect information.
    Yauwai Yim, Chunkit Chan, Tianyu Shi, Zheye Deng, Wei Fan, Tianshi Zheng , Yangqiu Song.
    In 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT 2024) , 2024. [pdf]

  • Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction.
    Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng , Yauwai Yim, Yangqiu Song.
    In The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP-2024)( Oral ) , 2024. [pdf] , [code]

  • Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation.
    Jiaxin Bai*, Yicheng Wang*, Tianshi Zheng , Yue Guo, Xin Liu, Yangqiu Song.
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL-2024) , 2024. [pdf] , [code]