I'm a second-year Ph.D. student in Computer Science at Yale University, where I am very fortunately advised by Prof. Mark Gerstein. Previously, I got my master's from Yale CS as well, advised by Prof. Dragomir Radev. My research lies in the intersection of large language models and applications in bioinformatics, with a view towards building AI scientists.

  • Alignment and Safety: The goal of alignment is to steer already-capable models to do what we want them to do.
  • Autonomous Scientific Discovery: LLMs autonomously design, plan, and perform complex experiments empowered by tools such as internet and documentation search, code execution, and experimental automation.
I am looking for grads/undergrads to collaborate and actively engage in mentorship, so feel free to email me if you are starting in the field and are confused about publication expectations, PhD Admissions, etc.

Selected Publications

Discover the google scholar | semantic scholar

  • A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation
    Xiangru Tang, Howard Dai, Elizabeth Knight, Fang Wu, Yunyang Li, Tianxiao Li, Mark Gerstein
    arXiv, 2024
    [PDF] [Abstract] [Bib]
    GenAI4Drug
  • Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science
    Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein
    arXiv, 2024
    [PDF] [Abstract] [Bib]
  • Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
    Zhuosheng Zhang, Yao Yao, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao.
    arXiv, 2023
    [PDF] [Abstract] [Bib]
    CoT-Igniting-Agent
  • MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
    Xiangru Tang*, Anni Zou*, Zhuosheng Zhang, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein.
    arXiv, 2023
    [PDF] [Abstract] [Bib]
    MedAgents
  • Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
    Cunxiang Wang*, Xiaoze Liu*, Yuanhao Yue*, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang.
    arXiv, 2023
    [PDF] [Abstract] [Bib]
    LLM-Factuality-Survey
  • Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
    Anni Zou, Zhuosheng Zhang, Hai Zhao, Xiangru Tang.
    arXiv, 2023
    [PDF] [Abstract] [Bib]
    Meta-CoT
  • ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks
    Yuliang Liu*, Xiangru Tang*, Zefan Cai*, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Liang Chen, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein.
    arXiv, 2023
    [PDF] [Abstract] [Bib]
    ML-Bench
  • MolLM: A Unified Language Model for Integrating Biomedical Text with 2D and 3D Molecular Representations
    Xiangru Tang, Andrew Tran, Jeffrey Tan, Mark Gerstein.
    ISMB 2024 (Bioinformatics special issue)
    [PDF] [Abstract] [Bib]
    MolLM
  • BioCoder: A Benchmark for Bioinformatics Code Generation with Contextual Pragmatic Knowledge
    Xiangru Tang*, Bill Qian*, Rick Gao, Jiakang Chen, Xinyun Chen, Mark Gerstein.
    ISMB 2024 (Bioinformatics special issue)
    [PDF] [Abstract] [Bib]
    BioCoder
  • Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
    Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein.
    NAACL 2024
    [PDF] [Abstract] [Bib]
    Struc-Bench
  • Investigating Data Contamination in Modern Benchmarks for Large Language Models
    Chunyuan Deng, Yilun Zhao, Xiangru Tang, Mark Gerstein, Arman Cohan.
    NAACL 2024
    [PDF] [Abstract] [Bib]
  • DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Financial Documents
    Yilun Zhao, Yitao Long, Hongjun Liu, Linyong Nan, Lyuhao Chen, Ryo Kamoi, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan
    NAACL 2024
    [PDF] [Abstract] [Bib]
  • OctoPack: Instruction Tuning Code Large Language Models
    Niklas Muennighoff, Qian Liu, Armel Zebaze, Qinkai Zheng, Binyuan Hui, Terry Yue Zhuo, Swayam Singh, Xiangru Tang, Leandro von Werra, Shayne Longpre.
    ICLR 2024 (Spotlight)
    [PDF] [Abstract] [Bib]
    Octopack
  • ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
    Yujia Qin*, Shihao Liang*, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun.
    ICLR 2024 (Spotlight)
    [PDF] [Abstract] [Bib]
    ToolLLM
  • QTSumm: Query-Focused Summarization over Tabular Data
    Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Ruizhe Chen, Xiangru Tang, Yumo Xu, Dragomir Radev, Arman Cohan.
    EMNLP 2023
    [PDF] [Abstract] [Bib]
    QTSumm
  • Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking Scenarios
    Yilun Zhao*, Haowei Zhang*, Shengyun Si*, Linyong Nan, Xiangru Tang, Arman Cohan.
    EMNLP 2023
    [PDF] [Abstract] [Bib]
    LLM-T2T
  • RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
    Yilun Zhao, Chen Zhao, Linyong Nan, Zhenting Qi, Wenlin Zhang, Xiangru Tang, Boyu Mi, Dragomir Radev.
    ACL 2023
    [PDF] [Abstract] [Bib]
    RobuT
  • CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning
    Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
    NAACL 2022
    [PDF] [Abstract] [Bib]
  • Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries
    Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
    NAACL 2022
    [PDF] [Abstract] [Bib]

Professional Services

Tutorial Organizer: ISMB 2024 Tutorial on A Practical Introduction to LLMs in Biomedical Research.
Workshop Organizer: ICLR 2024 Workshop on LLM Agents, SIGDIAL/INLG 2023 Workshop on Taming LLMs.
Conference Program Committee / Reviewer: NeurIPS, ICML, ACL, EMNLP, CIKM, NAACL, INLG, IEEE BigData, COLM.
Journal Reviewer: npj Digital Medicine, Neurocomputing, BMC Bioinformatics, PLOS ONE, Health Data Science.
Workshop Reviewer: KDD 2023 Workshop on Data Mining in Bioinformatics, ACL 2023 Workshop on Building Educational Apps, ACL 2023 Workshop on Clinical NLP, ICML 2023 Workshop on Neural Conv AI, ICML 2023 Workshop on Interpretable ML in Healthcare, NAACL-HLT 2021 Workshop on Language and Vision Research.

Teaching

Teaching Fellow for CPSC 452/CPSC 552/AMTH 552/CB&B 663 Deep Learning Theory and Applications, Yale University, 2023 and 2024.
Teaching Fellow for CPSC 437/CPSC 537 Introduction to Database Systems, Yale University, 2023.

Misc.

My 12 coursework at Yale: CPSC 523 Principles of Operating Systems, 537 Intro to Database, 539 Software Engineering, 552 Deep Learning Theory, 553 Unsupervised Learning, 569 Randomized Algorithms, 577 NLP, 583 Deep Learning on Graph, 668 Blockchain Research, 677 Adv NLP, 680 Trustworthy Deep Learning, 752 Biomedical Data Sci.