Publications

Filter by:

Years:
Research Topics:

2025

  • DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning [PDF]
    Pengcheng Jiang, Jiacheng Lin, Lang Cao, Runchu Tian, SeongKu Kang, Zifeng Wang, Jimeng Sun, Jiawei Han.
    Preprint.

  • InformGen: An AI Copilot for Accurate and Compliant Clinical Research Consent Document Generation [PDF]
    Zifeng Wang, Junyi Gao, Benjamin Danek, Brandon Theodorou, Ruba Shaik, Shivashankar Thati, Seunghyun Won, Jimeng Sun.
    Preprint.

  • A foundation model for human-AI collaboration in medical literature mining [PDF]
    Zifeng Wang, Lang Cao, Qiao Jin, Joey Chan, Nicholas Wan, Behdad Afzali, Hyun-Jin Cho, Chang-In Choi, Mehdi Emamverdi, Manjot K. Gill, Sun-Hyung Kim, Yijia Li, Yi Liu, Yiming Luo, Hanley Ong, Justin Rousseau, Irfan Sheikh, Jenny J. Wei, Ziyang Xu, Christopher M. Zallek, Kyungsang Kim, Yifan Peng, Zhiyong Lu, Jimeng Sun.
    Preprint.

2024

  • Demystifying Large Language Models for Medicine: A Primer [PDF]
    Qiao Jin, Nicholas Wan, Robert Leaman, Shubo Tian, Zhizheng Wang, Yifan Yang, Zifeng Wang, Guangzhi Xiong, Po-Ting Lai, Qingqing Zhu, Benjamin Hou, Maame Sarfo-Gyamfi, Gongbo Zhang, Aidan Gilson, Balu Bhasuran, Zhe He, Aidong Zhang, Jimeng Sun, Chunhua Weng, Ronald M. Summers, Qingyu Chen, Yifan Peng, Zhiyong Lu.
    Preprint.

  • SynRL: Aligning Synthetic Clinical Trial Data with Human-preferred Clinical Endpoints Using Reinforcement Learning [PDF]
    Trisha Das, Zifeng Wang, Afrah Shafquat, Mandis Beigi, Jason Mezey, Jimeng Sun.
    Preprint.

  • A Perspective for Adapting Generalist AI to Specialized Medical AI Applications and Their Challenges [PDF]
    Zifeng Wang, Hanyin Wang, Benjamin Danek, Ying Li, Christina Mack, Luk Arbuckle, Devyani Biswal, Hoifung Poon, Yajuan Wang, Pranav Rajpurkar, Jimeng Sun.
    Preprint.

  • Can Large Language Models Replace Data Scientists in Biomedical Research? [PDF]
    Zifeng Wang*, Benjamin Danek*, Ziwei Yang, Zheng Chen, Jimeng Sun.
    Preprint.

  • Matching Patients to Clinical Trials with Large Language Models [PDF]
    Qiao Jin, Zifeng Wang, Charalampos S. Floudas, Fangyuan Chen, Changlin Gong, Dara Bracken-Clarke, Elisabetta Xue, Yifan Yang, Jimeng Sun, Zhiyong Lu.
    Nature Communications, 2024

    News Coverage: Nature, NIH News, NIH Director's Blog, Health Science Top 25 of 2024, POLITICO, AUA News, Azure Government, Informa, FedScoop, MSN

  • Accelerating Clinical Evidence Synthesis with Large Language Models [PDF]
    Zifeng Wang, Lang Cao, Benjamin Danek, Yichi Zhang, Qiao Jin, Zhiyong Lu, Jimeng Sun.
    Preprint.

  • Panacea: A foundation model for clinical trial search, summarization, design, and recruitment [PDF]
    Jiacheng Lin, Hanwen Xu, Zifeng Wang, Sheng Wang, Jimeng Sun.
    Preprint.

  • MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models [PDF]
    Yilin Wen*, Zifeng Wang*, Jimeng Sun
    ACL'24

2023

  • BioBridge: Bridging Biomedical Foundation Models via Knowledge Graph [PDF]
    Zifeng Wang, Zichen Wang, Balasubramaniam Srinivasan, Vassilis N. Ioannidis, Huzefa Rangwala, Rishita Anubhai.
    ICLR'24

  • UniPredict: Large Language Models are Universal Tabular Classifiers [PDF]
    Ruiyu Wang*, Zifeng Wang*, Jimeng Sun
    Preprint.

  • PyTrial: Machine Learning Software and Benchmark for Clinical Trial Applications [PDF]
    Zifeng Wang, Brandon Theodoru, Tianfan Fu, Cao Xiao, Jimeng Sun.
    Preprint.

  • MediTab: Scaling Medical Tabular Data Predictors via Data Consolidation, Enrichment, and Refinement [PDF]
    Zifeng Wang*, Chufan Gao*, Cao Xiao, Jimeng Sun.
    IJCAI'24

  • AutoTrial: Prompting Language Models for Clinical Trial Design [PDF]
    Zifeng Wang, Cao Xiao, Jimeng Sun.
    EMNLP'23

  • SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with Meta-Learning [PDF]
    Zifeng Wang, Cao Xiao, Jimeng Sun.
    ACM-BCB'23

  • TWIN: Personalized Clinical Trial Digital Twin Generation [PDF]
    Trisha Das*, Zifeng Wang*, and Jimeng Sun.
    KDD'23

2022

  • PromptEHR: Conditional Electronic Healthcare Records Generation with Prompt Learning [PDF]
    Zifeng Wang and Jimeng Sun.
    EMNLP'22.

  • MedCLIP: Contrastive Learning from Unpaired Medical Images and Texts [PDF] [Video]
    Zifeng Wang, Zhenbang Wu, Dinesh Agarwal, and Jimeng Sun.
    EMNLP'22.

  • Trial2Vec: Zero-Shot Clinical Trial Document Similarity Search using Self-Supervision [PDF]
    Zifeng Wang and Jimeng Sun.
    Findings of EMNLP'22

  • TransTab: Learning Transferable Tabular Transformers Across Tables [PDF] [Package] [Poster]
    Zifeng Wang and Jimeng Sun.
    NeurIPS'22

  • SurvTRACE: Transformer for Survival Analysis with Competing Events [PDF]
    Zifeng Wang and Jimeng Sun.
    ACM-BCB'22

Before 2021

  • PAC-Bayes Information Bottleneck [PDF] [Slides] [Video(CN)]
    Zifeng Wang, Shao-Lun Huang, Ercan E. Kuruoglu, Jimeng Sun, Xi Chen, and Yefeng Zheng.
    ICLR'22 . Spotlight (176/3391).

  • Finding Influential Instances for Distantly Supervised Relation Extraction [PDF]
    Zifeng Wang, Rui Wen, Xi Chen, Shao-Lun Huang, Ningyu Zhang, and Yefeng Zheng.
    COLING'22. Oral.

  • Lifelong Learning Disease Diagnosis on Clinical Notes [PDF] [Video]
    Zifeng Wang, Yifan Yang, Rui Wen, Xi Chen, Shao-Lun Huang, and Yefeng Zheng.
    PAKDD'21 . Best Student Paper Award (1/768).

  • Online Disease Self-diagnosis with Inductive Heterogeneous Graph Convolutional Networks [PDF] [Video]
    Zifeng Wang, Rui Wen, Xi Chen, Shilei Cao, Shao-Lun Huang, Buyue Qian, and Yefeng Zheng.
    WWW'21 .

  • Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback [PDF] [Video]
    Zifeng Wang, Xi Chen, Rui Wen, Shao-Lun Huang, Ercan E. Kuruoglu, and Yefeng Zheng.
    NeurIPS'20 .

  • Less Is Better: Unweighted Data Subsampling via Influence Function [PDF]
    Zifeng Wang, Hong Zhu, Zhenhua Dong, Xiuqiang He, and Shao-Lun Huang.
    AAAI'20 .

  • Deep Semantic Segmentation for Visual Understanding on Construction Sites [PDF]
    Zifeng Wang, Yuyang Zhang, Khalid M. Mosalam, Yuqing Gao, and Shao-Lun Huang.
    Computer-Aided Civil and Infrastructure Engineering (CACAIE), 1-18 (2021).

  • Data-driven Risk Assessment on Urban Pipeline Network Based on a Cluster Model [PDF]
    Zifeng Wang and Suzhen Li.
    Reliability Engineering & System Safety, 196: 106781 (2019).