Faced with challenging cases, doctors are increasingly seeking diagnostic advice from large language models (LLMs). This study aims to compare the ability of LLMs and human physicians to diagnose challenging cases. An offline dataset of 67 challenging cases with primary gastrointestinal symptoms was used to solicit possible diagnoses from seven LLMs and 22 gastroenterologists. The diagnoses by Claude 3.5 Sonnet covered the highest proportion (95% confidence interval [CI]) of instructive diagnoses (76.1%, [70.6%-80.9%]), significantly surpassing all the gastroenterologists (p < 0.05 for all). Claude 3.5 Sonnet achieved a significantly higher coverage rate (95% CI) than that of the gastroenterologists using search engines or other traditional resource (76.1% [70.6%-80.9%] vs. 45.5% [40.7%-50.4%], p < 0.001). The study highlights that advanced LLMs may assist gastroenterologists with instructive, time-saving, and cost-effective diagnostic scopes in challenging cases.
基金:
National Natural Science Foundation of China (National Science Foundation of China) [2022YFC2505100]; National Key R&D Program of China [81970557, 82003152, 82000506]; National Natural Science Foundation of China [GTCZ-2023-SD-08]; Research Project of the Chinese Early Gastrointestinal Cancer Physicians' Collaborative Growth Program; Beijing Huaxia Cancer Prevention and Treatment Research Institute
语种:
外文
被引次数:
WOS:
PubmedID:
中科院(CAS)分区:
出版当年[2025]版:
大类|1 区医学
小类|1 区卫生保健与服务1 区医学:信息
最新[2025]版:
大类|1 区医学
小类|1 区卫生保健与服务1 区医学:信息
JCR分区:
出版当年[2023]版:
Q1HEALTH CARE SCIENCES & SERVICESQ1MEDICAL INFORMATICS
最新[2024]版:
Q1HEALTH CARE SCIENCES & SERVICESQ1MEDICAL INFORMATICS
第一作者机构:[1]Fourth Mil Med Univ, Xijing Hosp Digest Dis, State Key Lab Holist Integrat Management Gastroint, Xian, Peoples R China[2]Fourth Mil Med Univ, Xijing Hosp Digest Dis, Natl Clin Res Ctr Digest Dis, Xian, Peoples R China
共同第一作者:
通讯作者:
通讯机构:[1]Fourth Mil Med Univ, Xijing Hosp Digest Dis, State Key Lab Holist Integrat Management Gastroint, Xian, Peoples R China[2]Fourth Mil Med Univ, Xijing Hosp Digest Dis, Natl Clin Res Ctr Digest Dis, Xian, Peoples R China
推荐引用方式(GB/T 7714):
Yang Xintian,Li Tongxin,Wang Han,et al.Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms[J].NPJ DIGITAL MEDICINE.2025,8(1):doi:10.1038/s41746-025-01486-5.
APA:
Yang, Xintian,Li, Tongxin,Wang, Han,Zhang, Rongchun,Ni, Zhi...&Pan, Yanglin.(2025).Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms.NPJ DIGITAL MEDICINE,8,(1)
MLA:
Yang, Xintian,et al."Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms".NPJ DIGITAL MEDICINE 8..1(2025)