当前位置: 首页 > 详情页

Automatically Structuring on Chinese Ultrasound Report of Cerebrovascular Diseases via Natural Language Processing

| 导出 | |

文献详情

资源类型:
WOS体系:

收录情况: ◇ SCIE ◇ EI

机构: [1]Information Center, Xuanwu Hospital, Capital Medical University, Beijing 100053, China [2]Ministry of Education Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China [3]Bioinformatics Division, Center for Synthetic and Systems Biology, Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China [4]Department of Automation, Tsinghua University, Beijing 100084, China [5]Beijing INFI-SAGACITY Technology Company, Ltd., Beijing 100086, China
出处:
ISSN:

关键词: Natural language processing (NLP) conditional random fields (CRF) ultrasound report

摘要:
The current ultrasound reports in Chinese hospitals are mostly written in free-text format. Important clinical information, such as stenosis rate and plaque location, is recorded in long sentences, especially for ultrasound reports of cerebrovascular diseases. They cannot be directly used for further automatic analysis due to the lack of structure and standardization. The goal of this paper is to assess the feasibility of applying natural language processing technology to automatically extract disease entities and relate information such as the stenosis rate and plaque location from free-text ultrasound reports of cerebrovascular diseases. A structured model using conditional random fields (CRFs) is first constructed. Then, the clause optimizing and segmentation process is performed on a word level to achieve data structuring. Seven categories of terms, including symptoms, plaque locations, diseases, and degree, in 1980 de-identified ultrasound reports were manually annotated as a training dataset. With this model, 7937 ultrasound reports were automatically processed to structure data within 40 min. The true positive rate of the model for each category of terms is 96%, 94%, 97%, 100%, 100%, 100%, and 97%, respectively. The CRF model can be used in Chinese natural language processing to provide support for unstructured data analysis. The standardized segmentation results can be obtained based on medical ontology libraries. However, real-time processing and scientific annotation remain a challenge if intelligent clinical decision making needs to be applied to a real-world clinical environment.

基金:
语种:
被引次数:
WOS:
中科院(CAS)分区:
出版当年[2018]版:
大类 | 2 区 工程技术
小类 | 2 区 计算机:信息系统 3 区 工程:电子与电气 3 区 电信学
最新[2023]版:
大类 | 3 区 计算机科学
小类 | 3 区 工程:电子与电气 4 区 计算机:信息系统 4 区 电信学
JCR分区:
出版当年[2017]版:
Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Q1 TELECOMMUNICATIONS Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
最新[2023]版:
Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Q2 TELECOMMUNICATIONS

影响因子: 最新[2023版] 最新五年平均 出版当年[2017版] 出版当年五年平均 出版前一年[2016版] 出版后一年[2018版]

第一作者:
第一作者机构: [1]Information Center, Xuanwu Hospital, Capital Medical University, Beijing 100053, China
共同第一作者:
通讯作者:
通讯机构: [1]Information Center, Xuanwu Hospital, Capital Medical University, Beijing 100053, China [2]Ministry of Education Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China [3]Bioinformatics Division, Center for Synthetic and Systems Biology, Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084, China [4]Department of Automation, Tsinghua University, Beijing 100084, China
推荐引用方式(GB/T 7714):
APA:
MLA:

资源点击量:16409 今日访问量:0 总访问量:869 更新日期:2025-01-01 建议使用谷歌、火狐浏览器 常见问题

版权所有©2020 首都医科大学宣武医院 技术支持:重庆聚合科技有限公司 地址:北京市西城区长椿街45号宣武医院