Proceedings of the 2016 3rd International Conference on Materials Engineering, Manufacturing Technology and Control

Extracting Clinical entities and their assertions from Chinese Electronic Medical Records Based on Machine Learning

Authors
Jianhong Wang, Yousong Peng, Bin Liu, Zhiqiang Wu, Lizong Deng, Taijiao Jiang
Corresponding Author
Jianhong Wang
Available Online April 2016.
DOI
10.2991/icmemtc-16.2016.290How to use a DOI?
Keywords
Chinese Electronic Medical Records; Information extraction; Named Entity Recognition; assertion classification; Machine Learning
Abstract

With the rapid growth of electronic medical records (EMRs) in China, large amounts of clinical data have been accumulated. However, limited work for extracting information from EMRs in Chinese has been conducted. In this work, using manually annotated dataset of EMRs in Chinese, we investigated the clinical Named Entities Recognition (NER) based on Conditional Random Field (CRF) and further built a Support Vector Machine (SVM) classifier to determine their assertion status and evaluate the contributions of different features for assertion classification. For Chinese clinical NER, our CRF-based classifier achieved the best F-measure of 89.07%, while the SVM-based assertion classifier achieved a maximum F-measure of 94.10%. Our work suggests that machine learning methods are helpful in NER and assertion determination for Chinese medical clinical records.

Copyright
© 2016, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 2016 3rd International Conference on Materials Engineering, Manufacturing Technology and Control
Series
Advances in Engineering Research
Publication Date
April 2016
ISBN
10.2991/icmemtc-16.2016.290
ISSN
2352-5401
DOI
10.2991/icmemtc-16.2016.290How to use a DOI?
Copyright
© 2016, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Jianhong Wang
AU  - Yousong Peng
AU  - Bin Liu
AU  - Zhiqiang Wu
AU  - Lizong Deng
AU  - Taijiao Jiang
PY  - 2016/04
DA  - 2016/04
TI  - Extracting Clinical entities and their assertions from Chinese Electronic Medical Records Based on Machine Learning
BT  - Proceedings of the 2016 3rd International Conference on Materials Engineering, Manufacturing Technology and Control
PB  - Atlantis Press
SP  - 1503
EP  - 1508
SN  - 2352-5401
UR  - https://doi.org/10.2991/icmemtc-16.2016.290
DO  - 10.2991/icmemtc-16.2016.290
ID  - Wang2016/04
ER  -