Chinese Medical Named Entity Recognition Using Pre-Trained Language Models and an Efficient Global Pointer Mechanism
Abstract
Medical Named Entity Recognition (MNER) is a critical task in medical text mining, serving as a foundation for intelligent diagnosis, disease prediction, and related tasks. However, Chinese medical texts present unique challenges, such as diverse entity classifications, varying entity lengths, and complex entity nesting, which hinder NER performance. To address these issues, we propose a Chinese MNER method leveraging pre-trained models and the efficient global pointer (EGP). Our approach incorporates three key innovations: (1) data augmentation techniques, including easy data augmentation and remote supervision with a medical entity dictionary, to address imbalanced entity types distribution and varying entity lengths; (2) a character-word fusion strategy that integrates RoBERTa-generated character vectors and Word2Vec-generated word vectors, enhancing semantic representation; and (3) an improved decoding layer based on EGP, enabling efficientrecognition of both nested and flat entities while reducing computational costs. Experimentals show that our method achieves F1 scores of 75.87% and 92.77% on the CMeEE-V2 and CCKS2020 datasets, respectively, outperforming the RoBERTa-BiLSTM-CRF baseline by 3.06% and 4.38%, respectively.
Full Text:
PDFReferences
References
W. Li, K. Zhang, T. Guan, H. Zhang, T. Zhu, B. Chang and Q. Chen, Overview of CHIP2020 shared Task1: Named entity recognition in Chinese medical text, Journal of Chinese Information Processing 36(4) (2022) 66-72.
J. Du, H. Yi and S. Feng, Research and development of named entity recognition in Chinese electronic medical record, Acta Electronica Sinica 50(12) (2022) 3030-3053.
X. Wang, R. Yang, Y. Feng, D. Li and J. Hou, A military named entity relation extraction approach based on deep learning, in International Conference on Algorithms, Computing and Artificial Intelligence (ACAI) (ACM Press, Sanya, CN, 2018), 1-6.
C. Fócil-Arias, G. Sidorov and A. Gelbukh, Medical events extraction to analyze clinical records with conditional random fields, Journal of Intelligent & Fuzzy Systems 36(5) (2019) 4633-4643.
F. Shen, S. Liu, S. Fu, Y. Wang, S. Henry, O. Uzuner and H. Liu, Family history extraction from synthetic clinical narratives using natural language processing: Overview and evaluation of a challenge data set and solutions for the 2019 national NLP clinical challenges (n2c2)/Open Health Natural Language Processing (OHNLP) Competition, JMIR Medical Informatics 9(1) (2021) e24008.
N. Zhang, M. Chen, Z. Bi, et al, CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark, arxiv preprint arxiv:2106.08087(2021).
H. Wang and C. Wang, A Survey: Chinese medical named entity recognition, Journal of Shandong Normal University (Natural Science) 36(2) (2021) 109-117.
S. Jiang, S. Zhao, K. Hou, Y. Liu and L. Zhang, A BERT-BiLSTM-CRF model for Chinese electronic medical records named entity recognition, in International Conference on Intelligent Computation Technology and Automation (ICICTA) (IEEE Press, Xiangtan, CN, 2019), 166-169.
J. Yang, Y. Guan, B. He, C. Qu, Q. Yu, Y. Liu and Y. Zhao, Corpus construction for named entities and entity relations on Chinese electronic medical records, Journal of Software 27(11) (2016) 2725-2746.
Q. Wang, Y. M. Zhou, T. Ruan, D. Gao, Y. Xia and P. He, Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition, Journal of Biomedical Informatics 92(2019) 103133.
Z. Pan, Research on the recognition of Chinese named entity based on rules and statistics, Information Science 30(5) (2012) 708-712+786.
X. Chen, C. Ouyang, Y. Liu, and Y. Bu, Improving the named entity recognition of Chinese electronic medical records by combining domain dictionary and rules, International Journal of Environmental Research and Public Health 17(8) (2020) 2687.
H. Wang, W. L. K. Yeung, Q. X. Ng, A. Tung, J. A. M. Tay, D. Ryanputra, M. E. H. Ong, M. L. Feng and S. Arulanandam, A weakly-supervised named entity recognition machine learning approach for emergency medical services clinical audit, International Journal of Environmental Research and Public Health 18(15) (2021) 7776.
C. Cortes and V. Vapnik, Support-vector networks, Machine learning 20(1995) 273-297.
S. Fine, Y. Singer and N. J. M. l. Tishby, The hierarchical hidden Markov model: Analysis and applications, Machine Learning 32(1998) 41-62.
J. Lafferty, A. McCallum and F. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, in International Conference on Machine Learning (ICML) (Morgan Kaufmann Press, Williamstown, USA, 2001),. 3.
Z. F. Ju, J. Wang and F. Zhu, Named entity recognition from biomedical text using SVM, in International Conference on Bioinformatics and Biomedical Engineering (ICBBE) (IEEE Press, Wuhan, CN, 2011), 1-4.
B. T. Gao, Y. Zhang and B. Liu, BioTrHMM: named entity recognition algorithm based on transfer learning in biomedical texts, Application Research of Computers 36(1) (2019) 45-48.
K. Liu, Q. Hu, J. Liu and C. Xing, Named entity recognition in Chinese electronic medical records based on CRF, in Web Information Systems and Applications Conference (WISA) (IEEE Press, Liuzhou, CN, 2017), 105-110.
A. Mumuni and F. Mumuni, Automated data processing and feature engineering for deep learning and big data applications: a survey, Journal of Information Intelligence (2024).
T. Gui, R. Ma, Q. Zhang, L. Zhao, Y.-G. Jiang, and X. Huang, CNN-Based Chinese NER with Lexicon Rethinking, in International Joint Conference on Artificial Intelligence (IJCAI) (Macao, CN, 2019), 4982-4988.
J. Chung, Ç. Gülçehre, K. Cho and Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, arxiv preprint arxiv:1412.3555 (2014).
R. Dey and F. M. Salem, Gate-variants of gated recurrent unit (GRU) neural networks, in International Midwest Symposium on Circuits and Systems (MWSCAS) (IEEE Press, Boston, USA, 2017), 1597-1600.
S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural computation 9(8) (1997) 1735-1780.
J. Devlin, M. W. Chang, K. Lee and K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, arxiv preprint arxiv:1810.04805 (2018).
J. Zhong, Z. Xuan, K. Wang and Z. Cheng, A BERT-Span model for Chinese named entity recognition in rehabilitation medicine, PeerJ Computer science 9(2023) e1535.
J. Pennington, R. Socher, and C. D. Manning, Glove: Global vectors for word representation, in Empirical Methods in Natural Language Processing (EMNLP) (ACL Press, Doha, QA, 2014), 1532-1543.
B. K. K., P. Jens-Michalis, G. Paul, B. Florian, A. L. C., L. Leonhard, B. Felix, X. Lina, L. J. P., N. S. M., A. Moritz, G. Lennart, M. M. R., A. H. J.W.L., and L. Alexander, medBERT.de: A comprehensive German BERT model for the medical domain Expert Systems with Applications 237 (2024) 121598.
Y. Sun, S. Wang, Y. Li, S. Feng, X. Chen, H. Zhang, X. Tian, D. Zhu, H. Tian and H. Wu, Ernie: Enhanced representation through knowledge integration, arxiv preprint arxiv:1904.09223 (2019).
Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi, D. Chen, O. Levy, M. Lewis, L. Zettlemoyer and V. Stoyanov, Roberta: A robustly optimized bert pretraining approach, arxiv preprint arxiv:1907.11692 (2019).
S. K. Sienčnik, Adapting word2vec to named entity recognition, in Nordic Conference of Computational Linguistics (NODALIDA) (ACL Press, Vilnius, Lithuania, 2015), pp. 239-243.
M. Gridach, Character-level neural network for biomedical named entity recognition, Journal of Biomedical Informatics 70(2017) 85-91.
X. Wang, Y. Zhang, X. Ren, Y. H. Zhang, M. Zitnik, J. B. Shang, C. Langlotz and J. W. Han, Cross-type biomedical named entity recognition with deep multi-task learning, Bioinformatics 35(10) (2019) 1745-1752.
B. Ji, R. Liu, S. S. Li, J. Yu, Q. B. Wu, Y. S. Tan and J. J. Wu, A hybrid approach for named entity recognition in Chinese electronic medical record, BMC Medical Informatics and Decision Making 19(2019) 149-158.
J. Su, A. Murtadha, S. Pan, J. Hou, J. Sun, W. Huang, B. Wen and Y. Liu, Global pointer: Novel efficient span-based approach for named entity recognition, arxiv preprint arxiv:2208.03054 (2022).
P. Zhang and W. Liang, Medical name entity recognition based on lexical enhancement and global pointer, International Journal of Advanced Computer Science Applications 14(3) (2023).
J. Wei and K. Zou, Eda: Easy data augmentation techniques for boosting performance on text classification tasks, arxiv preprint arxiv:1901.11196 (2019).
T. Yan, X. Zhang and Z. Luo, LTACL: Long-tail awareness contrastive learning for distantly supervised relation extraction, Complex & Intelligent Systems 10(1) (2023) 1551-1563.
W. Zhang and J. Jiang, Bootstrap-Based resampling methods for software reliability measurement under small sample condition, Journal of Circuits, Systems and Computers 33(9) (2023) 2450161.
J. Su, M. Ahmed, Y. Lu, S. Pan, W. Bo and Y. Liu, Roformer: Enhanced transformer with rotary position embedding, Neurocomputing 568 (2024) 127063.
K. Yan, Optimizing an English text reading recommendation model by integrating collaborative filtering algorithm and FastText classification method, Heliyon 10(9) (2024) e30413.
W. K. Li, W. Li and Y. F. Wu, Combination methods of Chinese character and word embeddings in deep learning, Journal of Chinese Information Processing 31(06) (2017) 140-146.
J. Su, M. Zhu, A Murtadha, S. Pan, B. Wen and Y. Liu, Zlpr: A novel loss for multi-label classification, arxiv preprint arxiv:2208.02955 (2022).
DOI: https://doi.org/10.31449/inf.v49i19.8043

This work is licensed under a Creative Commons Attribution 3.0 License.