De-identification of Clinical Text via Bi-LSTM-CRF with Neural Language Models.

Researchers

Buzhou Tang Dehuan Jiang Jun Yan Qingcai Chen Xiaolong Wang Ying Shen

Journal

AMIA ... Annual Symposium proceedings. AMIA Symposium

Modalities

Models

Abstract

De-identification of clinical text, the prerequisite of electronic clinical data reuse, is a typical named entity recogni tion (NER) problem. A number of state-of-the-art deep learning methods for NER, such as Bi-LSTM-CRF (bidirec tional long-short-term-memory conditional random fields), have been applied for de-identification. Neural language models used for language representation bring great improvement in lots of NLP tasks when they are integrated with other deep learning methods. In this paper, we introduce Bi-LSTM-CRF with neural language models for de- identification of clinical text, and evaluate it on the de-identification datasets of the i2b2 2014 and the CEGS N- GRID 2016 challenges. Four neural language models of three types individually integrated with Bi-LSTM-CRF are compared in this study. Bi-LSTM-CRF with neural language models achieves the highest “strict” micro-averaged F1-score of 95.50% on the i2b2 2014 dataset and 91.82% on the CEGS N-GRID 2016 dataset, becoming new benchmark results on these two datasets respectively Keywords: De-identification, Named entity recognition, Bidirectional long-short-term-memory, Conditional ran dom fields, Neural language models.
©2019 AMIA – All rights reserved.

Show Full Text

De-identification of Clinical Text via Bi-LSTM-CRF with Neural Language Models.

Researchers

Journal

Modalities

Models

Abstract

Bathroom activities monitoring for older adults by a wrist-mounted accelerometer using a hybrid deep learning model.

Heart disease risk factors detection from electronic health records using advanced NLP and deep learning techniques.

From Free-text Drug Labels to Structured Medication Terminology with BERT and GPT.

Research and Application of Artificial Intelligence Based on Electronic Health Records of Patients With Cancer: Systematic Review.

Extracting chemical-protein relations with ensembles of SVM and deep learning models.

Application of specialized word embeddings and named entity and attribute recognition to the problem of unsupervised automated clinical coding.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply