Biomedical Informatics | Clinical Decision Support

Natural language generation for electronic health records.

January 31, 2019 Biomedical Informatics, Clinical Decision Support

Researchers

Scott H Lee

Journal

NPJ digital medicine

Modalities

Models

Encoder-decoder model Generative Adversarial Networks (GANs)

Abstract

One broad goal of biomedical informatics is to generate fully-synthetic, faithfully representative electronic health records (EHRs) to facilitate data sharing between healthcare providers and researchers and promote methodological research. A variety of methods existing for generating synthetic EHRs, but they are not capable of generating unstructured text, like emergency department (ED) chief complaints, history of present illness, or progress notes. Here, we use the encoder-decoder model, a deep learning algorithm that features in many contemporary machine translation systems, to generate synthetic chief complaints from discrete variables in EHRs, like age group, gender, and discharge diagnosis. After being trained end-to-end on authentic records, the model can generate realistic chief complaint text that appears to preserve the epidemiological information encoded in the original record-sentence pairs. As a side effect of the model’s optimization goal, these synthetic chief complaints are also free of relatively uncommon abbreviation and misspellings, and they include none of the personally identifiable information (PII) that was in the training data, suggesting that this model may be used to support the de-identification of text in EHRs. When combined with algorithms like generative adversarial networks (GANs), our model could be used to generate fully-synthetic EHRs, allowing healthcare providers to share faithful representations of multimodal medical data without compromising patient privacy. This is an important advance that we hope will facilitate the development of machine-learning methods for clinical decision support, disease surveillance, and other data-hungry applications in biomedical informatics.

Show Full Text

Natural language generation for electronic health records.

Researchers

Journal

Modalities

Models

Abstract

Contrast-enhanced MRI synthesis using dense-dilated residual convolutions based 3D network toward elimination of gadolinium in neuro-oncology.

Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer.

Assisting scalable diagnosis automatically via CT images in the combat against COVID-19.

Application of deep learning to the classification of uterine cervical squamous epithelial lesion from colposcopy images.

Research and Implementation of Robot Vision Scanning Tracking Algorithm Based on Deep Learning.

Phasetime: Deep Learning Approach to Detect Nuclei in Time Lapse Phase Images.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply