A comparison study on creating simulated patient data for individuals suffering from chronic coronary disorders.

Researchers

Angela Koloi Antonis Sakellarios Costas Papaloukas Dimitrios Fotiadis Jakub Kazmierski Jos A Bosch Karina Nowakowska Nikolaos Tachos Rick Quax Vasileios S Loukas

Journal

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Modalities

Models

Gaussian Copula Model Generative Adversarial Network (GAN)variational autoencoder (VAE)

Abstract

An emerging area in data science that has lately gained attention is the virtual population (VP) and synthetic data generation. This field has the potential to significantly affect the healthcare industry by providing a means to augment clinical research databases that have a shortage of subjects. The current study provides a comparative analysis of five distinct approaches for creating virtual data populations from real patient data. The data set utilized for the current analyses involved clinical data collected among patients scheduled for elective coronary artery bypass graft surgery (CABG). To that end, the five computational techniques employed to augment the given dataset were: (i) Tabular Preset, (ii) Gaussian Copula Model (iii) Generative Adversarial Network based (GAN) Deep Learning data synthesizer (CTGAN), (iv) a variation of the CTGAN Model (Copula GAN), and (v) VAE-based Deep Learning data synthesizer (TVAE). The performance of these techniques was assessed against their effectiveness in producing high-quality virtual data. For this purpose, dataset correlation matrices, cosine similarity distance, density histograms, and kernel density estimation are employed to perform a comparative analysis of each attribute and the respective synthetic equivalent. Our findings demonstrate that Gaussian Copula Model prevails in creating virtual data with consistent distributions (Kolmogorov-Smirnov (KS) and Chi-Squared (CS) tests equal to 0.9 and 0.98, respectively) and correlation patterns (average cosine similarity equals to 0.95).Clinical Relevance- It has been shown that the use of a VP can increase the predictive performance of a ML model, i.e., above using a smaller non-augmented population.

Show Full Text

A comparison study on creating simulated patient data for individuals suffering from chronic coronary disorders.

Researchers

Journal

Modalities

Models

Abstract

TLT: Recurrent fine-tuning transfer learning for water quality long-term prediction.

AORTA Gene: Polygenic prediction improves detection of thoracic aortic aneurysm.

Prediction of Skin Disease with Three Different Feature Selection Techniques Using Stacking Ensemble Method.

Application of time-frequency domain and deep learning fusion feature in non-invasive diagnosis of congenital heart disease-related pulmonary arterial hypertension.

Reliable segmentation of 2D cardiac magnetic resonance perfusion image sequences using time as the 3rd dimension.

Automatic segmentation of the carotid artery and internal jugular vein from 2D ultrasound images for 3D vascular reconstruction.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply