How does deep learning/machine learning perform in comparison to radiologists in distinguishing glioblastomas (or grade IV astrocytomas) from primary CNS lymphomas?: a meta-analysis and systematic review.
Researchers
Journal
Modalities
Models
Abstract
Several studies have been published comparing deep learning (DL)/machine learning (ML) to radiologists in differentiating PCNSLs from GBMs with equivocal results. We aimed to perform this meta-analysis to evaluate the diagnostic accuracy of ML/DL versus radiologists in classifying PCNSL versus GBM using MRI.The study was performed in accordance with PRISMA guidelines. Data was extracted and interpreted by two researchers with 12 and 23 years’ experience, respectively, and QUADAS-2 tool was used for quality and risk-bias assessment. We constructed contingency tables to derive sensitivity, specificity accuracy, summary receiver operating characteristic (SROC) curve, and the area under the curve (AUC).Our search identified 11 studies, of which 8 satisfied our inclusion criteria and restricted the analysis in each study to reporting the model showing highest accuracy, with a total sample size of 1159 patients. The random effects model showed a pooled sensitivity of 0.89 [95% CI:0.84-0.92] for ML and 0.82 [95% CI:0.76-0.87] for radiologists. Pooled specificity was 0.88 [95% CI: 0.84-0.91] for ML and 0.90 [95% CI: 0.81-0.95] for radiologists. Pooled accuracy was 0.88 [95% CI: 0.86-0.90] for ML and 0.86 [95% CI: 0.78-0.91] for radiologists. Pooled AUC of ML was 0.94 [95% CI:0.92-0.96]and for radiologists, it was 0.90 [95% CI: 0.84-0.93].MRI-based ML/DL techniques can complement radiologists to improve the accuracy of classifying GBMs from PCNSL, possibly reduce the need for a biopsy, and avoid any unwanted neurosurgical resection of a PCNSL.Copyright © 2024 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.