Data set terminology of deep learning in medicine: a historical review and recommendation.

Researchers

Akifumi Hagiwara Daiju Ueda Hiroshi Seki Hirotaka Takita Rintaro Ito Shannon L Walston Shingo Sato Shouhei Hanaoka Yasuhito Mitsuyama Yukio Miki

Journal

Japanese journal of radiology

Modalities

Models

deep learning

Abstract

Medicine and deep learning-based artificial intelligence (AI) engineering represent two distinct fields each with decades of published history. The current rapid convergence of deep learning and medicine has led to significant advancements, yet it has also introduced ambiguity regarding data set terms common to both fields, potentially leading to miscommunication and methodological discrepancies. This narrative review aims to give historical context for these terms, accentuate the importance of clarity when these terms are used in medical deep learning contexts, and offer solutions to mitigate misunderstandings by readers from either field. Through an examination of historical documents, including articles, writing guidelines, and textbooks, this review traces the divergent evolution of terms for data sets and their impact. Initially, the discordant interpretations of the word ‘validation’ in medical and AI contexts are explored. We then show that in the medical field as well, terms traditionally used in the deep learning domain are becoming more common, with the data for creating models referred to as the ‘training set’, the data for tuning of parameters referred to as the ‘validation (or tuning) set’, and the data for the evaluation of models as the ‘test set’. Additionally, the test sets used for model evaluation are classified into internal (random splitting, cross-validation, and leave-one-out) sets and external (temporal and geographic) sets. This review then identifies often misunderstood terms and proposes pragmatic solutions to mitigate terminological confusion in the field of deep learning in medicine. We support the accurate and standardized description of these data sets and the explicit definition of data set splitting terminologies in each publication. These are crucial methods for demonstrating the robustness and generalizability of deep learning applications in medicine. This review aspires to enhance the precision of communication, thereby fostering more effective and transparent research methodologies in this interdisciplinary field.© 2024. The Author(s) under exclusive licence to Japan Radiological Society.

Show Full Text

Data set terminology of deep learning in medicine: a historical review and recommendation.

Researchers

Journal

Modalities

Models

Abstract

The Use of Deep Learning Model for Effect Analysis of Conventional Friction Power Confinement.

Object Detection at Level Crossing Using Deep Learning.

Field rice panicle detection and counting based on deep learning.

SASA-Net: A Spatial-aware Self-attention Mechanism for Building Protein 3D Structure Directly from Inter-residue Distances.

Continuous cuffless and non-invasive measurement of arterial blood pressure-concepts and future perspectives.

Nonlinear Network Speech Recognition Structure in a Deep Learning Algorithm.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply