Understanding Double Descent Using VC-Theoretical Framework.

Researchers

Journal

IEEE transactions on neural networks and learning systems

Modalities

Models

Least Squares (LS)multilayer perceptron Support Vector Machine (SVM)

Abstract

In spite of many successful applications of deep learning (DL) networks, theoretical understanding of their generalization capabilities and limitations remains limited. We present analysis of generalization performance of DL networks for classification under VC-theoretical framework. In particular, we analyze the so-called “double descent” phenomenon, when large overparameterized networks can generalize well, even when they perfectly memorize all available training data. This appears to contradict conventional statistical view that optimal model complexity should reflect an optimal balance between underfitting and overfitting, i.e., the bias-variance trade-off. We present VC-theoretical explanation of double descent phenomenon, under classification setting. Our theoretical explanation is supported by empirical modeling of double descent curves, using analytic VC-bounds, for several learning methods, such as support vector machine (SVM), least squares (LS), and multilayer perceptron classifiers. The proposed VC-theoretical approach enables better understanding of overparameterized estimators during second descent.

Show Full Text

Understanding Double Descent Using VC-Theoretical Framework.

Researchers

Journal

Modalities

Models

Abstract

Mesh Saliency via Weakly Supervised Classification-for-Saliency CNN.

A deep learning algorithm for automatic detection and classification of acute intracranial hemorrhages in head CT scans.

A preliminary study of deep learning-based reconstruction specialized for denoising in high-frequency domain: usefulness in high-resolution three-dimensional magnetic resonance cisternography of the cerebellopontine angle.

The lesion detection efficacy of deep learning on automatic breast ultrasound and factors affecting its efficacy: a pilot study.

A Novel Deep Learning Approach for Forecasting Myocardial Infarction Occurrences with Time Series Patient Data.

A Computer Vision Platform to Automatically Locate Critical Events in Surgical Videos: Documenting Safety in Laparoscopic Cholecystectomy.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply