Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks.

Researchers

Journal

IEEE transactions on pattern analysis and machine intelligence

Modalities

Models

Abstract

In this paper, we introduce a novel methodology for characterising the performance of deep learning networks (ResNets and DenseNet) with respect to training convergence and generalisation as a function of mini-batch size and learning rate for image classification. This methodology is based on novel measurements derived from the eigenvalues of the approximate Fisher information matrix, which can be efficiently computed even for high capacity deep models. Our proposed measurements can help practitioners to monitor and control the training process (by actively tuning the mini-batch size and learning rate) to allow for good training convergence and generalisation. Furthermore, the proposed measurements also allow us to show that it is possible to optimise the training process with a new dynamic sampling training approach that continuously and automatically change the mini-batch size and learning rate during the training process. Finally, we show that the proposed dynamic sampling training approach has a faster training time and a competitive classification accuracy compared to the current state of the art.

Show Full Text

Approximate Fisher Information Matrix to Characterise the Training of Deep Neural Networks.

Researchers

Journal

Modalities

Models

Abstract

Epiretinal Membrane Detection at the Ophthalmologist Level using Deep Learning of Optical Coherence Tomography.

Species identification of phlebotomine sandflies using deep learning and wing interferential pattern (WIP).

Harnessing the power of hybrid deep learning algorithm for the estimation of global horizontal irradiance.

A deep learning algorithm to detect chronic kidney disease from retinal photographs in community-based populations.

A Deep Learning-Based Satellite Target Recognition Method Using Radar Data.

In-vivo verified anatomically aware deep learning for real-time electric field simulation.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply