Separation of scales and a thermodynamic description of feature learning in some CNNs.

Researchers

Journal

Modalities

Models

Abstract

Deep neural networks (DNNs) are powerful tools for compressing and distilling information. Their scale and complexity, often involving billions of inter-dependent parameters, render direct microscopic analysis difficult. Under such circumstances, a common strategy is to identify slow variables that average the erratic behavior of the fast microscopic variables. Here, we identify a similar separation of scales occurring in fully trained finitely over-parameterized deep convolutional neural networks (CNNs) and fully connected networks (FCNs). Specifically, we show that DNN layers couple only through the second cumulant (kernels) of their activations and pre-activations. Moreover, the latter fluctuates in a nearly Gaussian manner. For infinite width DNNs, these kernels are inert, while for finite ones they adapt to the data and yield a tractable data-aware Gaussian Process. The resulting thermodynamic theory of deep learning yields accurate predictions in various settings. In addition, it provides new ways of analyzing and understanding DNNs in general.© 2023. The Author(s).

Show Full Text

Separation of scales and a thermodynamic description of feature learning in some CNNs.

Researchers

Journal

Modalities

Models

Abstract

Deep learning-based harmonization of trabecular bone microstructures between high- and low-resolution CT imaging.

Rapid estimation of 2D relative -maps from localizers in the human heart at 7T using deep learning.

A Power Spectrum Maps Estimation Algorithm Based on Generative Adversarial Networks for Underlay Cognitive Radio Networks.

A deep ensemble medical image segmentation with novel sampling method and loss function.

MMDB: Multimodal dual-branch model for multi-functional bioactive peptide prediction.

Neural co-processors for restoring brain function:Results from a cortical model of grasping.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply