Compact Neural Architecture Designs by Tensor Representations.

Abstract

We propose a framework of tensorial neural networks (TNNs) extending existing linear layers on low-order tensors to multilinear operations on higher-order tensors. TNNs have three advantages over existing networks: First, TNNs naturally apply to higher-order data without flattening, which preserves their multi-dimensional structures. Second, compressing a pre-trained network into a TNN results in a model with similar expressive power but fewer parameters. Finally, TNNs interpret advanced compact designs of network architectures, such as bottleneck modules and interleaved group convolutions. To learn TNNs, we derive their backpropagation rules using a novel suite of generalized tensor algebra. With backpropagation, we can either learn TNNs from scratch or pre-trained models using knowledge distillation. Experiments on VGG, ResNet, and Wide-ResNet demonstrate that TNNs outperform the state-of-the-art low-rank methods on a wide range of backbone networks and datasets.Copyright © 2022 Su, Li, Liu, Ranadive, Coley, Tuan and Huang.

Show Full Text

Compact Neural Architecture Designs by Tensor Representations.

Researchers

Journal

Modalities

Models

Abstract

Complementary label learning based on knowledge distillation.

An ultra-fast deep-learning-based dose engine for prostate VMAT via knowledge distillation framework with limited patient data.

Knowledge distillation in deep learning and its applications.

Feature decoupled knowledge distillation enabled lightweight image transmission through multimode fibers.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply