Other

Unified Architecture Adaptation for Compressed Domain Semantic Inference.

August 7, 2023 Other

Abstract

Advances in both lossy image compression and semantic content understanding have been greatly fueled by deep learning techniques, yet these two tasks have been developed separately for the past decades. In this work, we address the problem of directly executing semantic inference from quantized latent features in the deep compressed domain without pixel reconstruction. Although different methods have been proposed for this problem setting, they either are restrictive to a specific architecture, or are sub-optimal in terms of compressed domain task accuracy. In contrast, we propose a lightweight, plug-and-play solution which is generally compliant with popular learned image coders and deep vision models, making it attractive to vast applications. Our method adapts prevalent pixel domain neural models that are deployed for various vision tasks to directly accept quantized latent features (other than pixels). We further suggest training the compressed domain model by transferring knowledge from its corresponding pixel domain counterpart. Experiments show that our method is compliant with popular learned image coders and vision task models. Under fair comparison, our approach outperforms a baseline method by a) more than 3% top-1 accuracy for compressed domain classification, and b) more than 7% mIoU for compressed domain semantic segmentation, at various data rates.

Show Full Text

Unified Architecture Adaptation for Compressed Domain Semantic Inference.

Researchers

Journal

Modalities

Models

Abstract

Computer-aided diagnosis of low grade endometrial stromal sarcoma (LGESS).

Computer-aided Cervical Cancer Diagnosis using Time-lapsed Colposcopic Images.

Adopting low-shot deep learning for the detection of conjunctival melanoma using ocular surface images.

Joint Cancer Segmentation and PI-RADS Classification on Multiparametric MRI Using MiniSegCaps Network.

Rapid and precise detection of cancers via label-free SERS and deep learning.

Historical-crack18-19: A dataset of annotated images for non-invasive surface crack detection in historical buildings.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply