Pathology | Radiology

Pre-gating and contextual attention gate – A new fusion method for multi-modal data tasks.

July 25, 2024 Pathology, Radiology

Abstract

Multi-modal representation learning has received significant attention across diverse research domains due to its ability to model a scenario comprehensively. Learning the cross-modal interactions is essential to combining multi-modal data into a joint representation. However, conventional cross-attention mechanisms can produce noisy and non-meaningful values in the absence of useful cross-modal interactions among input features, thereby introducing uncertainty into the feature representation. These factors have the potential to degrade the performance of downstream tasks. This paper introduces a novel Pre-gating and Contextual Attention Gate (PCAG) module for multi-modal learning comprising two gating mechanisms that operate at distinct information processing levels within the deep learning model. The first gate filters out interactions that lack informativeness for the downstream task, while the second gate reduces the uncertainty introduced by the cross-attention module. Experimental results on eight multi-modal classification tasks spanning various domains show that the multi-modal fusion model with PCAG outperforms state-of-the-art multi-modal fusion models. Additionally, we elucidate how PCAG effectively processes cross-modality interactions.Copyright © 2024 The Authors. Published by Elsevier Ltd.. All rights reserved.

Show Full Text

Artificial intelligence at the time of COVID-19: who does the lion’s share?

Patch-wise 3D segmentation quality assessment combining reconstruction and regression networks.

Using deep neural networks and biological subwords to detect protein S-sulfenylation sites.

A disector-based framework for the automatic optical fractionator.

A vendor-agnostic, PACS integrated, and DICOM-compatible software-server pipeline for testing segmentation algorithms within the clinical radiology workflow.

Development and validation of a deep learning system for ascites cytopathology interpretation.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply