Learning to embed semantic similarity for joint image-text retrieval.

Researchers

Journal

IEEE transactions on pattern analysis and machine intelligence

Modalities

Models

Abstract

We present a deep learning approach for learning the joint semantic embeddings of images and captions in a Euclidean space, such that the semantic similarity is approximated by the L distances in the embedding space. For that, we introduce a metric learning scheme that utilizes multitask learning to learn the embedding of identical semantic concepts using a center loss. By introducing a differentiable quantization scheme into the end-to-end trainable network, we derive a semantic embedding of semantically similar concepts in Euclidean space. We also propose a novel metric learning formulation using an adaptive margin hinge loss, that is refined during the training phase. The proposed scheme was applied to the MS-COCO, Flicke30K and Flickr8K datasets, and was shown to compare favorably with contemporary state-of-the-art approaches.

Show Full Text

Learning to embed semantic similarity for joint image-text retrieval.

Researchers

Journal

Modalities

Models

Abstract

Hardware-software co-design of an open-source automatic multimodal whole slide histopathology imaging system.

Model Fusion and Multiscale Feature Learning for Fault Diagnosis of Industrial Processes.

Navigating the Future: A Comprehensive Review of Artificial Intelligence Applications in Gastrointestinal Cancer.

EEG-based motor imagery channel selection and classification using hybrid optimization and two-tier deep learning.

Detection of Diabetic Retinopathy Using Bichannel Convolutional Neural Network.

Spike detection and sorting with deep learning.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply