Pooling region learning of visual word for image classification using bag-of-visual-words model.

Abstract

In the problem where there is not enough data to use Deep Learning, Bag-of-Visual-Words (BoVW) is still a good alternative for image classification. In BoVW model, many pooling methods are proposed to incorporate the spatial information of local feature into the image representation vector, but none of the methods devote to making each visual word have its own pooling regions. The practice of designing the same pooling regions for all the words restrains the discriminability of image representation, since the spatial distributions of the local features indexed by different visual words are not same. In this paper, we propose to make each visual word have its own pooling regions, and raise a simple yet effective method for learning pooling region. Concretely, a kind of small window named observation window is used to obtain its responses to each word over the whole image region. The pooling regions of each word are organized by a kind of tree structure, in which each node indicates a pooling region. For each word, its pooling regions are learned by constructing a tree with its labelled coordinate data. The labelled coordinate data consist of the coordinates of responses and image class labels. The effectiveness of our method is validated by observing if there is an obvious classification accuracy improvement after applying our method. Our experimental results on four small datasets (i.e., Scene-15, Caltech-101, Caltech-256 and Corel-10) show that, the classification accuracy is improved by about 1% to 2.5%. We experimentally demonstrate that the practice of making each word have its own pooling regions is beneficial to image classification task, which is the significance of our work.

Show Full Text

Pooling region learning of visual word for image classification using bag-of-visual-words model.

Researchers

Journal

Modalities

Models

Abstract

ScanGuard-YOLO: Enhancing X-ray Prohibited Item Detection with Significant Performance Gains.

Progress Indication for Deep Learning Model Training: A Feasibility Demonstration.

A weight optimization-based transfer learning approach for plant disease detection of New Zealand vegetables.

Central reading of ulcerative colitis clinical trial videos using neural networks.

A Long-Term Video Tracking Method for Group-Housed Pigs.

Dynamic polarization fusion network (DPFN) for imaging in different scattering systems.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply