Two-Stage Pedestrian Detection Model Using a New Classification Head for Domain Generalization.

December 9, 2023 Artificial Intelligence, Computer Science, Machine Learning

Abstract

Pedestrian detection based on deep learning methods have reached great success in the past few years with several possible real-world applications including autonomous driving, robotic navigation, and video surveillance. In this work, a new neural network two-stage pedestrian detector with a new custom classification head, adding the triplet loss function to the standard bounding box regression and classification losses, is presented. This aims to improve the domain generalization capabilities of existing pedestrian detectors, by explicitly maximizing inter-class distance and minimizing intra-class distance. Triplet loss is applied to the features generated by the region proposal network, aimed at clustering together pedestrian samples in the features space. We used Faster R-CNN and Cascade R-CNN with the HRNet backbone pre-trained on ImageNet, changing the standard classification head for Faster R-CNN, and changing one of the three heads for Cascade R-CNN. The best results were obtained using a progressive training pipeline, starting from a dataset that is further away from the target domain, and progressively fine-tuning on datasets closer to the target domain. We obtained state-of-the-art results, MR-2 of 9.9, 11.0, and 36.2 for the reasonable, small, and heavy subsets on the CityPersons benchmark with outstanding performance on the heavy subset, the most difficult one.

Show Full Text

Two-Stage Pedestrian Detection Model Using a New Classification Head for Domain Generalization.

Researchers

Journal

Modalities

Models

Abstract

Theory of deep convolutional neural networks II: Spherical analysis.

Adaptive UNet-based Lung Segmentation and Ensemble Learning with CNN-based Deep Features for Automated COVID-19 Diagnosis.

Self-Supervised Learning of Graph Neural Networks: A Unified Review.

Ransomware detection using deep learning based unsupervised feature extraction and a cost sensitive Pareto Ensemble classifier.

Learning Robust Shape Regularization for Generalizable Medical Image Segmentation.

MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply