Other

Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection.

August 26, 2023 Other

Researchers

Joohee Kim Taeho Kim

Journal

Sensors (Basel, Switzerland)

Modalities

Models

Transformer

Abstract

The Voxel Transformer (VoTr) is a prominent model in the field of 3D object detection, employing a transformer-based architecture to comprehend long-range voxel relationships through self-attention. However, despite its expanded receptive field, VoTr’s flexibility is constrained by its predefined receptive field. In this paper, we present a Voxel Transformer with Density-Aware Deformable Attention (VoTr-DADA), a novel approach to 3D object detection. VoTr-DADA leverages density-guided deformable attention for a more adaptable receptive field. It efficiently identifies key areas in the input using density features, combining the strengths of both VoTr and Deformable Attention. We introduce the Density-Aware Deformable Attention (DADA) module, which is specifically designed to focus on these crucial areas while adaptively extracting more informative features. Experimental results on the KITTI dataset and the Waymo Open dataset show that our proposed method outperforms the baseline VoTr model in 3D object detection while maintaining a fast inference speed.

Show Full Text

Voxel Transformer with Density-Aware Deformable Attention for 3D Object Detection.

Researchers

Journal

Modalities

Models

Abstract

Multiview High Dynamic Range Image Synthesis Using Fuzzy Broad Learning System.

Deep Supervised Multi-View Learning with Graph Priors.

Data-driven discovery of coordinates and governing equations.

Network-based protein structural classification.

Student Loss: Towards the Probability Assumption in Inaccurate Supervision.

Dendritic Computing: Branching Deeper into Machine Learning.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply