Enhanced multi view 3D reconstruction with improved MVSNet.

Abstract

Although 3D reconstruction has been widely used in many fields as a key component of environment perception, existing technologies still have the potential for further improvement in 3D scene reconstruction. We propose an improved reconstruction algorithm based on the MVSNet network architecture. To glean richer pixel details from images, we suggest deploying a DE module integrated with a residual framework, which supplants the prevailing feature extraction mechanism. The DE module uses ECA-Net and dilated convolution to expand the receptive field range, performing feature splicing and fusion through the residual structure to retain the global information of the original image. Moreover, harnessing attention mechanisms refines the 3D cost volume’s regularization process, bolstering the integration of information across multi-scale feature volumes, consequently enhancing depth estimation precision. When assessed our model using the DTU dataset, findings highlight the network’s 3D reconstruction scoring a completeness (comp) of 0.411 mm and an overall quality of 0.418 mm. This performance is higher than that of traditional methods and other deep learning-based methods. Additionally, the visual representation of the point cloud model exhibits marked advancements. Trials on the Blended MVS dataset signify that our network exhibits commendable generalization prowess.© 2024. The Author(s).

Show Full Text

Enhanced multi view 3D reconstruction with improved MVSNet.

Researchers

Journal

Modalities

Models

Abstract

Evolutionary neural architecture search combining multi-branch ConvNet and improved transformer.

Unifying Visual Attribute Learning with Object Recognition in a Multiplicative Framework.

A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future.

A Novel Outdoor Positioning Technique Using LTE Network Fingerprints.

Evaluation of Mixed Deep Neural Networks for Reverberant Speech Enhancement.

BX2S-Net: Learning to reconstruct 3D spinal structures from bi-planar X-ray images.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply