Remote intelligent perception system for multi-object detection.

Researchers

Abdulwahab Alazeb Ahmad Jalal Bisma Riaz Chughtai Hanan Aljuaid Hui Liu Mohammed Alonazi Naif Al Mudawi Yahya Alqahtani

Journal

Modalities

Discrete Wavelet Transform Image Analysis kernel convolution local binary pattern analysis Semantic segmentation Sobel and Laplacian visual sensor technology

Models

AlexNet Deep Belief Network Unet Segmentation

Abstract

During the last few years, a heightened interest has been shown in classifying scene images depicting diverse robotic environments. The surge in interest can be attributed to significant improvements in visual sensor technology, which has enhanced image analysis capabilities.Advances in vision technology have a major impact on the areas of multiple object detection and scene understanding. These tasks are an integral part of a variety of technologies, including integrating scenes in augmented reality, facilitating robot navigation, enabling autonomous driving systems, and improving applications in tourist information. Despite significant strides in visual interpretation, numerous challenges persist, encompassing semantic understanding, occlusion, orientation, insufficient availability of labeled data, uneven illumination including shadows and lighting, variation in direction, and object size and changing background. To overcome these challenges, we proposed an innovative scene recognition framework, which proved to be highly effective and yielded remarkable results. First, we perform preprocessing using kernel convolution on scene data. Second, we perform semantic segmentation using UNet segmentation. Then, we extract features from these segmented data using discrete wavelet transform (DWT), Sobel and Laplacian, and textual (local binary pattern analysis). To recognize the object, we have used deep belief network and then find the object-to-object relation. Finally, AlexNet is used to assign the relevant labels to the scene based on recognized objects in the image.The performance of the proposed system was validated using three standard datasets: PASCALVOC-12, Cityscapes, and Caltech 101. The accuracy attained on the PASCALVOC-12 dataset exceeds 96% while achieving a rate of 95.90% on the Cityscapes dataset.Furthermore, the model demonstrates a commendable accuracy of 92.2% on the Caltech 101 dataset. This model showcases noteworthy advancements beyond the capabilities of current models.Copyright © 2024 Alazeb, Chughtai, Al Mudawi, AlQahtani, Alonazi, Aljuaid, Jalal and Liu.

Show Full Text

Remote intelligent perception system for multi-object detection.

Researchers

Journal

Modalities

Models

Abstract

Fall prediction, control, and recovery of quadruped robots.

Differentiation of Benign from Malignant Pulmonary Nodules by Using a Convolutional Neural Network to Determine Volume Change at Chest CT.

Automated spheroid generation, drug application and efficacy screening using a deep learning classification: a feasibility study.

DeepLC can predict retention times for peptides that carry as-yet unseen modifications.

Artificial intelligence performance in image-based ovarian cancer identification: A systematic review and meta-analysis.

Data Efficiency Semi-Supervised Meta-Learning Elucidates Understudied Interspecies Molecular Interactions.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply