|

eXnet: An Efficient Approach for EmotionRecognition in the Wild.

Researchers

Journal

Modalities

Models

Abstract

Facial expression recognition has been well studied for its great importance in the areasof human-computer interaction and social sciences. With the evolution of deep learning, therehave been significant advances in this area that also surpass human-level accuracy. Althoughthese methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methodsin accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19:14.72 million), making it more efficient and lightweight for real-time systems. Several moderndata augmentation techniques are applied for generalization of eXnet; these techniques improvethe accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial ExpressionRecognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems,we  deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotionrecognition in the wild in terms of accuracy, the number of parameters, and size on disk.

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *