Image and video based emotion recognition using deep learning /

Emotion recognition utilizing pictures, videos, or speech as input is considered an intriguing issue in the research field over certain years. The introduction of deep learning procedures like the Convolutional Neural Networks (CNN) has made emotion recognition achieve promising outcomes. Since hum...

Full description

Saved in:
Bibliographic Details
Main Author: Arselan Ashraf (Author)
Format: Thesis
Language:English
Published: Kuala Lumpur : Kulliyah of Engineering,International Islamic University Malaysia, 2021
Subjects:
Online Access:http://studentrepo.iium.edu.my/handle/123456789/10766
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 044500000a22004090004500
008 210811s2021 my a f m 000 0 eng d
040 |a UIAM  |b eng  |e rda 
041 |a eng 
043 |a a-my--- 
050 0 0 |a Q325.73 
100 0 |a Arselan Ashraf,  |e author  |9 4443 
245 1 0 |a Image and video based emotion recognition using deep learning /  |c by Arselan Ashraf 
264 1 |a Kuala Lumpur :  |b Kulliyah of Engineering,International Islamic University Malaysia,  |c 2021 
300 |a xvi, 108 leaves :  |b colour illustrations ;  |c 30cm. 
336 |2 rdacontent  |a text 
337 |2 rdamedia  |a unmediated 
337 |2 rdmedia  |a computer 
338 |2 rdacarrier  |a volume 
338 |2 rdacarrier  |a online resource 
347 |2 rdaft  |a text file  |b PDF 
500 |a Abstracts in English and Arabic. 
500 |a "A dissertation submitted in fulfilment of the requirement for the degree of Master of Science (Computer and Information Engineering)." --On title page. 
502 |a Thesis (MSCIE)--International Islamic University Malaysia, 2021. 
504 |a Includes bibliographical references (leaves 95-101). 
520 |a Emotion recognition utilizing pictures, videos, or speech as input is considered an intriguing issue in the research field over certain years. The introduction of deep learning procedures like the Convolutional Neural Networks (CNN) has made emotion recognition achieve promising outcomes. Since human facial appearances are considered vital in understanding one's feelings, many research studies have been carried out in this field. However, it still lacks in developing a visual-based emotion recognition model with good accuracy and uncertainty in determining influencing features, type, the number of emotions under consideration, and algorithms. This research is carried out to develop an image and video-based emotion recognition model using CNN for automatic feature extraction and classification. The optimum CNN configuration was found to be having three convolutional layers with max-pooling attached to each layer. The third convolutional layer was followed by a batch normalization layer connected with two fully connected layers. This CNN configuration was selected because it minimized the risk of overfitting along with produced a normalized output. Five emotions are considered for recognition: angry, happy, neutral, sad, and surprised, to compare with previous algorithms. The construction of the emotion recognition model is carried out on two datasets: an image dataset, namely “Warsaw Set of Emotional Facial Expression Pictures (WSEFEP)” and a video dataset, namely “Amsterdam Dynamic Facial Expression Set – Bath Intensity Variations (ADFES-BIV).” Different pre-processing steps have been carried over data samples, followed by the popular and efficient Viola-Jones algorithm for face detection. CNN has been used for feature extraction and classification. Evaluating results using confusion matrix, accuracy, F1-score, precision, and recall shows that video-based datasets obtained more promising results than image-based datasets. The recognition accuracy, F1 score, precision, and recall for the video dataset came out to be 99.38%, 99.22%, 99.4%, 99.38, and that of the image dataset came out to be 83.33%, 79.1%, 84.46%, 80%, respectively. The proposed algorithm has been benchmarked with two other CNN-based algorithms, and the accuracy performs better around 5.33% and 3.33%, respectively, for the image dataset, while 4.38% for the video dataset. The outcome of this research provides the productivity and usability of the proposed system in visual-based emotion recognition. 
650 0 |a Deep learning (Machine learning)  |9 4444 
650 0 |a Emotion recognition  |x Computer simulation  |9 4445 
655 7 |a Theses, IIUM local 
690 |a Dissertations, Academic  |x Department of Electrical and Computer Engineering  |z IIUM  |9 4446 
700 1 |a Teddy Surya Gunawan,  |e degree supervisor  |9 4447 
700 0 |a Farah Diyana Abdul Rahman,  |e degree supervisor  |9 4448 
710 2 |a International Islamic University Malaysia.  |b Department of Electrical and Computer Engineering  |9 4449 
856 4 |u http://studentrepo.iium.edu.my/handle/123456789/10766 
900 |a sz-asbh 
942 |2 lcc 
999 |c 439418  |d 472712 
952 |0 0  |1 0  |2 lcc  |4 0  |6 T Q 00325.00073 A00781I 02021  |7 7  |8 IIUMTHESIS  |9 762464  |a IIUM  |b IIUM  |c THESIS  |d 2022-06-21  |g 0.00  |o t Q 325.73 A781I 2021  |p 11100392662  |r 1900-01-02  |t 1  |v 0.00  |y THESIS