Improved CNN-based mouth position and status detection

Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with r...

Full description

Saved in:
Bibliographic Details
Main Author: Chok, Yong Sheng
Format: Thesis
Language:English
Published: 2022
Subjects:
Online Access:http://eprints.utm.my/id/eprint/99383/1/ChokYongShengMSKE2022.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.99383
record_format uketd_dc
spelling my-utm-ep.993832023-02-27T03:01:27Z Improved CNN-based mouth position and status detection 2022 Chok, Yong Sheng TK Electrical engineering. Electronics Nuclear engineering Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement. 2022 Thesis http://eprints.utm.my/id/eprint/99383/ http://eprints.utm.my/id/eprint/99383/1/ChokYongShengMSKE2022.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:149993 masters Universiti Teknologi Malaysia, Faculty of Engineering - School of Electrical Engineering Faculty of Engineering - School of Electrical Engineering
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic TK Electrical engineering
Electronics Nuclear engineering
spellingShingle TK Electrical engineering
Electronics Nuclear engineering
Chok, Yong Sheng
Improved CNN-based mouth position and status detection
description Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement.
format Thesis
qualification_level Master's degree
author Chok, Yong Sheng
author_facet Chok, Yong Sheng
author_sort Chok, Yong Sheng
title Improved CNN-based mouth position and status detection
title_short Improved CNN-based mouth position and status detection
title_full Improved CNN-based mouth position and status detection
title_fullStr Improved CNN-based mouth position and status detection
title_full_unstemmed Improved CNN-based mouth position and status detection
title_sort improved cnn-based mouth position and status detection
granting_institution Universiti Teknologi Malaysia, Faculty of Engineering - School of Electrical Engineering
granting_department Faculty of Engineering - School of Electrical Engineering
publishDate 2022
url http://eprints.utm.my/id/eprint/99383/1/ChokYongShengMSKE2022.pdf
_version_ 1776100594603261952