Improved CNN-based mouth position and status detection
Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with r...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/99383/1/ChokYongShengMSKE2022.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-utm-ep.99383 |
---|---|
record_format |
uketd_dc |
spelling |
my-utm-ep.993832023-02-27T03:01:27Z Improved CNN-based mouth position and status detection 2022 Chok, Yong Sheng TK Electrical engineering. Electronics Nuclear engineering Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement. 2022 Thesis http://eprints.utm.my/id/eprint/99383/ http://eprints.utm.my/id/eprint/99383/1/ChokYongShengMSKE2022.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:149993 masters Universiti Teknologi Malaysia, Faculty of Engineering - School of Electrical Engineering Faculty of Engineering - School of Electrical Engineering |
institution |
Universiti Teknologi Malaysia |
collection |
UTM Institutional Repository |
language |
English |
topic |
TK Electrical engineering Electronics Nuclear engineering |
spellingShingle |
TK Electrical engineering Electronics Nuclear engineering Chok, Yong Sheng Improved CNN-based mouth position and status detection |
description |
Mouth position and status detection system plays an important role in the auto-feeding system for paralyzed people. Through identifying the mouth status, whether it is open or close, and obtain the location of the open mouth, the system will be able to pick the correct timing to feed patients with robotic arms. There are two major problems that urge the proposal of this project. First, the existing mouth status recognition networks are built and executed on high-end and costly hardware. Second, the existing CNN mouth status related detection systems are less accurate, the highest accuracy in the researched work is only 86.8% for 3 states mouth status detection. Based on the problems, there are two research objectives that are strived to be achieved. First, to develop a high-accuracy and light CNN-based model for mouth status detection on Python platform. Second, to shorten the inference time of the CNN-based model by resizing some of the convolution layers. For methodology, the primary task is to train a mouth status detection CNN model with high accuracy. The face picture datasets fed to the model during CNN model training are diverse, covering different human races and shooting angles. YOLOv5 is chosen to be the pre-trained network due to its outstanding performance. The YOLOv5 backbone convolution layers are resized to shorten the inference time and reduce the model size. The developed CNN-based model achieved the targeted performance which is 96.8%, successfully improved inference time by 21.90% and model size by 13.20% as compared to the original model before enhancement. |
format |
Thesis |
qualification_level |
Master's degree |
author |
Chok, Yong Sheng |
author_facet |
Chok, Yong Sheng |
author_sort |
Chok, Yong Sheng |
title |
Improved CNN-based mouth position and status detection |
title_short |
Improved CNN-based mouth position and status detection |
title_full |
Improved CNN-based mouth position and status detection |
title_fullStr |
Improved CNN-based mouth position and status detection |
title_full_unstemmed |
Improved CNN-based mouth position and status detection |
title_sort |
improved cnn-based mouth position and status detection |
granting_institution |
Universiti Teknologi Malaysia, Faculty of Engineering - School of Electrical Engineering |
granting_department |
Faculty of Engineering - School of Electrical Engineering |
publishDate |
2022 |
url |
http://eprints.utm.my/id/eprint/99383/1/ChokYongShengMSKE2022.pdf |
_version_ |
1776100594603261952 |