Offline text-independent Chinese writer identification method with two-tier image retrieval

Writer identification is essential today to identify the authenticity of a document in forensic expert decision-making. However, handwriting in various languages specifically Chinese poses a different challenge in identifying the writer. The main challenge faced by current researchers is that they f...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Gloria Jennis
Format: Thesis
Language:English
Published: 2019
Subjects:
Online Access:http://eprints.utm.my/id/eprint/96211/1/TanGloriaJennisPSC2019.pdf.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.96211
record_format uketd_dc
spelling my-utm-ep.962112022-07-05T03:07:20Z Offline text-independent Chinese writer identification method with two-tier image retrieval 2019 Tan, Gloria Jennis QA75 Electronic computers. Computer science Writer identification is essential today to identify the authenticity of a document in forensic expert decision-making. However, handwriting in various languages specifically Chinese poses a different challenge in identifying the writer. The main challenge faced by current researchers is that they fail to adopt traditional methods over an offline text independent Chinese writer identification scheme due to the complexity of Chinese writing structure and style. Furthermore, the previous method relies heavily on the selection of window size, which causes an ambiguity and leads to inconsistent results if the previous method is applied on a large image repository while finding the best-matched document from the database. Thus, much uncertainty still exists about the insurmountable searching space and the method has failed to show the effectiveness in searching relevant documents from a large image repository. This research attempted to solve problems by developing a new identification scheme for offline text-independent Chinese writer identification with the enhancement of feature extraction method and two-tier image retrieval mechanism to reduce search space and increase identification rates. The technique involved three essential steps. Firstly, the first-tier phase used Slantlet Transform based Local Binary Pattern (SLT-LBP) to bring out fine details. Then, sixty matching handwriting images were short-listed for the second-tier phase using Hierarchical Centroid (HC) of image pixels method for feature extraction. Finally, thirty shortlisted images were used as the input in the identification phase using Gray-Level Difference Method (GLDM) features. Experiment results had remarkably improved as compared to the previous method and the increase was from 95.4% to 96.68% in terms of identification rate as reported in the HIT-MW dataset. The contribution of this study is that it highlights the importance of using a two-tier retrieval mechanism to reduce search space in a large database in order to achieve higher accuracy. Besides, the development of a size-independent writer identification mechanism is a novelty as it can corroborate real-world application. 2019 Thesis http://eprints.utm.my/id/eprint/96211/ http://eprints.utm.my/id/eprint/96211/1/TanGloriaJennisPSC2019.pdf.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:142139 phd doctoral Universiti Teknologi Malaysia Faculty of Engineering - School of Computing
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic QA75 Electronic computers
Computer science
spellingShingle QA75 Electronic computers
Computer science
Tan, Gloria Jennis
Offline text-independent Chinese writer identification method with two-tier image retrieval
description Writer identification is essential today to identify the authenticity of a document in forensic expert decision-making. However, handwriting in various languages specifically Chinese poses a different challenge in identifying the writer. The main challenge faced by current researchers is that they fail to adopt traditional methods over an offline text independent Chinese writer identification scheme due to the complexity of Chinese writing structure and style. Furthermore, the previous method relies heavily on the selection of window size, which causes an ambiguity and leads to inconsistent results if the previous method is applied on a large image repository while finding the best-matched document from the database. Thus, much uncertainty still exists about the insurmountable searching space and the method has failed to show the effectiveness in searching relevant documents from a large image repository. This research attempted to solve problems by developing a new identification scheme for offline text-independent Chinese writer identification with the enhancement of feature extraction method and two-tier image retrieval mechanism to reduce search space and increase identification rates. The technique involved three essential steps. Firstly, the first-tier phase used Slantlet Transform based Local Binary Pattern (SLT-LBP) to bring out fine details. Then, sixty matching handwriting images were short-listed for the second-tier phase using Hierarchical Centroid (HC) of image pixels method for feature extraction. Finally, thirty shortlisted images were used as the input in the identification phase using Gray-Level Difference Method (GLDM) features. Experiment results had remarkably improved as compared to the previous method and the increase was from 95.4% to 96.68% in terms of identification rate as reported in the HIT-MW dataset. The contribution of this study is that it highlights the importance of using a two-tier retrieval mechanism to reduce search space in a large database in order to achieve higher accuracy. Besides, the development of a size-independent writer identification mechanism is a novelty as it can corroborate real-world application.
format Thesis
qualification_name Doctor of Philosophy (PhD.)
qualification_level Doctorate
author Tan, Gloria Jennis
author_facet Tan, Gloria Jennis
author_sort Tan, Gloria Jennis
title Offline text-independent Chinese writer identification method with two-tier image retrieval
title_short Offline text-independent Chinese writer identification method with two-tier image retrieval
title_full Offline text-independent Chinese writer identification method with two-tier image retrieval
title_fullStr Offline text-independent Chinese writer identification method with two-tier image retrieval
title_full_unstemmed Offline text-independent Chinese writer identification method with two-tier image retrieval
title_sort offline text-independent chinese writer identification method with two-tier image retrieval
granting_institution Universiti Teknologi Malaysia
granting_department Faculty of Engineering - School of Computing
publishDate 2019
url http://eprints.utm.my/id/eprint/96211/1/TanGloriaJennisPSC2019.pdf.pdf
_version_ 1747818648261623808