Text extraction from invariant complex image

Great progress has been made in Optical Character Recognition (OCR) technology. Most current OCRs, however, can only read characters printed on sheets of paper according to some rigid format restrictions. For that, the detection and extraction of text regions in an image are well known problems in C...

Full description

Saved in:
Bibliographic Details
Main Author: Al Hashi, Nouri Ali Al Mabrouk
Format: Thesis
Language:English
Published: 2009
Subjects:
Online Access:http://eprints.utm.my/id/eprint/12767/1/NouriAliAlMabroukMFSKSM2009.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.12767
record_format uketd_dc
spelling my-utm-ep.127672018-06-25T08:59:43Z Text extraction from invariant complex image 2009 Al Hashi, Nouri Ali Al Mabrouk QA76 Computer software Great progress has been made in Optical Character Recognition (OCR) technology. Most current OCRs, however, can only read characters printed on sheets of paper according to some rigid format restrictions. For that, the detection and extraction of text regions in an image are well known problems in Computer Vision research area. The goal of this project is to extract and recognize the text from an image by using the edge-based and fuzzy logic algorithm respectively. The algorithms are implemented and evaluated by using a set of images of natural scenes that vary along its’ size, scale and orientation. Various kernels can be used for this operation ,the whole set of 8 kernels is produced by taking one of kernels and rotating its coefficient circularly and edgedetection operator is calculated by forming matrix centered on pixel chosen as center of matrix area, then Localization involves further enhancing regions by eliminating nontext regions. Edge-detection works quite well for digital image corrupted with multiscale and multi-orientation whereas its performance of this operator cannot be used in practical image which generally corrupted other types and edge-detection for detection of edge in digital image is that image should contain sharp intensity transition and low noise of the type is present. Moreover the image is colored image .Then, edge detect at eight edges and convolve with Gaussian after that select the strong edge was suitable of detect the text. As known be the project in complex image by using eight kernels to accomplish the task .Then, we used identified pixel of determine the character with use fuzzy logic. 2009 Thesis http://eprints.utm.my/id/eprint/12767/ http://eprints.utm.my/id/eprint/12767/1/NouriAliAlMabroukMFSKSM2009.pdf application/pdf en public masters Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems Faculty of Computer Science and Information Systems
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic QA76 Computer software
spellingShingle QA76 Computer software
Al Hashi, Nouri Ali Al Mabrouk
Text extraction from invariant complex image
description Great progress has been made in Optical Character Recognition (OCR) technology. Most current OCRs, however, can only read characters printed on sheets of paper according to some rigid format restrictions. For that, the detection and extraction of text regions in an image are well known problems in Computer Vision research area. The goal of this project is to extract and recognize the text from an image by using the edge-based and fuzzy logic algorithm respectively. The algorithms are implemented and evaluated by using a set of images of natural scenes that vary along its’ size, scale and orientation. Various kernels can be used for this operation ,the whole set of 8 kernels is produced by taking one of kernels and rotating its coefficient circularly and edgedetection operator is calculated by forming matrix centered on pixel chosen as center of matrix area, then Localization involves further enhancing regions by eliminating nontext regions. Edge-detection works quite well for digital image corrupted with multiscale and multi-orientation whereas its performance of this operator cannot be used in practical image which generally corrupted other types and edge-detection for detection of edge in digital image is that image should contain sharp intensity transition and low noise of the type is present. Moreover the image is colored image .Then, edge detect at eight edges and convolve with Gaussian after that select the strong edge was suitable of detect the text. As known be the project in complex image by using eight kernels to accomplish the task .Then, we used identified pixel of determine the character with use fuzzy logic.
format Thesis
qualification_level Master's degree
author Al Hashi, Nouri Ali Al Mabrouk
author_facet Al Hashi, Nouri Ali Al Mabrouk
author_sort Al Hashi, Nouri Ali Al Mabrouk
title Text extraction from invariant complex image
title_short Text extraction from invariant complex image
title_full Text extraction from invariant complex image
title_fullStr Text extraction from invariant complex image
title_full_unstemmed Text extraction from invariant complex image
title_sort text extraction from invariant complex image
granting_institution Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems
granting_department Faculty of Computer Science and Information Systems
publishDate 2009
url http://eprints.utm.my/id/eprint/12767/1/NouriAliAlMabroukMFSKSM2009.pdf
_version_ 1747814956349259776