Statistical byte frequency analysis for identifying JPEG file segments

File carving is a file recovery technique based on file structure, without the assistance of file system metadata. The important concern here is how file recovery can take place for the file segments that cannot be linked to an existing image header. This project focuses on identifying JPEG file for...

Full description

Saved in:
Bibliographic Details
Main Author: Abdul Kadir, Nur Fasihah
Format: Thesis
Language:English
Published: 2015
Subjects:
Online Access:http://eprints.utm.my/id/eprint/77989/1/NurFasihahAbdulMFC20151.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.77989
record_format uketd_dc
spelling my-utm-ep.779892018-07-18T07:49:58Z Statistical byte frequency analysis for identifying JPEG file segments 2015-01 Abdul Kadir, Nur Fasihah QA75 Electronic computers. Computer science File carving is a file recovery technique based on file structure, without the assistance of file system metadata. The important concern here is how file recovery can take place for the file segments that cannot be linked to an existing image header. This project focuses on identifying JPEG file format in hard disk storage. Digital images are broadly used in most industries. It plays a vital role in advertising, education, filming activities, etc. In business world, an image acts as an instant communication to present products and services promptly to the market. Rapid advancements in image processing technology make images more interactive and more modifiable to comply with particular preferences. However, this kind of adjustment will disturb the originality of the raw data. In previous works, researchers mostly focused on recover file segments with assistance of file markers which sometimes might be corrupted. Thus, the statistical byte frequency technique is proposed to provide alternative to address the limitations. In this study, the proposed solution was evaluated based on the accuracy and efficiency performance in identifying the distributed segments. The simulation process involved four different JPEG files format. The simulation indicates that the proposed technique gives a better performance for the files to be carved, in term of accuracy. During the simulation, most of the segments are identified with small gap between all four JPEG files format. The results are gained from k-mean clustering evaluation tool. For computational speed, it takes shorter response time to find file patterns which might due to less number of file segments. The results might be helpful for future reference in file carving program. 2015-01 Thesis http://eprints.utm.my/id/eprint/77989/ http://eprints.utm.my/id/eprint/77989/1/NurFasihahAbdulMFC20151.pdf application/pdf en public http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:85325 masters Universiti Teknologi Malaysia, Faculty of Computing Faculty of Computing
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic QA75 Electronic computers
Computer science
spellingShingle QA75 Electronic computers
Computer science
Abdul Kadir, Nur Fasihah
Statistical byte frequency analysis for identifying JPEG file segments
description File carving is a file recovery technique based on file structure, without the assistance of file system metadata. The important concern here is how file recovery can take place for the file segments that cannot be linked to an existing image header. This project focuses on identifying JPEG file format in hard disk storage. Digital images are broadly used in most industries. It plays a vital role in advertising, education, filming activities, etc. In business world, an image acts as an instant communication to present products and services promptly to the market. Rapid advancements in image processing technology make images more interactive and more modifiable to comply with particular preferences. However, this kind of adjustment will disturb the originality of the raw data. In previous works, researchers mostly focused on recover file segments with assistance of file markers which sometimes might be corrupted. Thus, the statistical byte frequency technique is proposed to provide alternative to address the limitations. In this study, the proposed solution was evaluated based on the accuracy and efficiency performance in identifying the distributed segments. The simulation process involved four different JPEG files format. The simulation indicates that the proposed technique gives a better performance for the files to be carved, in term of accuracy. During the simulation, most of the segments are identified with small gap between all four JPEG files format. The results are gained from k-mean clustering evaluation tool. For computational speed, it takes shorter response time to find file patterns which might due to less number of file segments. The results might be helpful for future reference in file carving program.
format Thesis
qualification_level Master's degree
author Abdul Kadir, Nur Fasihah
author_facet Abdul Kadir, Nur Fasihah
author_sort Abdul Kadir, Nur Fasihah
title Statistical byte frequency analysis for identifying JPEG file segments
title_short Statistical byte frequency analysis for identifying JPEG file segments
title_full Statistical byte frequency analysis for identifying JPEG file segments
title_fullStr Statistical byte frequency analysis for identifying JPEG file segments
title_full_unstemmed Statistical byte frequency analysis for identifying JPEG file segments
title_sort statistical byte frequency analysis for identifying jpeg file segments
granting_institution Universiti Teknologi Malaysia, Faculty of Computing
granting_department Faculty of Computing
publishDate 2015
url http://eprints.utm.my/id/eprint/77989/1/NurFasihahAbdulMFC20151.pdf
_version_ 1747817880105254912