Comparison of different automatic text summarization systems using standard performance evaluations

There are many automatic summarization systems can be used to produce a summary from a single text documents. From the different automatic summarization system, it can be found that the system will produce a different content of summary results although the percentage of sentences out of whole singl...

Full description

Saved in:
Bibliographic Details
Main Author: Abd Munir, Nur Hafizah
Format: Thesis
Language:English
Published: 2009
Subjects:
Online Access:http://eprints.utm.my/id/eprint/18202/1/NurhafizahAbdMunirMFSKSM2009.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utm-ep.18202
record_format uketd_dc
spelling my-utm-ep.182022018-06-25T09:00:08Z Comparison of different automatic text summarization systems using standard performance evaluations 2009-04 Abd Munir, Nur Hafizah QA75 Electronic computers. Computer science There are many automatic summarization systems can be used to produce a summary from a single text documents. From the different automatic summarization system, it can be found that the system will produce a different content of summary results although the percentage of sentences out of whole single text document is setting to the same value. Therefore, in this study, three automatic summarization systems are used to produce the summary results; Microsoft Word Automatic Summarization, Shvoong Summarization and Simple Text Summarization in PHP. The performance of those results are investigated and measured using standard performance evaluation such recall, precision and f-measure. The dataset collection used in this study is collected from The New Straits Time and The Stars online and it is about Iskandar Region Development Authority (IRDA). Two automatic summarization system are already existed which is Microsoft Word Automatic Summarization and Shvoong Summarization and only one summarization system is coded in PHP language, there is Simple Text Summarization in PHP. Many operations have been applied in this coded system such as removing stop word, stemming, normalizing, creating weighted term-frequency and applying the technique. The results from those systems are stored into the database. In this study, about 50 articles are used. The comparison between different automatic summarization systems was made using standard performance evaluation. The performance evaluation is fully analyzed without depending on human evaluator. One program of analyzing the performance is coded in PERL language to produce a statistic of all summary results from those three automatic summarization systems. From the experimental results, it can be concluded that the Shvoong Summarization is the most effective automatic summarization system for single text document. 2009-04 Thesis http://eprints.utm.my/id/eprint/18202/ http://eprints.utm.my/id/eprint/18202/1/NurhafizahAbdMunirMFSKSM2009.pdf application/pdf en public masters Universiti Teknologi Malaysia, Faculty of Computer Science and Information System Faculty of Computer Science and Information System
institution Universiti Teknologi Malaysia
collection UTM Institutional Repository
language English
topic QA75 Electronic computers
Computer science
spellingShingle QA75 Electronic computers
Computer science
Abd Munir, Nur Hafizah
Comparison of different automatic text summarization systems using standard performance evaluations
description There are many automatic summarization systems can be used to produce a summary from a single text documents. From the different automatic summarization system, it can be found that the system will produce a different content of summary results although the percentage of sentences out of whole single text document is setting to the same value. Therefore, in this study, three automatic summarization systems are used to produce the summary results; Microsoft Word Automatic Summarization, Shvoong Summarization and Simple Text Summarization in PHP. The performance of those results are investigated and measured using standard performance evaluation such recall, precision and f-measure. The dataset collection used in this study is collected from The New Straits Time and The Stars online and it is about Iskandar Region Development Authority (IRDA). Two automatic summarization system are already existed which is Microsoft Word Automatic Summarization and Shvoong Summarization and only one summarization system is coded in PHP language, there is Simple Text Summarization in PHP. Many operations have been applied in this coded system such as removing stop word, stemming, normalizing, creating weighted term-frequency and applying the technique. The results from those systems are stored into the database. In this study, about 50 articles are used. The comparison between different automatic summarization systems was made using standard performance evaluation. The performance evaluation is fully analyzed without depending on human evaluator. One program of analyzing the performance is coded in PERL language to produce a statistic of all summary results from those three automatic summarization systems. From the experimental results, it can be concluded that the Shvoong Summarization is the most effective automatic summarization system for single text document.
format Thesis
qualification_level Master's degree
author Abd Munir, Nur Hafizah
author_facet Abd Munir, Nur Hafizah
author_sort Abd Munir, Nur Hafizah
title Comparison of different automatic text summarization systems using standard performance evaluations
title_short Comparison of different automatic text summarization systems using standard performance evaluations
title_full Comparison of different automatic text summarization systems using standard performance evaluations
title_fullStr Comparison of different automatic text summarization systems using standard performance evaluations
title_full_unstemmed Comparison of different automatic text summarization systems using standard performance evaluations
title_sort comparison of different automatic text summarization systems using standard performance evaluations
granting_institution Universiti Teknologi Malaysia, Faculty of Computer Science and Information System
granting_department Faculty of Computer Science and Information System
publishDate 2009
url http://eprints.utm.my/id/eprint/18202/1/NurhafizahAbdMunirMFSKSM2009.pdf
_version_ 1747815217584144384