Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
The rapid development of Internet increases the writers of blog sites. Sometimes these blog sites focused on solving some important problems. To find specific blogs are hard problem for the users because a lot of these blogs contain unuseful information such as online advertisements, notice and nois...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | eng eng |
Published: |
2012
|
Subjects: | |
Online Access: | https://etd.uum.edu.my/3304/1/ATHRAA_JASIM_MOHAMMED.pdf https://etd.uum.edu.my/3304/3/ATHRAA_JASIM_MOHAMMED.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uum-etd.3304 |
---|---|
record_format |
uketd_dc |
institution |
Universiti Utara Malaysia |
collection |
UUM ETD |
language |
eng eng |
advisor |
Husni, Husniza |
topic |
QA76.76 Fuzzy System. |
spellingShingle |
QA76.76 Fuzzy System. Mohammed, Athraa Jasim Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic |
description |
The rapid development of Internet increases the writers of blog sites. Sometimes these blog sites focused on solving some important problems. To find specific blogs are hard problem for the users because a lot of these blogs contain unuseful information such as online advertisements, notice and noise which minimize the rank of blog site. Furthermore to retrieve more relevant blogs is another problem which lowering the search performance. This study proposes blogs search engine adopting RSS syndication using Fuzzy logic. The blogs search engine consists of three main phases which are crawling using RSS feeds algorithm, indexing weblogs algorithm and searching technique with Fuzzy logic. In RSS crawling process RSS feeds need to be gathered to extract useful information such as title, links, publish time and description. Indexing weblogs use the links to retrieve the blogs sites for text processing and construct indexing database. In order to retrieve such information needed by any user, there is user interface to search for keyword with importance degree and compute the density of keyword from the indexing database. The rank of the pages is computed based on fuzzy weighted average value. A prototype is built using visual basic 2008 to validate the proposed blogs search engine. It is a windows application with http connection protocol. In system evaluation used two measurement performances which are precision and mean average precision. The parameters of precision determine based on respondents whom determine the total retrieved links and the total relevant links for the keyword search result. The number of keywords that used in testing system is five pairs keywords. The experimental results show that the mean average precision is 81.7% of the whole system performance. The percent of respondents is 80% who knows and uses the blogs and 20% don’t have knowledge. The execution time of the system based on respondents is 70% between 3-5 minute and 30% less than 3 minute. This percentage is good considering the rate of satisfaction for system is 80% satisfied and 20% strongly satisfied. |
format |
Thesis |
qualification_name |
masters |
qualification_level |
Master's degree |
author |
Mohammed, Athraa Jasim |
author_facet |
Mohammed, Athraa Jasim |
author_sort |
Mohammed, Athraa Jasim |
title |
Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic |
title_short |
Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic |
title_full |
Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic |
title_fullStr |
Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic |
title_full_unstemmed |
Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic |
title_sort |
blogs search engine adopting rss syndication using fuzzy logic |
granting_institution |
Universiti Utara Malaysia |
granting_department |
Awang Had Salleh Graduate School of Arts & Sciences |
publishDate |
2012 |
url |
https://etd.uum.edu.my/3304/1/ATHRAA_JASIM_MOHAMMED.pdf https://etd.uum.edu.my/3304/3/ATHRAA_JASIM_MOHAMMED.pdf |
_version_ |
1747827542739386368 |
spelling |
my-uum-etd.33042016-04-27T03:57:59Z Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic 2012 Mohammed, Athraa Jasim Husni, Husniza Awang Had Salleh Graduate School of Arts & Sciences Awang Had Salleh Graduate School of Arts and Sciences QA76.76 Fuzzy System. The rapid development of Internet increases the writers of blog sites. Sometimes these blog sites focused on solving some important problems. To find specific blogs are hard problem for the users because a lot of these blogs contain unuseful information such as online advertisements, notice and noise which minimize the rank of blog site. Furthermore to retrieve more relevant blogs is another problem which lowering the search performance. This study proposes blogs search engine adopting RSS syndication using Fuzzy logic. The blogs search engine consists of three main phases which are crawling using RSS feeds algorithm, indexing weblogs algorithm and searching technique with Fuzzy logic. In RSS crawling process RSS feeds need to be gathered to extract useful information such as title, links, publish time and description. Indexing weblogs use the links to retrieve the blogs sites for text processing and construct indexing database. In order to retrieve such information needed by any user, there is user interface to search for keyword with importance degree and compute the density of keyword from the indexing database. The rank of the pages is computed based on fuzzy weighted average value. A prototype is built using visual basic 2008 to validate the proposed blogs search engine. It is a windows application with http connection protocol. In system evaluation used two measurement performances which are precision and mean average precision. The parameters of precision determine based on respondents whom determine the total retrieved links and the total relevant links for the keyword search result. The number of keywords that used in testing system is five pairs keywords. The experimental results show that the mean average precision is 81.7% of the whole system performance. The percent of respondents is 80% who knows and uses the blogs and 20% don’t have knowledge. The execution time of the system based on respondents is 70% between 3-5 minute and 30% less than 3 minute. This percentage is good considering the rate of satisfaction for system is 80% satisfied and 20% strongly satisfied. 2012 Thesis https://etd.uum.edu.my/3304/ https://etd.uum.edu.my/3304/1/ATHRAA_JASIM_MOHAMMED.pdf text eng validuser https://etd.uum.edu.my/3304/3/ATHRAA_JASIM_MOHAMMED.pdf text eng public masters masters Universiti Utara Malaysia Bouras,C., Poulopoulos,V., & Silintziris,P. (2009). Personalized news search in www: adapting on user’s behaviour, fourth international conference on internet and web applications and services, 125-130, doi: 10.1109/ICIW. 2009.25. Bracewell,D., Gustafson,S., Moitra,A., & Steuben,G. (2010). WISDOM from light-weight information retrieval, IEEE international conference on social computing / IEEE international conference on privacy, security, risk and trust, 347-354, doi: 10.1109/SocialCom.2010.57. Chong,T. (2010). A kind of algorithm for page ranking based on classified tree in search engine, International conference on computer application and system modeling (ICCASM 2010), 04 November 2010, v13-538 - v13-541, doi: 10. 1109/ICCASM.2010.5622891. Ding,L., Finin,T., Joshi,A., Pan,P., Cost,R., Peng,Y., Reddivari,P., Doshi,V., & Sachs,J. (2004).Swoogle: a search and metadata engine for the semantic web, CIKM’04, November 8–13, 2004, Washington, DC, USA, ACM,652-659, doi: 10.1145/ 1031171.1031289. Gao,W., Tian,Y., Huang,T., & Yang,Q. (2010). Vlogging: a survey of videoblogging technology on the web, ACM Comput. Surv. 42(4), Article 15 (June 2010), 57 pages, doi:10.1145/1749603.1749606. Gulli,A. (2005). The anatomy of a news search engine, ACM, May 10–14,2005, Chiba, Japan,880-881, doi: 10.1145/1062745. 1062778. Hirokawa,S., Yin,C. , & Nakatoh,T. (2011). Component-based search engine for blogs, 2011 IEEE international conference on fuzzy systems, June 27-30, 2011, Taipei, Taiwan, 1074- 1078, doi: 10.1109/FUZZY.2011.6007650. Jiang,Z., & Deng,X. (2010). A personalized search engine model based on RSS user’s interest, 2010 2nd international conference on future computer and communication, V2-196 - V2-199, doi: 10.1109/ICFCC.2010.5497371. Keong,B., & Anthony,P. (2011). Meta search engine powered by DBpedia, 2011 international conference on semantic technology and information retrieval, 28-29 June 2011, Putrajaya, Malaysia, 89-93,doi: 10.1109/STAIR.2011.5995770. Kim,K., & Cho,S. (2001). A personalized web search engine using fuzzy concept network with link structure, IFSA world congress and 20th NAFIPS international conference, 2001. Joint 9th, 81-86 vol.1, doi: 10.1109/NAFIPS.2001.944231. Lai,L., Wu,C., Lin,P., & Huang,L. (2011).Developing a fuzzy search engine based on fuzzy ontology and semantic search, 2011 IEEE international conference on fuzzy systems, June 27-30, 2011, Taipei, Taiwan, 2684-2689, doi: 10.1109/FUZZY. 2011.6007378. Laughlin,A., Olson,J., Simpson,D., & Inoue,A. (2011). Page ranking refinement using fuzzy sets and logic, Proceedings of The 22nd Midwest artificial intelligence and cognitive science conference 2011,Cincinnati, USA, April 16-17, 2011, 40-46. Lee,W., Jung-Hoon,J., Kim,Y., & Kai-Sang,C. (2009). Anchor- Woman: top-k structured mobile web search engine, CIKM’09, November 2–6, 2009, Hong Kong, China, ACM, 2089-2090, doi: 10.1145/1645953.1646317. Li,G., Ji,S., Li,C., Wang,J.,& Feng,J. (2010). Efficient fuzzy type-ahead search in TASTIER, ICDE conference 2010, IEEE, 1105-1108, doi: 10.1109/ICDE.2010.5447804. Lin,Y., Lai,L., Wu,C.,& Huang,L. (2010). A self-adaptation approach to fuzzy-go search engine, Computer symposium (ICS), 2010 International IEEE, 1020-1025, doi: 10.1109/ COMPSYM.2010.5685543. Matsumoto,T., & Hung,E. (2010). Fuzzy clustering and relevance ranking of web search results with differentiating cluster label generation, Fuzzy Systems (FUZZ), IEEE international conference on2010, 23 September 2010, 1–8, doi: 10.1109/fuzzy.2010.5584771. Meghabghab,G., & Kandel,A. (2008). Search engines, link analysis, and user's web behaviour, Berlin Heidelberg: Springer-Verlag. Negnevitsky,M. (2011). Artificial intelligence: a guide to intelligent systems, third edition, Wesley. Park,J., Shin,Y., Kim,K., & Chung,B. (2010). Searching the long tail of social media streams on the web, IEEE intelligent systems, 09 November 2010, doi: 10.1109/MIS.2010 .115. Pavlacka,O.,& Talasova,J. (2006). Application of the fuzzy weighted average of fuzzy numbers in decision making models. Phoey Lee,T., Abdul Ghani,A., Ibrahim,H., & Atan,R. (2009). Coalescence of XML-based Really Simple Syndication(RSS) aggregator for blogosphere, Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia, December 14–16, 2009, Kuala Lumpur, Malaysia, ACM, 530-534, doi: 10.1145/1821748.1821850. Shang,W., Wang,T., & Lv,R. (2011). The key technology research of intelligent information syndication, 2011 fourth international joint conference on computational sciences and optimization, 865-867, doi: 10.1109/cso.2011. 275. Snasel,V., Kromer,P., Nyongesa,H., Musilek,P., & Husek,D. (2007). Fuzzy modeling of user needs for improvement of web search queries, Fuzzy information processing society, 2007. NAFIPS '07. annual meeting of the north American, 24-27 June 2007, 446 – 451,doi: 10.1109/nafips.2007.383881. Topac,V. (2010).Efficient fuzzy search enabled hash map,4th international workshop on soft computing applications, 15-17 July, 2010 -Arad, Romania, IEEE, 39-44, doi: 10.1109/ SOFA.2010.5565628. Xu,G., Zhang,Y., & Li,L. (2011). Web mining and social networking, techniques and application, New York, Springer, doi 10.1007/978-4419-7735-9. Yang,S., Zi-tao,L., Cheng,L., & Ye,L. (2009). Research on social network based on meta-search engine, 2009 sixth web information systems and applications conference, 179-183, doi 10.1109/wisa.2009.21. Zhang,X., Xu,C., Cheng,J., Lu,H., & Ma,S. (2009). Effective annotation and search for video blogs with integration of context and content analysis. IEEE transactions on multimedia, 11(2), 272-285, doi: 10.1109/TMM.2008.2009689. Zhou,Y., Chen,X., & Wang,C. (2006).A self-organizing search engine for RSS syndicated web contents, Proceedings of the 22nd international conference on data engineering workshops (ICDEW'06), 24 April 2006, Atlanta, GA, USA, IEEE Computer Society, doi: 10.1109/ICDEW.2006.19. Zhu,J., & Wang,H. (2010). Application of E-commerce personality searching based on RSS, 2010 2nd IEEE international conference on information management and engineering (ICIME), 197– 199, doi: 10.1109/ICIME.2010.54780 85. |