Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic

The rapid development of Internet increases the writers of blog sites. Sometimes these blog sites focused on solving some important problems. To find specific blogs are hard problem for the users because a lot of these blogs contain unuseful information such as online advertisements, notice and nois...

Full description

Saved in:
Bibliographic Details
Main Author: Mohammed, Athraa Jasim
Format: Thesis
Language:eng
eng
Published: 2012
Subjects:
Online Access:https://etd.uum.edu.my/3304/1/ATHRAA_JASIM_MOHAMMED.pdf
https://etd.uum.edu.my/3304/3/ATHRAA_JASIM_MOHAMMED.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-uum-etd.3304
record_format uketd_dc
institution Universiti Utara Malaysia
collection UUM ETD
language eng
eng
advisor Husni, Husniza
topic QA76.76 Fuzzy System.
spellingShingle QA76.76 Fuzzy System.
Mohammed, Athraa Jasim
Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
description The rapid development of Internet increases the writers of blog sites. Sometimes these blog sites focused on solving some important problems. To find specific blogs are hard problem for the users because a lot of these blogs contain unuseful information such as online advertisements, notice and noise which minimize the rank of blog site. Furthermore to retrieve more relevant blogs is another problem which lowering the search performance. This study proposes blogs search engine adopting RSS syndication using Fuzzy logic. The blogs search engine consists of three main phases which are crawling using RSS feeds algorithm, indexing weblogs algorithm and searching technique with Fuzzy logic. In RSS crawling process RSS feeds need to be gathered to extract useful information such as title, links, publish time and description. Indexing weblogs use the links to retrieve the blogs sites for text processing and construct indexing database. In order to retrieve such information needed by any user, there is user interface to search for keyword with importance degree and compute the density of keyword from the indexing database. The rank of the pages is computed based on fuzzy weighted average value. A prototype is built using visual basic 2008 to validate the proposed blogs search engine. It is a windows application with http connection protocol. In system evaluation used two measurement performances which are precision and mean average precision. The parameters of precision determine based on respondents whom determine the total retrieved links and the total relevant links for the keyword search result. The number of keywords that used in testing system is five pairs keywords. The experimental results show that the mean average precision is 81.7% of the whole system performance. The percent of respondents is 80% who knows and uses the blogs and 20% don’t have knowledge. The execution time of the system based on respondents is 70% between 3-5 minute and 30% less than 3 minute. This percentage is good considering the rate of satisfaction for system is 80% satisfied and 20% strongly satisfied.
format Thesis
qualification_name masters
qualification_level Master's degree
author Mohammed, Athraa Jasim
author_facet Mohammed, Athraa Jasim
author_sort Mohammed, Athraa Jasim
title Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
title_short Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
title_full Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
title_fullStr Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
title_full_unstemmed Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic
title_sort blogs search engine adopting rss syndication using fuzzy logic
granting_institution Universiti Utara Malaysia
granting_department Awang Had Salleh Graduate School of Arts & Sciences
publishDate 2012
url https://etd.uum.edu.my/3304/1/ATHRAA_JASIM_MOHAMMED.pdf
https://etd.uum.edu.my/3304/3/ATHRAA_JASIM_MOHAMMED.pdf
_version_ 1747827542739386368
spelling my-uum-etd.33042016-04-27T03:57:59Z Blogs Search Engine Adopting RSS Syndication Using Fuzzy Logic 2012 Mohammed, Athraa Jasim Husni, Husniza Awang Had Salleh Graduate School of Arts & Sciences Awang Had Salleh Graduate School of Arts and Sciences QA76.76 Fuzzy System. The rapid development of Internet increases the writers of blog sites. Sometimes these blog sites focused on solving some important problems. To find specific blogs are hard problem for the users because a lot of these blogs contain unuseful information such as online advertisements, notice and noise which minimize the rank of blog site. Furthermore to retrieve more relevant blogs is another problem which lowering the search performance. This study proposes blogs search engine adopting RSS syndication using Fuzzy logic. The blogs search engine consists of three main phases which are crawling using RSS feeds algorithm, indexing weblogs algorithm and searching technique with Fuzzy logic. In RSS crawling process RSS feeds need to be gathered to extract useful information such as title, links, publish time and description. Indexing weblogs use the links to retrieve the blogs sites for text processing and construct indexing database. In order to retrieve such information needed by any user, there is user interface to search for keyword with importance degree and compute the density of keyword from the indexing database. The rank of the pages is computed based on fuzzy weighted average value. A prototype is built using visual basic 2008 to validate the proposed blogs search engine. It is a windows application with http connection protocol. In system evaluation used two measurement performances which are precision and mean average precision. The parameters of precision determine based on respondents whom determine the total retrieved links and the total relevant links for the keyword search result. The number of keywords that used in testing system is five pairs keywords. The experimental results show that the mean average precision is 81.7% of the whole system performance. The percent of respondents is 80% who knows and uses the blogs and 20% don’t have knowledge. The execution time of the system based on respondents is 70% between 3-5 minute and 30% less than 3 minute. This percentage is good considering the rate of satisfaction for system is 80% satisfied and 20% strongly satisfied. 2012 Thesis https://etd.uum.edu.my/3304/ https://etd.uum.edu.my/3304/1/ATHRAA_JASIM_MOHAMMED.pdf text eng validuser https://etd.uum.edu.my/3304/3/ATHRAA_JASIM_MOHAMMED.pdf text eng public masters masters Universiti Utara Malaysia Bouras,C., Poulopoulos,V., & Silintziris,P. (2009). Personalized news search in www: adapting on user’s behaviour, fourth international conference on internet and web applications and services, 125-130, doi: 10.1109/ICIW. 2009.25. Bracewell,D., Gustafson,S., Moitra,A., & Steuben,G. (2010). WISDOM from light-weight information retrieval, IEEE international conference on social computing / IEEE international conference on privacy, security, risk and trust, 347-354, doi: 10.1109/SocialCom.2010.57. Chong,T. (2010). A kind of algorithm for page ranking based on classified tree in search engine, International conference on computer application and system modeling (ICCASM 2010), 04 November 2010, v13-538 - v13-541, doi: 10. 1109/ICCASM.2010.5622891. Ding,L., Finin,T., Joshi,A., Pan,P., Cost,R., Peng,Y., Reddivari,P., Doshi,V., & Sachs,J. (2004).Swoogle: a search and metadata engine for the semantic web, CIKM’04, November 8–13, 2004, Washington, DC, USA, ACM,652-659, doi: 10.1145/ 1031171.1031289. Gao,W., Tian,Y., Huang,T., & Yang,Q. (2010). Vlogging: a survey of videoblogging technology on the web, ACM Comput. Surv. 42(4), Article 15 (June 2010), 57 pages, doi:10.1145/1749603.1749606. Gulli,A. (2005). The anatomy of a news search engine, ACM, May 10–14,2005, Chiba, Japan,880-881, doi: 10.1145/1062745. 1062778. Hirokawa,S., Yin,C. , & Nakatoh,T. (2011). Component-based search engine for blogs, 2011 IEEE international conference on fuzzy systems, June 27-30, 2011, Taipei, Taiwan, 1074- 1078, doi: 10.1109/FUZZY.2011.6007650. Jiang,Z., & Deng,X. (2010). A personalized search engine model based on RSS user’s interest, 2010 2nd international conference on future computer and communication, V2-196 - V2-199, doi: 10.1109/ICFCC.2010.5497371. Keong,B., & Anthony,P. (2011). Meta search engine powered by DBpedia, 2011 international conference on semantic technology and information retrieval, 28-29 June 2011, Putrajaya, Malaysia, 89-93,doi: 10.1109/STAIR.2011.5995770. Kim,K., & Cho,S. (2001). A personalized web search engine using fuzzy concept network with link structure, IFSA world congress and 20th NAFIPS international conference, 2001. Joint 9th, 81-86 vol.1, doi: 10.1109/NAFIPS.2001.944231. Lai,L., Wu,C., Lin,P., & Huang,L. (2011).Developing a fuzzy search engine based on fuzzy ontology and semantic search, 2011 IEEE international conference on fuzzy systems, June 27-30, 2011, Taipei, Taiwan, 2684-2689, doi: 10.1109/FUZZY. 2011.6007378. Laughlin,A., Olson,J., Simpson,D., & Inoue,A. (2011). Page ranking refinement using fuzzy sets and logic, Proceedings of The 22nd Midwest artificial intelligence and cognitive science conference 2011,Cincinnati, USA, April 16-17, 2011, 40-46. Lee,W., Jung-Hoon,J., Kim,Y., & Kai-Sang,C. (2009). Anchor- Woman: top-k structured mobile web search engine, CIKM’09, November 2–6, 2009, Hong Kong, China, ACM, 2089-2090, doi: 10.1145/1645953.1646317. Li,G., Ji,S., Li,C., Wang,J.,& Feng,J. (2010). Efficient fuzzy type-ahead search in TASTIER, ICDE conference 2010, IEEE, 1105-1108, doi: 10.1109/ICDE.2010.5447804. Lin,Y., Lai,L., Wu,C.,& Huang,L. (2010). A self-adaptation approach to fuzzy-go search engine, Computer symposium (ICS), 2010 International IEEE, 1020-1025, doi: 10.1109/ COMPSYM.2010.5685543. Matsumoto,T., & Hung,E. (2010). Fuzzy clustering and relevance ranking of web search results with differentiating cluster label generation, Fuzzy Systems (FUZZ), IEEE international conference on2010, 23 September 2010, 1–8, doi: 10.1109/fuzzy.2010.5584771. Meghabghab,G., & Kandel,A. (2008). Search engines, link analysis, and user's web behaviour, Berlin Heidelberg: Springer-Verlag. Negnevitsky,M. (2011). Artificial intelligence: a guide to intelligent systems, third edition, Wesley. Park,J., Shin,Y., Kim,K., & Chung,B. (2010). Searching the long tail of social media streams on the web, IEEE intelligent systems, 09 November 2010, doi: 10.1109/MIS.2010 .115. Pavlacka,O.,& Talasova,J. (2006). Application of the fuzzy weighted average of fuzzy numbers in decision making models. Phoey Lee,T., Abdul Ghani,A., Ibrahim,H., & Atan,R. (2009). Coalescence of XML-based Really Simple Syndication(RSS) aggregator for blogosphere, Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia, December 14–16, 2009, Kuala Lumpur, Malaysia, ACM, 530-534, doi: 10.1145/1821748.1821850. Shang,W., Wang,T., & Lv,R. (2011). The key technology research of intelligent information syndication, 2011 fourth international joint conference on computational sciences and optimization, 865-867, doi: 10.1109/cso.2011. 275. Snasel,V., Kromer,P., Nyongesa,H., Musilek,P., & Husek,D. (2007). Fuzzy modeling of user needs for improvement of web search queries, Fuzzy information processing society, 2007. NAFIPS '07. annual meeting of the north American, 24-27 June 2007, 446 – 451,doi: 10.1109/nafips.2007.383881. Topac,V. (2010).Efficient fuzzy search enabled hash map,4th international workshop on soft computing applications, 15-17 July, 2010 -Arad, Romania, IEEE, 39-44, doi: 10.1109/ SOFA.2010.5565628. Xu,G., Zhang,Y., & Li,L. (2011). Web mining and social networking, techniques and application, New York, Springer, doi 10.1007/978-4419-7735-9. Yang,S., Zi-tao,L., Cheng,L., & Ye,L. (2009). Research on social network based on meta-search engine, 2009 sixth web information systems and applications conference, 179-183, doi 10.1109/wisa.2009.21. Zhang,X., Xu,C., Cheng,J., Lu,H., & Ma,S. (2009). Effective annotation and search for video blogs with integration of context and content analysis. IEEE transactions on multimedia, 11(2), 272-285, doi: 10.1109/TMM.2008.2009689. Zhou,Y., Chen,X., & Wang,C. (2006).A self-organizing search engine for RSS syndicated web contents, Proceedings of the 22nd international conference on data engineering workshops (ICDEW'06), 24 April 2006, Atlanta, GA, USA, IEEE Computer Society, doi: 10.1109/ICDEW.2006.19. Zhu,J., & Wang,H. (2010). Application of E-commerce personality searching based on RSS, 2010 2nd IEEE international conference on information management and engineering (ICIME), 197– 199, doi: 10.1109/ICIME.2010.54780 85.