Malaysia household incomes classification prediction with k-means clustering and fuzzy inference system

The economy level of the citizen has become a main concern for Malaysia as a developing country to improve the living status. On this point of view, the household income data would be a very useful information to measure the economic status of the population in Malaysia. This study aims to build a c...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Hamzah, Nur Atiqah
التنسيق: أطروحة
اللغة:English
English
English
منشور في: 2018
الموضوعات:
الوصول للمادة أونلاين:http://eprints.uthm.edu.my/20/1/24p%20NUR%20ATIQAH%20BINTI%20HAMZAH.pdf
http://eprints.uthm.edu.my/20/2/NUR%20ATIQAH%20BINTI%20HAMZAH%20COPYRIGHT%20DECLARATION.pdf
http://eprints.uthm.edu.my/20/3/NUR%20ATIQAH%20BINTI%20HAMZAH%20WATERMARK.pdf
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:The economy level of the citizen has become a main concern for Malaysia as a developing country to improve the living status. On this point of view, the household income data would be a very useful information to measure the economic status of the population in Malaysia. This study aims to build a classification prediction of household incomes using fuzzy inference system (FIS) from the K-means clustering outputs. Thus, this study focuses on three main objectives which are (a) To apply K-means clustering on household incomes data, (b) To propose the prediction of household incomes classification using FIS, and (c) To analyze and validate the classification solution for household incomes and to compare with discriminant analysis. Initially, the number of groups in the household income data is determined by using K-means clustering. Accordingly, the outputs from K-means clustering are used to identify the membership functions, namely, triangle, trapezoidal and Gaussian membership functions. Furthermore, FIS models for each membership function are built for the household income class prediction based on clustering outputs. For verification, the root mean square error (RMSE) value for each FIS model is calculated and the percentage of data correctly classified using the FIS models built is compared with the discriminant analysis output. As a result, it is found that Mamdani FIS model with Gaussian membership function is the best model with the RMSE is 1.0396, while the percentage of data correctly classified is 64.9989%. In conclusion, the classification prediction of household incomes discussed in this thesis could identify the predicted class of household income in a tractable way and the efficiency of the technique used in this thesis for classification prediction of household income is highly recommended.