Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points

The binary logistic regression model popularly used in medical data analysis. In spite of its popularity, there are only a few available robust methods for this model to encounter the effects of high leverage points and multicollinearity. Failure to address model adequacy when a combination of hi...

Full description

Saved in:
Bibliographic Details
Main Author: Ariffin @ Mat Zin, Syaiba Balqish
Format: Thesis
Language:English
Published: 2018
Subjects:
Online Access:http://psasir.upm.edu.my/id/eprint/83721/1/FS%202019%2041%20-%20ir.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-upm-ir.83721
record_format uketd_dc
spelling my-upm-ir.837212022-01-05T02:34:53Z Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points 2018-12 Ariffin @ Mat Zin, Syaiba Balqish The binary logistic regression model popularly used in medical data analysis. In spite of its popularity, there are only a few available robust methods for this model to encounter the effects of high leverage points and multicollinearity. Failure to address model adequacy when a combination of high leverage points and multicollinearity exist in data, lead to misleading and incorrect inferences. This study is aimed to develop new robust diagnostic and estimation for logistic regression (overlap cases) and hidden logistic regression (non-overlap cases). A new robust diagnostic called Logistic Influential Outlier Nominator (LION) is developed to identify influential outliers and the LION successfully detect the outliers in both x and y directions. Then, second robust diagnostic, namely Diagnostic Influential Observations (DIO) is developed, specifically to identify high leverage influential observations (HLIO). The DIO introduces two important stages whereby the initial stage employs the LION procedure and the confirmation stage comprises combine measures of Generalized Distance from the Mean and Generalized Standardized Pearson Residual to flag the HLIO. Adjusted Weighted Bianco and Yohai (AWEBY) is an improvisation on the Weighted Bianco and Yohai (WBY) robust estimator. The AWEBY is proposed to increase the efficiency of WBY estimator by constructing a "smooth rejection" to replace the "hard rejection" weight function. In the AWEBY, new robust weights are formulated based on the DIO and found to properly reduce the effect of HLIO whilst protecting the good leverage points. In combined problems of HLIO and multicollinearity for overlap cases, the AWEBY estimator is integrated for computing robust ridge parameter and formed Robust Ridge Logistic (RRL) iterative update scheme. By using the updated robust weights, the impact of the HLIO and multicollinearity will be toned down immensely. Adjusted Weighted Maximum Estimated Likelihood (AWEMEL) in hidden logistic regression is proposed to rectify the HLIO in separation problem. New robust weights in the AWEMEL is designed based on DIO which particularly down weighs the HLIO but not the good leverage points. Finally, Robust Ridge Hidden Logistic (RRHL) is proposed to remedy both HLIO and multicollinearity for separation problem. In RRHL's iteration, the AWEMEL estimator is employed to compute robust ridge parameter which is resistance towards the bad impacts of HLIO. Mathematics - Research Binary system (Mathematics) 2018-12 Thesis http://psasir.upm.edu.my/id/eprint/83721/ http://psasir.upm.edu.my/id/eprint/83721/1/FS%202019%2041%20-%20ir.pdf text en public doctoral Universiti Putra Malaysia Mathematics - Research Binary system (Mathematics) Midi, Habshah
institution Universiti Putra Malaysia
collection PSAS Institutional Repository
language English
advisor Midi, Habshah
topic Mathematics - Research
Binary system (Mathematics)

spellingShingle Mathematics - Research
Binary system (Mathematics)

Ariffin @ Mat Zin, Syaiba Balqish
Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
description The binary logistic regression model popularly used in medical data analysis. In spite of its popularity, there are only a few available robust methods for this model to encounter the effects of high leverage points and multicollinearity. Failure to address model adequacy when a combination of high leverage points and multicollinearity exist in data, lead to misleading and incorrect inferences. This study is aimed to develop new robust diagnostic and estimation for logistic regression (overlap cases) and hidden logistic regression (non-overlap cases). A new robust diagnostic called Logistic Influential Outlier Nominator (LION) is developed to identify influential outliers and the LION successfully detect the outliers in both x and y directions. Then, second robust diagnostic, namely Diagnostic Influential Observations (DIO) is developed, specifically to identify high leverage influential observations (HLIO). The DIO introduces two important stages whereby the initial stage employs the LION procedure and the confirmation stage comprises combine measures of Generalized Distance from the Mean and Generalized Standardized Pearson Residual to flag the HLIO. Adjusted Weighted Bianco and Yohai (AWEBY) is an improvisation on the Weighted Bianco and Yohai (WBY) robust estimator. The AWEBY is proposed to increase the efficiency of WBY estimator by constructing a "smooth rejection" to replace the "hard rejection" weight function. In the AWEBY, new robust weights are formulated based on the DIO and found to properly reduce the effect of HLIO whilst protecting the good leverage points. In combined problems of HLIO and multicollinearity for overlap cases, the AWEBY estimator is integrated for computing robust ridge parameter and formed Robust Ridge Logistic (RRL) iterative update scheme. By using the updated robust weights, the impact of the HLIO and multicollinearity will be toned down immensely. Adjusted Weighted Maximum Estimated Likelihood (AWEMEL) in hidden logistic regression is proposed to rectify the HLIO in separation problem. New robust weights in the AWEMEL is designed based on DIO which particularly down weighs the HLIO but not the good leverage points. Finally, Robust Ridge Hidden Logistic (RRHL) is proposed to remedy both HLIO and multicollinearity for separation problem. In RRHL's iteration, the AWEMEL estimator is employed to compute robust ridge parameter which is resistance towards the bad impacts of HLIO.
format Thesis
qualification_level Doctorate
author Ariffin @ Mat Zin, Syaiba Balqish
author_facet Ariffin @ Mat Zin, Syaiba Balqish
author_sort Ariffin @ Mat Zin, Syaiba Balqish
title Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
title_short Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
title_full Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
title_fullStr Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
title_full_unstemmed Robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
title_sort robust diagnostic and estimation for binary logistic regression model in the presence of multicollinearity and high leverage points
granting_institution Universiti Putra Malaysia
publishDate 2018
url http://psasir.upm.edu.my/id/eprint/83721/1/FS%202019%2041%20-%20ir.pdf
_version_ 1747813411955146752