Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks
Traditionally, the regression analysis for competing risks survival time is based on the cause-specific hazard that treat failures from causes other than the cause of interest as censored observations. That includes technique such as the Cox proportional hazard model. The modelling of hazard rat...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English English |
Published: |
2008
|
Subjects: | |
Online Access: | http://psasir.upm.edu.my/id/eprint/5418/1/IPM_2008_1.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-upm-ir.5418 |
---|---|
record_format |
uketd_dc |
spelling |
my-upm-ir.54182013-05-27T07:22:40Z Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks 2008 Kudus, Abdul Traditionally, the regression analysis for competing risks survival time is based on the cause-specific hazard that treat failures from causes other than the cause of interest as censored observations. That includes technique such as the Cox proportional hazard model. The modelling of hazard rate may or may not match the objective of investigator. It is often more desirable to investigate the subdistribution function, because cause-specific hazard doesn’t obviously give the information about proportion of individuals experiencing a cause of interest. Furthermore, the subdistribution and cause-specific hazard function are not interchangeable. Thus, if we intended to draw inference from subdistribution function, then we must model on subdistribution function directly or indirectly. Sometimes, we do not only intend to investigate the relationship between response and covariates through regression analysis, but also we want to identify the presence of subgroup of individuals in our data. We could then utilize tree-structured regression for this purpose. In this thesis, we developed statistical methods for competing risks data analysis through direct, indirect and parametric subdistribution modelling. Indirect model is employed via hazard of subdistribution. Evaluation of the performance of proposed methods is conducted through series of simulation studies as well as real data application. We developed four methods: 1) a method to categorize continuous covariate by considering the competing risks survival time outcome variables, called outcome-oriented categorization method, 2) a tree-structured competing risks regression to extract meaningful sub-groups of subjects determined by the value of covariates, 3) a hybrid model which boost the available subdistribution hazards regression by ugmenting it with tree-structured regression resulted from the previous step, 4) two kinds of parametric direct subdistribution model. These models are constructed based on non-mixture cure model. The first model is developed by taking into account the fraction of individuals who did not experience the event of interest in the long term. The second model is developed by reparameterizing the first model in order to mimic Gompertz distribution which allows no immune fraction. Research finding is as follows: 1) Method of outcome-oriented categorization based on deviance statistic is the best. The application of the method to contraceptive discontinuation data showed good result. 2) Regression tree for competing risks data can uncover the structure of data and yield the sub-group of individuals with a clear description based on their covariates. The application of the method to contraceptive discontinuation data showed good result. Extensive Monte Carlo simulation suggests the method has good performance in identifying the structure of data. 3) Application of the hybrid model to the contraceptive discontinuation data showed that the hybrid model is better than the available subdistribution regression in terms of AIC. 4) By using some well known kernel distribution, the parametric direct subdistribution models are developed. The maximum likelihood estimations are carried out simultaneously for all causes of event. In Bone Marrow Transplantation (BMT) data analysis, the first proposed model gave noticeably good fit to the nonparametric counterpart. The second proposed model is fitted to contraceptive discontinuation data and showed that Gompertz-like subdistribution with Gompertz kernel is the best fit. Mathematical analysis 2008 Thesis http://psasir.upm.edu.my/id/eprint/5418/ http://psasir.upm.edu.my/id/eprint/5418/1/IPM_2008_1.pdf application/pdf en public phd doctoral Universiti Putra Malaysia Mathematical analysis Institute Mathematical Research English |
institution |
Universiti Putra Malaysia |
collection |
PSAS Institutional Repository |
language |
English English |
topic |
Mathematical analysis |
spellingShingle |
Mathematical analysis Kudus, Abdul Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks |
description |
Traditionally, the regression analysis for competing risks survival
time is based on the cause-specific hazard that treat failures from
causes other than the cause of interest as censored observations.
That includes technique such as the Cox proportional hazard
model. The modelling of hazard rate may or may not match the
objective of investigator. It is often more desirable to investigate
the subdistribution function, because cause-specific hazard
doesn’t obviously give the information about proportion of
individuals experiencing a cause of interest. Furthermore, the
subdistribution and cause-specific hazard function are not
interchangeable. Thus, if we intended to draw inference from subdistribution function, then we must model on subdistribution
function directly or indirectly.
Sometimes, we do not only intend to investigate the relationship
between response and covariates through regression analysis, but
also we want to identify the presence of subgroup of individuals in
our data. We could then utilize tree-structured regression for this
purpose.
In this thesis, we developed statistical methods for competing risks
data analysis through direct, indirect and parametric
subdistribution modelling. Indirect model is employed via hazard
of subdistribution. Evaluation of the performance of proposed
methods is conducted through series of simulation studies as well
as real data application.
We developed four methods: 1) a method to categorize
continuous covariate by considering the competing risks
survival time outcome variables, called outcome-oriented
categorization method, 2) a tree-structured competing risks
regression to extract meaningful sub-groups of subjects
determined by the value of covariates, 3) a hybrid model which
boost the available subdistribution hazards regression by ugmenting it with tree-structured regression resulted from the
previous step, 4) two kinds of parametric direct subdistribution
model. These models are constructed based on non-mixture
cure model. The first model is developed by taking into account
the fraction of individuals who did not experience the event of
interest in the long term. The second model is developed by
reparameterizing the first model in order to mimic Gompertz
distribution which allows no immune fraction.
Research finding is as follows: 1) Method of outcome-oriented
categorization based on deviance statistic is the best. The
application of the method to contraceptive discontinuation data
showed good result. 2) Regression tree for competing risks data
can uncover the structure of data and yield the sub-group of
individuals with a clear description based on their covariates.
The application of the method to contraceptive discontinuation
data showed good result. Extensive Monte Carlo simulation
suggests the method has good performance in identifying the
structure of data. 3) Application of the hybrid model to the
contraceptive discontinuation data showed that the hybrid
model is better than the available subdistribution regression in
terms of AIC. 4) By using some well known kernel distribution,
the parametric direct subdistribution models are developed. The maximum likelihood estimations are carried out simultaneously
for all causes of event. In Bone Marrow Transplantation (BMT)
data analysis, the first proposed model gave noticeably good fit
to the nonparametric counterpart. The second proposed model
is fitted to contraceptive discontinuation data and showed that
Gompertz-like subdistribution with Gompertz kernel is the best
fit. |
format |
Thesis |
qualification_name |
Doctor of Philosophy (PhD.) |
qualification_level |
Doctorate |
author |
Kudus, Abdul |
author_facet |
Kudus, Abdul |
author_sort |
Kudus, Abdul |
title |
Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks |
title_short |
Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks |
title_full |
Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks |
title_fullStr |
Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks |
title_full_unstemmed |
Tree-Structured and Direct Parametric Regression Models for the Subdistribution of Competing Risks |
title_sort |
tree-structured and direct parametric regression models for the subdistribution of competing risks |
granting_institution |
Universiti Putra Malaysia |
granting_department |
Institute Mathematical Research |
publishDate |
2008 |
url |
http://psasir.upm.edu.my/id/eprint/5418/1/IPM_2008_1.pdf |
_version_ |
1747810420666662912 |