Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
Hotelling T² control chart is an effective tool in statistical process control for multivariate environment. However, the performance of traditional Hotelling T² control chart using classical location and scatter estimators is usually marred by the masking and swamping effects. In order to alleviate...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | eng eng |
Published: |
2013
|
Subjects: | |
Online Access: | https://etd.uum.edu.my/3870/1/s91512.pdf https://etd.uum.edu.my/3870/7/s91512.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my-uum-etd.3870 |
---|---|
record_format |
uketd_dc |
institution |
Universiti Utara Malaysia |
collection |
UUM ETD |
language |
eng eng |
advisor |
Syed Yahaya, Sharipah Soaad Omar, Zurni |
topic |
QA273-280 Probabilities Mathematical statistics |
spellingShingle |
QA273-280 Probabilities Mathematical statistics Hazlina, Ali Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance |
description |
Hotelling T² control chart is an effective tool in statistical process control for multivariate environment. However, the performance of traditional Hotelling T² control chart using classical location and scatter estimators is usually marred by the masking and swamping effects. In order to alleviate the problem, robust estimators are recommended. The most popular and widely used robust estimator in the Hotelling T² control chart is the minimum covariance determinant (MCD). Recently, a new robust estimator known as minimum vector variance (MVV) was introduced. This estimator possesses high breakdown point, affine equivariance and is superior in terms of computational efficiency. Due to these nice properties, this study proposed to replace the classical estimators with the MVV location and scatter estimators in the construction of Hotelling T² control chart for individual observations in Phase II analysis. Nevertheless, some drawbacks such as inconsistency under normal distribution, biased for small sample size and low efficiency under high breakdown point were discovered. To improve the MVV estimators in terms of consistency and unbiasedness, the MVV scatter estimator was multiplied by consistency and correction factors respectively. To maintain the high breakdown point while having high statistical efficiency, a reweighted version of MVV estimator (RMVV) was proposed. Subsequently, the RMVV estimators were applied in the construction of Hotelling T² control chart. The new robust Hotelling T² chart produced positive impact in detecting outliers while simultaneously controlling false alarm rates. Apart from analysis of simulated data, analysis of real data also found that the new robust Hotelling T² chart was able to detect out of control observations better than the other charts investigated in this study. Based on the good performance on both simulated and real data analysis, the new robust Hotelling T² chart is a good alternative to the existing Hotelling T² charts. |
format |
Thesis |
qualification_name |
Ph.D. |
qualification_level |
Doctorate |
author |
Hazlina, Ali |
author_facet |
Hazlina, Ali |
author_sort |
Hazlina, Ali |
title |
Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance |
title_short |
Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance |
title_full |
Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance |
title_fullStr |
Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance |
title_full_unstemmed |
Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance |
title_sort |
efficient and highly robust hotelling t² control charts using reweighted mininum vector variance |
granting_institution |
Universiti Utara Malaysia |
granting_department |
Awang Had Salleh Graduate School of Arts & Sciences |
publishDate |
2013 |
url |
https://etd.uum.edu.my/3870/1/s91512.pdf https://etd.uum.edu.my/3870/7/s91512.pdf |
_version_ |
1747827657951674368 |
spelling |
my-uum-etd.38702022-10-06T07:06:04Z Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance 2013 Hazlina, Ali Syed Yahaya, Sharipah Soaad Omar, Zurni Awang Had Salleh Graduate School of Arts & Sciences Awang Had Salleh Graduate School of Arts and Sciences QA273-280 Probabilities. Mathematical statistics Hotelling T² control chart is an effective tool in statistical process control for multivariate environment. However, the performance of traditional Hotelling T² control chart using classical location and scatter estimators is usually marred by the masking and swamping effects. In order to alleviate the problem, robust estimators are recommended. The most popular and widely used robust estimator in the Hotelling T² control chart is the minimum covariance determinant (MCD). Recently, a new robust estimator known as minimum vector variance (MVV) was introduced. This estimator possesses high breakdown point, affine equivariance and is superior in terms of computational efficiency. Due to these nice properties, this study proposed to replace the classical estimators with the MVV location and scatter estimators in the construction of Hotelling T² control chart for individual observations in Phase II analysis. Nevertheless, some drawbacks such as inconsistency under normal distribution, biased for small sample size and low efficiency under high breakdown point were discovered. To improve the MVV estimators in terms of consistency and unbiasedness, the MVV scatter estimator was multiplied by consistency and correction factors respectively. To maintain the high breakdown point while having high statistical efficiency, a reweighted version of MVV estimator (RMVV) was proposed. Subsequently, the RMVV estimators were applied in the construction of Hotelling T² control chart. The new robust Hotelling T² chart produced positive impact in detecting outliers while simultaneously controlling false alarm rates. Apart from analysis of simulated data, analysis of real data also found that the new robust Hotelling T² chart was able to detect out of control observations better than the other charts investigated in this study. Based on the good performance on both simulated and real data analysis, the new robust Hotelling T² chart is a good alternative to the existing Hotelling T² charts. 2013 Thesis https://etd.uum.edu.my/3870/ https://etd.uum.edu.my/3870/1/s91512.pdf text eng public https://etd.uum.edu.my/3870/7/s91512.pdf text eng public Ph.D. doctoral Universiti Utara Malaysia Alfaro, J. L., & Ortega, J. F. (2009). A comparison of robust alternatives to Hotelling's T2 control chart. Journal of Applied Statistics, 36(12),1385-1396. Ali, H., Djauhari, M. A., & Syed-Yahaya, S. S. (2008). On the distribution of FMCD-based robust mahalanobis distance. Publish in proceeding of the 3rd International Conference on Mathematics and Statistics (ICoMS-3), Institut Pertanian Bogor, Indonesia, 134-1506. Alt, F. B. (1985). Multivariate quality control. Encyclopedia of Statistical Sciences (Vol. 6, Kotz S., Johnson N. L., eds.). New York: Wiley. Angiulli, F., & Pizzuti, C. (2005). Outlier mining and large high-dimensional data sets.IEEE Transaction Knowledge Data Engineering, 17(2), 203–215. Atkinson, A. C., & Mulira, H. M. (1993). The stalactite plot for the detection of multivariate outliers. Statistics and Computing,3, 27-35. Barnett, V., & Lewis, T. (1994). Outliers in Statistical Data. New York: John Wiley. Beckman, R., & Cook, R. (1983). Outlier.......s. Technometrics, 25, 119–149. Billor, N., Hadi, A. S., & Velleman, P. F. (2000). BACON: blocked adaptive computationally efficient outlier nominators. Computational Statistics and Data Analysis, 34, 279-298. Bradley, J. V. (1978). Robustness? British Journal of Mathematical and Statistical Psyschology, 31, 144-152. Butler, R. W., Davies, P. L., & Jhun, M. (1993).Asymptotics for the minimum covariance determinant estimator. The Annals of Statistics, 21, 1385-1400. Campbell, N. A.(1980). Robust procedure in multivariate analysis I robust covariance estimation. Applied Statistics, 29, 231-237. Cerioli, A., Riani, M., & Atkinson, A. C. (2008). Controlling the size of multivariate outlier tests with the MCD estimator of scatter. Statistics and Computing. Doi:10.1007/s11222-008-9096-5 Chang, S. C., & Bai, D. S. (2004). A multivariate T2 control chartc for skewed populations using weighted standard deviations. Qual. Reliab. Engng. Int., 20, 31-46. Chen, T. (2010). On reducing false alarms in multivariate statistical process control. Chemical Engineering Research and Design, 88 (4). 430-436. ISSN 0263-8762. Chenouri, S., Steiner, S. H., & Mulayath, A. (2009). A multivariate robust control chart for individual observations, Journal of Quality Technology, 41(3), 259-271. Chou, Y. M., Mason, R. L. & Young, J. C.(2001). The control chart for individual observations from a multivariate non-normal distribution. Communications in Statistics-Theory and Methods, 30(8), 1937-1949. Cleroux, R., & Ducharme, G. R. (1989). Vector correlation for elliptical distributions.Commun.Statist.Theor.Meth. 18(4):1441-1454. Croux, C., & Haesbroeck, G. (1999). Influence function and efficiency of the minimum covariance determinant scattermatrix estimator, Journal of Multivariate Analysis 71, 161-190. Croux, C., & Rousseeuw, P. J.(1992). A class of high-breakdown scale estimators based on subranges. Communication statistics–Theory meth., 21(7), 1935-1951. Davies, P. L. (1987). Asymptotic behavior of S-estimates of multivariate location parameters and dispersion matrices. The Annals of Statistics, 15,1269-1292. Davies, P. L., & Gather, U. (1993). The identification of multiple outliers. Journal of the American Statistical Association, 88, 782-792. Djauhari, M. A., Mashuri, M., & Herwindiati, D. E. (2008). Multivariate process variability monitoring, Communications in Statistics-Theory and Methods, 37(11), 1742-1754. Djauhari, M. A. (2007). A measure of multivariate data concentration. Journal of Applied Probability and Statistics 2, 139-155. Djauhari, M. A., Adnan, R., Lee, M. H., & Ali, H. An Equivalent Objective Function of Fast MCD. unpublished manuscript. Fauconnier, C., & Haesbroeck, G. (2009). Outliers detection with the minimum covariance determinant estimator in practice. Statistical Methodology, 6(4), 363-379. Ferrel, O. C. & Hartline, M. D. (2008). Marketing Strategy (4rd ed.).South Western: Edition Thomson Learning Inc. Garrett, R. G. (1989). The chi-square plot: A tool for multivariate outlier recognition. Journal of Geochemical Exploration, 32, 319-341. Gather, U., & Becker, C. (1997). Convergence Rates in Multivariate Robust Outlier Identification. In: Mathematics 34, 101-107. Grubel, R., & Rocke, D. M. (1990). On the cumulants of affine equivariant estimators in elliptical families. Journal of Multivariate Analysis 35, 203- 222. doi:10.1016/0047-259X(90)90025-D Guo, Jiin-Huarng and Luh, Wei-Ming, (2000). An invertible transformation two-sample trimmed t-statistic under heterogeneity and nonnormality. Statistic & Probability letters. 49: 1-7. Hadi, A. S. (1992). Identifying multivariate outlier in multivariate data. Journal of Royal Statistical Society B,53, 761-771. Hample, F. R. (1985). The breakdown points of the mean combined with some rejection rules. Technometrics, 27, 95-107. Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J., & Stahel, W. A. (1986). Robust statistics: The approach based on influence functions. New York: Wiley. Hampel, F. R. (1971). A general qualitative definition of robustness. Annals of MathematicsStatistics, 42, 1887-1896. Hampel, F. R. (1974). The influence curve and its role in robust estimation. Journal of theAmerican Statistical Association, 69, 382-393. Hawkins, D. (1980). Identification of Outliers. Chapman and Hall. Hawkins, D.M. (1994). The feasible solution algorithm for the minimum covariance determinant estimator in multivariate data. Computational Statistics and Data Analysis, 17, 197-210. Hawkins, D. M., & Olive, D. J. (1999) . Improved feasible solution algorithm for high breakdown estimation. Computational Statistics and Data Analysis, 30, 1-11. Herwindiati, D. E. (2006). A new criterion in robust estimation for location and covariance matrix, and its application for outlier labeling. Unpublished Ph.D thesis, Institut Teknologi Bandung. Herwindiati, D. E., Djauhari, M. A., & Mashuri, M. (2007). Robust multivariate outlier labeling. Communication in Statistics-Computation and Simulation, 36: 1287-1294. Hotelling, H. (1947). Techniques of Statistical Analysis. InC. Eisenhart, M.W. Hastay, & W.A. Wallis, Multivariate quality control(pp. 111-184). New York: McGraw-Hill. Hubert, M., Rousseeuw, P., & Branden, K. V. (2005). Robpca: A new approach to robust principal components analysis. Technometrics, 47, 64-79. Huber, P. J. (1964). Robust estimation of a location parameter. Ann. Math.Statist, 35, 73-101. Huber, P. J. (1977). Robust covariances. In Statistical Decision Theory and Related Topics (Eds. S. S. Gupta and D. S. Moore). Academic Press, NewYork. 165-191. Jackson, J. E. (1985). Multivariate quality control. Communications in statistics: Theory and Methods,11, 2657-2688. Jensen,W. A., Birch, J. B. & Woodall, W. H. (2007). High breakdown estimation methods for Phase I multivariate control charts, Qual. Reliab. Eng. Int. 23, 615–629. Johnson, R. (1992). Applied Multivariate Statistical Analysis. Prentice Hall. Khoo, M. B. C. & Quah, S. H. (2003). Multivariate control chart for process dispersion based on individual observations. Quality Engineering, 15, 639-642. Khoo, M. B. C. & Quah, S. H. (2004). Alternatives to multivariate control chart for process dispersion. Quality Engineering, 16, 423-435. Lopuhaä, H. P., & Rousseeuw, P. J. (1991). Breakdown points of affine equivariance estimators of multivariate location and covariance matrices. Annal of Statistics, 19, 229-248. Lopuhaä, H. P. (1999). Asymptotics of reweighted estimators of multivariate location and scatter. Ann. Statist., 27, 1638-1665. Lowry, C. A. & Montgomery, D. C. (1995). A review of multivariate control charts. IIE Transactions, 27, 800-810. Levinson, W. A., Holmes, D. S., & Mergen, A. E. (2002). Variation charts for multivariate processes. Quality Engineering, 14, 539-545. Mardia, K. V., Kent, J. T., & Bibby, J. M. (2000). Multivariate Analysis (7rd ed.). London: Academic Press. Maronna, R. A., Stahel, W. A., & Yohai, V. J. (1992). Bias-Robust Estimators of Multivariate Scatter Based on Projections. Journal of Multivariate Analysis, 42, 141-161. Maronna, R. A., & Zamar, R. (2002). Robust estimation of location and dispersion for high- dimensional datasets. Technometrics, 44, 307-317. Maronna, R.A., Martin, R. D, & Yohai, V. J. (2006)Robust Statistics: Theory and Methods. New York, NY: John Wiley & Sons. Mason, R. L., Champ, C. W., Tracy, N. D., Wierda, S. J. & Young, J. C. (1997) Assessment of multivariate process control techniques. Journal of Quality Technology, 29 (2), 140-143. Mason, R. L., & Young, J. C. (2002). Multivariate Statistical Process Control with Industrial Applications. Philadelphia: ASA-SIAM. Montgomery, D.C. (2005)Introduction to Statistical Quality Control (5rd ed.). New York: Wiley. Pena, D., & Prieto, J. F. (2001). Multivariate outlier detection and robust covariance matrix estimation. Technometrics, 3, 286-322. Pena, D., & Rodriguez, J. (2003). Descriptive measures of multivariate scatter and linear dependence. Journal of multivariate analysis, 85, 361-374. Pison, G., van Aelst, S., & Willems, G. (2002). Small sample corrections for LTS and MCD. Metrika, 55, 111- 123. Pison, G., & van Aelst, S. (2004). Diagnostic plots for robust multivariate methods. J. Comput. Graph. Stat., 13, 310–329. Prins, J., & Mader, D. (1997). Multivariate control charts for grouped and individual observations. Qual. Eng., 10(1), 49-57. Quesenberry, C. P. (2001). The multivariate short-run snapshot q chart. Quality Engineering, 13, 679-683. Ramaker, H., van Sprang, E. N. M., Westerhuis, J. A., & Smilde, A. K. (2004).The effect of the size of the training set and number of principal components on the false alarm rate in statistical process monitoring. Chemometrics and Intelligent Laboratory Systems, 73, 181-187. Rocke, D., and Woodruff, D. (1996). Identification of outliers in multivariate data. Journal of the American Statistical Association, 91, 1047-1061. Roelant, E., van Aelst, S., and Willems, G. (2009). The minimum weighted covariance determinant estimator. Metrika, 70(2), 177-204. doi: 10.1007/500184-008-0186-3. Rousseeuw, P. J., van Driessen, K., van Aelst, S., & Agullo , J. (2004). Robustmultivariateregression.Technometrics, 46(3).doi:10.1198/004017004000000329. Rousseeuw, P. J. (1984).Least median of squares regression. Journal of the American Statistical Association, 79,871-880. Rousseeuw, P.J. (1985). Mathematical Statistics and Applications, B. In W. Grossman, G. Pflug, I. Vincze, & W. Wertz,Multivariate estimation with high breakdown point. (pp. 283-297). D. Reidel Publishing Company. Rousseeuw, P.J. (1994). Unconventional features of positive-breakdown estimators, Statist. Probab. Lett., 19, 417-431. Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York: John Wiley. Rousseeuw, P. J., & van Driessen, K. (1999). A Fast algorithm for the minimum covariance determinant estimator.Technometrics, 41, 212-223. Rousseeuw, P. J., and van Zomeren, B. C. (1990). Unmasking multivariate outliers and leverage points. Journal of American Statistical Association, 85(411), 633-639. Ryan, T.P. (1989) Statistical methods for quality improvement. New York: John Wiley & Sons. Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. New York: John Wiley. Sullivan, J. H., & Woodall, W. H. (1996). A comparison of multivariate control charts for individual measurements. J. Qual. Technol, 28 (4), 398-408. Sullivan, J. H., & Woodall, W. H. (1998). Adapting control charts for the preliminary analysis of multivariate observations, Commun. Stat. Simulation Comput. 27, 953-979. Tracy, N. D., Young, J. C. & Mason, R. L. (1992). Multivariate control charts for individual observations. Journal of Quality Technology, 24, 88-95. Vargas, J. A. (2003). Robust estimation in multivariate control charts for individual observation, J. Qual. Technol. 35, 367-376. Wierda, S. J. (1994). Multivariate statistical process control-recent results and directions for future research, Statistica Neerlandica, 48,147-168. Willems, G., Pison, G., Rousseeuw, P. J. & van Alest, S. (2002). A robust Hotelling test. Metrika, 55, 125-138. Williams, J. D., Woodall, W. H., Birch, J. B. & Sullivan, J. H. (2006). Distribution of Hotelling's T2 statistic based on the successive differences estimator, Journal of Quality Technology, 38(3), 217-229. Woodruff, D. L., Rocke, D. M. (1994). Computable robust estimation of multivariate location and shape in high dimension using compound estimators. Journal of the American Statistical Association, 89, 888-896 Woodall, W. H., Spitzner, D. J., Montgomery, D. C., & Gupta, S. (2004). Using control charts to monitor process and product profiles. Journal of Quality Technology, 36, 309-320. Woodall, W. H., & Montgomery, D. C. (1999). Research issues and ideas in statistical processcontrol. Journal of Quality Technology, 31(4), 376. Xu, J. (2003). Multivariate Outlier detection and process monitoring. Unpublished Ph.D dissertation, University of Waterloo, Canada. Ye, N., Emran, S. M., Chen, Q. & Vilbert, S. (2002). Multivariate statistical analysis of audit trails for host-based intrusion detection. IEEE transactions on computers, 51(7), 810-820. Zuo, Y. (2006). The Frontiers in Statistics. In Peter Bickel on his 65th Birthday, Robust location and scatter estimators in multivariate analysis. Imperial College Press. |