Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance

Hotelling T² control chart is an effective tool in statistical process control for multivariate environment. However, the performance of traditional Hotelling T² control chart using classical location and scatter estimators is usually marred by the masking and swamping effects. In order to alleviate...

Full description

Saved in:
Bibliographic Details
Main Author: Hazlina, Ali
Format: Thesis
Language:eng
eng
Published: 2013
Subjects:
Online Access:https://etd.uum.edu.my/3870/1/s91512.pdf
https://etd.uum.edu.my/3870/7/s91512.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-uum-etd.3870
record_format uketd_dc
institution Universiti Utara Malaysia
collection UUM ETD
language eng
eng
advisor Syed Yahaya, Sharipah Soaad
Omar, Zurni
topic QA273-280 Probabilities
Mathematical statistics
spellingShingle QA273-280 Probabilities
Mathematical statistics
Hazlina, Ali
Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
description Hotelling T² control chart is an effective tool in statistical process control for multivariate environment. However, the performance of traditional Hotelling T² control chart using classical location and scatter estimators is usually marred by the masking and swamping effects. In order to alleviate the problem, robust estimators are recommended. The most popular and widely used robust estimator in the Hotelling T² control chart is the minimum covariance determinant (MCD). Recently, a new robust estimator known as minimum vector variance (MVV) was introduced. This estimator possesses high breakdown point, affine equivariance and is superior in terms of computational efficiency. Due to these nice properties, this study proposed to replace the classical estimators with the MVV location and scatter estimators in the construction of Hotelling T² control chart for individual observations in Phase II analysis. Nevertheless, some drawbacks such as inconsistency under normal distribution, biased for small sample size and low efficiency under high breakdown point were discovered. To improve the MVV estimators in terms of consistency and unbiasedness, the MVV scatter estimator was multiplied by consistency and correction factors respectively. To maintain the high breakdown point while having high statistical efficiency, a reweighted version of MVV estimator (RMVV) was proposed. Subsequently, the RMVV estimators were applied in the construction of Hotelling T² control chart. The new robust Hotelling T² chart produced positive impact in detecting outliers while simultaneously controlling false alarm rates. Apart from analysis of simulated data, analysis of real data also found that the new robust Hotelling T² chart was able to detect out of control observations better than the other charts investigated in this study. Based on the good performance on both simulated and real data analysis, the new robust Hotelling T² chart is a good alternative to the existing Hotelling T² charts.
format Thesis
qualification_name Ph.D.
qualification_level Doctorate
author Hazlina, Ali
author_facet Hazlina, Ali
author_sort Hazlina, Ali
title Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
title_short Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
title_full Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
title_fullStr Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
title_full_unstemmed Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance
title_sort efficient and highly robust hotelling t² control charts using reweighted mininum vector variance
granting_institution Universiti Utara Malaysia
granting_department Awang Had Salleh Graduate School of Arts & Sciences
publishDate 2013
url https://etd.uum.edu.my/3870/1/s91512.pdf
https://etd.uum.edu.my/3870/7/s91512.pdf
_version_ 1747827657951674368
spelling my-uum-etd.38702022-10-06T07:06:04Z Efficient and Highly Robust Hotelling T² Control Charts Using Reweighted Mininum Vector Variance 2013 Hazlina, Ali Syed Yahaya, Sharipah Soaad Omar, Zurni Awang Had Salleh Graduate School of Arts & Sciences Awang Had Salleh Graduate School of Arts and Sciences QA273-280 Probabilities. Mathematical statistics Hotelling T² control chart is an effective tool in statistical process control for multivariate environment. However, the performance of traditional Hotelling T² control chart using classical location and scatter estimators is usually marred by the masking and swamping effects. In order to alleviate the problem, robust estimators are recommended. The most popular and widely used robust estimator in the Hotelling T² control chart is the minimum covariance determinant (MCD). Recently, a new robust estimator known as minimum vector variance (MVV) was introduced. This estimator possesses high breakdown point, affine equivariance and is superior in terms of computational efficiency. Due to these nice properties, this study proposed to replace the classical estimators with the MVV location and scatter estimators in the construction of Hotelling T² control chart for individual observations in Phase II analysis. Nevertheless, some drawbacks such as inconsistency under normal distribution, biased for small sample size and low efficiency under high breakdown point were discovered. To improve the MVV estimators in terms of consistency and unbiasedness, the MVV scatter estimator was multiplied by consistency and correction factors respectively. To maintain the high breakdown point while having high statistical efficiency, a reweighted version of MVV estimator (RMVV) was proposed. Subsequently, the RMVV estimators were applied in the construction of Hotelling T² control chart. The new robust Hotelling T² chart produced positive impact in detecting outliers while simultaneously controlling false alarm rates. Apart from analysis of simulated data, analysis of real data also found that the new robust Hotelling T² chart was able to detect out of control observations better than the other charts investigated in this study. Based on the good performance on both simulated and real data analysis, the new robust Hotelling T² chart is a good alternative to the existing Hotelling T² charts. 2013 Thesis https://etd.uum.edu.my/3870/ https://etd.uum.edu.my/3870/1/s91512.pdf text eng public https://etd.uum.edu.my/3870/7/s91512.pdf text eng public Ph.D. doctoral Universiti Utara Malaysia Alfaro, J. L., & Ortega, J. F. (2009). A comparison of robust alternatives to Hotelling's T2 control chart. Journal of Applied Statistics, 36(12),1385-1396. Ali, H., Djauhari, M. A., & Syed-Yahaya, S. S. (2008). On the distribution of FMCD-based robust mahalanobis distance. Publish in proceeding of the 3rd International Conference on Mathematics and Statistics (ICoMS-3), Institut Pertanian Bogor, Indonesia, 134-1506. Alt, F. B. (1985). Multivariate quality control. Encyclopedia of Statistical Sciences (Vol. 6, Kotz S., Johnson N. L., eds.). New York: Wiley. Angiulli, F., & Pizzuti, C. (2005). Outlier mining and large high-dimensional data sets.IEEE Transaction Knowledge Data Engineering, 17(2), 203–215. Atkinson, A. C., & Mulira, H. M. (1993). The stalactite plot for the detection of multivariate outliers. Statistics and Computing,3, 27-35. Barnett, V., & Lewis, T. (1994). Outliers in Statistical Data. New York: John Wiley. Beckman, R., & Cook, R. (1983). Outlier.......s. Technometrics, 25, 119–149. Billor, N., Hadi, A. S., & Velleman, P. F. (2000). BACON: blocked adaptive computationally efficient outlier nominators. Computational Statistics and Data Analysis, 34, 279-298. Bradley, J. V. (1978). Robustness? British Journal of Mathematical and Statistical Psyschology, 31, 144-152. Butler, R. W., Davies, P. L., & Jhun, M. (1993).Asymptotics for the minimum covariance determinant estimator. The Annals of Statistics, 21, 1385-1400. Campbell, N. A.(1980). Robust procedure in multivariate analysis I robust covariance estimation. Applied Statistics, 29, 231-237. Cerioli, A., Riani, M., & Atkinson, A. C. (2008). Controlling the size of multivariate outlier tests with the MCD estimator of scatter. Statistics and Computing. Doi:10.1007/s11222-008-9096-5 Chang, S. C., & Bai, D. S. (2004). A multivariate T2 control chartc for skewed populations using weighted standard deviations. Qual. Reliab. Engng. Int., 20, 31-46. Chen, T. (2010). On reducing false alarms in multivariate statistical process control. Chemical Engineering Research and Design, 88 (4). 430-436. ISSN 0263-8762. Chenouri, S., Steiner, S. H., & Mulayath, A. (2009). A multivariate robust control chart for individual observations, Journal of Quality Technology, 41(3), 259-271. Chou, Y. M., Mason, R. L. & Young, J. C.(2001). The control chart for individual observations from a multivariate non-normal distribution. Communications in Statistics-Theory and Methods, 30(8), 1937-1949. Cleroux, R., & Ducharme, G. R. (1989). Vector correlation for elliptical distributions.Commun.Statist.Theor.Meth. 18(4):1441-1454. Croux, C., & Haesbroeck, G. (1999). Influence function and efficiency of the minimum covariance determinant scattermatrix estimator, Journal of Multivariate Analysis 71, 161-190. Croux, C., & Rousseeuw, P. J.(1992). A class of high-breakdown scale estimators based on subranges. Communication statistics–Theory meth., 21(7), 1935-1951. Davies, P. L. (1987). Asymptotic behavior of S-estimates of multivariate location parameters and dispersion matrices. The Annals of Statistics, 15,1269-1292. Davies, P. L., & Gather, U. (1993). The identification of multiple outliers. Journal of the American Statistical Association, 88, 782-792. Djauhari, M. A., Mashuri, M., & Herwindiati, D. E. (2008). Multivariate process variability monitoring, Communications in Statistics-Theory and Methods, 37(11), 1742-1754. Djauhari, M. A. (2007). A measure of multivariate data concentration. Journal of Applied Probability and Statistics 2, 139-155. Djauhari, M. A., Adnan, R., Lee, M. H., & Ali, H. An Equivalent Objective Function of Fast MCD. unpublished manuscript. Fauconnier, C., & Haesbroeck, G. (2009). Outliers detection with the minimum covariance determinant estimator in practice. Statistical Methodology, 6(4), 363-379. Ferrel, O. C. & Hartline, M. D. (2008). Marketing Strategy (4rd ed.).South Western: Edition Thomson Learning Inc. Garrett, R. G. (1989). The chi-square plot: A tool for multivariate outlier recognition. Journal of Geochemical Exploration, 32, 319-341. Gather, U., & Becker, C. (1997). Convergence Rates in Multivariate Robust Outlier Identification. In: Mathematics 34, 101-107. Grubel, R., & Rocke, D. M. (1990). On the cumulants of affine equivariant estimators in elliptical families. Journal of Multivariate Analysis 35, 203- 222. doi:10.1016/0047-259X(90)90025-D Guo, Jiin-Huarng and Luh, Wei-Ming, (2000). An invertible transformation two-sample trimmed t-statistic under heterogeneity and nonnormality. Statistic & Probability letters. 49: 1-7. Hadi, A. S. (1992). Identifying multivariate outlier in multivariate data. Journal of Royal Statistical Society B,53, 761-771. Hample, F. R. (1985). The breakdown points of the mean combined with some rejection rules. Technometrics, 27, 95-107. Hampel, F. R., Ronchetti, E. M., Rousseeuw, P. J., & Stahel, W. A. (1986). Robust statistics: The approach based on influence functions. New York: Wiley. Hampel, F. R. (1971). A general qualitative definition of robustness. Annals of MathematicsStatistics, 42, 1887-1896. Hampel, F. R. (1974). The influence curve and its role in robust estimation. Journal of theAmerican Statistical Association, 69, 382-393. Hawkins, D. (1980). Identification of Outliers. Chapman and Hall. Hawkins, D.M. (1994). The feasible solution algorithm for the minimum covariance determinant estimator in multivariate data. Computational Statistics and Data Analysis, 17, 197-210. Hawkins, D. M., & Olive, D. J. (1999) . Improved feasible solution algorithm for high breakdown estimation. Computational Statistics and Data Analysis, 30, 1-11. Herwindiati, D. E. (2006). A new criterion in robust estimation for location and covariance matrix, and its application for outlier labeling. Unpublished Ph.D thesis, Institut Teknologi Bandung. Herwindiati, D. E., Djauhari, M. A., & Mashuri, M. (2007). Robust multivariate outlier labeling. Communication in Statistics-Computation and Simulation, 36: 1287-1294. Hotelling, H. (1947). Techniques of Statistical Analysis. InC. Eisenhart, M.W. Hastay, & W.A. Wallis, Multivariate quality control(pp. 111-184). New York: McGraw-Hill. Hubert, M., Rousseeuw, P., & Branden, K. V. (2005). Robpca: A new approach to robust principal components analysis. Technometrics, 47, 64-79. Huber, P. J. (1964). Robust estimation of a location parameter. Ann. Math.Statist, 35, 73-101. Huber, P. J. (1977). Robust covariances. In Statistical Decision Theory and Related Topics (Eds. S. S. Gupta and D. S. Moore). Academic Press, NewYork. 165-191. Jackson, J. E. (1985). Multivariate quality control. Communications in statistics: Theory and Methods,11, 2657-2688. Jensen,W. A., Birch, J. B. & Woodall, W. H. (2007). High breakdown estimation methods for Phase I multivariate control charts, Qual. Reliab. Eng. Int. 23, 615–629. Johnson, R. (1992). Applied Multivariate Statistical Analysis. Prentice Hall. Khoo, M. B. C. & Quah, S. H. (2003). Multivariate control chart for process dispersion based on individual observations. Quality Engineering, 15, 639-642. Khoo, M. B. C. & Quah, S. H. (2004). Alternatives to multivariate control chart for process dispersion. Quality Engineering, 16, 423-435. Lopuhaä, H. P., & Rousseeuw, P. J. (1991). Breakdown points of affine equivariance estimators of multivariate location and covariance matrices. Annal of Statistics, 19, 229-248. Lopuhaä, H. P. (1999). Asymptotics of reweighted estimators of multivariate location and scatter. Ann. Statist., 27, 1638-1665. Lowry, C. A. & Montgomery, D. C. (1995). A review of multivariate control charts. IIE Transactions, 27, 800-810. Levinson, W. A., Holmes, D. S., & Mergen, A. E. (2002). Variation charts for multivariate processes. Quality Engineering, 14, 539-545. Mardia, K. V., Kent, J. T., & Bibby, J. M. (2000). Multivariate Analysis (7rd ed.). London: Academic Press. Maronna, R. A., Stahel, W. A., & Yohai, V. J. (1992). Bias-Robust Estimators of Multivariate Scatter Based on Projections. Journal of Multivariate Analysis, 42, 141-161. Maronna, R. A., & Zamar, R. (2002). Robust estimation of location and dispersion for high- dimensional datasets. Technometrics, 44, 307-317. Maronna, R.A., Martin, R. D, & Yohai, V. J. (2006)Robust Statistics: Theory and Methods. New York, NY: John Wiley & Sons. Mason, R. L., Champ, C. W., Tracy, N. D., Wierda, S. J. & Young, J. C. (1997) Assessment of multivariate process control techniques. Journal of Quality Technology, 29 (2), 140-143. Mason, R. L., & Young, J. C. (2002). Multivariate Statistical Process Control with Industrial Applications. Philadelphia: ASA-SIAM. Montgomery, D.C. (2005)Introduction to Statistical Quality Control (5rd ed.). New York: Wiley. Pena, D., & Prieto, J. F. (2001). Multivariate outlier detection and robust covariance matrix estimation. Technometrics, 3, 286-322. Pena, D., & Rodriguez, J. (2003). Descriptive measures of multivariate scatter and linear dependence. Journal of multivariate analysis, 85, 361-374. Pison, G., van Aelst, S., & Willems, G. (2002). Small sample corrections for LTS and MCD. Metrika, 55, 111- 123. Pison, G., & van Aelst, S. (2004). Diagnostic plots for robust multivariate methods. J. Comput. Graph. Stat., 13, 310–329. Prins, J., & Mader, D. (1997). Multivariate control charts for grouped and individual observations. Qual. Eng., 10(1), 49-57. Quesenberry, C. P. (2001). The multivariate short-run snapshot q chart. Quality Engineering, 13, 679-683. Ramaker, H., van Sprang, E. N. M., Westerhuis, J. A., & Smilde, A. K. (2004).The effect of the size of the training set and number of principal components on the false alarm rate in statistical process monitoring. Chemometrics and Intelligent Laboratory Systems, 73, 181-187. Rocke, D., and Woodruff, D. (1996). Identification of outliers in multivariate data. Journal of the American Statistical Association, 91, 1047-1061. Roelant, E., van Aelst, S., and Willems, G. (2009). The minimum weighted covariance determinant estimator. Metrika, 70(2), 177-204. doi: 10.1007/500184-008-0186-3. Rousseeuw, P. J., van Driessen, K., van Aelst, S., & Agullo , J. (2004). Robustmultivariateregression.Technometrics, 46(3).doi:10.1198/004017004000000329. Rousseeuw, P. J. (1984).Least median of squares regression. Journal of the American Statistical Association, 79,871-880. Rousseeuw, P.J. (1985). Mathematical Statistics and Applications, B. In W. Grossman, G. Pflug, I. Vincze, & W. Wertz,Multivariate estimation with high breakdown point. (pp. 283-297). D. Reidel Publishing Company. Rousseeuw, P.J. (1994). Unconventional features of positive-breakdown estimators, Statist. Probab. Lett., 19, 417-431. Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. New York: John Wiley. Rousseeuw, P. J., & van Driessen, K. (1999). A Fast algorithm for the minimum covariance determinant estimator.Technometrics, 41, 212-223. Rousseeuw, P. J., and van Zomeren, B. C. (1990). Unmasking multivariate outliers and leverage points. Journal of American Statistical Association, 85(411), 633-639. Ryan, T.P. (1989) Statistical methods for quality improvement. New York: John Wiley & Sons. Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. New York: John Wiley. Sullivan, J. H., & Woodall, W. H. (1996). A comparison of multivariate control charts for individual measurements. J. Qual. Technol, 28 (4), 398-408. Sullivan, J. H., & Woodall, W. H. (1998). Adapting control charts for the preliminary analysis of multivariate observations, Commun. Stat. Simulation Comput. 27, 953-979. Tracy, N. D., Young, J. C. & Mason, R. L. (1992). Multivariate control charts for individual observations. Journal of Quality Technology, 24, 88-95. Vargas, J. A. (2003). Robust estimation in multivariate control charts for individual observation, J. Qual. Technol. 35, 367-376. Wierda, S. J. (1994). Multivariate statistical process control-recent results and directions for future research, Statistica Neerlandica, 48,147-168. Willems, G., Pison, G., Rousseeuw, P. J. & van Alest, S. (2002). A robust Hotelling test. Metrika, 55, 125-138. Williams, J. D., Woodall, W. H., Birch, J. B. & Sullivan, J. H. (2006). Distribution of Hotelling's T2 statistic based on the successive differences estimator, Journal of Quality Technology, 38(3), 217-229. Woodruff, D. L., Rocke, D. M. (1994). Computable robust estimation of multivariate location and shape in high dimension using compound estimators. Journal of the American Statistical Association, 89, 888-896 Woodall, W. H., Spitzner, D. J., Montgomery, D. C., & Gupta, S. (2004). Using control charts to monitor process and product profiles. Journal of Quality Technology, 36, 309-320. Woodall, W. H., & Montgomery, D. C. (1999). Research issues and ideas in statistical processcontrol. Journal of Quality Technology, 31(4), 376. Xu, J. (2003). Multivariate Outlier detection and process monitoring. Unpublished Ph.D dissertation, University of Waterloo, Canada. Ye, N., Emran, S. M., Chen, Q. & Vilbert, S. (2002). Multivariate statistical analysis of audit trails for host-based intrusion detection. IEEE transactions on computers, 51(7), 810-820. Zuo, Y. (2006). The Frontiers in Statistics. In Peter Bickel on his 65th Birthday, Robust location and scatter estimators in multivariate analysis. Imperial College Press.