GROUP DIAGNOSTIC MEASURES OF DIFFERENT TYPES OF OUTLIERS IN MULTIPLE LINEAR REGRESSION MODEL
Main Article Content
Abstract
The topic of detection outliers is one of the crucial topics that have been of interest to researchers in many scientific fields. The presence of outliers in the dataset may lead to the breakdown of the estimator of the method in use. The statistical literature has shown that several types of outliers occur according to the type and nature of the data. Therefore, the researchers concentrated on identifying the type of outliers of statistical models by using two diagnostic procedures, individual and grouped. Unfortunately, the first procedure neglects the effect of the phenomenon of (masking and swamping). In contrast, the second procedure has not been able to eliminate this phenomenon ideally but rather reduce the rates of its appearance. This paper seeks to suggest improving one of the well-known group diagnostic methods (DRGP) by using an RMVN location and scale matrix instead of MVE to reduce the effect of (swamping). A newly proposed method denoted as DRGP(RMVN) is tested with a simulation study and real data. The results have shown that the performance of our proposed method is more efficient than (DRGP.MVE) to reduce the swamping points.
Downloads
Article Details
Licensee MJS, Universiti Malaya, Malaysia. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
References
A.H.M.R, Imon, Identifying multiple high leverage points in linear regression. Journal of Statistical Studies, Special Volume in Honour of Professor Mir Masoom Ali. 3(2002), 207–218.
Devlin, Susan J, Gnanadesikan, Ramanathan, & Kettenring, Jon R, Robust estimation of dispersion matrices and principal components, J J. AM. STAT. ASSOC., 76 (1981), 354-362.
F. R.Hampel, E. M. Ronchetti, P. Rousseeuw and W. A. Stahel, Robust Statistics,Wiley, New York,(1986).
H.Midi, N. Ramli, A.H.M.RImon, The performance of Diagnostic-Robust Generalized Potentials to identify multiple high leverage points in linear regression, J. APPL STAT. 36(2009): 507-520.
H. S. Uraibi, H. Midi, On Robust Bivariate and Multivariate Correlation Coefficient, Economic Computation & EconomicCybernetics Studies & Research, 53(2019), 2.
H. S. Uraibi, S. A. Alhussieny, Improvise Group Diagnostic Potential Measure for Multivariate Normal Data, Al-Qadisiyah Journal for Administrative and Economic Sciences, 23,(2021),2.
I-Cheng Yeh, "Modeling of strength of high performance concrete using artificial neural networks," Cement and Concrete Research, Vol. 28, No. 12, pp. 1797-1808 (1998).
Olive, David J., A resistant estimator of multivariate location and dispersion, Computational statistics & data analysis,,46,(2004), 93-102.
Olive, David J, & Hawkins, Douglas M., Robust multivariate location and dispersion. Preprint, (2010) (www. math.siu. Edu/olive/preprints. htm).
P.J. Huber, Robust Statistics, Wiley, New York, (1981).
P.j. Rousseeuw and A. M. Leroy, Robust Regression and Outlier Detection, Wiley, New York, (1987).
P. j. Rousseeuw and B. Van Zomeren, Unmasking multivariate outliers and leverage points, J. Am. STAT. ASSOC., 85(1990), 633-639
P. J., Rousseeuw, Least median of squares regression, J. AM. STAT. ASSOC., 79(1984), 871–880.
R.A. Maronna, R. D. Martin and V.J. Yohai, Robust Statistics Theory and Methods. New York: Willy and sons, (2006).