Identification of multiple outliers in a generalized linear model with continuous variables

In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: Loo, Yee Peng, Midi, Habshah, Rana, Md. Sohel, Fitrianto, Anwar
التنسيق: مقال
اللغة:English
منشور في: Hindawi Publishing Corporation 2016
الوصول للمادة أونلاين:http://psasir.upm.edu.my/id/eprint/54484/1/Identification%20of%20multiple%20outliers%20in%20a%20generalized%20linear%20model%20with%20continuous%20variables.pdf
http://psasir.upm.edu.my/id/eprint/54484/
https://www.hindawi.com/journals/mpe/2016/5840523/abs/
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:In the statistical analysis of data, a model might be awfully fitted with the presence of outliers. Besides, it has been well established to use residuals for identification of outliers. The asymptotic properties of residuals can be utilized to contribute diagnostic tools. However, it is now evident that most of the existing diagnostic methods have failed in identifying multiple outliers. Therefore, this paper proposed a diagnostic method for the identification of multiple outliers in GLM, where traditionally used outlier detection methods are effortless as they undergo masking or swamping dilemma. Hence, an investigation was carried out to determine the capability of the proposed GSCPR method. The findings obtained from the numerical examples indicated that the performance of the proposed method was satisfactory for the identification of multiple outliers. Meanwhile, in the simulation study, two scenarios were considered to assess the validity of the proposed method. The proposed method consistently displayed higher percentage of correct detection, as well as lower rates of swamping and masking, regardless of the sample size and the contamination levels.