Comparing three methods of handling multicollinearity using simulation approach

In regression, the objective is to explain the variation in one or more response variables, by associating this variation with proportional variation in one or more explanatory variables. A frequent obstacle is that several of the explanatory variables will vary in rather similar ways. This phenomen...

Full description

Saved in:
Bibliographic Details
Main Author: Adnan, Norliza
Format: Thesis
Language:en
Published: 2006
Subjects:
Online Access:http://eprints.utm.my/5335/1/NorlizaAdnanMFS2006.pdf
http://eprints.utm.my/5335/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1845471352503402496
author Adnan, Norliza
author_facet Adnan, Norliza
author_sort Adnan, Norliza
building UTM Library
collection Institutional Repository
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
continent Asia
country Malaysia
description In regression, the objective is to explain the variation in one or more response variables, by associating this variation with proportional variation in one or more explanatory variables. A frequent obstacle is that several of the explanatory variables will vary in rather similar ways. This phenomenon called multicollinearity, is a common problem in regression analysis. Handling multicollinearity problem in regression analysis is important because least squares estimations assume that predictor variables are not correlated with each other. The performances of ridge regression (RR), principal component regression (PCR) and partial least squares regression (PLSR) in handling multicollinearity problem in simulated data sets are compared to help and give future researchers a comprehensive view about the best procedure to handle multicollinearity problems. PCR is a combination of principal component analysis (PCA) and ordinary least squares regression (OLS) while PLSR is an approach similar to PCR because a component that can be used to reduce the number of variables need to be constructed. RR on the other hand is the modified least square method that allows a biased but more precise estimator. The algorithm is described and for the purpose of comparing the three methods, simulated data sets where the number of cases was less than the number of observations were used. The goal was to develop a linear equation that relates all the predictor variables to a response variable. For comparison purposes, mean square errors (MSE) were calculated. A Monte Carlo simulation study was used to evaluate the effectiveness of these three procedure. The analysis including all simulations and calculations were done using statistical package S-Plus 2000 software.
format Thesis
id my.utm.eprints-5335
institution Universiti Teknologi Malaysia
language en
publishDate 2006
record_format eprints
spelling my.utm.eprints-53352018-03-07T20:57:32Z http://eprints.utm.my/5335/ Comparing three methods of handling multicollinearity using simulation approach Adnan, Norliza QA Mathematics In regression, the objective is to explain the variation in one or more response variables, by associating this variation with proportional variation in one or more explanatory variables. A frequent obstacle is that several of the explanatory variables will vary in rather similar ways. This phenomenon called multicollinearity, is a common problem in regression analysis. Handling multicollinearity problem in regression analysis is important because least squares estimations assume that predictor variables are not correlated with each other. The performances of ridge regression (RR), principal component regression (PCR) and partial least squares regression (PLSR) in handling multicollinearity problem in simulated data sets are compared to help and give future researchers a comprehensive view about the best procedure to handle multicollinearity problems. PCR is a combination of principal component analysis (PCA) and ordinary least squares regression (OLS) while PLSR is an approach similar to PCR because a component that can be used to reduce the number of variables need to be constructed. RR on the other hand is the modified least square method that allows a biased but more precise estimator. The algorithm is described and for the purpose of comparing the three methods, simulated data sets where the number of cases was less than the number of observations were used. The goal was to develop a linear equation that relates all the predictor variables to a response variable. For comparison purposes, mean square errors (MSE) were calculated. A Monte Carlo simulation study was used to evaluate the effectiveness of these three procedure. The analysis including all simulations and calculations were done using statistical package S-Plus 2000 software. 2006-05 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/5335/1/NorlizaAdnanMFS2006.pdf Adnan, Norliza (2006) Comparing three methods of handling multicollinearity using simulation approach. Masters thesis, Universiti Teknologi Malaysia, Faculty of Science.
spellingShingle QA Mathematics
Adnan, Norliza
Comparing three methods of handling multicollinearity using simulation approach
title Comparing three methods of handling multicollinearity using simulation approach
title_full Comparing three methods of handling multicollinearity using simulation approach
title_fullStr Comparing three methods of handling multicollinearity using simulation approach
title_full_unstemmed Comparing three methods of handling multicollinearity using simulation approach
title_short Comparing three methods of handling multicollinearity using simulation approach
title_sort comparing three methods of handling multicollinearity using simulation approach
topic QA Mathematics
url http://eprints.utm.my/5335/1/NorlizaAdnanMFS2006.pdf
http://eprints.utm.my/5335/
url_provider http://eprints.utm.my/