Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE

Accurate house price prediction is crucial for buyers, investors, and policymakers to make informed decisions. However, real estate datasets often contain high-dimensional features, including redundant and irrelevant attributes, which can negatively impact model performance. This study proposes a hy...

Full description

Saved in:
Bibliographic Details
Main Author: Low, Jun Liang
Format: Final Year Project / Dissertation / Thesis
Published: 2025
Subjects:
Online Access:http://eprints.utar.edu.my/6142/1/Final_Year_Project__Low_Jun_Liang_%2D_JUN_LIANG_LOW.pdf
http://eprints.utar.edu.my/6142/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1848452686805991424
author Low, Jun Liang
author_facet Low, Jun Liang
author_sort Low, Jun Liang
building UTAR Library
collection Institutional Repository
content_provider Universiti Tunku Abdul Rahman
content_source UTAR Institutional Repository
continent Asia
country Malaysia
description Accurate house price prediction is crucial for buyers, investors, and policymakers to make informed decisions. However, real estate datasets often contain high-dimensional features, including redundant and irrelevant attributes, which can negatively impact model performance. This study proposes a hybrid feature selection approach that combines Information Gain (IG) and Support Vector Machine Recursive Feature Elimination to enhance predictive accuracy. The proposed hybrid method significantly improves model performance, achieving a 22.2% reduction in Root Mean Squared Error (RMSE) (from 185,518.52 to 154,403.70) and a 22.7% increase in R-squared (from 0.6522 to 0.8008) compared to using IG alone. While IG is effective in ranking features based on their relevance to the target variable, it does not account for feature interactions and redundancy, which can lead to suboptimal feature selection. The addition of SVM-RFE addresses this limitation by iteratively refining the feature set, ensuring only the most informative attributes are retained. Furthermore, the hybrid approach demonstrated robustness even in the presence of artificially introduced noise. Hyperparameter tuning further optimized the best-performing model, yielding marginal improvements in accuracy. These findings highlight the effectiveness of combining filter and wrapper methods for real estate price prediction, demonstrating that hybrid feature selection leads to more reliable and interpretable models.
format Final Year Project / Dissertation / Thesis
id my-utar-eprints.6142
institution Universiti Tunku Abdul Rahman
publishDate 2025
record_format eprints
spelling my-utar-eprints.61422025-11-05T12:32:12Z Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE Low, Jun Liang H Social Sciences (General) Q Science (General) T Technology (General) Accurate house price prediction is crucial for buyers, investors, and policymakers to make informed decisions. However, real estate datasets often contain high-dimensional features, including redundant and irrelevant attributes, which can negatively impact model performance. This study proposes a hybrid feature selection approach that combines Information Gain (IG) and Support Vector Machine Recursive Feature Elimination to enhance predictive accuracy. The proposed hybrid method significantly improves model performance, achieving a 22.2% reduction in Root Mean Squared Error (RMSE) (from 185,518.52 to 154,403.70) and a 22.7% increase in R-squared (from 0.6522 to 0.8008) compared to using IG alone. While IG is effective in ranking features based on their relevance to the target variable, it does not account for feature interactions and redundancy, which can lead to suboptimal feature selection. The addition of SVM-RFE addresses this limitation by iteratively refining the feature set, ensuring only the most informative attributes are retained. Furthermore, the hybrid approach demonstrated robustness even in the presence of artificially introduced noise. Hyperparameter tuning further optimized the best-performing model, yielding marginal improvements in accuracy. These findings highlight the effectiveness of combining filter and wrapper methods for real estate price prediction, demonstrating that hybrid feature selection leads to more reliable and interpretable models. 2025-01 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/6142/1/Final_Year_Project__Low_Jun_Liang_%2D_JUN_LIANG_LOW.pdf Low, Jun Liang (2025) Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE. Final Year Project, UTAR. http://eprints.utar.edu.my/6142/
spellingShingle H Social Sciences (General)
Q Science (General)
T Technology (General)
Low, Jun Liang
Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE
title Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE
title_full Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE
title_fullStr Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE
title_full_unstemmed Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE
title_short Enhancing house price prediction using hybrid feature selection: A combination of information gain and SVM-RFE
title_sort enhancing house price prediction using hybrid feature selection: a combination of information gain and svm-rfe
topic H Social Sciences (General)
Q Science (General)
T Technology (General)
url http://eprints.utar.edu.my/6142/1/Final_Year_Project__Low_Jun_Liang_%2D_JUN_LIANG_LOW.pdf
http://eprints.utar.edu.my/6142/
url_provider http://eprints.utar.edu.my