A Developed Collaborative Filtering Similarity Method to Improve the Accuracy of Recommendations under Data Sparsity

This paper presented a new similarity method to improve the accuracy of traditional Collaborative Filtering (CF) method under sparse data issue. CF provides the user with items, that what they need, based on analyses the preferences of users who have a strong correlation to him/her preference. Howev...

Full description

Saved in:
Bibliographic Details
Main Authors: Al-Bashiri, Hael, Abdulgabber, Mansoor Abdullateef, Awanis, Romli, Norazuwa, Salehudin
Format: Article
Language:English
Published: The Science and Information (SAI) Organization Limited 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/21368/1/A%20Developed%20Collaborative%20Filtering%20Similarity-fskkp-2018.pdf
http://umpir.ump.edu.my/id/eprint/21368/
http://dx.doi.org/10.14569/IJACSA.2018.090423
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presented a new similarity method to improve the accuracy of traditional Collaborative Filtering (CF) method under sparse data issue. CF provides the user with items, that what they need, based on analyses the preferences of users who have a strong correlation to him/her preference. However, the accuracy is influencing by the method that use to find neighbors. Pearson correlation coefficient and Cosine measures, as the most widely used methods, depending on the rating of only co-rated items to find the correlations between users. Consequently, these methods have lack of ability in addressing the sparsity. This paper presented a new proposed similarity method based on the global user preference to address the sparsity issue and improve the accuracy of recommendation. Thus, the novelty of this method is the ability to solve the similarity issue with a capability of finding the relationship among non-correlated users. Furthermore, to determine the right neighbors during the process of computing the similarity between a pair of users, the developed method considered two main factors (fairness and proportion of co-rated). The MovieLens 100K benchmark dataset is used to evaluate the developed method accuracy. The experiments’ result showed that the accuracy of the developed method is improved compared to the traditional CF similarity methods using a specific common CF evaluation metrics.