A sparse partial least squares algorithm based on sure independence screening method

Partial least squares (PLS) regression is a dimension reduction method used in many areas of scientific discoveries. However, it has been shown that the consistency property of the PLS algorithm does not extend to cases with very large number of variables p and small number of samples n (i.e., p>...

Full description

Saved in:
Bibliographic Details
Main Authors: Xu, X., Cheng, K. K., Deng, L., Dong, J.
Format: Article
Published: Elsevier B.V. 2017
Subjects:
Online Access:http://eprints.utm.my/id/eprint/75907/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85030102045&doi=10.1016%2fj.chemolab.2017.09.011&partnerID=40&md5=750cd790f05fd955b23f17f42610d5ef
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.75907
record_format eprints
spelling my.utm.759072018-05-30T04:09:09Z http://eprints.utm.my/id/eprint/75907/ A sparse partial least squares algorithm based on sure independence screening method Xu, X. Cheng, K. K. Deng, L. Dong, J. TP Chemical technology Partial least squares (PLS) regression is a dimension reduction method used in many areas of scientific discoveries. However, it has been shown that the consistency property of the PLS algorithm does not extend to cases with very large number of variables p and small number of samples n (i.e., p>>n). To overcome the issue, sparsity can be imposed to the dimension reduction step of the PLS algorithm. This leads to a sparse version of PLS (SPLS) algorithm which can achieve dimension reduction and variable selection simultaneously. Here, we present a new SPLS method called sure-independence-screening based sparse partial least squares (SIS-SPLS) algorithm, by incorporating both SIS method and extended Bayesian information criterion (BIC) into the PLS algorithm. The developed SIS-SPLS method was evaluated using a number of numerical studies including simulation and real datasets. The current results showed that the proposed SIS-SPLS method is efficient in variable selection. It offered low mean squared prediction errors with high sensitivity and specificity. The SIS-SPLS algorithm proposed in the current work may serve as an alternative SPLS method for the analysis of modern biological data. Elsevier B.V. 2017 Article PeerReviewed Xu, X. and Cheng, K. K. and Deng, L. and Dong, J. (2017) A sparse partial least squares algorithm based on sure independence screening method. Chemometrics and Intelligent Laboratory Systems, 170 . pp. 38-50. ISSN 0169-7439 https://www.scopus.com/inward/record.uri?eid=2-s2.0-85030102045&doi=10.1016%2fj.chemolab.2017.09.011&partnerID=40&md5=750cd790f05fd955b23f17f42610d5ef
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TP Chemical technology
spellingShingle TP Chemical technology
Xu, X.
Cheng, K. K.
Deng, L.
Dong, J.
A sparse partial least squares algorithm based on sure independence screening method
description Partial least squares (PLS) regression is a dimension reduction method used in many areas of scientific discoveries. However, it has been shown that the consistency property of the PLS algorithm does not extend to cases with very large number of variables p and small number of samples n (i.e., p>>n). To overcome the issue, sparsity can be imposed to the dimension reduction step of the PLS algorithm. This leads to a sparse version of PLS (SPLS) algorithm which can achieve dimension reduction and variable selection simultaneously. Here, we present a new SPLS method called sure-independence-screening based sparse partial least squares (SIS-SPLS) algorithm, by incorporating both SIS method and extended Bayesian information criterion (BIC) into the PLS algorithm. The developed SIS-SPLS method was evaluated using a number of numerical studies including simulation and real datasets. The current results showed that the proposed SIS-SPLS method is efficient in variable selection. It offered low mean squared prediction errors with high sensitivity and specificity. The SIS-SPLS algorithm proposed in the current work may serve as an alternative SPLS method for the analysis of modern biological data.
format Article
author Xu, X.
Cheng, K. K.
Deng, L.
Dong, J.
author_facet Xu, X.
Cheng, K. K.
Deng, L.
Dong, J.
author_sort Xu, X.
title A sparse partial least squares algorithm based on sure independence screening method
title_short A sparse partial least squares algorithm based on sure independence screening method
title_full A sparse partial least squares algorithm based on sure independence screening method
title_fullStr A sparse partial least squares algorithm based on sure independence screening method
title_full_unstemmed A sparse partial least squares algorithm based on sure independence screening method
title_sort sparse partial least squares algorithm based on sure independence screening method
publisher Elsevier B.V.
publishDate 2017
url http://eprints.utm.my/id/eprint/75907/
https://www.scopus.com/inward/record.uri?eid=2-s2.0-85030102045&doi=10.1016%2fj.chemolab.2017.09.011&partnerID=40&md5=750cd790f05fd955b23f17f42610d5ef
_version_ 1643657193283649536
score 13.211869