TY - JOUR
T1 - Sparse classification with paired covariates
AU - Rauschenberger, Armin
AU - Ciocănea-Teodorescu, Iuliana
AU - Jonker, Marianne A.
AU - Menezes, Renée X.
AU - van de Wiel, Mark A.
N1 - Publisher Copyright:
© 2019, The Author(s).
PY - 2020/9/1
Y1 - 2020/9/1
N2 - This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran.
AB - This paper introduces the paired lasso: a generalisation of the lasso for paired covariate settings. Our aim is to predict a single response from two high-dimensional covariate sets. We assume a one-to-one correspondence between the covariate sets, with each covariate in one set forming a pair with a covariate in the other set. Paired covariates arise, for example, when two transformations of the same data are available. It is often unknown which of the two covariate sets leads to better predictions, or whether the two covariate sets complement each other. The paired lasso addresses this problem by weighting the covariates to improve the selection from the covariate sets and the covariate pairs. It thereby combines information from both covariate sets and accounts for the paired structure. We tested the paired lasso on more than 2000 classification problems with experimental genomics data, and found that for estimating sparse but predictive models, the paired lasso outperforms the standard and the adaptive lasso. The R package palasso is available from cran.
KW - Lasso regression
KW - Paired data
KW - Prediction
KW - Sparsity
UR - http://www.scopus.com/inward/record.url?scp=85075439602&partnerID=8YFLogxK
U2 - 10.1007/s11634-019-00375-6
DO - 10.1007/s11634-019-00375-6
M3 - Article
AN - SCOPUS:85075439602
SN - 1862-5347
VL - 14
SP - 571
EP - 588
JO - Advances in Data Analysis and Classification
JF - Advances in Data Analysis and Classification
IS - 3
ER -