TY - JOUR
T1 - Communication-Efficient Nonparametric Quantile Regression via Random Features
AU - Wang, Caixing
AU - Li, Tao
AU - Zhang, Xinyi
AU - Feng, Xingdong
AU - He, Xin
N1 - Publisher Copyright:
© 2024 American Statistical Association and Institute of Mathematical Statistics.
PY - 2024
Y1 - 2024
N2 - This article introduces a refined algorithm designed for distributed nonparametric quantile regression in a reproducing kernel Hilbert space (RKHS). Unlike existing nonparametric approaches that primarily address homogeneous data, our approach uses kernel-based quantile regression to effectively model heterogeneous data. Moreover, we integrate the concepts of random features (RF) and communication-efficient surrogate likelihood (CSL) to ensure accurate estimation and enhance computational efficiency in distributed settings. Specifically, we employ an embedding technique to map the original data into RF spaces, enabling us to construct an extended surrogate loss function. This function can be locally optimized using an iterative alternating direction method of multipliers (ADMM) algorithm, minimizing the need for extensive computation and communication within the distributed system. The article thoroughly investigates the asymptotic properties of the distributed estimator and provides convergence rates of the excess risk. These properties are established under mild technical conditions and are comparable to state-of-the-art results in the literature. Additionally, we validate the effectiveness of the proposed algorithm through a comprehensive set of synthetic examples and a real data study, effectively highlighting its advantages and practical utility. Supplementary materials for this article are available online.
AB - This article introduces a refined algorithm designed for distributed nonparametric quantile regression in a reproducing kernel Hilbert space (RKHS). Unlike existing nonparametric approaches that primarily address homogeneous data, our approach uses kernel-based quantile regression to effectively model heterogeneous data. Moreover, we integrate the concepts of random features (RF) and communication-efficient surrogate likelihood (CSL) to ensure accurate estimation and enhance computational efficiency in distributed settings. Specifically, we employ an embedding technique to map the original data into RF spaces, enabling us to construct an extended surrogate loss function. This function can be locally optimized using an iterative alternating direction method of multipliers (ADMM) algorithm, minimizing the need for extensive computation and communication within the distributed system. The article thoroughly investigates the asymptotic properties of the distributed estimator and provides convergence rates of the excess risk. These properties are established under mild technical conditions and are comparable to state-of-the-art results in the literature. Additionally, we validate the effectiveness of the proposed algorithm through a comprehensive set of synthetic examples and a real data study, effectively highlighting its advantages and practical utility. Supplementary materials for this article are available online.
KW - ADMM
KW - Distributed computing
KW - Distributed estimation
KW - Reproducing kernel Hilbert space
UR - http://www.scopus.com/inward/record.url?scp=85188230288&partnerID=8YFLogxK
U2 - 10.1080/10618600.2024.2308798
DO - 10.1080/10618600.2024.2308798
M3 - Article
AN - SCOPUS:85188230288
SN - 1061-8600
JO - Journal of Computational and Graphical Statistics
JF - Journal of Computational and Graphical Statistics
ER -