TY - JOUR
T1 - Online nonlinear estimation via iterative L2-space projections
T2 - Reproducing Kernel of subspace
AU - Ohnishi, Motoya
AU - Yukawa, Masahiro
N1 - Publisher Copyright:
Copyright © 2017, The Authors. All rights reserved.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2017/12/12
Y1 - 2017/12/12
N2 - We propose a novel online learning paradigm for nonlinear-function estimation tasks based on the iterative projections in the L2space with probability measure reflecting the stochastic property of input signals. The proposed learning algorithm exploits the reproducing kernel of the so-called dictionary subspace, based on the fact that any finite-dimensional space of functions has a reproducing kernel characterized by the Gram matrix. The L2-space geometry provides the best decorrelation property in principle. The proposed learning paradigm is significantly different from the conventional kernelbased learning paradigm in two senses: (i) the whole space is not a reproducing kernel Hilbert space and (ii) the minimum mean squared error estimator gives the best approximation of the desired nonlinear function in the dictionary subspace. It preserves efficiency in computing the inner product as well as in updating the Gram matrix when the dictionary grows. Monotone approximation, asymptotic optimality, and convergence of the proposed algorithm are analyzed based on the variable-metric version of adaptive projected subgradient method. Numerical examples show the efficacy of the proposed algorithm for real data over a variety of methods including the extended Kalman filter and many batch machine-learning methods such as the multilayer perceptron.
AB - We propose a novel online learning paradigm for nonlinear-function estimation tasks based on the iterative projections in the L2space with probability measure reflecting the stochastic property of input signals. The proposed learning algorithm exploits the reproducing kernel of the so-called dictionary subspace, based on the fact that any finite-dimensional space of functions has a reproducing kernel characterized by the Gram matrix. The L2-space geometry provides the best decorrelation property in principle. The proposed learning paradigm is significantly different from the conventional kernelbased learning paradigm in two senses: (i) the whole space is not a reproducing kernel Hilbert space and (ii) the minimum mean squared error estimator gives the best approximation of the desired nonlinear function in the dictionary subspace. It preserves efficiency in computing the inner product as well as in updating the Gram matrix when the dictionary grows. Monotone approximation, asymptotic optimality, and convergence of the proposed algorithm are analyzed based on the variable-metric version of adaptive projected subgradient method. Numerical examples show the efficacy of the proposed algorithm for real data over a variety of methods including the extended Kalman filter and many batch machine-learning methods such as the multilayer perceptron.
KW - Kernel adaptive filter
KW - Lspace
KW - Metric projection
KW - Online learning
KW - Recursive least squares
UR - http://www.scopus.com/inward/record.url?scp=85093422341&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093422341&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:85093422341
JO - Mathematical Social Sciences
JF - Mathematical Social Sciences
SN - 0165-4896
ER -