|
|
||||||||
Letter |
Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8552, Japan
Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Institute of Technology, Meguro-ku, Tokyo, 152-8552, Japan
The problem of model selection is considerably important for acquiring higher levels of generalization capability in supervised learning. In this article, we propose a new criterion for model selection, the subspace information criterion (SIC), which is a generalization of Mallows's CL. It is assumed that the learning target function belongs to a specified functional Hilbert space and the generalization error is defined as the Hilbert space squared norm of the difference between the learning result function and target function. SIC gives an unbiased estimate of the generalization error so defined. SIC assumes the availability of an unbiased estimate of the target function and the noise covariance matrix, which are generally unknown. A practical calculation method of SIC for least-mean-squares learning is provided under the assumption that the dimension of the Hilbert space is less than the number of training examples. Finally, computer simulations in two examples show that SIC works well even when the number of training examples is small.
This article has been cited by other articles:
![]() |
M. SUGIYAMA and K. SAKURAI Analytic Optimization of Shrinkage Parameters Based on Regularized Subspace Information Criterion IEICE Trans A: Fundamentals, August 1, 2006; E89-A(8): 2216 - 2225. [Abstract] [PDF] |
||||
![]() |
J.-M. Ye, X.-L. Zhu, and X.-D. Zhang Adaptive Blind Separation with an Unknown Number of Sources Neural Comput., August 1, 2004; 16(8): 1641 - 1660. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sugiyama, M. Kawanabe, and K.-R. Muller Trading Variance Reduction with Unbiasedness: The Regularized Subspace Information Criterion for Robust Model Selection in Kernel Regression Neural Comput., May 1, 2004; 16(5): 1077 - 1104. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| J COGNITIVE NEUROSCIENCE | NEURAL COMPUTATION | MIT PRESS JOURNALS |