Neural Comp. NEW Faster Access
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zhang, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zhang, J.
(Neural Computation. 2004;16:159-195.)
© 2004 The MIT Press


Letter

Divergence Function, Duality, and Convex Analysis

Jun Zhang

junz{at}umich.edu, Department of Psychology, University of Michigan, Ann Arbor, MI 48109, U. S. A

From a smooth, strictly convex function {Phi}: Rn -> R, a parametric family of divergence function ({alpha}){Phi} may be introduced:

for x, y int dom ({Phi}) Rn, and for {alpha} R, with {Phi}(± 1) defined through taking the limit of {alpha}. Each member is shown to induce an {alpha}-independent Riemannian metric, as well as a pair of dual {alpha}-connections, which are generally nonflat, except for {alpha} = ± 1. In the latter case, (± 1){Phi} reduces to the (nonparametric) Bregman divergence, which is representable using {Phi} and its convex conjugate {Phi}* and becomes the canonical divergence for dually flat spaces (Amari, 1982, 1985; Amari & Nagaoka, 2000). This formulation based on convex analysis naturally extends the information-geometric interpretation of divergence functions (Eguchi, 1983) to allow the distinction between two different kinds of duality: referential duality ({alpha} {leftrightarrow} - {alpha}) and representational duality ({Phi} {leftrightarrow} {Phi}*). When applied to (not necessarily normalized) probability densities, the concept of conjugated representations of densities is introduced, so that ±{alpha}-connections defined on probability densities embody both referential and representational duality and are hence themselves bidual. When restricted to a finite-dimensional affine submanifold, the natural parameters of a certain representation of densities and the expectation parameters under its conjugate representation form biorthogonal coordinates. The alpha representation (indexed by ß now, ß [-1,1]) is shown to be the only measure-invariant representation. The resulting two-parameter family of divergence functionals ({alpha}, ß), ({alpha}, ß) [-1,1] x [-1,1] induces identical Fisher information but bidual alpha-connection pairs; it reduces in form to Amari's alpha-divergence family when {alpha} = ± 1 or when ß=1, but to the family of Jensen difference (Rao, 1987) when ß= -1.




This article has been cited by other articles:


Home page
Neural Comput.Home page
S.-i. Amari
Integration of Stochastic Models by Minimizing {alpha}-Divergence
Neural Comput., October 1, 2007; 19(10): 2780 - 2796.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
J COGNITIVE NEUROSCIENCE NEURAL COMPUTATION MIT PRESS JOURNALS
Copyright © 2004 by The MIT Press.