ISIT 2009:

A Large-Deviation Analysis for the Maximum Likelihood Learning of Tree Structures

by Vincent Tan, Animashree Anandkumar, Lang Tong and Alan S. Willsky.

Abstract

 

The problem of maximum-likelihood learning of the structure of an unknown discrete distribution from samples is considered when the distribution is Markov on a tree. Large deviation analysis of the error in estimation of the set of edges of the tree is performed. Necessary and sufficient conditions are provided to ensure that this error probability decays exponentially. These conditions are based on the mutual information between each pair of variables being distinct from that of other pairs. The rate of error decay, or error exponent, is derived using the large-deviation principle. The error exponent is approximated using Euclidean information theory and is given by a ratio, to be interpreted as the signal-to-noise ratio (SNR) for learning .Numerical experiments show the SNR approximation is accurate.

 

Index Terms—Large-deviations, Tree structure learning, Error exponents, Euclidean Information Theory.

Paper

The proofs of the main results can be found in the journal version of this paper, which is posted on arXiv.