Automatic Relevance Determination for Nonnegative Matrix Factorization
Vincent Y. F. Tan and Cédric Févotte,
2009 Workshop on Signal Processing with Adaptive Sparse Structured Representations (SPARS’09), St Malo, France

Apr 6 – Apr 9, 2009

Abstract:

 

Nonnegative matrix factorization (NMF) has become a popular technique for data analysis and dimensionality reduction. However, it is often assumed that the number of latent dimensions (or components) is given. In practice, one must choose a suitable value depending on the data and/or setting. In this paper, we address this important issue by using a Bayesian approach to estimate the latent dimensionality, or equivalently, select the model order. This is achieved via automatic relevance determination (ARD), a technique that has been employed in Bayesian PCA and sparse Bayesian learning. We show via experiments on synthetic data that our technique is able to recover the correct number of components, while it is also able to recover an effective number of components from real datasets such as the MIT CBCL dataset.

 

---------------------------------------------------------------------------------------------------------------------------------------

 

The associated Matlab code for this paper is available here.

 

The zip file contains two files: nmf_kl_mos.m is the main function and sc_data_nmf.m is the test file.

 

Please contact the author if you have any questions. Thanks!