Title:
Sparse Nonnegative Matrix Factorization for Clustering
Sparse Nonnegative Matrix Factorization for Clustering
dc.contributor.author | Kim, Jingu | |
dc.contributor.author | Park, Haesun | |
dc.date.accessioned | 2008-02-21T22:54:42Z | |
dc.date.available | 2008-02-21T22:54:42Z | |
dc.date.issued | 2008 | |
dc.description.abstract | Properties of Nonnegative Matrix Factorization (NMF) as a clustering method are studied by relating its formulation to other methods such as K-means clustering. We show how interpreting the objective function of K-means as that of a lower rank approximation with special constraints allows comparisons between the constraints of NMF and K-means and provides the insight that some constraints can be relaxed from K-means to achieve NMF formulation. By introducing sparsity constraints on the coefficient matrix factor in NMF objective function, we in term can view NMF as a clustering method. We tested sparse NMF as a clustering method, and our experimental results with synthetic and text data shows that sparse NMF does not simply provide an alternative to K-means, but rather gives much better and consistent solutions to the clustering problem. In addition, the consistency of solutions further explains how NMF can be used to determine the unknown number of clusters from data. We also tested with a recently proposed clustering algorithm, Affinity Propagation, and achieved comparable results. A fast alternating nonnegative least squares algorithm was used to obtain NMF and sparse NMF. | en_US |
dc.identifier.uri | http://hdl.handle.net/1853/20058 | |
dc.language.iso | en_US | en_US |
dc.publisher | Georgia Institute of Technology | en_US |
dc.relation.ispartofseries | CSE Technical Reports ; GT-CSE-08-01 | en_US |
dc.subject | Affinity Propagation | en_US |
dc.subject | Clustering | en_US |
dc.subject | K-means | en_US |
dc.subject | Non-negative matrix factorization | en_US |
dc.title | Sparse Nonnegative Matrix Factorization for Clustering | en_US |
dc.type | Text | |
dc.type.genre | Technical Report | |
dspace.entity.type | Publication | |
local.contributor.author | Park, Haesun | |
local.contributor.corporatename | College of Computing | |
local.contributor.corporatename | School of Computational Science and Engineering | |
local.relation.ispartofseries | College of Computing Technical Report Series | |
local.relation.ispartofseries | School of Computational Science and Engineering Technical Report Series | |
relation.isAuthorOfPublication | 92013a6f-96b2-4ca8-9ef7-08f408ec8485 | |
relation.isOrgUnitOfPublication | c8892b3c-8db6-4b7b-a33a-1b67f7db2021 | |
relation.isOrgUnitOfPublication | 01ab2ef1-c6da-49c9-be98-fbd1d840d2b1 | |
relation.isSeriesOfPublication | 35c9e8fc-dd67-4201-b1d5-016381ef65b8 | |
relation.isSeriesOfPublication | 5a01f926-96af-453d-a75b-abc3e0f0abb3 |