Show simple item record

dc.contributor.advisorBhattacharyya, Chiranjib
dc.contributor.authorPani, Jagdeep
dc.date.accessioned2017-10-31T07:00:49Z
dc.date.accessioned2018-07-31T04:38:40Z
dc.date.available2017-10-31T07:00:49Z
dc.date.available2018-07-31T04:38:40Z
dc.date.issued2017-10-31
dc.date.submitted2016
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/2739
dc.identifier.abstracthttp://etd.iisc.ac.in/static/etd/abstracts/3566/G27836-Abs.pdfen_US
dc.description.abstractNonnegative matrix factorization (NMF) is an important data-analysis problem which concerns factoring a given d n matrix A with nonnegative entries into matrices B and C where B and C are d k and k n with nonnegative entries. It has numerous applications including Object recognition, Topic Modelling, Hyper-spectral imaging, Music transcription etc. In general, NMF is intractable and several heuristics exists to solve the problem of NMF. Recently there has been interest in investigating conditions under which NMF can be tractably recovered. We note that existing attempts make unrealistic assumptions and often the associated algorithms tend to be not scalable. In this thesis, we make three major contributions: First, we formulate a model of NMF with assumptions which are natural and is a substantial weakening of separability. Unlike requiring a bound on the error in each column of (A BC) as was done in much of previous work, our assumptions are about aggregate errors, namely spectral norm of (A BC) i.e. jjA BCjj2 should be low. This is a much weaker error assumption and the associated B; C would be much more resilient than existing models. Second, we describe a robust polynomial time SVD-based algorithm, UTSVD, with realistic provable error guarantees and can handle higher levels of noise than previous algorithms. Indeed, experimentally we show that existing NMF models, which are based on separability assumptions, degrade much faster than UTSVD, in the presence of noise. Furthermore, when the data has dominant features, UTSVD significantly outperforms existing models. On real life datasets we again see a similar outperformance of UTSVD on clustering tasks. Finally, under a weaker model, we prove a robust version of uniqueness of NMF, where again, the word \robust" refers to realistic error bounds.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesG27836en_US
dc.subjectNon-negative Matrix Factorization (NMF)en_US
dc.subjectData Analysisen_US
dc.subjectNon-Negative Matrix Factorization Uniquenessen_US
dc.subjectNMF Algorithmsen_US
dc.subjectSingular Value Decompositon (SVD) Algorithmen_US
dc.subjectUTSVD NMF Algorithmen_US
dc.subjectMachine Learningen_US
dc.subject.classificationComputer Scienceen_US
dc.titleProvable Methods for Non-negative Matrix Factorizationen_US
dc.typeThesisen_US
dc.degree.nameMSc Enggen_US
dc.degree.levelMastersen_US
dc.degree.disciplineFaculty Of Engineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record