Metagenes and molecular pattern discovery using matrix factorization.
| Authors | |
| Keywords | |
| Abstract | We describe here the use of nonnegative matrix factorization (NMF), an algorithm based on decomposition by parts that can reduce the dimension of expression data from thousands of genes to a handful of metagenes. Coupled with a model selection mechanism, adapted to work for any stochastic clustering algorithm, NMF is an efficient method for identification of distinct molecular patterns and provides a powerful method for class discovery. We demonstrate the ability of NMF to recover meaningful biological information from cancer-related microarray data. NMF appears to have advantages over other methods such as hierarchical clustering or self-organizing maps. We found it less sensitive to a priori selection of genes or initial conditions and able to detect alternative or context-dependent patterns of gene expression in complex biological systems. This ability, similar to semantic polysemy in text, provides a general method for robust molecular pattern discovery. |
| Year of Publication | 2004
|
| Journal | Proc Natl Acad Sci U S A
|
| Volume | 101
|
| Issue | 12
|
| Pages | 4164-9
|
| Date Published | 2004 Mar 23
|
| ISSN | 0027-8424
|
| URL | |
| DOI | 10.1073/pnas.0308531101
|
| PubMed ID | 15016911
|
| PubMed Central ID | PMC384712
|
| Links |