ӳ��ý

Linear Recursive Feature Machines provably recover low-rank matrices.

Proceedings of the National Academy of Sciences of the United States of America

Authors	Adityanarayanan Radhakrishnan Mikhail Belkin Dmitriy Drusvyatskiy
Keywords	feature learning matrix sensing neural networks sparse recovery
Abstract	A fundamental problem in machine learning is to understand how neural networks make accurate predictions, while seemingly bypassing the curse of dimensionality. A possible explanation is that common training algorithms for neural networks implicitly perform dimensionality reduction-a process called feature learning. Recent work [A. Radhakrishnan, D. Beaglehole, P. Pandit, M. Belkin, , 1461-1467 (2024).] posited that the effects of feature learning can be elicited from a classical statistical estimator called the average gradient outer product (AGOP). The authors proposed Recursive Feature Machines (RFMs) as an algorithm that explicitly performs feature learning by alternating between 1) reweighting the feature vectors by the AGOP and 2) learning the prediction function in the transformed space. In this work, we develop theoretical guarantees for how RFM performs dimensionality reduction by focusing on the class of overparameterized problems arising in sparse linear regression and low-rank matrix recovery. Specifically, we show that RFM restricted to linear models (lin-RFM) reduces to a variant of the well-studied Iteratively Reweighted Least Squares (IRLS) algorithm. Furthermore, our results connect feature learning in neural networks and classical sparse recovery algorithms and shed light on how neural networks recover low rank structure from data. In addition, we provide an implementation of lin-RFM that scales to matrices with millions of missing entries. Our implementation is faster than the standard IRLS algorithms since it avoids forming singular value decompositions. It also outperforms deep linear networks for sparse linear regression and low-rank matrix completion.
Year of Publication	2025
Journal	Proceedings of the National Academy of Sciences of the United States of America
Volume	122
Issue	13
Pages	e2411325122
Date Published	04/2025
ISSN	1091-6490
DOI	10.1073/pnas.2411325122
PubMed ID	40153460
Links

Recent ӳ��ý Publications

Quantifying the fatal and non-fatal burden of disease associated with child growth failure, 2000-2023: a systematic analysis from the Global Burden of Disease Study 2023.

Association Between Maternal Genome-Wide Polygenic Scores for Psychiatric and Neurodevelopmental Disorders and Adverse Perinatal Events: A Danish Population-Based Study.

Multisite, Multiancestry Genome-Wide Association Study Meta-Analysis of Functional Seizure Disorder in a Hospital Sample of 675,680 Patients.

Plasma Proteins Invariant to Diet in Celiac Disease: Results From a Proteomics Study on the UK Biobank.

Response and Resistance to RAS Inhibition in Cancer.