ӳ��ý

Wide and deep neural networks achieve consistency for classification.

Proceedings of the National Academy of Sciences of the United States of America

Authors	Adityanarayanan Radhakrishnan Mikhail Belkin Caroline Uhler
Keywords	classification consistency neural networks neural tangent kernel
Abstract	While neural networks are used for classification tasks across domains, a long-standing open problem in machine learning is determining whether neural networks trained using standard procedures are consistent for classification, i.e., whether such models minimize the probability of misclassification for arbitrary data distributions. In this work, we identify and construct an explicit set of neural network classifiers that are consistent. Since effective neural networks in practice are typically both wide and deep, we analyze infinitely wide networks that are also infinitely deep. In particular, using the recent connection between infinitely wide neural networks and neural tangent kernels, we provide explicit activation functions that can be used to construct networks that achieve consistency. Interestingly, these activation functions are simple and easy to implement, yet differ from commonly used activations such as ReLU or sigmoid. More generally, we create a taxonomy of infinitely wide and deep networks and show that these models implement one of three well-known classifiers depending on the activation function used: 1) 1-nearest neighbor (model predictions are given by the label of the nearest training example); 2) majority vote (model predictions are given by the label of the class with the greatest representation in the training set); or 3) singular kernel classifiers (a set of classifiers containing those that achieve consistency). Our results highlight the benefit of using deep networks for classification tasks, in contrast to regression tasks, where excessive depth is harmful.
Year of Publication	2023
Journal	Proceedings of the National Academy of Sciences of the United States of America
Volume	120
Issue	14
Pages	e2208779120
Date Published	04/2023
ISSN	1091-6490
DOI	10.1073/pnas.2208779120
PubMed ID	36996114
Links

Recent ӳ��ý Publications

Astrocyte Biology in CNS Inflammatory Diseases: A Clinical-Translational Perspective.

Association of Modifiable Risk Factors Measured With the Brain Care Score and Incident Stroke in the REGARDS Cohort.

Quantifying the fatal and non-fatal burden of disease associated with child growth failure, 2000-2023: a systematic analysis from the Global Burden of Disease Study 2023.

Association Between Maternal Genome-Wide Polygenic Scores for Psychiatric and Neurodevelopmental Disorders and Adverse Perinatal Events: A Danish Population-Based Study.

Multisite, Multiancestry Genome-Wide Association Study Meta-Analysis of Functional Seizure Disorder in a Hospital Sample of 675,680 Patients.