MathInstitutes Dev

Learning Over-Parameterized Neural Networks on Structured Data

Presenter

Yingyu Liang

November 27, 2018

Learning Over-Parameterized Neural Networks on Structured Data Thumbnail

Abstract

Yingyu Liang - University of Wisconsin-Madison Neural networks have many successful applications, while much less theoretical understanding has been gained. Towards bridging this gap, we study the problem of learning a two-layer overparameterized ReLU neural network for multi-class classification via stochastic gradient descent (SGD) from random initialization. In the overparameterized setting, when the data comes from mixtures of well-separated distributions, we prove that SGD learns a network with a small generalization error, albeit the network has enough capacity to fit arbitrary labels. Furthermore, the analysis provides interesting insights into several aspects of learning neural networks and can be verified based on empirical studies on synthetic data and on the MNIST dataset.

Abstract

Supplementary Materials

Learning Over-Parameterized Neural Networks on Structured DataLearning Over-Parameterized Neural Networks on Structured Data

Videos

Learning Over-Parameterized Neural Networks on Structured Data

Presenter

Abstract

Supplementary Materials