research
∙
10/01/2019
How noise affects the Hessian spectrum in overparameterized neural networks
Stochastic gradient descent (SGD) forms the core optimization method for...
research
∙
03/06/2019