« Do We Need 2nd Order Methods in Machine Learning?
August 15, 2018, 5:00 PM - 5:30 PM
Martin Takáč, Lehigh University
In this talk, we address the question if and when do we need 2nd order optimization methods for training deep neural networks and when are the SGD type algorithms sufficient. We will further discuss some challenges when using stochastic and batch Quasi-Newton methods for training DNN. We will conclude the talk with preliminary numerical experiments.