Other The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima. February 23, 2021
Artificial Intelligence | Computer Science | Machine Learning Archetypal landscapes for deep neural networks. August 26, 2020