Gradient Analysis and Optimization

In these papers, we develop new methods for optimizing complex neural networks that differ from naive stochastic gradient descent. We also exploit the optimization trajectory to interpret the neural network.

Interpretability