An illustrated guide to automatic sparse differentiation (iclr-blogposts.github.io)
In numerous applications of machine learning, Hessians and Jacobians exhibit sparsity, a property that can be leveraged to vastly accelerate their computation.