Visualizing memorization in RNNsBy Distill / March 25, 2019 Inspecting gradient magnitudes in context can be a powerful tool to see when recurrent units use short-term or long-term contextual understanding.