Exploding Gradient
Gradient values become too large during backpropagation
Common in deep rnn → weights grow exponentially
Solutions:
- Gradient clipping
- lstm architecture
- Proper weight initialization
References
#ml-notes
Gradient values become too large during backpropagation
Common in deep rnn → weights grow exponentially
Solutions:
#ml-notes