Exploding Gradient

#deep-learning #interview

Exploding gradient occurs when the gradient of the weights become so large that it becomes NaN due to overflow

Why Exploding Gradient Occurs?

If the gradient is greater than 1.0 and the network is too deep, then the gradient accumulates to a very large number

How to identify Exploding Gradient?

How to solve Exploding Gradient?

Related Notes