Weight Initialization

If we don't initialize our weight and start with 0, then the derivatives for all the variables will be same. This makes the neurons to learn the same value.

Also, if the weights we initialize are not random, then all the nodes will represent the same feature.

Reviews:
  1. add different types of weight initializations

Related Notes