Search code examples
caffepycaffematcaffematconvnet

Does scaling the values to [0,1] affect the CNN learning procedure?


I am working on semantic segmentation using CNNs.

I have normalized the values of images to range [0,1]. I have trained my network many times, learning curve seems is learning well, however, the output is always black image. My question is does really scaling affects the learning or should the range of pixel values remains in 0-255 range?

Thanks a lot.


Solution

  • In general, it doesn't affect the training. In many situations, [0 1] is actually better than [0 255] for training a DNN since the latter may cause some numerical issues in learning the first layer weights if you don't use lower learning rates in the first layer since w' = y' * x, where ' denotes derivative.