Softmax Not So Hot? Try Temperature Scaling!
·46 words·1 min
So we all know that output of softmax is not real probabilities - we just pretend that they are. There is a method called temperature scaling that can calibrate this to actual probabilities (using validation set), works good in practice and easy to use: https://github.com/gpleiss/temperature_scaling