Softmax Not So Hot? Try Temperature Scaling!
So we all know that output of softmax is not real probabilities - we just pretend that they are. There is a method called temperature scaling that can calibrate this to actual probabilities (using validation set), works good in practice and easy to use: https://github.com/gpleiss/temperature_scaling