Kullback–Leibler divergence and Cross entropy: http://sens.tistory.com/412KL散度: https://blog.csdn.net/sallyyoung_sh/article/details/54406615
Linear Classification Loss Visualization: http://vision.stanford.edu/teaching/cs231n-demos/linear-classify/