softmax VS normalizatoin

为什么使用softmax,不用normalization?

“max” because amplifies probability of largest

“soft” because still assigns some probability to smaller 

猜你喜欢

转载自www.cnblogs.com/zhaopAC/p/10149698.html