Softmax temperature in machine learning. L=t-1 if you are doing self-attention).