What:
A function that takes a bunch of numbers and returns them so they all add up to , with each assigned a Probability ~relative to the size of it in the beginning.
The Formula:
Temperature (For Use in Large Language Models (LLMs))
By dividing the exponents by , it affects how we choose the words - the lower the more deterministic, the higher, the more random.