Top PAlso known as nucleus sampling, this parameter controls the diversity of the generated text. It determines which tokens are considered when the model predicts the next word, influencing the balance between coherence and randomness.
- 1 — the model considers all possible options for continuing the text and generates a more unpredictable response;
- 0 — the model will choose from the smallest possible set of words. This is necessary when you need to obtain more specific and focused information.
Ex.: when Top P is set to 0.1, the chatbot selects the more popular words.