Controls creativity. 0 = predictable responses, higher = more varied and unexpected results. View Documentation
Type
FLOAT
Default
1.00
Range
0.00 ~ 2.00
Response Diversitytop_p
Filters responses to the most likely words within a probability range. Lower values keep answers more focused. View Documentation
Type
FLOAT
Default
1.00
Range
0.00 ~ 1.00
New Idea Encouragementpresence_penalty
Encourages or discourages using new words. Higher values promote fresh ideas, while lower values allow more reuse. View Documentation
Type
FLOAT
Default
0.00
Range
-2.00 ~ 2.00
Repetition Controlfrequency_penalty
Adjusts how often the model repeats words. Higher values mean fewer repeats, while lower values allow more repetition. View Documentation
Type
FLOAT
Default
0.00
Range
-2.00 ~ 2.00
Response Length Limitmax_tokens
Sets the max length of responses. Increase for longer replies, decrease for shorter ones. View Documentation
Type
INT
Default
--
Reasoning Depthreasoning_effort
Determines how much effort the model puts into reasoning. Higher settings generate more thoughtful responses but take longer. View Documentation
Type
STRING
Default
--
Range
low ~ high
Related Models
Llama 3.1 8B
llama3.1
Llama 3.1 is a leading model launched by Meta, supporting up to 405B parameters, applicable in complex dialogues, multilingual translation, and data analysis.
Llama 3.1 70B
llama3.1:70b
70b.description
Llama 3.1 405B
llama3.1:405b
405b.description
Code Llama 7B
codellama
Code Llama is an LLM focused on code generation and discussion, combining extensive programming language support, suitable for developer environments.