powered by hugging face inference API
longer responses = more tokens
higher = more creative
nucleus sampling threshold
0 = unlimited history
tokens used: calculating...