tabbyAPI-ollama/endpoints/OAI
kingbri fb1d2f34c1 OAI: Add response_prefix and fix BOS token issues in chat completions
response_prefix is used to add a prefix before generating the next
message. This is used in many cases such as continuining a prompt
(see #96).

Also if a template has BOS token specified, add_bos_token will
append two BOS tokens. Add a check which strips a starting BOS token
from the prompt if it exists.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-04-25 00:54:43 -04:00
..
types OAI: Add response_prefix and fix BOS token issues in chat completions 2024-04-25 00:54:43 -04:00
utils OAI: Add response_prefix and fix BOS token issues in chat completions 2024-04-25 00:54:43 -04:00
router.py Templates: Migrate to class 2024-04-21 23:28:14 -04:00