tabbyAPI-ollama/backends/exllamav2
kingbri 0b4ca567f8 API: Persist request IDs and append full_text to finish chunk
Adding these to each generation chunk helps remove redundancy and
unecessary request ID operations.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-07-25 12:27:44 -04:00
..
grammar.py Tree: Format 2025-05-17 00:46:40 -04:00
model.py API: Persist request IDs and append full_text to finish chunk 2025-07-25 12:27:44 -04:00
utils.py Exl3: Add chunk size, cache size, and model info 2025-05-02 21:33:25 -04:00
vision.py Exl3: Add vision capability 2025-06-15 19:22:51 +02:00