tabbyAPI-ollama/backends/exllamav3
kingbri 2913ce29fc API: Add timings to usage stats
It's useful for the client to know what the T/s and total time for
generation are per-request.

Works with both completions and chat completions.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-06-17 22:54:51 -04:00
..
model.py API: Add timings to usage stats 2025-06-17 22:54:51 -04:00
sampler.py Model: Add Exllamav3 sampler 2025-05-02 21:33:25 -04:00
vision.py Exl3: Add vision capability 2025-06-15 19:22:51 +02:00