tabbyAPI-ollama/endpoints/OAI
kingbri f070587e9f Model: Add proper jobs cleanup and fix var calls
Jobs should be started and immediately cleaned up when calling the
generation stream. Expose a stream_generate function and append
this to the base class since it's more idiomatic than generate_gen.

The exl2 container's generate_gen function is now internal.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-04-24 21:30:55 -04:00
..
types Model + API: Migrate to use BaseSamplerParams 2025-04-16 00:50:05 -04:00
utils Model: Add proper jobs cleanup and fix var calls 2025-04-24 21:30:55 -04:00
router.py API: Fix chat completion formatting flow 2024-11-21 17:51:14 -05:00