tabbyAPI-ollama/endpoints/Kobold
kingbri f070587e9f Model: Add proper jobs cleanup and fix var calls
Jobs should be started and immediately cleaned up when calling the
generation stream. Expose a stream_generate function and append
this to the base class since it's more idiomatic than generate_gen.

The exl2 container's generate_gen function is now internal.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-04-24 21:30:55 -04:00
..
types Refactor the sampling class (#199) 2024-10-27 11:43:41 -04:00
utils Model: Add proper jobs cleanup and fix var calls 2025-04-24 21:30:55 -04:00
router.py Model: Make model params return a model card 2025-04-21 23:15:46 -04:00