tabbyAPI-ollama/backends
kingbri f070587e9f Model: Add proper jobs cleanup and fix var calls
Jobs should be started and immediately cleaned up when calling the
generation stream. Expose a stream_generate function and append
this to the base class since it's more idiomatic than generate_gen.

The exl2 container's generate_gen function is now internal.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-04-24 21:30:55 -04:00
..
exllamav2 Model: Add proper jobs cleanup and fix var calls 2025-04-24 21:30:55 -04:00
infinity Model: Add proper jobs cleanup and fix var calls 2025-04-24 21:30:55 -04:00
base_model_container.py Model: Add proper jobs cleanup and fix var calls 2025-04-24 21:30:55 -04:00