tabbyAPI-ollama/backends/exllamav3
kingbri 303e2dde12 Model: Correct exl3 generation, add concurrency, and cleanup
Fixes application of sampler parameters by adding a new sampler builder
interface. Also expose the generator class-wide and add wait_for_jobs.

Finally, allow inline loading to specify the backend.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
2025-05-02 21:33:25 -04:00
..
model.py Model: Correct exl3 generation, add concurrency, and cleanup 2025-05-02 21:33:25 -04:00