tabbyAPI-ollama/common
kingbri 0b25c208d6 API: Fix error reporting
Make a disconnect on load error consistently. It should be safer to
warn the user to run unload (or re-run load) if a model does not
load correctly.

Also don't log the traceback for request errors that don't have one.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-03-05 18:16:02 -05:00
..
args.py Config: Add experimental torch cuda malloc backend 2024-02-14 21:45:56 -05:00
auth.py API: Back to async 2024-03-04 23:21:40 -05:00
config.py Launch: Make exllamav2 requirement more friendly 2024-02-02 23:36:17 -05:00
gen_logging.py Tree: Refactor code organization 2024-01-25 00:15:40 -05:00
generators.py API: Fix issues with concurrent requests and queueing 2024-03-04 23:21:40 -05:00
logger.py Tree: Refactor code organization 2024-01-25 00:15:40 -05:00
sampling.py Model: Add EBNF grammar support 2024-02-24 23:40:11 -05:00
templating.py API: Add template switching and unload endpoints 2024-01-25 00:15:40 -05:00
utils.py API: Fix error reporting 2024-03-05 18:16:02 -05:00