tabbyAPI-ollama/common
kingbri 7fded4f183 Tree: Switch to async generators
Async generation helps remove many roadblocks to managing tasks
using threads. It should allow for abortables and modern-day paradigms.

NOTE: Exllamav2 itself is not an asynchronous library. It's just
been added into tabby's async nature to allow for a fast and concurrent
API server. It's still being debated to run stream_ex in a separate
thread or manually manage it using asyncio.sleep(0)

Signed-off-by: kingbri <bdashore3@proton.me>
2024-03-16 23:23:31 -04:00
..
args.py Tree: Format 2024-03-13 00:02:55 -04:00
auth.py Tree: Format 2024-03-13 00:02:55 -04:00
concurrency.py Tree: Switch to async generators 2024-03-16 23:23:31 -04:00
config.py Tree: Update to cleanup globals 2024-03-12 23:59:30 -04:00
gen_logging.py Tree: Format 2024-03-13 23:33:18 -04:00
logger.py Logging: Escape rich markup sequences 2024-03-11 00:28:48 -04:00
model.py Tree: Switch to async generators 2024-03-16 23:23:31 -04:00
sampling.py API + Model: Add speculative ngram decoding 2024-03-13 23:32:11 -04:00
templating.py Tree: Format 2024-03-13 00:02:55 -04:00
utils.py Tree: Switch to async generators 2024-03-16 23:23:31 -04:00