|
args.py
|
Model: Add Tensor Parallel support
|
2024-08-22 14:15:19 -04:00 |
|
downloader.py
|
Downloader: Make timeout configurable
|
2024-07-23 21:42:38 -04:00 |
|
gen_logging.py
|
Model: Attach request ID to logs
|
2024-08-01 00:25:54 -04:00 |
|
logger.py
|
API: Add HuggingFace downloader
|
2024-04-29 01:15:02 -04:00 |
|
model.py
|
Model: Bypass lock checks when shutting down
|
2024-08-03 16:05:34 -04:00 |
|
networking.py
|
API: Add request logging
|
2024-07-22 21:40:00 -04:00 |
|
sampling.py
|
API: Add allowed_tokens support
|
2024-08-29 21:44:42 -04:00 |
|
signals.py
|
Model: Bypass lock checks when shutting down
|
2024-08-03 16:05:34 -04:00 |
|
templating.py
|
Templates: Switch to async jinja engine
|
2024-08-17 12:03:41 -04:00 |
|
transformers_utils.py
|
Tree: Format
|
2024-07-26 18:33:04 -04:00 |