Model: Add option to select backend

Changing the backend switches the container that's used.

Signed-off-by: kingbri <8082010+kingbri1@users.noreply.github.com>
This commit is contained in:
kingbri 2025-04-27 22:27:26 -04:00
parent 242f6b7d2a
commit 7c6a053747
4 changed files with 38 additions and 6 deletions

View file

@ -81,7 +81,10 @@ class ModelLoadRequest(BaseModel):
)
# Config arguments
backend: Optional[str] = Field(
description="Backend to use",
default="exllamav2",
)
max_seq_len: Optional[int] = Field(
description="Leave this blank to use the model's base sequence length",
default=None,