OAI: Add API-based model loading/unloading and auth routes
Models can be loaded and unloaded via the API. Also add authentication to use the API and for administrator tasks. Both types of authorization use different keys. Also fix the unload function to properly free all used vram. Signed-off-by: kingbri <bdashore3@proton.me>
This commit is contained in:
parent
47343e2f1a
commit
b625bface9
11 changed files with 195 additions and 55 deletions
|
|
@ -1,7 +1,7 @@
|
|||
import pathlib
|
||||
from OAI.models.completions import CompletionResponse, CompletionRespChoice
|
||||
from OAI.models.common import UsageStats
|
||||
from OAI.models.models import ModelList, ModelCard
|
||||
from OAI.types.completions import CompletionResponse, CompletionRespChoice
|
||||
from OAI.types.common import UsageStats
|
||||
from OAI.types.models import ModelList, ModelCard
|
||||
from typing import Optional
|
||||
|
||||
def create_completion_response(text: str, index: int, model_name: Optional[str]):
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue