API: Update inline load

- Add a config flag - Migrate support to /v1/completions - Unify the load function Signed-off-by: kingbri <bdashore3@proton.me>
2024-09-03 23:37:28 -04:00 · 2024-09-03 23:37:28 -04:00 · 21f14d4318
commit 21f14d4318
parent dd30d6592a
3 changed files with 60 additions and 48 deletions
--- a/config_sample.yml
+++ b/config_sample.yml
@ -83,6 +83,9 @@ model:
  # Enable this if the program is looking for a specific OAI model
  #use_dummy_models: False

+  # Allow direct loading of models from a completion or chat completion request
+  inline_model_loading: False
+
  # An initial model to load. Make sure the model is located in the model directory!
  # A model can be loaded later via the API.
  # REQUIRED: This must be filled out to load a model on startup!