Model: Add vision loading support

Adds the ability to load vision parts of text + image models. Requires an explicit flag in config because there isn't a way to automatically determine whether the vision tower should be used. Signed-off-by: kingbri <bdashore3@proton.me>
2024-11-11 12:04:40 -05:00 · 2024-11-11 12:04:40 -05:00 · 69ac0eb8aa
commit 69ac0eb8aa
parent cc2516790d
5 changed files with 42 additions and 5 deletions
--- a/common/config_models.py
+++ b/common/config_models.py
@ -270,6 +270,12 @@ class ModelConfig(BaseConfigModel):
            "NOTE: Only works with chat completion message lists!"
        ),
    )
+    vision: Optional[bool] = Field(
+        False,
+        description=(
+            "Enables vision support if the model supports it. (default: False)"
+        ),
+    )
    num_experts_per_token: Optional[int] = Field(
        None,
        description=(