Model: Fix no flash attention

Was being called wrong from config. Signed-off-by: kingbri <bdashore3@proton.me>
2023-12-17 23:31:58 -05:00 · 2023-12-17 23:31:58 -05:00 · 95fd0f075e
commit 95fd0f075e
parent ad8807a830
1 changed files with 1 additions and 1 deletions
--- a/model.py
+++ b/model.py
@ -94,7 +94,7 @@ class ModelContainer:
        )

        # Turn off flash attention?
-        self.config.no_flash_attn = unwrap(kwargs.get("no_flash_attn"), False)
+        self.config.no_flash_attn = unwrap(kwargs.get("no_flash_attention"), False)

        # low_mem is currently broken in exllamav2. Don't use it until it's fixed.
        """