Commit graph

1063 commits

Author SHA1 Message Date
kingbri
eee8b642bd OAI: Implement completion API endpoint
Add support for /v1/completions with the option to use streaming
if needed. Also rewrite API endpoints to use async when possible
since that improves request performance.

Model container parameter names also needed rewrites as well and
set fallback cases to their disabled values.

Signed-off-by: kingbri <bdashore3@proton.me>
2023-11-13 18:31:26 -05:00
turboderp
4fa4386275 Add new samplers 2023-11-12 08:12:08 +01:00
kingbri
a10c14d357 Config: Switch to YAML and add load progress
YAML is a more flexible format when it comes to configuration. Commandline
arguments are difficult to remember and configure especially for
an API with complicated commandline names. Rather than using half-baked
textfiles, implement a proper config solution.

Also add a progress bar when loading models in the commandline.

Signed-off-by: kingbri <bdashore3@proton.me>
2023-11-12 00:21:16 -05:00
kingbri
5d32aa02cd Tree: Update to use ModelContainer and args
Use command-line arguments to load an initial model if necessary.
API routes are broken, but we should be using the container from
now on as a primary interface with the exllama2 library.

Also these args should be turned into a YAML configuration file in
the future.

Signed-off-by: kingbri <bdashore3@proton.me>
2023-11-10 23:19:54 -05:00
turboderp
9d34479e3e Model container with generator logic, initial 2023-11-11 02:53:00 +01:00
turboderp
d2480bae28 Test 2023-11-10 23:57:41 +01:00
turboderp
5de2a4005f Test 2023-11-10 23:57:12 +01:00
kingbri
ef099cb55a Chore: Add gitignore and remove ignored files
Ignore any IDE specific configurations and extra files that can break
git indexing.

Signed-off-by: kingbri <bdashore3@proton.me>
2023-11-10 15:58:51 -05:00
Splice86
8e2671a265 Update to README and other minor changes 2023-11-10 01:37:24 -06:00
Splice86
ab84b01fdf
Updated readme 2023-11-10 00:39:08 -06:00
david
ca992f483f Update README.md 2023-11-09 23:45:21 -06:00
david
f844b1ee91 Update README.md
Updated readme
2023-11-09 23:44:28 -06:00
david
b967e2e604 Initial 2023-11-09 21:27:45 -06:00