Commit graph

  • d25310e55d Requirements: Update Flash Attention 2 kingbri 2023-11-21 22:12:55 -05:00
  • 71b9a53336 API: Add temperature_last support kingbri 2023-11-21 21:20:59 -05:00
  • 3337fe6acc Warning if unsupported samplers are used turboderp 2023-11-21 18:35:22 +01:00
  • a54de11cf3 Add new samplers turboderp 2023-11-21 18:16:53 +01:00
  • c92ee24bb4 Tree: Add batch script kingbri 2023-11-20 01:48:06 -05:00
  • 2aa9c145be Auth: Fix an oops with headers kingbri 2023-11-20 00:04:10 -05:00
  • 39ea730be5 Auth: Allow admin keys to work with api key routes kingbri 2023-11-19 23:53:07 -05:00
  • 8ef730f016
    Merge pull request #11 from veden/patch-1 turboderp 2023-11-20 04:23:34 +01:00
  • f960fac8ff
    Fix incorrect ratio calculation for draft model Veden 2023-11-19 13:12:53 -08:00
  • 4cddd0400c Model: Fix draft model loading kingbri 2023-11-19 02:04:02 -05:00
  • 698b0b1976 Update README kingbri 2023-11-19 01:19:31 -05:00
  • 581e1fc219 Sample config: Remove unused value kingbri 2023-11-19 01:16:03 -05:00
  • e0e93c103b Sample config: Uncomment all parameters kingbri 2023-11-19 01:10:57 -05:00
  • 63762654f0 Update README kingbri 2023-11-19 01:05:20 -05:00
  • e46676cb08
    Merge pull request #9 from city-unit/main Brian Dashore 2023-11-19 00:53:24 -05:00
  • e4a8848445 Auth: Log API and admin key on startup kingbri 2023-11-19 00:52:39 -05:00
  • 31bc418795 Model: Add context in response output kingbri 2023-11-19 00:49:32 -05:00
  • 80c69939ae Remove unneeded stuffs city_unit 2023-11-19 00:34:54 -05:00
  • f47919b1d3 API: Add draft model support kingbri 2023-11-19 00:32:25 -05:00
  • 6b22dc0119 Rename, fschat support city_unit 2023-11-19 00:32:14 -05:00
  • 99cf0b6d7b Add basic docker support city_unit 2023-11-19 00:01:17 -05:00
  • 6b9af58cc1 Tree: Fix extraneous bugs and update T/s print kingbri 2023-11-18 22:34:40 -05:00
  • a51889bdb8 Requirements: Update Flash Attention kingbri 2023-11-18 21:59:34 -05:00
  • b2410a0436
    Merge pull request #4 from waldfee/config_samples Brian Dashore 2023-11-18 13:16:23 -05:00
  • 27ebec3b35 Model: Add speculative decoding support via config kingbri 2023-11-18 01:38:54 -05:00
  • 2ad79cb9ea Model: Add tokens in responses kingbri 2023-11-17 23:33:48 -05:00
  • 7f18ea1d7c Tree: Remove SillyTavern shim docs kingbri 2023-11-17 22:03:46 -05:00
  • 6f2078cbe4 Update README kingbri 2023-11-17 22:02:21 -05:00
  • d627d14385 API: Fix exceptions and defaults kingbri 2023-11-17 17:56:05 -05:00
  • 78a6587b95 add cache_mode and draft_model_dir to config_sample.yml waldfee 2023-11-17 22:08:31 +01:00
  • 4669e49ff0 API: Fix errors with token endpoint kingbri 2023-11-17 01:39:06 -05:00
  • 9dfa580b1e Model: Add tokens/second output kingbri 2023-11-17 01:16:20 -05:00
  • 021981fce0 API: Re-add depends endpoints kingbri 2023-11-17 00:47:51 -05:00
  • ac4e9c2277 API: Add CORS support kingbri 2023-11-16 22:19:32 -05:00
  • 08a183540b Config: Add warning on exceptions and clarify parameters kingbri 2023-11-16 21:48:08 -05:00
  • feef782dbf
    Update requirements.txt to include uvicorn Splice86 2023-11-16 22:50:27 +00:00
  • d5374c2c1f
    Create LICENSE Brian Dashore 2023-11-16 17:43:23 -05:00
  • 2cf93c092b Add SillyTavern instructions kingbri 2023-11-16 17:33:23 -05:00
  • b20e71dcd4 Requirements: Add Flash Attention 2 wheels kingbri 2023-11-16 17:25:00 -05:00
  • d5551352bf Model: Fix parsing of stop conditions kingbri 2023-11-16 17:15:33 -05:00
  • 282b5b2931 API: Fix responses and some params kingbri 2023-11-16 17:11:55 -05:00
  • d8d61fa19b API: Add fallback if model isn't loaded kingbri 2023-11-16 12:20:35 -05:00
  • c0525c042e Update README kingbri 2023-11-16 12:03:51 -05:00
  • 60eb076b43 Tree: Basic formatting and comments kingbri 2023-11-16 11:48:30 -05:00
  • 5defb1b0b4 Config: Fix errors when stuff doesn't exist kingbri 2023-11-16 11:41:03 -05:00
  • 03f45cb0a3 Tree: Update documentation and configs kingbri 2023-11-16 02:30:33 -05:00
  • 2248705c4a Requirements: Don't force fastchat installation kingbri 2023-11-16 01:26:46 -05:00
  • 5e8419ec0c OAI: Add chat completions endpoint kingbri 2023-11-16 01:06:07 -05:00
  • 593471a04d Auth: Fix init from YAML dict kingbri 2023-11-15 23:00:12 -05:00
  • 1f444c8fb7 Requirements: Add fastchat and override pydantic kingbri 2023-11-15 01:00:08 -05:00
  • bbb59d0747 Auth: Fix methods for writing and validation kingbri 2023-11-15 00:19:15 -05:00
  • cb8da7f092 Chore: Remove mistakenly committed file kingbri 2023-11-15 00:00:22 -05:00
  • d0b6b11068 OAI: Make freq and presence pen floats kingbri 2023-11-14 23:44:22 -05:00
  • 126afdfdc2 Model: Fix gpu split params kingbri 2023-11-14 23:20:14 -05:00
  • ea91d17a11 Api: Add ban_eos_token and add_bos_token support kingbri 2023-11-14 23:05:47 -05:00
  • 8fea5391a8 Api: Add token endpoints kingbri 2023-11-14 22:43:37 -05:00
  • 2d741653c3 Update .gitignore kingbri 2023-11-14 22:40:44 -05:00
  • fc14046318 Updated readme Splice86 2023-11-14 21:17:03 -06:00
  • 4fd7da8fb6 Updated readme Splice86 2023-11-14 21:16:24 -06:00
  • a0cf65e88f Updated readme Splice86 2023-11-14 21:13:36 -06:00
  • 4670a77c26 API: Don't use response_class kingbri 2023-11-14 21:56:15 -05:00
  • b625bface9 OAI: Add API-based model loading/unloading and auth routes kingbri 2023-11-14 01:17:19 -05:00
  • 47343e2f1a OAI: Add models support kingbri 2023-11-13 21:38:34 -05:00
  • eee8b642bd OAI: Implement completion API endpoint kingbri 2023-11-13 18:24:12 -05:00
  • 4fa4386275 Add new samplers turboderp 2023-11-12 08:12:08 +01:00
  • a10c14d357 Config: Switch to YAML and add load progress kingbri 2023-11-12 00:21:16 -05:00
  • 5d32aa02cd Tree: Update to use ModelContainer and args kingbri 2023-11-10 23:19:54 -05:00
  • 9d34479e3e Model container with generator logic, initial turboderp 2023-11-11 02:53:00 +01:00
  • d2480bae28 Test turboderp 2023-11-10 23:57:41 +01:00
  • 5de2a4005f Test turboderp 2023-11-10 23:57:12 +01:00
  • ef099cb55a Chore: Add gitignore and remove ignored files kingbri 2023-11-10 15:58:51 -05:00
  • 8e2671a265 Update to README and other minor changes Splice86 2023-11-10 01:37:24 -06:00
  • ab84b01fdf
    Updated readme Splice86 2023-11-10 00:39:08 -06:00
  • ca992f483f Update README.md david 2023-11-09 23:45:21 -06:00
  • f844b1ee91 Update README.md david 2023-11-09 23:44:28 -06:00
  • b967e2e604 Initial david 2023-11-09 21:27:45 -06:00