Commit graph

137 commits

Author SHA1 Message Date
oobabooga
9331ab4798
Read GGUF metadata (#3873) 2023-09-11 18:49:30 -03:00
oobabooga
cec8db52e5
Add max_tokens_second param (#3533) 2023-08-29 17:44:31 -03:00
oobabooga
c237ce607e Move characters/instruction-following to instruction-templates 2023-08-06 17:50:32 -07:00
oobabooga
0af10ab49b
Add Classifier Free Guidance (CFG) for Transformers/ExLlama (#3325) 2023-08-06 17:22:48 -03:00
matatonic
8f98268252
extensions/openai: include content-length for json replies (#3416) 2023-08-03 16:10:49 -03:00
oobabooga
e931844fe2
Add auto_max_new_tokens parameter (#3419) 2023-08-02 14:52:20 -03:00
matatonic
9ae0eab989
extensions/openai: +Array input (batched) , +Fixes (#3309) 2023-08-01 22:26:00 -03:00
matatonic
90a4ab631c
extensions/openai: Fixes for: embeddings, tokens, better errors. +Docs update, +Images, +logit_bias/logprobs, +more. (#3122) 2023-07-24 11:28:12 -03:00
oobabooga
e202190c4f lint 2023-07-12 11:33:25 -07:00
matatonic
3e7feb699c
extensions/openai: Major openai extension updates & fixes (#3049)
* many openai updates

* total reorg & cleanup.

* fixups

* missing import os for images

* +moderations, custom_stopping_strings, more fixes

* fix bugs in completion streaming

* moderation fix (flagged)

* updated moderation categories

---------

Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-07-11 18:50:08 -03:00
oobabooga
3443219cbc
Add repetition penalty range parameter to transformers (#2916) 2023-06-29 13:40:13 -03:00
oobabooga
c52290de50
ExLlama with long context (#2875) 2023-06-25 22:49:26 -03:00
matatonic
b45baeea41
extensions/openai: Major docs update, fix #2852 (critical bug), minor improvements (#2849) 2023-06-24 22:50:04 -03:00
matatonic
1e97aaac95
extensions/openai: docs update, model loader, minor fixes (#2557) 2023-06-17 19:15:24 -03:00
matatonic
4a17a5db67
[extensions/openai] various fixes (#2533) 2023-06-06 01:43:04 -03:00
matatonic
a6d3f010a5
extensions/openai: include all available models in Model.list (#2368)
Co-authored-by: Matthew Ashton <mashton-gitlab@zhero.org>
2023-05-30 22:13:37 -03:00
matatonic
e5b756ecfe
Fixes #2331, IndexError: string index out of range (#2383) 2023-05-30 22:07:40 -03:00
matatonic
ebcadc0042
extensions/openai: cross_origin + chunked_response (updated fix) (#2423) 2023-05-30 21:54:24 -03:00
matatonic
df50f077db
fixup missing tfs top_a params, defaults reorg (#2443) 2023-05-30 21:52:33 -03:00
matatonic
2b7ba9586f
Fixes #2326, KeyError: 'assistant' (#2382) 2023-05-29 14:19:57 -03:00
matatonic
9714072692
[extensions/openai] use instruction templates with chat_completions (#2291) 2023-05-23 19:58:41 -03:00
oobabooga
c0fd7f3257
Add mirostat parameters for llama.cpp (#2287) 2023-05-22 19:37:24 -03:00
oobabooga
8ac3636966
Add epsilon_cutoff/eta_cutoff parameters (#2258) 2023-05-21 15:11:57 -03:00
matatonic
78b2478d9c
assistant: space fix, system: prompt fix (#2219) 2023-05-20 23:32:34 -03:00
matatonic
ab08cf6465
[extensions/openai] clip extra leading space (#2042) 2023-05-14 12:57:52 -03:00
oobabooga
c746a5bd00 Add .rstrip(' ') to openai api 2023-05-12 14:40:48 -03:00
matatonic
f98fd01dcd
is_chat=False for /edits (#2011) 2023-05-11 19:15:11 -03:00
oobabooga
0d36c18f5d Always return only the new tokens in generation functions 2023-05-11 17:07:20 -03:00
matatonic
c4f0e6d740
is_chat changes fix for openai extension (#2008) 2023-05-11 16:32:25 -03:00
matatonic
309b72e549
[extension/openai] add edits & image endpoints & fix prompt return in non --chat modes (#1935) 2023-05-11 11:06:39 -03:00
oobabooga
3913155c1f
Style improvements (#1957) 2023-05-09 22:49:39 -03:00
Jeffrey Lin
791a38bad1
[extensions/openai] Support undocumented base64 'encoding_format' param for compatibility with official OpenAI client (#1876) 2023-05-08 22:31:34 -03:00
oobabooga
8aafb1f796
Refactor text_generation.py, add support for custom generation functions (#1817) 2023-05-05 18:53:03 -03:00
Thireus ☠
4883e20fa7
Fix openai extension script.py - TypeError: '_Environ' object is not callable (#1753) 2023-05-03 09:51:49 -03:00
oobabooga
c31b0f15a7 Remove some spaces 2023-05-02 23:07:07 -03:00
oobabooga
320fcfde4e Style/pep8 improvements 2023-05-02 23:05:38 -03:00
matatonic
7ac41b87df
add openai compatible api (#1475) 2023-05-02 22:49:53 -03:00