Commit graph

459 commits

Author SHA1 Message Date
Tom Jobbins
3c67fc0362
Allow groupsize 1024, needed for larger models eg 30B to lower VRAM usage (#1660) 2023-05-02 00:46:26 -03:00
oobabooga
a777c058af
Precise prompts for instruct mode 2023-04-26 03:21:53 -03:00
oobabooga
f39c99fa14 Load more than one LoRA with --lora, fix a bug 2023-04-25 22:58:48 -03:00
oobabooga
b6af2e56a2 Add --character flag, add character to settings.json 2023-04-24 13:19:42 -03:00
oobabooga
caaa556159 Move extensions block definition to the bottom 2023-04-24 03:30:35 -03:00
oobabooga
b1ee674d75 Make interface state (mostly) persistent on page reload 2023-04-24 03:05:47 -03:00
oobabooga
47809e28aa Minor changes 2023-04-24 01:04:48 -03:00
Andy Salerno
654933c634
New universal API with streaming/blocking endpoints (#990)
Previous title: Add api_streaming extension and update api-example-stream to use it

* Merge with latest main

* Add parameter capturing encoder_repetition_penalty

* Change some defaults, minor fixes

* Add --api, --public-api flags

* remove unneeded/broken comment from blocking API startup. The comment is already correctly emitted in try_start_cloudflared by calling the lambda we pass in.

* Update on_start message for blocking_api, it should say 'non-streaming' and not 'streaming'

* Update the API examples

* Change a comment

* Update README

* Remove the gradio API

* Remove unused import

* Minor change

* Remove unused import

---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-23 15:52:43 -03:00
oobabooga
2dca8bb25e Sort imports 2023-04-21 17:20:59 -03:00
oobabooga
c238ba9532 Add a 'Count tokens' button 2023-04-21 17:18:34 -03:00
oobabooga
2d766d2e19 Improve notebook mode button sizes 2023-04-21 02:37:58 -03:00
oobabooga
b4af319fa2 Add a workaround for GALACTICA on some systems 2023-04-19 01:43:10 -03:00
oobabooga
61126f4674 Change the button styles 2023-04-19 00:56:24 -03:00
oobabooga
649e4017a5 Style improvements 2023-04-19 00:36:28 -03:00
oobabooga
c58c1d89bd
Clean method to prevent gradio from phoning home 2023-04-18 03:56:20 -03:00
oobabooga
e1b80e6fe6
Comment the gradio patch 2023-04-18 01:57:59 -03:00
oobabooga
36f7c022f2
Rename a file 2023-04-18 01:38:33 -03:00
oobabooga
00186f76f4
Monkey patch gradio to prevent it from calling home 2023-04-18 01:13:16 -03:00
oobabooga
c3dc348d1c Don't show 'None' in the LoRA list 2023-04-17 13:52:23 -03:00
oobabooga
209fcd21d5 Reorganize Parameters tab 2023-04-17 00:33:22 -03:00
oobabooga
b937c9d8c2
Add skip_special_tokens checkbox for Dolly model (#1218) 2023-04-16 14:24:49 -03:00
oobabooga
a9c7ef4159 Exclude yaml files from model list 2023-04-16 12:47:30 -03:00
Mikel Bober-Irizar
16a3a5b039
Merge pull request from GHSA-hv5m-3rp9-xcpf
* Remove eval of API input

* Remove unnecessary eval/exec for security

* Use ast.literal_eval

* Use ast.literal_eval

---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-16 01:36:50 -03:00
oobabooga
ac189011cb Add "Save current settings for this model" button 2023-04-15 12:54:02 -03:00
oobabooga
b9dcba7762 Don't overwrite --gpu_memory on boot (#1237/#1235) 2023-04-15 11:59:31 -03:00
oobabooga
628f8e6168 Reorganize chat buttons 2023-04-14 23:17:15 -03:00
oobabooga
c4aa1a42b1 Fix chat history downloading 2023-04-14 19:38:30 -03:00
oobabooga
3a337cfded Use argparse defaults 2023-04-14 15:35:06 -03:00
oobabooga
43e01282b3 Don't override user initial wbits/groupsize 2023-04-14 15:24:03 -03:00
Alex "mcmonkey" Goodwin
64e3b44e0f
initial multi-lora support (#1103)
---------

Co-authored-by: oobabooga <112222186+oobabooga@users.noreply.github.com>
2023-04-14 14:52:06 -03:00
oobabooga
ebb81eb176 Add Undo/Regenerate buttons to notebook mode 2023-04-14 14:34:56 -03:00
oobabooga
581f739b2f Reorganize 2023-04-14 11:38:26 -03:00
oobabooga
8e31f2bad4
Automatically set wbits/groupsize/instruct based on model name (#1167) 2023-04-14 11:07:28 -03:00
v0xie
9d66957207
Add --listen-host launch option (#1122) 2023-04-13 21:35:08 -03:00
oobabooga
c13e8651ad Suppress "TypedStorage is deprecated" warnings 2023-04-13 12:09:42 -03:00
oobabooga
17ce7c8671 Suppress annoying bitsandbytes welcome message 2023-04-13 12:04:39 -03:00
oobabooga
fbb448ce4f If only 1 model is available, load that model 2023-04-13 11:44:10 -03:00
oobabooga
5744b31593 Reorganize some buttons 2023-04-13 11:05:47 -03:00
Xan
6e19ae4b2f
Fix gpt-j model type in UI (#1129) 2023-04-13 10:17:20 -03:00
oobabooga
ddbd237ec9 Better way to sort the models/loras 2023-04-12 22:56:32 -03:00
oobabooga
7dfbe54f42 Add --model-menu option 2023-04-12 21:24:26 -03:00
oobabooga
86c10c6f0c Add some labels 2023-04-12 18:39:21 -03:00
oobabooga
0baa50bcc4 Update a comment 2023-04-12 18:26:15 -03:00
oobabooga
5d1d0bd11f Add the GPU index to the label 2023-04-12 18:24:19 -03:00
oobabooga
13789fd200 Handle the no-GPU / multi-GPU cases 2023-04-12 18:21:14 -03:00
oobabooga
1566d8e344 Add model settings to the Models tab 2023-04-12 17:20:18 -03:00
oobabooga
80f4eabb2a Fix send_pictures extension 2023-04-12 10:27:06 -03:00
oobabooga
2289d3686f Update API example 2023-04-11 22:43:43 -03:00
oobabooga
f2be87235d Comment lines that were causing undefined behavior 2023-04-11 22:40:04 -03:00
oobabooga
8265d45db8 Add send dummy message/reply buttons
Useful for starting a new reply.
2023-04-11 22:21:41 -03:00