Ettore Di Giacinto
|
ae533cadef
|
feat: move gpt4all to a grpc service
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2023-07-15 01:19:43 +02:00 |
|
Ettore Di Giacinto
|
58f6aab637
|
feat: move llama to a grpc
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2023-07-15 01:19:43 +02:00 |
|
Ettore Di Giacinto
|
b816009db0
|
feat: add falcon ggllm via grpc client
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2023-07-15 01:19:43 +02:00 |
|
mudler
|
dcf35dd25f
|
Fixup custom role encoding
Signed-off-by: mudler <mudler@localai.io>
|
2023-07-09 11:13:19 +02:00 |
|
mudler
|
e70322676c
|
Allow to customize no action behavior
Signed-off-by: mudler <mudler@localai.io>
|
2023-07-09 10:53:46 +02:00 |
|
mudler
|
b3f43ab938
|
Add a way to disable default action
|
2023-07-09 10:02:21 +02:00 |
|
mudler
|
bbc4468908
|
Make functions more compatible with OpenAI specs
|
2023-07-09 10:02:09 +02:00 |
|
mudler
|
55befe396a
|
Add grammar_json to the request parameters to facilitate JSON generation
|
2023-07-06 19:08:04 +02:00 |
|
mudler
|
483fddccf9
|
minor fixups
|
2023-07-06 11:55:19 +02:00 |
|
mudler
|
05aed255db
|
Customize function call in templates
|
2023-07-05 18:24:44 +02:00 |
|
mudler
|
0f1326b2bd
|
fixups
|
2023-07-04 23:40:22 +02:00 |
|
mudler
|
b722e7eb7e
|
feat: cleanups, small enhancements
Signed-off-by: mudler <mudler@localai.io>
|
2023-07-04 18:58:19 +02:00 |
|
mudler
|
f09ddd2983
|
feat: add grammar and functions call support
|
2023-07-04 18:58:19 +02:00 |
|
Luis López
|
a6839fd238
|
feat: [whisper] Partial support for verbose_json format in transcribe endpoint (#721)
|
2023-07-04 14:31:31 +02:00 |
|
Ettore Di Giacinto
|
3593cb0c87
|
feat: update llama, enable NUMA (#684)
|
2023-06-27 09:00:10 +02:00 |
|
Ettore Di Giacinto
|
02136531a3
|
fix: return index and delta in stream token (#680)
Signed-off-by: mudler <mudler@localai.io>
|
2023-06-26 18:49:36 +02:00 |
|
Ettore Di Giacinto
|
d3a486a4f8
|
feat: Add '/version' endpoint and display it in the CLI (#679)
|
2023-06-26 15:12:43 +02:00 |
|
Ettore Di Giacinto
|
2b957df56c
|
fix: rename /models/list to /models/available (#678)
|
2023-06-26 15:12:26 +02:00 |
|
Ettore Di Giacinto
|
78f3c3da48
|
refactor: consolidate usage of GetURI (#674)
Signed-off-by: mudler <mudler@localai.io>
|
2023-06-26 12:25:38 +02:00 |
|
Ettore Di Giacinto
|
60db5957d3
|
Gallery repository (#663)
Signed-off-by: mudler <mudler@localai.io>
|
2023-06-24 08:18:17 +02:00 |
|
Ettore Di Giacinto
|
a7bb029d23
|
feat: add tts with go-piper (#649)
Signed-off-by: mudler <mudler@localai.io>
|
2023-06-22 17:53:10 +02:00 |
|
Ettore Di Giacinto
|
2f5feb4841
|
Add LowVRAM option parameter (#642)
|
2023-06-20 20:33:47 +02:00 |
|
Ettore Di Giacinto
|
295f3030a9
|
feat: add typical_p to model parameters (#598)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-06-14 19:33:20 +02:00 |
|
Ettore Di Giacinto
|
10ddd72b58
|
fix: set default batch size (#597)
|
2023-06-14 19:09:27 +02:00 |
|
Ettore Di Giacinto
|
e37361985c
|
deps: update gpt4all bindings, fix search path on new versions (#592)
|
2023-06-14 13:24:53 +02:00 |
|
Ettore Di Giacinto
|
84946e9275
|
feat: display download progress when installing models (#543)
|
2023-06-08 21:33:18 +02:00 |
|
Ettore Di Giacinto
|
c9bbba4872
|
tests: add llama tests with openllama (#538)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-06-08 00:36:11 +02:00 |
|
Ettore Di Giacinto
|
5abbb134d9
|
feat: extend model configuration for llama.cpp (#536)
|
2023-06-07 21:46:19 +02:00 |
|
Ettore Di Giacinto
|
d62aef2016
|
feat: add experimental support for falcon-7b (#516)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-06-06 17:23:19 +02:00 |
|
Ettore Di Giacinto
|
b503725dc7
|
fix: downgrade gpt4all (#503)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-06-05 09:42:50 +02:00 |
|
Samuel Maynard
|
96794851b3
|
feat: add support for Stream: true to completionEndpoint (#465)
|
2023-06-03 00:27:03 +02:00 |
|
Ettore Di Giacinto
|
78ad4813df
|
feat: Update gpt4all, support multiple implementations in runtime (#472)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-06-01 23:38:52 +02:00 |
|
Aisuko
|
c8a4a4f4e9
|
feat: Add new test cases for LoadConfigs (#447)
Signed-off-by: Aisuko <urakiny@gmail.com>
|
2023-06-01 16:20:45 +02:00 |
|
Pavel Zloi
|
3ba07a5928
|
feat: add LangChainGo Huggingface backend (#446)
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2023-06-01 12:00:06 +02:00 |
|
Aisuko
|
49ce24984c
|
feat: Add more test-cases and remove dev container (#433)
Signed-off-by: Aisuko <urakiny@gmail.com>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2023-05-30 13:01:55 +02:00 |
|
Ettore Di Giacinto
|
f401181cb5
|
fix: switch back to upstream for rwkv bindings (#432)
|
2023-05-30 12:35:32 +02:00 |
|
Ettore Di Giacinto
|
aacb96df7a
|
fix: correctly handle errors from App constructor (#430)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-30 12:00:30 +02:00 |
|
Ettore Di Giacinto
|
217dbb448e
|
feat: allow to set a prompt cache path and enable saving state (#395)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-27 14:29:11 +02:00 |
|
Ettore Di Giacinto
|
76c881043e
|
feat: allow to preload models before startup via env var or configs (#391)
|
2023-05-27 09:26:33 +02:00 |
|
Ettore Di Giacinto
|
bf54b78270
|
feat: add /healthz and /readyz endpoints for kubernetes (#374)
|
2023-05-24 22:19:13 +02:00 |
|
Ettore Di Giacinto
|
9decd0813c
|
feat: update go-gpt2 (#359)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-23 21:47:47 +02:00 |
|
Robert Hambrock
|
4aa78843c0
|
fix: spec compliant instantiation and termination of streams (#341)
|
2023-05-21 15:24:04 +02:00 |
|
Ettore Di Giacinto
|
6f54cab3f0
|
feat: allow to set cors (#339)
|
2023-05-21 14:38:25 +02:00 |
|
Ettore Di Giacinto
|
05a3d569b0
|
feat: allow to override model config (#323)
|
2023-05-20 17:03:53 +02:00 |
|
Ettore Di Giacinto
|
4e381cbe92
|
feat: support shorter urls for github repositories (#314)
|
2023-05-20 09:06:30 +02:00 |
|
Ettore Di Giacinto
|
1fade53a61
|
feat: minor enhancements to /models/apply (#297)
|
2023-05-19 08:31:11 +02:00 |
|
Ettore Di Giacinto
|
cc9aa9eb3f
|
feat: add /models/apply endpoint to prepare models (#286)
|
2023-05-18 15:59:03 +02:00 |
|
Ettore Di Giacinto
|
3f739575d8
|
Minor fixes (#285)
|
2023-05-17 21:01:46 +02:00 |
|
Ettore Di Giacinto
|
9d051c5d4f
|
feat: add image generation with ncnn-stablediffusion (#272)
|
2023-05-16 19:32:53 +02:00 |
|
Ettore Di Giacinto
|
acd03d15f2
|
feat: add support for cublas/openblas in the llama.cpp backend (#258)
|
2023-05-16 16:26:25 +02:00 |
|
Ettore Di Giacinto
|
a035de2fdd
|
tests: add rwkv (#261)
|
2023-05-15 08:15:01 +02:00 |
|
Ettore Di Giacinto
|
2488c445b6
|
feat: bert.cpp token embeddings (#241)
|
2023-05-12 17:16:49 +02:00 |
|
Ettore Di Giacinto
|
b4241d0a0d
|
tests: enable whisper (#239)
|
2023-05-12 14:10:18 +02:00 |
|
Ettore Di Giacinto
|
8250391e49
|
Add support for gptneox/replit (#238)
|
2023-05-12 11:36:35 +02:00 |
|
Ettore Di Giacinto
|
fd1df4e971
|
whisper: add tests and allow to set upload size (#237)
|
2023-05-12 10:04:20 +02:00 |
|
Ettore Di Giacinto
|
4413defca5
|
feat: add starcoder (#236)
|
2023-05-11 20:20:07 +02:00 |
|
Ettore Di Giacinto
|
85f0f8227d
|
refactor: drop code dups (#234)
|
2023-05-11 16:34:16 +02:00 |
|
Ettore Di Giacinto
|
59e3c02002
|
make use of new bindings for gpt4all (#232)
|
2023-05-11 14:31:19 +02:00 |
|
Matthew Campbell
|
032dee256f
|
Keep whisper models in memory (#233)
|
2023-05-11 14:05:07 +02:00 |
|
Matthew Campbell
|
6b5e2b2bf5
|
Upload transcription API wasn't reading the data from the post (#229)
|
2023-05-11 10:43:05 +02:00 |
|
Ettore Di Giacinto
|
11675932ac
|
feat: add dolly/redpajama/bloomz models support (#214)
|
2023-05-11 01:12:58 +02:00 |
|
Ettore Di Giacinto
|
f8ee20991c
|
feat: add bert.cpp embeddings (#222)
|
2023-05-10 15:20:21 +02:00 |
|
Ettore Di Giacinto
|
9f426578cf
|
feat: add transcript endpoint (#211)
|
2023-05-09 11:43:50 +02:00 |
|
Ettore Di Giacinto
|
89dfa0f5fc
|
feat: add experimental support for embeddings as arrays (#207)
|
2023-05-08 19:31:18 +02:00 |
|
Dave
|
07ec2e441d
|
mini fix - OpenAI documentation url (#200)
|
2023-05-06 00:42:08 +02:00 |
|
mudler
|
8c8cf38d4d
|
tests: use 1 core
|
2023-05-05 23:29:34 +02:00 |
|
mudler
|
009ee47fe2
|
Don't allow 0 as thread count
|
2023-05-05 22:51:20 +02:00 |
|
mudler
|
ec2adc2c03
|
tests: use 3 cores
|
2023-05-05 22:07:01 +02:00 |
|
mudler
|
e62ee2bc06
|
fix: remove trailing 0s from embeddings
This happens when no max_tokens are set, so by default go-llama
allocates more space for the slice and padding happens.
|
2023-05-05 18:35:03 +02:00 |
|
mudler
|
b49721cdd1
|
fix: respect config from file for backends settings
|
2023-05-05 18:05:10 +02:00 |
|
mudler
|
64c0a7967f
|
fix: pass prediction options when using the model
|
2023-05-05 15:56:02 +02:00 |
|
mudler
|
e96eadab40
|
feat: support deprecated embeddings API
|
2023-05-05 15:55:19 +02:00 |
|
mudler
|
e73283121b
|
feat: support arrays for prompt and input
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-05 15:54:59 +02:00 |
|
mudler
|
857d13e8d6
|
debug: wire up go-fiber debugger
|
2023-05-05 15:53:57 +02:00 |
|
Ettore Di Giacinto
|
961cf29217
|
feat: expose mirostat to config (#193)
|
2023-05-05 13:45:37 +02:00 |
|
Ettore Di Giacinto
|
c839b334eb
|
feat: add embeddings for go-llama.cpp backend (#190)
|
2023-05-05 11:20:06 +02:00 |
|
Ettore Di Giacinto
|
714bfcd45b
|
fix: missing returning error and free callback stream (#187)
|
2023-05-04 19:49:43 +02:00 |
|
Ettore Di Giacinto
|
fdf75c6d0e
|
rwkv fixes and examples (#185)
|
2023-05-04 17:32:23 +02:00 |
|
Ettore Di Giacinto
|
c974dad799
|
Return usage in the API responses (#166)
|
2023-05-03 17:29:18 +02:00 |
|
Ettore Di Giacinto
|
67992a7d99
|
feat: support slices or strings in the prompt completion endpoint (#162)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-03 13:13:31 +02:00 |
|
Ettore Di Giacinto
|
751b7eca62
|
feat: add rwkv support (#158)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-03 11:45:22 +02:00 |
|
Ettore Di Giacinto
|
1ae7150810
|
feat: allow to specify default backend for model (#156)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-05-03 00:31:28 +02:00 |
|
Ettore Di Giacinto
|
70caf9bf8c
|
feat: support stopwords both string and arrays (#154)
|
2023-05-02 23:30:00 +02:00 |
|
Dave
|
0b226ac027
|
Stop parameter of OpenAIRequest changed to String Array (#153)
|
2023-05-02 22:02:45 +02:00 |
|
Ettore Di Giacinto
|
220d6fd59b
|
feat: add stream events (#152)
|
2023-05-02 20:03:35 +02:00 |
|
Ettore Di Giacinto
|
156e15a4fa
|
Bump llama.cpp, downgrade gpt4all-j (#149)
|
2023-05-02 16:07:18 +02:00 |
|
Ettore Di Giacinto
|
92452d46da
|
feat: add new gpt4all-j binding (#142)
|
2023-05-01 20:00:15 +02:00 |
|
Ettore Di Giacinto
|
52f4d993c1
|
feat: add /edit endpoint (#119)
|
2023-04-29 09:22:09 +02:00 |
|
Ettore Di Giacinto
|
c806eae0de
|
feat: config files and SSE (#83)
Signed-off-by: mudler <mudler@mocaccino.org>
Signed-off-by: Tyler Gillson <tyler.gillson@gmail.com>
Co-authored-by: Tyler Gillson <tyler.gillson@gmail.com>
|
2023-04-26 21:18:18 -07:00 |
|
Ettore Di Giacinto
|
12d83a4184
|
feat: Return OpenAI errors and update docs (#80)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-04-24 23:42:03 +02:00 |
|
Ettore Di Giacinto
|
1c872ec326
|
feat: add CI/tests (#58)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-04-22 00:44:52 +02:00 |
|
Ettore Di Giacinto
|
79791438fe
|
Use the first available model if not specified (#55)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-04-21 22:54:43 +02:00 |
|
Ettore Di Giacinto
|
5cba71de70
|
Add stopwords, debug mode, and other API enhancements (#54)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-04-21 19:46:59 +02:00 |
|
Ettore Di Giacinto
|
f816dfae65
|
Add support for stablelm (#48)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-04-21 00:06:55 +02:00 |
|
Ettore Di Giacinto
|
1c4fbaae20
|
Add support for cerebras (#45)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-04-20 19:33:36 +02:00 |
|
Ettore Di Giacinto
|
d517a54e28
|
Major API enhancements (#44)
|
2023-04-20 18:33:02 +02:00 |
|
Ettore Di Giacinto
|
80f50e6ccd
|
Rename project to LocalAI (#35)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-04-19 18:43:10 +02:00 |
|
Ettore Di Giacinto
|
7fec26f5d3
|
Enhancements (#34)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-04-19 17:10:29 +02:00 |
|
Ettore Di Giacinto
|
0b330d90ad
|
feat: drop embedded webui (#27)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-04-16 10:46:20 +02:00 |
|
Ettore Di Giacinto
|
63601fabd1
|
feat: drop default model and llama-specific API (#26)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-04-16 10:40:50 +02:00 |
|