Commit Graph

30 Commits

Author SHA1 Message Date
Ettore Di Giacinto
5abbb134d9
feat: extend model configuration for llama.cpp (#536) 2023-06-07 21:46:19 +02:00
Samuel Maynard
96794851b3
feat: add support for Stream: true to completionEndpoint (#465) 2023-06-03 00:27:03 +02:00
Robert Hambrock
4aa78843c0
fix: spec compliant instantiation and termination of streams (#341) 2023-05-21 15:24:04 +02:00
Ettore Di Giacinto
6f54cab3f0
feat: allow to set cors (#339) 2023-05-21 14:38:25 +02:00
Ettore Di Giacinto
cc9aa9eb3f
feat: add /models/apply endpoint to prepare models (#286) 2023-05-18 15:59:03 +02:00
Ettore Di Giacinto
3f739575d8
Minor fixes (#285) 2023-05-17 21:01:46 +02:00
Ettore Di Giacinto
9d051c5d4f
feat: add image generation with ncnn-stablediffusion (#272) 2023-05-16 19:32:53 +02:00
Ettore Di Giacinto
fd1df4e971
whisper: add tests and allow to set upload size (#237) 2023-05-12 10:04:20 +02:00
Ettore Di Giacinto
85f0f8227d
refactor: drop code dups (#234) 2023-05-11 16:34:16 +02:00
Matthew Campbell
032dee256f
Keep whisper models in memory (#233) 2023-05-11 14:05:07 +02:00
Matthew Campbell
6b5e2b2bf5
Upload transcription API wasn't reading the data from the post (#229) 2023-05-11 10:43:05 +02:00
Ettore Di Giacinto
9f426578cf
feat: add transcript endpoint (#211) 2023-05-09 11:43:50 +02:00
Ettore Di Giacinto
89dfa0f5fc
feat: add experimental support for embeddings as arrays (#207) 2023-05-08 19:31:18 +02:00
Dave
07ec2e441d
mini fix - OpenAI documentation url (#200) 2023-05-06 00:42:08 +02:00
mudler
009ee47fe2 Don't allow 0 as thread count 2023-05-05 22:51:20 +02:00
mudler
b49721cdd1 fix: respect config from file for backends settings 2023-05-05 18:05:10 +02:00
mudler
e96eadab40 feat: support deprecated embeddings API 2023-05-05 15:55:19 +02:00
mudler
e73283121b feat: support arrays for prompt and input
Signed-off-by: mudler <mudler@mocaccino.org>
2023-05-05 15:54:59 +02:00
Ettore Di Giacinto
961cf29217
feat: expose mirostat to config (#193) 2023-05-05 13:45:37 +02:00
Ettore Di Giacinto
c839b334eb
feat: add embeddings for go-llama.cpp backend (#190) 2023-05-05 11:20:06 +02:00
Ettore Di Giacinto
714bfcd45b
fix: missing returning error and free callback stream (#187) 2023-05-04 19:49:43 +02:00
Ettore Di Giacinto
fdf75c6d0e
rwkv fixes and examples (#185) 2023-05-04 17:32:23 +02:00
Ettore Di Giacinto
c974dad799
Return usage in the API responses (#166) 2023-05-03 17:29:18 +02:00
Ettore Di Giacinto
67992a7d99
feat: support slices or strings in the prompt completion endpoint (#162)
Signed-off-by: mudler <mudler@mocaccino.org>
2023-05-03 13:13:31 +02:00
Ettore Di Giacinto
1ae7150810
feat: allow to specify default backend for model (#156)
Signed-off-by: mudler <mudler@c3os.io>
2023-05-03 00:31:28 +02:00
Ettore Di Giacinto
70caf9bf8c
feat: support stopwords both string and arrays (#154) 2023-05-02 23:30:00 +02:00
Dave
0b226ac027
Stop parameter of OpenAIRequest changed to String Array (#153) 2023-05-02 22:02:45 +02:00
Ettore Di Giacinto
220d6fd59b
feat: add stream events (#152) 2023-05-02 20:03:35 +02:00
Ettore Di Giacinto
52f4d993c1
feat: add /edit endpoint (#119) 2023-04-29 09:22:09 +02:00
Ettore Di Giacinto
c806eae0de
feat: config files and SSE (#83)
Signed-off-by: mudler <mudler@mocaccino.org>
Signed-off-by: Tyler Gillson <tyler.gillson@gmail.com>
Co-authored-by: Tyler Gillson <tyler.gillson@gmail.com>
2023-04-26 21:18:18 -07:00