Ettore Di Giacinto
|
10ddd72b58
|
fix: set default batch size (#597)
|
2023-06-14 19:09:27 +02:00 |
|
Ettore Di Giacinto
|
e37361985c
|
deps: update gpt4all bindings, fix search path on new versions (#592)
|
2023-06-14 13:24:53 +02:00 |
|
Ettore Di Giacinto
|
5abbb134d9
|
feat: extend model configuration for llama.cpp (#536)
|
2023-06-07 21:46:19 +02:00 |
|
Ettore Di Giacinto
|
d62aef2016
|
feat: add experimental support for falcon-7b (#516)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-06-06 17:23:19 +02:00 |
|
Pavel Zloi
|
3ba07a5928
|
feat: add LangChainGo Huggingface backend (#446)
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2023-06-01 12:00:06 +02:00 |
|
Ettore Di Giacinto
|
217dbb448e
|
feat: allow to set a prompt cache path and enable saving state (#395)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-27 14:29:11 +02:00 |
|
Ettore Di Giacinto
|
9decd0813c
|
feat: update go-gpt2 (#359)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-23 21:47:47 +02:00 |
|
Ettore Di Giacinto
|
9d051c5d4f
|
feat: add image generation with ncnn-stablediffusion (#272)
|
2023-05-16 19:32:53 +02:00 |
|
Ettore Di Giacinto
|
acd03d15f2
|
feat: add support for cublas/openblas in the llama.cpp backend (#258)
|
2023-05-16 16:26:25 +02:00 |
|
Ettore Di Giacinto
|
2488c445b6
|
feat: bert.cpp token embeddings (#241)
|
2023-05-12 17:16:49 +02:00 |
|
Ettore Di Giacinto
|
8250391e49
|
Add support for gptneox/replit (#238)
|
2023-05-12 11:36:35 +02:00 |
|
Ettore Di Giacinto
|
4413defca5
|
feat: add starcoder (#236)
|
2023-05-11 20:20:07 +02:00 |
|
Ettore Di Giacinto
|
59e3c02002
|
make use of new bindings for gpt4all (#232)
|
2023-05-11 14:31:19 +02:00 |
|
Ettore Di Giacinto
|
11675932ac
|
feat: add dolly/redpajama/bloomz models support (#214)
|
2023-05-11 01:12:58 +02:00 |
|
Ettore Di Giacinto
|
f8ee20991c
|
feat: add bert.cpp embeddings (#222)
|
2023-05-10 15:20:21 +02:00 |
|
Ettore Di Giacinto
|
89dfa0f5fc
|
feat: add experimental support for embeddings as arrays (#207)
|
2023-05-08 19:31:18 +02:00 |
|
mudler
|
e62ee2bc06
|
fix: remove trailing 0s from embeddings
This happens when no max_tokens are set, so by default go-llama
allocates more space for the slice and padding happens.
|
2023-05-05 18:35:03 +02:00 |
|
mudler
|
64c0a7967f
|
fix: pass prediction options when using the model
|
2023-05-05 15:56:02 +02:00 |
|
mudler
|
e73283121b
|
feat: support arrays for prompt and input
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-05 15:54:59 +02:00 |
|
Ettore Di Giacinto
|
961cf29217
|
feat: expose mirostat to config (#193)
|
2023-05-05 13:45:37 +02:00 |
|
Ettore Di Giacinto
|
c839b334eb
|
feat: add embeddings for go-llama.cpp backend (#190)
|
2023-05-05 11:20:06 +02:00 |
|
Ettore Di Giacinto
|
714bfcd45b
|
fix: missing returning error and free callback stream (#187)
|
2023-05-04 19:49:43 +02:00 |
|
Ettore Di Giacinto
|
fdf75c6d0e
|
rwkv fixes and examples (#185)
|
2023-05-04 17:32:23 +02:00 |
|
Ettore Di Giacinto
|
751b7eca62
|
feat: add rwkv support (#158)
Signed-off-by: mudler <mudler@mocaccino.org>
|
2023-05-03 11:45:22 +02:00 |
|
Ettore Di Giacinto
|
1ae7150810
|
feat: allow to specify default backend for model (#156)
Signed-off-by: mudler <mudler@c3os.io>
|
2023-05-03 00:31:28 +02:00 |
|
Ettore Di Giacinto
|
220d6fd59b
|
feat: add stream events (#152)
|
2023-05-02 20:03:35 +02:00 |
|
Ettore Di Giacinto
|
156e15a4fa
|
Bump llama.cpp, downgrade gpt4all-j (#149)
|
2023-05-02 16:07:18 +02:00 |
|
Ettore Di Giacinto
|
92452d46da
|
feat: add new gpt4all-j binding (#142)
|
2023-05-01 20:00:15 +02:00 |
|
Ettore Di Giacinto
|
52f4d993c1
|
feat: add /edit endpoint (#119)
|
2023-04-29 09:22:09 +02:00 |
|
Ettore Di Giacinto
|
c806eae0de
|
feat: config files and SSE (#83)
Signed-off-by: mudler <mudler@mocaccino.org>
Signed-off-by: Tyler Gillson <tyler.gillson@gmail.com>
Co-authored-by: Tyler Gillson <tyler.gillson@gmail.com>
|
2023-04-26 21:18:18 -07:00 |
|