LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

Author	SHA1	Message	Date
Dave	c6bf67f446	feat(llama2): add template for chat messages (#782 ) Co-authored-by: Aman Karmani <aman@tmm1.net> Lays some of the groundwork for LLAMA2 compatibility as well as other future models with complex prompting schemes. Started small refactoring in pkg/model/loader.go regarding template loading. Currently still a part of ModelLoader, but should be easy to add template loading for situations other than overall prompt templates and the new chat-specific per-message templates Adds support for new chat-endpoint-specific, per-message templates as an alternative to the existing Role: XYZ sprintf method. Includes a temporary prompt template as an example, since I have a few questions before we merge in the model-gallery side changes (see ) Minor debug logging changes.	2023-07-22 11:31:39 -04:00
Ettore Di Giacinto	c71c729bc2	debug	2023-07-21 10:53:26 +02:00
Ettore Di Giacinto	94916749c5	feat: add external grpc and model autoloading	2023-07-20 22:10:12 +02:00
Ettore Di Giacinto	47cc95fc9f	feat: add all backends to autoload Now since gRPCs are not crashing the main thread we can just greedly attempt all the backends we have available. Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-20 00:40:28 +02:00
Ettore Di Giacinto	3feb632eb4	refactor: rename "llama-master" and "llama" (#776 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-20 00:36:16 +02:00
Ettore Di Giacinto	236497e331	feat: resolve JSONSchema refs (planners) (#774 )	2023-07-19 22:56:13 +02:00
Ettore Di Giacinto	6352448b72	feat: add llama-master backend (#752 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-17 23:58:15 +02:00
Ettore Di Giacinto	1d0ed95a54	feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
Ettore Di Giacinto	5dcfdbe51d	feat: various refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
Ettore Di Giacinto	f2f1d7fe72	feat: use gRPC for transformers Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
Ettore Di Giacinto	ae533cadef	feat: move gpt4all to a grpc service Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
Ettore Di Giacinto	58f6aab637	feat: move llama to a grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
Ettore Di Giacinto	b816009db0	feat: add falcon ggllm via grpc client Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
mudler	55befe396a	Add grammar_json to the request parameters to facilitate JSON generation	2023-07-06 19:08:04 +02:00
mudler	c0578031b5	Add tests Signed-off-by: mudler <mudler@localai.io>	2023-07-04 18:58:19 +02:00
mudler	b722e7eb7e	feat: cleanups, small enhancements Signed-off-by: mudler <mudler@localai.io>	2023-07-04 18:58:19 +02:00
mudler	f09ddd2983	feat: add grammar and functions call support	2023-07-04 18:58:19 +02:00
Luis López	a6839fd238	feat: [whisper] Partial support for verbose_json format in transcribe endpoint (#721 )	2023-07-04 14:31:31 +02:00
Ettore Di Giacinto	bf5acf646e	fix: adapt whisper to bindings updates (#702 ) Signed-off-by: mudler <mudler@localai.io>	2023-06-29 11:26:07 +02:00
Ettore Di Giacinto	78f3c3da48	refactor: consolidate usage of GetURI (#674 ) Signed-off-by: mudler <mudler@localai.io>	2023-06-26 12:25:38 +02:00
mudler	d18f85df46	fix: add tags Signed-off-by: mudler <mudler@localai.io>	2023-06-25 23:03:58 +02:00
Ettore Di Giacinto	6213da330a	fix: add omitempty where needed (#671 )	2023-06-25 22:51:02 +02:00
Ettore Di Giacinto	60db5957d3	Gallery repository (#663 ) Signed-off-by: mudler <mudler@localai.io>	2023-06-24 08:18:17 +02:00
Ettore Di Giacinto	a7bb029d23	feat: add tts with go-piper (#649 ) Signed-off-by: mudler <mudler@localai.io>	2023-06-22 17:53:10 +02:00
Ettore Di Giacinto	e37361985c	deps: update gpt4all bindings, fix search path on new versions (#592 )	2023-06-14 13:24:53 +02:00
Ettore Di Giacinto	84946e9275	feat: display download progress when installing models (#543 )	2023-06-08 21:33:18 +02:00
Ettore Di Giacinto	d62aef2016	feat: add experimental support for falcon-7b (#516 ) Signed-off-by: mudler <mudler@mocaccino.org>	2023-06-06 17:23:19 +02:00
Ettore Di Giacinto	b447a2a719	feat: support upscaled image generation with esrgan (#509 )	2023-06-05 17:21:38 +02:00
Ettore Di Giacinto	78ad4813df	feat: Update gpt4all, support multiple implementations in runtime (#472 ) Signed-off-by: mudler <mudler@mocaccino.org>	2023-06-01 23:38:52 +02:00
Pavel Zloi	3ba07a5928	feat: add LangChainGo Huggingface backend (#446 ) Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-06-01 12:00:06 +02:00
Ettore Di Giacinto	9decd0813c	feat: update go-gpt2 (#359 ) Signed-off-by: mudler <mudler@mocaccino.org>	2023-05-23 21:47:47 +02:00
Ettore Di Giacinto	05a3d569b0	feat: allow to override model config (#323 )	2023-05-20 17:03:53 +02:00
Ettore Di Giacinto	1fade53a61	feat: minor enhancements to /models/apply (#297 )	2023-05-19 08:31:11 +02:00
Ettore Di Giacinto	cc9aa9eb3f	feat: add /models/apply endpoint to prepare models (#286 )	2023-05-18 15:59:03 +02:00
Ettore Di Giacinto	9d051c5d4f	feat: add image generation with ncnn-stablediffusion (#272 )	2023-05-16 19:32:53 +02:00
Ettore Di Giacinto	2a9d7474ce	fix(rwkv): load tokenizer file from model path (#255 )	2023-05-14 17:49:10 +02:00
Ettore Di Giacinto	8250391e49	Add support for gptneox/replit (#238 )	2023-05-12 11:36:35 +02:00
Ettore Di Giacinto	fd1df4e971	whisper: add tests and allow to set upload size (#237 )	2023-05-12 10:04:20 +02:00
Ettore Di Giacinto	4413defca5	feat: add starcoder (#236 )	2023-05-11 20:20:07 +02:00
Ettore Di Giacinto	85f0f8227d	refactor: drop code dups (#234 )	2023-05-11 16:34:16 +02:00
Ettore Di Giacinto	59e3c02002	make use of new bindings for gpt4all (#232 )	2023-05-11 14:31:19 +02:00
Matthew Campbell	032dee256f	Keep whisper models in memory (#233 )	2023-05-11 14:05:07 +02:00
Ettore Di Giacinto	11675932ac	feat: add dolly/redpajama/bloomz models support (#214 )	2023-05-11 01:12:58 +02:00
Ettore Di Giacinto	f8ee20991c	feat: add bert.cpp embeddings (#222 )	2023-05-10 15:20:21 +02:00
Ettore Di Giacinto	9f426578cf	feat: add transcript endpoint (#211 )	2023-05-09 11:43:50 +02:00
Ettore Di Giacinto	c839b334eb	feat: add embeddings for go-llama.cpp backend (#190 )	2023-05-05 11:20:06 +02:00
Ettore Di Giacinto	714bfcd45b	fix: missing returning error and free callback stream (#187 )	2023-05-04 19:49:43 +02:00
Ettore Di Giacinto	751b7eca62	feat: add rwkv support (#158 ) Signed-off-by: mudler <mudler@mocaccino.org>	2023-05-03 11:45:22 +02:00
Ettore Di Giacinto	1ae7150810	feat: allow to specify default backend for model (#156 ) Signed-off-by: mudler <mudler@c3os.io>	2023-05-03 00:31:28 +02:00
Ettore Di Giacinto	156e15a4fa	Bump llama.cpp, downgrade gpt4all-j (#149 )	2023-05-02 16:07:18 +02:00

1 2

58 Commits