LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

Author	SHA1	Message	Date
Dave	255748bcba	MQTT Startup Refactoring Part 1: core/ packages part 1 (#1728 ) This PR specifically introduces a `core` folder and moves the following packages over, without any other changes: - `api/backend` - `api/config` - `api/options` - `api/schema` Once this is merged and we confirm there's no regressions, I can migrate over the remaining changes piece by piece to split up application startup, backend services, http, and mqtt as was the goal of the earlier PRs!	2024-02-21 01:21:19 +00:00
Ettore Di Giacinto	960d314e4f	feat(tools): Parallel function calling (#1726 ) feat(tools): support returning multiple tools choices Fixes: https://github.com/mudler/LocalAI/issues/1275	2024-02-20 21:58:45 +01:00
Steven Christou	01205fd4c0	Initial implementation of upload files api. (#1703 ) * Initial implementation of upload files api. * Move sanitize method to utils. * Save uploaded data to uploads folder. * Avoid loop if we do not have a purpose. * Minor cleanup of api and fix bug where deleting duplicate filename cause error. * Revert defer of saving config * Moved creation of directory to startup. * Make file names unique when storing on disk. * Add test for files api. * Update dependencies.	2024-02-18 10:12:02 +00:00
Ettore Di Giacinto	c72808f18b	feat(tools): support Tool calls in the API (#1715 ) * feat(tools): support Tools in the API Co-authored-by: =?UTF-8?q?Stephan=20A=C3=9Fmus?= <stephan.assmus@sap.com> * feat(tools): support function streaming * Adhere to new return types when using tools instead of functions * Keep backward compatibility with function calling * Evaluate function names in chat templates * Disable recovery with --debug * Correctly stream out the entire result * Detect when llm chooses to reply and to not perform any action in SSE * Feedback from code review --------- Co-authored-by: =?UTF-8?q?Stephan=20A=C3=9Fmus?= <stephan.assmus@sap.com>	2024-02-17 10:00:34 +01:00
Ettore Di Giacinto	53dbe36f32	feat(tts): respect YAMLs config file, add sycl docs/examples (#1692 ) * feat(refactor): refactor config and input reading * feat(tts): read config file for TTS * examples(kubernetes): Add simple deployment example * examples(kubernetes): Add simple deployment for intel arc * docs(sycl): add sycl example * feat(tts): do not always pick a first model * fixups to run vall-e-x on container * Correctly resolve backend	2024-02-10 21:37:03 +01:00
Ettore Di Giacinto	db926896bd	Revert "[Refactor]: Core/API Split" (#1550 ) Revert "[Refactor]: Core/API Split (#1506)" This reverts commit `ab7b4d5ee9`.	2024-01-05 18:04:46 +01:00
Dave	ab7b4d5ee9	[Refactor]: Core/API Split (#1506 ) Refactors api folder to core, creates firm split between backend code and api frontend.	2024-01-05 15:34:56 +01:00
JZacharie	24adf9cbcb	remove default to stablediffusion (#1500 )	2023-12-27 23:16:49 +00:00
Gianluca Boiano	cae7b197ec	feat: add tiny dream stable diffusion support (#1283 ) Signed-off-by: Gianluca Boiano <morf3089@gmail.com>	2023-12-24 19:27:24 +00:00
Ettore Di Giacinto	1fc3a375df	feat: inline templates and accept URLs in models (#1452 ) * feat: Allow inline templates * feat: Allow to specify url in model config files Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * feat: support 'huggingface://' format * style: reuse-code from gallery --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-12-18 18:58:44 +01:00
Ettore Di Giacinto	dd982acf2c	feat(img2vid,txt2vid): Initial support for img2vid,txt2vid (#1442 ) * feat(img2vid): Initial support for img2vid * doc(SD): fix SDXL Example * Minor fixups for img2vid * docs(img2img): fix example curl call * feat(txt2vid): initial support Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> * diffusers: be retro-compatible with CUDA settings * docs(img2vid, txt2vid): examples * Add notice on docs --------- Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>	2023-12-15 18:06:20 -05:00
Ettore Di Giacinto	66a558ff41	fix: respect OpenAI spec for response format (#1289 ) fix: properly respect OpenAI spec for response format Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-15 19:36:23 +01:00
Ettore Di Giacinto	0eae727366	🔥 add LaVA support and GPT vision API, Multiple requests for llama.cpp, return JSON types (#1254 ) * wip * wip * Make it functional Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * wip * Small fixups * do not inject space on role encoding, encode img at beginning of messages Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Add examples/config defaults * Add include dir of current source dir * cleanup * fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fixups * Revert "fixups" This reverts commit `f1a4731cca`. * fixes Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-11-11 13:14:59 +01:00
Jesús Espino	81a5ed9f31	fix(openai): Populate ID and Created fields in OpenAI compatible responses (#1164 ) Adding the extra ID and Created fields to any request to the OpenAI Compatible API to improve the compatibility. This PR fixes #1103	2023-10-12 02:00:08 +00:00
Ettore Di Giacinto	cc060a283d	fix: drop racy code, refactor and group API schema (#931 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-20 14:04:45 +02:00
Ettore Di Giacinto	1079b18ff7	feat(diffusers): be consistent with pipelines, support also depthimg2img (#926 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-18 22:06:24 +02:00
Dave	8cb1061c11	Usage Features (#863 )	2023-08-18 21:23:14 +02:00
Ettore Di Giacinto	2bacd0180d	feat(diffusers): add img2img and clip_skip, support more kernels schedulers (#906 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-17 23:38:59 +02:00
Ettore Di Giacinto	8c781a6a44	feat: Add Diffusers (#874 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-09 08:38:51 +02:00
Ettore Di Giacinto	3c8fc37c56	feat: Add UseFastTokenizer Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-08 01:10:05 +02:00
Ettore Di Giacinto	a843e64fc2	feat: add initial AutoGPTQ backend implementation	2023-08-07 22:53:28 +02:00
Ettore Di Giacinto	acd829a7a0	fix: do not break on newlines on function returns (#864 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-08-04 21:46:36 +02:00
Dave	7fb8b4191f	feat: "simple" chat/edit/completion template system prompt from config (#856 )	2023-08-03 00:19:55 +02:00
Dave	ce8e9dc690	feature: model list :: filter query string parameter (#830 )	2023-07-31 19:14:32 +02:00
Dave	8e8d474ae8	refactor: Remove remaining uses of depreciated package `io/ioutil` (#837 )	2023-07-30 11:23:43 +00:00
Ettore Di Giacinto	dde12b492b	fix: select function calls if 'name' is set in the request (#827 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-28 01:17:11 +02:00
Ettore Di Giacinto	569c1d1163	feat: add rope settings and negative prompt, drop grammar backend (#797 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-25 19:05:27 +02:00
Aman Gupta Karmani	12fe0932c4	feat: cancel stream generation if client disappears (#792 )	2023-07-24 23:10:54 +02:00
Dave	c6bf67f446	feat(llama2): add template for chat messages (#782 ) Co-authored-by: Aman Karmani <aman@tmm1.net> Lays some of the groundwork for LLAMA2 compatibility as well as other future models with complex prompting schemes. Started small refactoring in pkg/model/loader.go regarding template loading. Currently still a part of ModelLoader, but should be easy to add template loading for situations other than overall prompt templates and the new chat-specific per-message templates Adds support for new chat-endpoint-specific, per-message templates as an alternative to the existing Role: XYZ sprintf method. Includes a temporary prompt template as an example, since I have a few questions before we merge in the model-gallery side changes (see ) Minor debug logging changes.	2023-07-22 11:31:39 -04:00
Ettore Di Giacinto	94817b557c	fix: make completions endpoint more close to OpenAI specification (#790 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-22 00:53:52 +02:00
Ettore Di Giacinto	94916749c5	feat: add external grpc and model autoloading	2023-07-20 22:10:12 +02:00
Ettore Di Giacinto	d0e67cce75	fix: make last stream message to send empty content Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-16 00:09:28 +02:00
Ettore Di Giacinto	17294ae5e5	fix: make first stream message to send empty content (#751 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 22:50:52 +02:00
Ettore Di Giacinto	1d0ed95a54	feat: move other backends to grpc This finally makes everything more consistent Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00
Ettore Di Giacinto	5dcfdbe51d	feat: various refactorings Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-07-15 01:19:43 +02:00

35 Commits