LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

History

Ettore Di Giacinto c89271b2e4 feat(llama.cpp): add distributed llama.cpp inferencing (#2324 ) * feat(llama.cpp): support distributed llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: let tweak how chat messages are merged together Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Makefile: register to ALL_GRPC_BACKENDS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring, allow disable auto-detection of backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * minor fixups Signed-off-by: mudler <mudler@localai.io> * feat: add cmd to start rpc-server from llama.cpp Signed-off-by: mudler <mudler@localai.io> * ci: add ccache Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>		2024-05-15 01:17:02 +02:00
..
CMakeLists.txt	deps(llama.cpp): update, support Gemma models (#1734 )	2024-02-21 17:23:38 +01:00
grpc-server.cpp	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
json.hpp	🔥 add LaVA support and GPT vision API, Multiple requests for llama.cpp, return JSON types (#1254 )	2023-11-11 13:14:59 +01:00
Makefile	feat: migrate python backends from conda to uv (#2215 )	2024-05-10 15:08:08 +02:00
prepare.sh	feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants (#2232 )	2024-05-04 17:56:12 +02:00
utils.hpp	feat(sycl): Add support for Intel GPUs with sycl (#1647 ) (#1660 )	2024-02-01 19:21:52 +01:00