LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

Author	SHA1	Message	Date
Ettore Di Giacinto	530bec9c64	feat(llama.cpp): do not specify backends to autoload and add llama.cpp variants (#2232 ) * feat(initializer): do not specify backends to autoload We can simply try to autoload the backends extracted in the asset dir. This will allow to build variants of the same backend (for e.g. with different instructions sets), so to have a single binary for all the variants. Signed-off-by: mudler <mudler@localai.io> * refactor(prepare): refactor out llama.cpp prepare steps Make it so are idempotent and that we can re-build Signed-off-by: mudler <mudler@localai.io> * [TEST] feat(build): build noavx version along Signed-off-by: mudler <mudler@localai.io> * build: make build parallel Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build: do not override CMAKE_ARGS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * build: add fallback variant Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Fixups Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(huggingface-langchain): fail if no token is set Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix(huggingface-langchain): rename Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: do not autoload local-store Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: give priority between the listed backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: mudler <mudler@localai.io> Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-05-04 17:56:12 +02:00
LocalAI [bot]	ac0f3d6e82	⬆️ Update ggerganov/whisper.cpp (#2230 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-03 22:16:26 +00:00
LocalAI [bot]	da0b6a89ae	⬆️ Update ggerganov/llama.cpp (#2229 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-03 21:39:28 +00:00
LocalAI [bot]	2cc1bd85af	⬆️ Update ggerganov/llama.cpp (#2224 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-02 21:23:40 +00:00
LocalAI [bot]	6a7a7996bb	⬆️ Update ggerganov/llama.cpp (#2213 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-01 21:19:44 +00:00
LocalAI [bot]	f90d56d371	⬆️ Update ggerganov/llama.cpp (#2203 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-30 21:53:31 +00:00
Chris Jowett	970cb3a219	chore: update go-stablediffusion to latest commit with Make jobserver fix Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-30 15:59:28 -05:00
LocalAI [bot]	29d7812344	⬆️ Update ggerganov/whisper.cpp (#2188 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-29 22:16:04 +00:00
cryptk	5fd46175dc	fix: ensure GNUMake jobserver is passed through to whisper.cpp build (#2187 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-29 16:40:50 -05:00
LocalAI [bot]	52a268c38c	⬆️ Update ggerganov/llama.cpp (#2189 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-29 21:36:30 +00:00
cryptk	93ca56086e	update go-tinydream to latest commit (#2182 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-29 15:17:09 +02:00
LocalAI [bot]	5fef3b0ff1	⬆️ Update ggerganov/whisper.cpp (#2177 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-28 22:32:45 +00:00
LocalAI [bot]	01860674c4	⬆️ Update ggerganov/llama.cpp (#2176 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-28 21:41:12 +00:00
cryptk	21974fe1d3	fix: swap to WHISPER_CUDA per deprecation message from whisper.cpp (#2170 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-28 17:51:53 +00:00
LocalAI [bot]	c3982212f9	⬆️ Update ggerganov/llama.cpp (#2159 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-27 21:32:43 +00:00
LocalAI [bot]	030d555995	⬆️ Update ggerganov/llama.cpp (#2150 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-27 02:18:28 +00:00
fakezeta	c9451cb604	Bump oneapi-basekit, optimum and openvino (#2139 ) * Bump oneapi-basekit, optimum and openvino * Changed PERFORMANCE HINT to CUMULATIVE_THROUGHPUT Minor latency change for first token but about 10-15% speedup on token generation.	2024-04-26 16:20:43 +02:00
LocalAI [bot]	365ef92530	⬆️ Update mudler/go-stable-diffusion (#2134 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-25 21:41:38 +00:00
LocalAI [bot]	5fceb876c4	⬆️ Update ggerganov/llama.cpp (#2133 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-25 21:40:41 +00:00
Ettore Di Giacinto	b664edde29	feat(rerankers): Add new backend, support jina rerankers API (#2121 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-25 00:19:02 +02:00
LocalAI [bot]	e16658b7ec	⬆️ Update ggerganov/llama.cpp (#2123 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-24 22:00:17 +00:00
LocalAI [bot]	d30280ed23	⬆️ Update ggerganov/whisper.cpp (#2122 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-24 21:55:30 +00:00
Ettore Di Giacinto	4fffc47e77	deps(llama.cpp): update, use better model for function call tests (#2119 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-24 18:44:04 +02:00
LocalAI [bot]	38c9abed8b	⬆️ Update ggerganov/llama.cpp (#2089 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-21 16:35:30 +00:00
Ettore Di Giacinto	284ad026b1	refactor(routes): split routes registration (#2077 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-21 01:19:57 +02:00
LocalAI [bot]	1e37101930	⬆️ Update ggerganov/llama.cpp (#2080 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-20 00:05:16 +00:00
LocalAI [bot]	e9448005a5	⬆️ Update ggerganov/llama.cpp (#2051 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-18 21:30:55 +00:00
cryptk	e9f090257c	fix: adjust some sources names to match the naming of their repositories (#2061 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-18 01:59:05 +00:00
Ettore Di Giacinto	af9e5a2d05	Revert #1963 (#2056 ) * Revert "fix(fncall): fix regression introduced in #1963 (#2048)" This reverts commit `6b06d4e0af`. * Revert "fix: action-tmate back to upstream, dead code removal (#2038)" This reverts commit `fdec8a9d00`. * Revert "feat(grpc): return consumed token count and update response accordingly (#2035)" This reverts commit `e843d7df0e`. * Revert "refactor: backend/service split, channel-based llm flow (#1963)" This reverts commit `eed5706994`. * feat(grpc): return consumed token count and update response accordingly Fixes: #1920 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-17 23:33:49 +02:00
LocalAI [bot]	af8c705ecd	⬆️ Update ggerganov/whisper.cpp (#2060 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-17 21:17:25 +00:00
LocalAI [bot]	5763dc1613	⬆️ Update ggerganov/whisper.cpp (#2050 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-16 21:37:50 +00:00
LocalAI [bot]	0cc1ad2188	⬆️ Update ggerganov/whisper.cpp (#2042 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-15 23:27:52 +00:00
LocalAI [bot]	cdece3879f	⬆️ Update ggerganov/llama.cpp (#2043 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-15 22:47:29 +00:00
LocalAI [bot]	de3a1a0a8e	⬆️ Update ggerganov/llama.cpp (#2033 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-14 23:35:44 +00:00
Ettore Di Giacinto	0fdff26924	feat(parler-tts): Add new backend (#2027 ) * feat(parler-tts): Add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parler-tts): try downgrade protobuf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parler-tts): add parler conda env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Revert "feat(parler-tts): try downgrade protobuf" This reverts commit bd5941d5cfc00676b45a99f71debf3c34249cf3c. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * deps: add grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: try to gen proto with same environment * workaround * Revert "fix: try to gen proto with same environment" This reverts commit `998c745e2f`. * Workaround fixup --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Dave <dave@gray101.com>	2024-04-13 18:59:21 +02:00
LocalAI [bot]	619f2517a4	⬆️ Update ggerganov/llama.cpp (#2028 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-13 13:47:39 +00:00
Dave	eed5706994	refactor: backend/service split, channel-based llm flow (#1963 ) Refactor: channel based llm flow and services split --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-13 09:45:34 +02:00
cryptk	1981154f49	fix: dont commit generated files to git (#1993 ) * fix: initial work towards not committing generated files to the repository Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: improve build docs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove unused folder from .dockerignore and .gitignore Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix extra backend tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix other tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more test fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: fix apple tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more extras tests fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add GOBIN to PATH in docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: extra tests and Dockerfile corrections Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove build dependency checks Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add golang protobuf compilers to tests-linux action Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: ensure protogen is run for extra backend installs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use newer protobuf Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more missing protoc binaries Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: missing dependencies during docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: don't install grpc compilers in the final stage if they aren't needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: python-grpc-tools in 22.04 repos is too old Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add a couple of extra build dependencies to Makefile Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: unbreak container rebuild functionality Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-13 09:37:32 +02:00
LocalAI [bot]	912d2dccfa	⬆️ Update ggerganov/llama.cpp (#2024 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-13 09:13:00 +02:00
LocalAI [bot]	677e20756b	⬆️ Update ggerganov/llama.cpp (#2014 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-12 00:49:41 +02:00
LocalAI [bot]	e152b07b74	⬆️ Update ggerganov/llama.cpp (#1991 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-11 09:22:07 +02:00
LocalAI [bot]	7e2f8bb408	⬆️ Update ggerganov/whisper.cpp (#1980 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-10 09:08:00 +02:00
LocalAI [bot]	951e39d36c	⬆️ Update ggerganov/llama.cpp (#1979 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-10 09:07:41 +02:00
LocalAI [bot]	195be10050	⬆️ Update ggerganov/llama.cpp (#1973 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 23:26:52 +02:00
LocalAI [bot]	efcca15d3f	⬆️ Update ggerganov/llama.cpp (#1970 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 08:38:47 +02:00
LocalAI [bot]	a153b628c2	⬆️ Update ggerganov/whisper.cpp (#1969 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 08:38:17 +02:00
LocalAI [bot]	ed13782986	⬆️ Update ggerganov/llama.cpp (#1964 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-07 10:32:10 +02:00
LocalAI [bot]	8aa5f5a660	⬆️ Update ggerganov/llama.cpp (#1960 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-06 19:15:25 +00:00
LocalAI [bot]	b2d9e3f704	⬆️ Update ggerganov/llama.cpp (#1959 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-05 08:41:55 +02:00
LocalAI [bot]	f744e1f931	⬆️ Update ggerganov/whisper.cpp (#1958 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-05 08:41:35 +02:00
LocalAI [bot]	3851b51d98	⬆️ Update ggerganov/llama.cpp (#1953 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-04 00:27:57 +02:00
LocalAI [bot]	4d4d76114d	⬆️ Update ggerganov/llama.cpp (#1941 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-02 09:16:04 +02:00
LocalAI [bot]	66f90f8dc1	⬆️ Update ggerganov/llama.cpp (#1937 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-01 08:59:23 +02:00
LocalAI [bot]	784657a652	⬆️ Update ggerganov/llama.cpp (#1934 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:38 +01:00
LocalAI [bot]	831efa8893	⬆️ Update ggerganov/whisper.cpp (#1933 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:16 +01:00
LocalAI [bot]	2bba62ca4d	⬆️ Update ggerganov/llama.cpp (#1928 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:52:01 +00:00
cryptk	93702e39d4	feat(build): adjust number of parallel make jobs (#1915 ) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile	2024-03-29 22:32:40 +01:00
LocalAI [bot]	a7fc89c207	⬆️ Update ggerganov/whisper.cpp (#1927 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:29:50 +01:00
Ettore Di Giacinto	123a5a2e16	feat(swagger): Add swagger API doc (#1926 ) * makefile(build): add minimal and api build target * feat(swagger): Add swagger	2024-03-29 22:29:33 +01:00
LocalAI [bot]	ab2f403dd0	⬆️ Update ggerganov/whisper.cpp (#1924 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:59 +01:00
LocalAI [bot]	b9c5e14e2c	⬆️ Update ggerganov/llama.cpp (#1923 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:38 +01:00
LocalAI [bot]	07c49ee4b8	⬆️ Update ggerganov/whisper.cpp (#1914 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 22:53:13 +00:00
LocalAI [bot]	07c4bdda7c	⬆️ Update ggerganov/llama.cpp (#1913 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 21:57:59 +00:00
cryptk	0c0efc871c	fix(build): better CI logging and correct some build failure modes in Makefile (#1899 ) * feat: group make output by target when running parallelized builds in CI * fix: quote GO_TAGS in makefile to fix handling of whitespace in value * fix: set CPATH to find opencv2 in it's commonly installed location * fix: add missing go mod dropreplace for go-llama.cpp * chore: remove opencv symlink from github workflows	2024-03-27 21:12:19 +01:00
Gianluca Boiano	7ef5f3b473	⬆️ Update M0Rf30/go-tiny-dream (#1911 )	2024-03-27 21:12:04 +01:00
LocalAI [bot]	b500ceaf73	⬆️ Update ggerganov/llama.cpp (#1904 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 23:21:54 +00:00
LocalAI [bot]	1395e505cd	⬆️ Update ggerganov/llama.cpp (#1897 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:34:10 +01:00
LocalAI [bot]	42a4c86dca	⬆️ Update ggerganov/whisper.cpp (#1896 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:33:46 +01:00
LocalAI [bot]	3e293f1465	⬆️ Update ggerganov/llama.cpp (#1889 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 21:12:18 +00:00
LocalAI [bot]	0106c58181	⬆️ Update ggerganov/llama.cpp (#1885 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 14:54:01 +01:00
LocalAI [bot]	a922119c41	⬆️ Update ggerganov/llama.cpp (#1881 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-23 09:23:28 +01:00
Richard Palethorpe	643d85d2cc	feat(stores): Vector store backend (#1795 ) Add simple vector store backend Signed-off-by: Richard Palethorpe <io@richiejp.com>	2024-03-22 21:14:04 +01:00
Ettore Di Giacinto	4b1ee0c170	feat(aio): add tests, update model definitions (#1880 )	2024-03-22 21:13:11 +01:00
LocalAI [bot]	dd84c29a3d	⬆️ Update ggerganov/whisper.cpp (#1875 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:56 +01:00
LocalAI [bot]	07468c8786	⬆️ Update ggerganov/llama.cpp (#1874 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:42 +01:00
Ettore Di Giacinto	abc9360dc6	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Ettore Di Giacinto	e533dcf506	feat(functions/aio): all-in-one images, function template enhancements (#1862 ) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text	2024-03-21 01:12:20 +01:00
LocalAI [bot]	eeaf8c7ccd	⬆️ Update ggerganov/whisper.cpp (#1867 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:26:29 +00:00
LocalAI [bot]	7e34dfdae7	⬆️ Update ggerganov/llama.cpp (#1866 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:13:29 +00:00
LocalAI [bot]	e4bf51d5bd	⬆️ Update ggerganov/llama.cpp (#1864 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 09:05:53 +01:00
LocalAI [bot]	ead61bf9d5	⬆️ Update ggerganov/llama.cpp (#1857 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:03:17 +00:00
LocalAI [bot]	621541a92f	⬆️ Update ggerganov/whisper.cpp (#1508 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:44:23 +01:00
Dave	ed5734ae25	test/fix: OSX Test Repair (#1843 ) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate	2024-03-18 19:19:43 +01:00
Ettore Di Giacinto	b202bfaaa0	deps(whisper.cpp): update, fix cublas build (#1846 ) fix(whisper.cpp): Add stubs and -lcuda	2024-03-18 15:56:53 +01:00
LocalAI [bot]	0eb0ac7dd0	⬆️ Update ggerganov/llama.cpp (#1848 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-18 08:57:58 +01:00
cryptk	020ce29cd8	fix(make): allow to parallelize jobs (#1845 ) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows	2024-03-17 15:39:20 +01:00
LocalAI [bot]	8967ed1601	⬆️ Update ggerganov/llama.cpp (#1840 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-16 11:25:41 +00:00
LocalAI [bot]	5826fb8e6d	⬆️ Update mudler/go-piper (#1844 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-15 23:51:03 +00:00
Dave	db199f61da	fix: osx build default.metallib (#1837 ) fix: osx build default.metallib (#1837) * port osx fix from refactor pr to slim pr * manually bump llama.cpp version to unstick CI?	2024-03-15 08:18:58 +00:00
LocalAI [bot]	44adbd2c75	⬆️ Update go-skynet/go-llama.cpp (#1835 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 23:06:42 +00:00
Dave	45d520f913	fix: OSX Build Files for llama.cpp (#1836 ) bot ate my changes, seperate branch	2024-03-14 23:07:47 +01:00
LocalAI [bot]	f82065703d	⬆️ Update ggerganov/llama.cpp (#1827 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 08:39:39 +01:00
LocalAI [bot]	5c5f07c1e7	⬆️ Update ggerganov/llama.cpp (#1821 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-13 10:05:46 +01:00
LocalAI [bot]	8e57f4df31	⬆️ Update ggerganov/llama.cpp (#1818 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-11 00:02:37 +01:00
LocalAI [bot]	a08cc5adbb	⬆️ Update ggerganov/llama.cpp (#1816 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-10 09:32:09 +01:00
LocalAI [bot]	595a73fce4	⬆️ Update ggerganov/llama.cpp (#1813 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-09 09:27:06 +01:00
LocalAI [bot]	dc919e08e8	⬆️ Update ggerganov/llama.cpp (#1811 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-08 08:21:25 +01:00
Ettore Di Giacinto	5d1018495f	feat(intel): add diffusers/transformers support (#1746 ) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu	2024-03-07 14:37:45 +01:00
LocalAI [bot]	ad6fd7a991	⬆️ Update ggerganov/llama.cpp (#1805 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-06 23:28:31 +01:00
LocalAI [bot]	e022b5959e	⬆️ Update mudler/go-stable-diffusion (#1802 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 23:39:57 +00:00

1 2 3 4 5 ...

552 Commits