LocalAI

Commit Graph

Author	SHA1	Message	Date
LocalAI [bot]	da0b6a89ae	⬆️ Update ggerganov/llama.cpp (#2229 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-03 21:39:28 +00:00
LocalAI [bot]	2cc1bd85af	⬆️ Update ggerganov/llama.cpp (#2224 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-02 21:23:40 +00:00
LocalAI [bot]	6a7a7996bb	⬆️ Update ggerganov/llama.cpp (#2213 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-05-01 21:19:44 +00:00
LocalAI [bot]	f90d56d371	⬆️ Update ggerganov/llama.cpp (#2203 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-30 21:53:31 +00:00
Chris Jowett	970cb3a219	chore: update go-stablediffusion to latest commit with Make jobserver fix Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-30 15:59:28 -05:00
LocalAI [bot]	29d7812344	⬆️ Update ggerganov/whisper.cpp (#2188 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-29 22:16:04 +00:00
cryptk	5fd46175dc	fix: ensure GNUMake jobserver is passed through to whisper.cpp build (#2187 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-29 16:40:50 -05:00
LocalAI [bot]	52a268c38c	⬆️ Update ggerganov/llama.cpp (#2189 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-29 21:36:30 +00:00
cryptk	93ca56086e	update go-tinydream to latest commit (#2182 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-29 15:17:09 +02:00
LocalAI [bot]	5fef3b0ff1	⬆️ Update ggerganov/whisper.cpp (#2177 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-28 22:32:45 +00:00
LocalAI [bot]	01860674c4	⬆️ Update ggerganov/llama.cpp (#2176 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-28 21:41:12 +00:00
cryptk	21974fe1d3	fix: swap to WHISPER_CUDA per deprecation message from whisper.cpp (#2170 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-28 17:51:53 +00:00
LocalAI [bot]	c3982212f9	⬆️ Update ggerganov/llama.cpp (#2159 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-27 21:32:43 +00:00
LocalAI [bot]	030d555995	⬆️ Update ggerganov/llama.cpp (#2150 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-27 02:18:28 +00:00
fakezeta	c9451cb604	Bump oneapi-basekit, optimum and openvino (#2139 ) * Bump oneapi-basekit, optimum and openvino * Changed PERFORMANCE HINT to CUMULATIVE_THROUGHPUT Minor latency change for first token but about 10-15% speedup on token generation.	2024-04-26 16:20:43 +02:00
LocalAI [bot]	365ef92530	⬆️ Update mudler/go-stable-diffusion (#2134 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-25 21:41:38 +00:00
LocalAI [bot]	5fceb876c4	⬆️ Update ggerganov/llama.cpp (#2133 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-25 21:40:41 +00:00
Ettore Di Giacinto	b664edde29	feat(rerankers): Add new backend, support jina rerankers API (#2121 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-25 00:19:02 +02:00
LocalAI [bot]	e16658b7ec	⬆️ Update ggerganov/llama.cpp (#2123 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-24 22:00:17 +00:00
LocalAI [bot]	d30280ed23	⬆️ Update ggerganov/whisper.cpp (#2122 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-24 21:55:30 +00:00
Ettore Di Giacinto	4fffc47e77	deps(llama.cpp): update, use better model for function call tests (#2119 ) deps(llama.cpp): update Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-24 18:44:04 +02:00
LocalAI [bot]	38c9abed8b	⬆️ Update ggerganov/llama.cpp (#2089 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-21 16:35:30 +00:00
Ettore Di Giacinto	284ad026b1	refactor(routes): split routes registration (#2077 ) Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-21 01:19:57 +02:00
LocalAI [bot]	1e37101930	⬆️ Update ggerganov/llama.cpp (#2080 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-20 00:05:16 +00:00
LocalAI [bot]	e9448005a5	⬆️ Update ggerganov/llama.cpp (#2051 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-18 21:30:55 +00:00
cryptk	e9f090257c	fix: adjust some sources names to match the naming of their repositories (#2061 ) Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-18 01:59:05 +00:00
Ettore Di Giacinto	af9e5a2d05	Revert #1963 (#2056 ) * Revert "fix(fncall): fix regression introduced in #1963 (#2048)" This reverts commit `6b06d4e0af`. * Revert "fix: action-tmate back to upstream, dead code removal (#2038)" This reverts commit `fdec8a9d00`. * Revert "feat(grpc): return consumed token count and update response accordingly (#2035)" This reverts commit `e843d7df0e`. * Revert "refactor: backend/service split, channel-based llm flow (#1963)" This reverts commit `eed5706994`. * feat(grpc): return consumed token count and update response accordingly Fixes: #1920 Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2024-04-17 23:33:49 +02:00
LocalAI [bot]	af8c705ecd	⬆️ Update ggerganov/whisper.cpp (#2060 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-17 21:17:25 +00:00
LocalAI [bot]	5763dc1613	⬆️ Update ggerganov/whisper.cpp (#2050 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-16 21:37:50 +00:00
LocalAI [bot]	0cc1ad2188	⬆️ Update ggerganov/whisper.cpp (#2042 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-15 23:27:52 +00:00
LocalAI [bot]	cdece3879f	⬆️ Update ggerganov/llama.cpp (#2043 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-15 22:47:29 +00:00
LocalAI [bot]	de3a1a0a8e	⬆️ Update ggerganov/llama.cpp (#2033 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-14 23:35:44 +00:00
Ettore Di Giacinto	0fdff26924	feat(parler-tts): Add new backend (#2027 ) * feat(parler-tts): Add new backend Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parler-tts): try downgrade protobuf Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat(parler-tts): add parler conda env Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Revert "feat(parler-tts): try downgrade protobuf" This reverts commit bd5941d5cfc00676b45a99f71debf3c34249cf3c. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * deps: add grpc Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * fix: try to gen proto with same environment * workaround * Revert "fix: try to gen proto with same environment" This reverts commit `998c745e2f`. * Workaround fixup --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Co-authored-by: Dave <dave@gray101.com>	2024-04-13 18:59:21 +02:00
LocalAI [bot]	619f2517a4	⬆️ Update ggerganov/llama.cpp (#2028 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-13 13:47:39 +00:00
Dave	eed5706994	refactor: backend/service split, channel-based llm flow (#1963 ) Refactor: channel based llm flow and services split --------- Signed-off-by: Dave Lee <dave@gray101.com>	2024-04-13 09:45:34 +02:00
cryptk	1981154f49	fix: dont commit generated files to git (#1993 ) * fix: initial work towards not committing generated files to the repository Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * feat: improve build docs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove unused folder from .dockerignore and .gitignore Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix extra backend tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: attempt to fix other tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more test fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: fix apple tests Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more extras tests fixes Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add GOBIN to PATH in docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: extra tests and Dockerfile corrections Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: remove build dependency checks Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add golang protobuf compilers to tests-linux action Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: ensure protogen is run for extra backend installs Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: use newer protobuf Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: more missing protoc binaries Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: missing dependencies during docker build Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: don't install grpc compilers in the final stage if they aren't needed Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: python-grpc-tools in 22.04 repos is too old Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: add a couple of extra build dependencies to Makefile Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> * fix: unbreak container rebuild functionality Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com> --------- Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com>	2024-04-13 09:37:32 +02:00
LocalAI [bot]	912d2dccfa	⬆️ Update ggerganov/llama.cpp (#2024 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-13 09:13:00 +02:00
LocalAI [bot]	677e20756b	⬆️ Update ggerganov/llama.cpp (#2014 ) Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>	2024-04-12 00:49:41 +02:00
LocalAI [bot]	e152b07b74	⬆️ Update ggerganov/llama.cpp (#1991 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-11 09:22:07 +02:00
LocalAI [bot]	7e2f8bb408	⬆️ Update ggerganov/whisper.cpp (#1980 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-10 09:08:00 +02:00
LocalAI [bot]	951e39d36c	⬆️ Update ggerganov/llama.cpp (#1979 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-10 09:07:41 +02:00
LocalAI [bot]	195be10050	⬆️ Update ggerganov/llama.cpp (#1973 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 23:26:52 +02:00
LocalAI [bot]	efcca15d3f	⬆️ Update ggerganov/llama.cpp (#1970 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 08:38:47 +02:00
LocalAI [bot]	a153b628c2	⬆️ Update ggerganov/whisper.cpp (#1969 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-08 08:38:17 +02:00
LocalAI [bot]	ed13782986	⬆️ Update ggerganov/llama.cpp (#1964 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-07 10:32:10 +02:00
LocalAI [bot]	8aa5f5a660	⬆️ Update ggerganov/llama.cpp (#1960 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-06 19:15:25 +00:00
LocalAI [bot]	b2d9e3f704	⬆️ Update ggerganov/llama.cpp (#1959 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-05 08:41:55 +02:00
LocalAI [bot]	f744e1f931	⬆️ Update ggerganov/whisper.cpp (#1958 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-05 08:41:35 +02:00
LocalAI [bot]	3851b51d98	⬆️ Update ggerganov/llama.cpp (#1953 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-04 00:27:57 +02:00
LocalAI [bot]	4d4d76114d	⬆️ Update ggerganov/llama.cpp (#1941 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-02 09:16:04 +02:00
LocalAI [bot]	66f90f8dc1	⬆️ Update ggerganov/llama.cpp (#1937 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-04-01 08:59:23 +02:00
LocalAI [bot]	784657a652	⬆️ Update ggerganov/llama.cpp (#1934 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:38 +01:00
LocalAI [bot]	831efa8893	⬆️ Update ggerganov/whisper.cpp (#1933 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-31 00:27:16 +01:00
LocalAI [bot]	2bba62ca4d	⬆️ Update ggerganov/llama.cpp (#1928 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:52:01 +00:00
cryptk	93702e39d4	feat(build): adjust number of parallel make jobs (#1915 ) * feat(build): adjust number of parallel make jobs * fix: update make on MacOS from brew to support --output-sync argument * fix: cache grpc with version as part of key to improve validity of cache hits * fix: use gmake for tests-apple to use the updated GNU make version * fix: actually use the new make version for tests-apple * feat: parallelize tests-extra * feat: attempt to cache grpc build for docker images * fix: don't quote GRPC version * fix: don't cache go modules, we have limited cache space, better used elsewhere * fix: release with the same version of go that we test with * fix: don't fail on exporting cache layers * fix: remove deprecated BUILD_GRPC docker arg from Makefile	2024-03-29 22:32:40 +01:00
LocalAI [bot]	a7fc89c207	⬆️ Update ggerganov/whisper.cpp (#1927 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 22:29:50 +01:00
Ettore Di Giacinto	123a5a2e16	feat(swagger): Add swagger API doc (#1926 ) * makefile(build): add minimal and api build target * feat(swagger): Add swagger	2024-03-29 22:29:33 +01:00
LocalAI [bot]	ab2f403dd0	⬆️ Update ggerganov/whisper.cpp (#1924 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:59 +01:00
LocalAI [bot]	b9c5e14e2c	⬆️ Update ggerganov/llama.cpp (#1923 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-29 00:13:38 +01:00
LocalAI [bot]	07c49ee4b8	⬆️ Update ggerganov/whisper.cpp (#1914 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 22:53:13 +00:00
LocalAI [bot]	07c4bdda7c	⬆️ Update ggerganov/llama.cpp (#1913 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-27 21:57:59 +00:00
cryptk	0c0efc871c	fix(build): better CI logging and correct some build failure modes in Makefile (#1899 ) * feat: group make output by target when running parallelized builds in CI * fix: quote GO_TAGS in makefile to fix handling of whitespace in value * fix: set CPATH to find opencv2 in it's commonly installed location * fix: add missing go mod dropreplace for go-llama.cpp * chore: remove opencv symlink from github workflows	2024-03-27 21:12:19 +01:00
Gianluca Boiano	7ef5f3b473	⬆️ Update M0Rf30/go-tiny-dream (#1911 )	2024-03-27 21:12:04 +01:00
LocalAI [bot]	b500ceaf73	⬆️ Update ggerganov/llama.cpp (#1904 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 23:21:54 +00:00
LocalAI [bot]	1395e505cd	⬆️ Update ggerganov/llama.cpp (#1897 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:34:10 +01:00
LocalAI [bot]	42a4c86dca	⬆️ Update ggerganov/whisper.cpp (#1896 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-26 00:33:46 +01:00
LocalAI [bot]	3e293f1465	⬆️ Update ggerganov/llama.cpp (#1889 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 21:12:18 +00:00
LocalAI [bot]	0106c58181	⬆️ Update ggerganov/llama.cpp (#1885 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-24 14:54:01 +01:00
LocalAI [bot]	a922119c41	⬆️ Update ggerganov/llama.cpp (#1881 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-23 09:23:28 +01:00
Richard Palethorpe	643d85d2cc	feat(stores): Vector store backend (#1795 ) Add simple vector store backend Signed-off-by: Richard Palethorpe <io@richiejp.com>	2024-03-22 21:14:04 +01:00
Ettore Di Giacinto	4b1ee0c170	feat(aio): add tests, update model definitions (#1880 )	2024-03-22 21:13:11 +01:00
LocalAI [bot]	dd84c29a3d	⬆️ Update ggerganov/whisper.cpp (#1875 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:56 +01:00
LocalAI [bot]	07468c8786	⬆️ Update ggerganov/llama.cpp (#1874 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-22 09:14:42 +01:00
Ettore Di Giacinto	abc9360dc6	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Ettore Di Giacinto	e533dcf506	feat(functions/aio): all-in-one images, function template enhancements (#1862 ) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text	2024-03-21 01:12:20 +01:00
LocalAI [bot]	eeaf8c7ccd	⬆️ Update ggerganov/whisper.cpp (#1867 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:26:29 +00:00
LocalAI [bot]	7e34dfdae7	⬆️ Update ggerganov/llama.cpp (#1866 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 22:13:29 +00:00
LocalAI [bot]	e4bf51d5bd	⬆️ Update ggerganov/llama.cpp (#1864 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-20 09:05:53 +01:00
LocalAI [bot]	ead61bf9d5	⬆️ Update ggerganov/llama.cpp (#1857 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:03:17 +00:00
LocalAI [bot]	621541a92f	⬆️ Update ggerganov/whisper.cpp (#1508 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-19 00:44:23 +01:00
Dave	ed5734ae25	test/fix: OSX Test Repair (#1843 ) * test with gguf instead of ggml. Updates testPrompt to match? Adds debugging line to Dockerfile that I've found helpful recently. * fix testPrompt slightly * Sad Experiment: Test GH runner without metal? * break apart CGO_LDFLAGS * switch runner * upstream llama.cpp disables Metal on Github CI! * missed a dir from clean-tests * CGO_LDFLAGS * tmate failure + NO_ACCELERATE * whisper.cpp has a metal fix * do the exact opposite of the name of this branch, but keep it around for unrelated fixes? * add back newlines * add tmate to linux for testing * update fixtures * timeout for tmate	2024-03-18 19:19:43 +01:00
Ettore Di Giacinto	b202bfaaa0	deps(whisper.cpp): update, fix cublas build (#1846 ) fix(whisper.cpp): Add stubs and -lcuda	2024-03-18 15:56:53 +01:00
LocalAI [bot]	0eb0ac7dd0	⬆️ Update ggerganov/llama.cpp (#1848 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-18 08:57:58 +01:00
cryptk	020ce29cd8	fix(make): allow to parallelize jobs (#1845 ) * fix: clean up Makefile dependencies to allow for parallel builds * refactor: remove old unused backend from Makefile * fix: finish removing legacy backend, update piper * fix: I broke llama... I fixed llama * feat: give the tests and builds a few threads * fix: ensure libraries are replaced before build, add dropreplace target * Fix image build workflows	2024-03-17 15:39:20 +01:00
LocalAI [bot]	8967ed1601	⬆️ Update ggerganov/llama.cpp (#1840 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-16 11:25:41 +00:00
LocalAI [bot]	5826fb8e6d	⬆️ Update mudler/go-piper (#1844 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-15 23:51:03 +00:00
Dave	db199f61da	fix: osx build default.metallib (#1837 ) fix: osx build default.metallib (#1837) * port osx fix from refactor pr to slim pr * manually bump llama.cpp version to unstick CI?	2024-03-15 08:18:58 +00:00
LocalAI [bot]	44adbd2c75	⬆️ Update go-skynet/go-llama.cpp (#1835 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 23:06:42 +00:00
Dave	45d520f913	fix: OSX Build Files for llama.cpp (#1836 ) bot ate my changes, seperate branch	2024-03-14 23:07:47 +01:00
LocalAI [bot]	f82065703d	⬆️ Update ggerganov/llama.cpp (#1827 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-14 08:39:39 +01:00
LocalAI [bot]	5c5f07c1e7	⬆️ Update ggerganov/llama.cpp (#1821 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-13 10:05:46 +01:00
LocalAI [bot]	8e57f4df31	⬆️ Update ggerganov/llama.cpp (#1818 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-11 00:02:37 +01:00
LocalAI [bot]	a08cc5adbb	⬆️ Update ggerganov/llama.cpp (#1816 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-10 09:32:09 +01:00
LocalAI [bot]	595a73fce4	⬆️ Update ggerganov/llama.cpp (#1813 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-09 09:27:06 +01:00
LocalAI [bot]	dc919e08e8	⬆️ Update ggerganov/llama.cpp (#1811 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-08 08:21:25 +01:00
Ettore Di Giacinto	5d1018495f	feat(intel): add diffusers/transformers support (#1746 ) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu	2024-03-07 14:37:45 +01:00
LocalAI [bot]	ad6fd7a991	⬆️ Update ggerganov/llama.cpp (#1805 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-06 23:28:31 +01:00
LocalAI [bot]	e022b5959e	⬆️ Update mudler/go-stable-diffusion (#1802 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 23:39:57 +00:00
LocalAI [bot]	db7f4955a1	⬆️ Update ggerganov/llama.cpp (#1801 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 21:50:27 +00:00
LocalAI [bot]	c8e29033c2	⬆️ Update ggerganov/llama.cpp (#1794 ) Signed-off-by: GitHub <noreply@github.com> Co-authored-by: mudler <mudler@users.noreply.github.com>	2024-03-05 08:59:09 +01:00

1 2 3 4 5 ...

550 Commits