mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

ai alpaca api api-rest bloom containers falcon gpt-neox gpt4all guanaco kubernetes llama llm rwkv stable-diffusion tts vicuna

Go to file

renovate[bot] 28a36e20aa fix(deps): update module google.golang.org/grpc to v1.58.1 (#1020 ) [![Mend Renovate](https://app.renovatebot.com/images/banner.svg)](https://renovatebot.com) This PR contains the following updates: \| Package \| Type \| Update \| Change \| \|---\|---\|---\|---\| \| [google.golang.org/grpc](https://togithub.com/grpc/grpc-go) \| require \| minor \| `v1.57.0` -> `v1.58.1` \| --- ### Release Notes <details> <summary>grpc/grpc-go (google.golang.org/grpc)</summary> ### [`v1.58.1`](https://togithub.com/grpc/grpc-go/releases/tag/v1.58.1): Release 1.58.1 [Compare Source](https://togithub.com/grpc/grpc-go/compare/v1.58.0...v1.58.1) ### Bug Fixes - grpc: fix a bug that was decrementing active RPC count too early for streaming RPCs; leading to channel moving to IDLE even though it had open streams - grpc: fix a bug where transports were not being closed upon channel entering IDLE ### [`v1.58.0`](https://togithub.com/grpc/grpc-go/releases/tag/v1.58.0): Release 1.58.0 [Compare Source](https://togithub.com/grpc/grpc-go/compare/v1.57.0...v1.58.0) ### API Changes See [#6472](https://togithub.com/grpc/grpc-go/issues/6472) for details about these changes. - balancer: add `StateListener` to `NewSubConnOptions` for `SubConn` state updates and deprecate `Balancer.UpdateSubConnState` ([#6481](https://togithub.com/grpc/grpc-go/issues/6481)) - `UpdateSubConnState` will be deleted in the future. - balancer: add `SubConn.Shutdown` and deprecate `Balancer.RemoveSubConn` ([#6493](https://togithub.com/grpc/grpc-go/issues/6493)) - `RemoveSubConn` will be deleted in the future. - resolver: remove deprecated `AddressType` ([#6451](https://togithub.com/grpc/grpc-go/issues/6451)) - This was previously used as a signal to enable the "grpclb" load balancing policy, and to pass LB addresses to the policy. Instead, `balancer/grpclb/state.Set()` should be used to add these addresses to the name resolver's output. The built-in "dns" name resolver already does this. - resolver: add new field `Endpoints` to `State` and deprecate `Addresses` ([#6471](https://togithub.com/grpc/grpc-go/issues/6471)) - `Addresses` will be deleted in the future. ### New Features - balancer/leastrequest: Add experimental support for least request LB policy and least request configured as a custom xDS policy ([#6510](https://togithub.com/grpc/grpc-go/issues/6510), [#6517](https://togithub.com/grpc/grpc-go/issues/6517)) - Set `GRPC_EXPERIMENTAL_ENABLE_LEAST_REQUEST=true` to enable - stats: Add an RPC event for blocking caused by the LB policy's picker ([#6422](https://togithub.com/grpc/grpc-go/issues/6422)) ### Bug Fixes - clusterresolver: fix deadlock when dns resolver responds inline with update or error at build time ([#6563](https://togithub.com/grpc/grpc-go/issues/6563)) - grpc: fix a bug where the channel could erroneously report `TRANSIENT_FAILURE` when actually moving to `IDLE` ([#6497](https://togithub.com/grpc/grpc-go/issues/6497)) - balancergroup: do not cache closed sub-balancers by default; affects `rls`, `weightedtarget` and `clustermanager` LB policies ([#6523](https://togithub.com/grpc/grpc-go/issues/6523)) - client: fix a bug that prevented detection of RPC status in trailers-only RPC responses when using `ClientStream.Header()`, and prevented retry of the RPC ([#6557](https://togithub.com/grpc/grpc-go/issues/6557)) ### Performance Improvements - client & server: Add experimental `[With]SharedWriteBuffer` to improve performance by reducing allocations when sending RPC messages. (Disabled by default.) ([#6309](https://togithub.com/grpc/grpc-go/issues/6309)) - Special Thanks: [@s-matyukevich](https://togithub.com/s-matyukevich) </details> --- ### Configuration 📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined). 🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied. ♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox. 🔕 Ignore: Close this PR and you won't be reminded about this update again. --- - [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check this box --- This PR has been generated by [Mend Renovate](https://www.mend.io/free-developer-tools/renovate/). View repository job log [here](https://developer.mend.io/github/go-skynet/LocalAI). <!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiIzNi44My4wIiwidXBkYXRlZEluVmVyIjoiMzYuODMuMCIsInRhcmdldEJyYW5jaCI6Im1hc3RlciJ9--> Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>		2023-09-17 08:38:52 +02:00
.github	chore(deps): update docker/metadata-action action to v5 (#1045 )	2023-09-14 10:40:51 +02:00
.vscode	feat: Add more test-cases and remove dev container (#433 )	2023-05-30 13:01:55 +02:00
api	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 )	2023-09-14 17:44:16 +02:00
cmd/grpc	feat: add llama-stable backend (#932 )	2023-08-20 16:35:42 +02:00
examples	1038 - Streamlit bot with LocalAI (#1072 )	2023-09-17 08:33:23 +02:00
extra	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 )	2023-09-14 17:44:16 +02:00
internal	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
models	Add docker-compose	2023-04-13 01:13:14 +02:00
pkg	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 )	2023-09-14 17:44:16 +02:00
prompt-templates	feat(llama2): add template for chat messages (#782 )	2023-07-22 11:31:39 -04:00
tests	feat(llama2): add template for chat messages (#782 )	2023-07-22 11:31:39 -04:00
.dockerignore	Remove .git from .dockerignore	2023-07-06 21:25:10 +02:00
.env	feat(llama.cpp): update (#1024 )	2023-09-08 18:38:22 +02:00
.gitattributes	Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )	2023-07-30 15:27:43 +02:00
.gitignore	Feat: rwkv improvements: (#937 )	2023-08-22 18:48:06 +02:00
assets.go	feat: Update gpt4all, support multiple implementations in runtime (#472 )	2023-06-01 23:38:52 +02:00
docker-compose.yaml	images: cleanup, drop .dev Dockerfile (#437 )	2023-05-30 15:58:10 +02:00
Dockerfile	feat(vllm): Initial vllm backend implementation	2023-09-09 17:03:23 +02:00
Earthfile	Rename project to LocalAI (#35 )	2023-04-19 18:43:10 +02:00
entrypoint.sh	Added CPU information to entrypoint.sh (#794 )	2023-07-23 19:27:55 +00:00
go.mod	fix(deps): update module google.golang.org/grpc to v1.58.1 (#1020 )	2023-09-17 08:38:52 +02:00
go.sum	fix(deps): update module google.golang.org/grpc to v1.58.1 (#1020 )	2023-09-17 08:38:52 +02:00
LICENSE	docs: update docs/license(clarification) and point to new website (#415 )	2023-05-29 23:09:19 +02:00
main.go	feat: add --single-active-backend to allow only one backend active at the time (#925 )	2023-08-19 01:49:33 +02:00
Makefile	⬆️ Update go-skynet/go-llama.cpp (#1062 )	2023-09-17 08:38:28 +02:00
README.md	Update README.md	2023-09-16 23:00:42 +02:00
renovate.json	ci: manually update deps	2023-05-04 15:01:29 +02:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models

LocalAI is a drop-in replacement REST API that's compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families that are compatible with the ggml format, pytorch and more. Does not require GPU.

Follow LocalAI

Connect with the Creator

Share LocalAI Repository

In a nutshell:

Local, OpenAI drop-in alternative REST API. You own your data.
NO GPU required. NO Internet access is required either
- Optional, GPU Acceleration is available in llama.cpp-compatible LLMs. See also the build section.
Supports multiple models
🏃 Once loaded the first time, it keep models loaded in memory for faster inference
⚡ Doesn't shell-out, but uses C++ bindings for a faster inference and better performance.

LocalAI was created by Ettore Di Giacinto and is a community-driven project, focused on making the AI accessible to anyone. Any contribution, feedback and PR is welcome!

Note that this started just as a fun weekend project in order to try to create the necessary pieces for a full AI assistant like ChatGPT: the community is growing fast and we are working hard to make it better and more stable. If you want to help, please consider contributing (see below)!

🔥🔥 Hot topics / Roadmap

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI functions 🆕
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface

💻 Usage

Check out the Getting started section in our documentation.

💡 Example: Use GPT4ALL-J model

See the documentation

🔗 Resources

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project:


Spectro Cloud
Spectro Cloud kindly supports LocalAI by providing GPU and computing resources to run tests on lamdalabs!

And a huge shout-out to individuals sponsoring the project by donating hardware or backing the project.

Sponsor list
JDAM00 (donating HW for the CI)

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗