mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

ai alpaca api api-rest bloom containers falcon gpt-neox gpt4all guanaco kubernetes llama llm rwkv stable-diffusion tts vicuna

Go to file

Ettore Di Giacinto c89271b2e4 feat(llama.cpp): add distributed llama.cpp inferencing (#2324 ) * feat(llama.cpp): support distributed llama.cpp Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: let tweak how chat messages are merged together Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactor Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Makefile: register to ALL_GRPC_BACKENDS Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * refactoring, allow disable auto-detection of backends Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * minor fixups Signed-off-by: mudler <mudler@localai.io> * feat: add cmd to start rpc-server from llama.cpp Signed-off-by: mudler <mudler@localai.io> * ci: add ccache Signed-off-by: mudler <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io> Signed-off-by: mudler <mudler@localai.io>		2024-05-15 01:17:02 +02:00
.github	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
.vscode	feat: first pass at improving logging (#1956 )	2024-04-04 09:24:22 +02:00
aio	feat(aio): switch to llama3-based for LLM (#2225 )	2024-05-03 00:41:45 +02:00
backend	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
configuration	refactor: move remaining api packages to core (#1731 )	2024-03-01 16:19:53 +01:00
core	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
custom-ca-certs	feat(certificates): add support for custom CA certificates (#880 )	2023-11-01 20:10:14 +01:00
docs	Update openai-functions.md	2024-05-10 17:09:51 +02:00
embedded	fix: security scanner warning noise: error handlers part 1 (#2141 )	2024-04-26 10:34:31 +02:00
examples	docs: Update semantic-todo/README.md (#2294 )	2024-05-12 09:02:11 +02:00
gallery	models(gallery): add orthocopter (#2313 )	2024-05-13 18:45:58 +02:00
internal	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
models	Add docker-compose	2023-04-13 01:13:14 +02:00
pkg	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
prompt-templates	Requested Changes from GPT4ALL to Luna-AI-Llama2 (#1092 )	2023-09-22 11:22:17 +02:00
swagger	feat(swagger): update swagger (#2302 )	2024-05-12 21:00:18 +00:00
tests	fix: security scanner warning noise: error handlers part 2 (#2145 )	2024-04-29 15:11:42 +02:00
.dockerignore	feat: migrate python backends from conda to uv (#2215 )	2024-05-10 15:08:08 +02:00
.editorconfig	feat(stores): Vector store backend (#1795 )	2024-03-22 21:14:04 +01:00
.env	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
.gitattributes	Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )	2023-07-30 15:27:43 +02:00
.gitignore	feat: migrate python backends from conda to uv (#2215 )	2024-05-10 15:08:08 +02:00
.gitmodules	docs/examples: enhancements (#1572 )	2024-01-18 19:41:08 +01:00
.yamllint	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
assets.go	feat: Update gpt4all, support multiple implementations in runtime (#472 )	2023-06-01 23:38:52 +02:00
CONTRIBUTING.md	Update CONTRIBUTING.md	2024-04-12 15:27:40 +02:00
docker-compose.yaml	fix(docker-compose): update docker compose file (#1824 )	2024-03-13 17:57:45 +01:00
Dockerfile	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
Dockerfile.aio	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Earthfile	Rename project to LocalAI (#35 )	2023-04-19 18:43:10 +02:00
Entitlements.plist	Feat: OSX Local Codesigning (#1319 )	2023-11-23 15:22:54 +01:00
entrypoint.sh	fix: use exec in entrypoint scripts to fix signal handling (#1943 )	2024-04-02 09:15:44 +02:00
go.mod	feat: auto select llama-cpp cpu variant (#2305 )	2024-05-13 11:37:52 +02:00
go.sum	feat: auto select llama-cpp cuda runtime (#2306 )	2024-05-14 19:40:18 +02:00
LICENSE	docs/examples: enhancements (#1572 )	2024-01-18 19:41:08 +01:00
main.go	fix: security scanner warning noise: error handlers part 1 (#2141 )	2024-04-26 10:34:31 +02:00
Makefile	feat(llama.cpp): add distributed llama.cpp inferencing (#2324 )	2024-05-15 01:17:02 +02:00
README.md	Update README.md	2024-05-15 00:33:16 +02:00
renovate.json	ci: manually update deps	2023-05-04 15:01:29 +02:00
SECURITY.md	Create SECURITY.md	2024-02-29 19:53:04 +01:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models 🚀 Roadmap

LocalAI is the free, Open Source OpenAI alternative. LocalAI act as a drop-in replacement REST API that’s compatible with OpenAI (Elevenlabs, Anthropic... ) API specifications for local AI inferencing. It allows you to run LLMs, generate images, audio (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families. Does not require GPU. It is created and maintained by Ettore Di Giacinto.

🔥🔥 Hot topics / Roadmap

Roadmap

Distributed inferencing: https://github.com/mudler/LocalAI/pull/2324
Chat, TTS, and Image generation in the WebUI: https://github.com/mudler/LocalAI/pull/2222
Reranker API: https://github.com/mudler/LocalAI/pull/2121
Gallery WebUI: https://github.com/mudler/LocalAI/pull/2104
llama3: https://github.com/mudler/LocalAI/discussions/2076
Parler-TTS: https://github.com/mudler/LocalAI/pull/2027
Openvino support: https://github.com/mudler/LocalAI/pull/1892
Vector store: https://github.com/mudler/LocalAI/pull/1795
All-in-one container image: https://github.com/mudler/LocalAI/issues/1855

Hot topics (looking for contributors):

WebUI improvements: https://github.com/mudler/LocalAI/issues/2156
Backends v2: https://github.com/mudler/LocalAI/issues/1126
Improving UX v2: https://github.com/mudler/LocalAI/issues/1373
Assistant API: https://github.com/mudler/LocalAI/issues/1273
Moderation endpoint: https://github.com/mudler/LocalAI/issues/999
Vulkan: https://github.com/mudler/LocalAI/issues/1647

If you want to help and contribute, issues up for grabs: https://github.com/mudler/LocalAI/issues?q=is%3Aissue+is%3Aopen+label%3A%22up+for+grabs%22

💻 Getting started

For a detailed step-by-step introduction, refer to the Getting Started guide.

For those in a hurry, here's a straightforward one-liner to launch a LocalAI AIO(All-in-one) Image using docker:

docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-aio-cpu
# or, if you have an Nvidia GPU:
# docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-aio-gpu-nvidia-cuda-12

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI functions 🆕
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface
🥽 Vision API
🆕 Reranker API

💻 Usage

Check out the Getting started section in our documentation.

🔗 Community and integrations

Build and deploy custom containers:

https://github.com/sozercan/aikit

WebUIs:

Model galleries

https://github.com/go-skynet/model-gallery

Other:

Helm chart https://github.com/go-skynet/helm-charts
VSCode extension https://github.com/badgooooor/localai-vscode-plugin
Terminal utility https://github.com/djcopley/ShellOracle
Local Smart assistant https://github.com/mudler/LocalAGI
Home Assistant https://github.com/sammcj/homeassistant-localai / https://github.com/drndos/hass-openai-custom-conversation
Discord bot https://github.com/mudler/LocalAGI/tree/main/examples/discord
Slack bot https://github.com/mudler/LocalAGI/tree/main/examples/slack
Telegram bot https://github.com/mudler/LocalAI/tree/master/examples/telegram-bot
Examples: https://github.com/mudler/LocalAI/tree/master/examples/

🔗 Resources

🆕 New! LLM finetuning guide
How to build locally
How to install in Kubernetes
Projects integrating LocalAI
How tos section (curated by our community)

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project:


Spectro Cloud
Spectro Cloud kindly supports LocalAI by providing GPU and computing resources to run tests on lamdalabs!

And a huge shout-out to individuals sponsoring the project by donating hardware or backing the project.

Sponsor list
JDAM00 (donating HW for the CI)

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

README.md Unescape Escape