mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

ai alpaca api api-rest bloom containers falcon gpt-neox gpt4all guanaco kubernetes llama llm rwkv stable-diffusion tts vicuna

Go to file

Sertaç Özercan a670318a9f feat: auto select llama-cpp cuda runtime (#2306 ) * auto select cpu variant Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * remove cuda target for now Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix metal Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * fix path Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * cuda Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * auto select cuda Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * update test Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * select CUDA backend only if present Signed-off-by: mudler <mudler@localai.io> * ci: keep cuda bin in path Signed-off-by: mudler <mudler@localai.io> * Makefile: make dist now builds also cuda Signed-off-by: mudler <mudler@localai.io> * Keep pushing fallback in case auto-flagset/nvidia fails There could be other reasons for which the default binary may fail. For example we might have detected an Nvidia GPU, however the user might not have the drivers/cuda libraries installed in the system, and so it would fail to start. We keep the fallback of llama.cpp at the end of the llama.cpp backends to try to fallback loading in case things go wrong Signed-off-by: mudler <mudler@localai.io> * Do not build cuda on MacOS Signed-off-by: mudler <mudler@localai.io> * cleanup Signed-off-by: Sertac Ozercan <sozercan@gmail.com> * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> --------- Signed-off-by: Sertac Ozercan <sozercan@gmail.com> Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Signed-off-by: mudler <mudler@localai.io> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com> Co-authored-by: mudler <mudler@localai.io>		2024-05-14 19:40:18 +02:00
.github	feat: auto select llama-cpp cuda runtime (#2306 )	2024-05-14 19:40:18 +02:00
.vscode	feat: first pass at improving logging (#1956 )	2024-04-04 09:24:22 +02:00
aio	feat(aio): switch to llama3-based for LLM (#2225 )	2024-05-03 00:41:45 +02:00
backend	feat(llama.cpp): add `flash_attention` and `no_kv_offloading` (#2310 )	2024-05-13 19:07:51 +02:00
configuration	refactor: move remaining api packages to core (#1731 )	2024-03-01 16:19:53 +01:00
core	feat(functions): allow to set JSON matcher (#2319 )	2024-05-14 09:39:20 +02:00
custom-ca-certs	feat(certificates): add support for custom CA certificates (#880 )	2023-11-01 20:10:14 +01:00
docs	Update openai-functions.md	2024-05-10 17:09:51 +02:00
embedded	fix: security scanner warning noise: error handlers part 1 (#2141 )	2024-04-26 10:34:31 +02:00
examples	docs: Update semantic-todo/README.md (#2294 )	2024-05-12 09:02:11 +02:00
gallery	models(gallery): add orthocopter (#2313 )	2024-05-13 18:45:58 +02:00
internal	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
models	Add docker-compose	2023-04-13 01:13:14 +02:00
pkg	feat: auto select llama-cpp cuda runtime (#2306 )	2024-05-14 19:40:18 +02:00
prompt-templates	Requested Changes from GPT4ALL to Luna-AI-Llama2 (#1092 )	2023-09-22 11:22:17 +02:00
swagger	feat(swagger): update swagger (#2302 )	2024-05-12 21:00:18 +00:00
tests	fix: security scanner warning noise: error handlers part 2 (#2145 )	2024-04-29 15:11:42 +02:00
.dockerignore	feat: migrate python backends from conda to uv (#2215 )	2024-05-10 15:08:08 +02:00
.editorconfig	feat(stores): Vector store backend (#1795 )	2024-03-22 21:14:04 +01:00
.env	Update .env	2024-04-28 23:56:10 +02:00
.gitattributes	Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )	2023-07-30 15:27:43 +02:00
.gitignore	feat: migrate python backends from conda to uv (#2215 )	2024-05-10 15:08:08 +02:00
.gitmodules	docs/examples: enhancements (#1572 )	2024-01-18 19:41:08 +01:00
.yamllint	fix: yamlint warnings and errors (#2131 )	2024-04-25 17:25:56 +00:00
assets.go	feat: Update gpt4all, support multiple implementations in runtime (#472 )	2023-06-01 23:38:52 +02:00
CONTRIBUTING.md	Update CONTRIBUTING.md	2024-04-12 15:27:40 +02:00
docker-compose.yaml	fix(docker-compose): update docker compose file (#1824 )	2024-03-13 17:57:45 +01:00
Dockerfile	feat: migrate python backends from conda to uv (#2215 )	2024-05-10 15:08:08 +02:00
Dockerfile.aio	feat(aio): entrypoint, update workflows (#1872 )	2024-03-21 22:09:04 +01:00
Earthfile	Rename project to LocalAI (#35 )	2023-04-19 18:43:10 +02:00
Entitlements.plist	Feat: OSX Local Codesigning (#1319 )	2023-11-23 15:22:54 +01:00
entrypoint.sh	fix: use exec in entrypoint scripts to fix signal handling (#1943 )	2024-04-02 09:15:44 +02:00
go.mod	feat: auto select llama-cpp cpu variant (#2305 )	2024-05-13 11:37:52 +02:00
go.sum	feat: auto select llama-cpp cuda runtime (#2306 )	2024-05-14 19:40:18 +02:00
LICENSE	docs/examples: enhancements (#1572 )	2024-01-18 19:41:08 +01:00
main.go	fix: security scanner warning noise: error handlers part 1 (#2141 )	2024-04-26 10:34:31 +02:00
Makefile	feat: auto select llama-cpp cuda runtime (#2306 )	2024-05-14 19:40:18 +02:00
README.md	Update readme: add ShellOracle to community integrations (#2254 )	2024-05-07 08:39:58 +02:00
renovate.json	ci: manually update deps	2023-05-04 15:01:29 +02:00
SECURITY.md	Create SECURITY.md	2024-02-29 19:53:04 +01:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models 🚀 Roadmap

LocalAI is the free, Open Source OpenAI alternative. LocalAI act as a drop-in replacement REST API that’s compatible with OpenAI (Elevenlabs, Anthropic... ) API specifications for local AI inferencing. It allows you to run LLMs, generate images, audio (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families. Does not require GPU. It is created and maintained by Ettore Di Giacinto.

🔥🔥 Hot topics / Roadmap

Roadmap

Chat, TTS, and Image generation in the WebUI: https://github.com/mudler/LocalAI/pull/2222
Reranker API: https://github.com/mudler/LocalAI/pull/2121
Gallery WebUI: https://github.com/mudler/LocalAI/pull/2104
llama3: https://github.com/mudler/LocalAI/discussions/2076
Parler-TTS: https://github.com/mudler/LocalAI/pull/2027
Openvino support: https://github.com/mudler/LocalAI/pull/1892
Vector store: https://github.com/mudler/LocalAI/pull/1795
All-in-one container image: https://github.com/mudler/LocalAI/issues/1855

Hot topics (looking for contributors):

WebUI improvements: https://github.com/mudler/LocalAI/issues/2156
Backends v2: https://github.com/mudler/LocalAI/issues/1126
Improving UX v2: https://github.com/mudler/LocalAI/issues/1373
Assistant API: https://github.com/mudler/LocalAI/issues/1273
Moderation endpoint: https://github.com/mudler/LocalAI/issues/999
Vulkan: https://github.com/mudler/LocalAI/issues/1647

If you want to help and contribute, issues up for grabs: https://github.com/mudler/LocalAI/issues?q=is%3Aissue+is%3Aopen+label%3A%22up+for+grabs%22

💻 Getting started

For a detailed step-by-step introduction, refer to the Getting Started guide.

For those in a hurry, here's a straightforward one-liner to launch a LocalAI AIO(All-in-one) Image using docker:

docker run -ti --name local-ai -p 8080:8080 localai/localai:latest-aio-cpu
# or, if you have an Nvidia GPU:
# docker run -ti --name local-ai -p 8080:8080 --gpus all localai/localai:latest-aio-gpu-nvidia-cuda-12

🚀 Features

📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... 📖 and more)
🗣 Text to Audio
🔈 Audio to Text (Audio transcription with whisper.cpp)
🎨 Image generation with stable diffusion
🔥 OpenAI functions 🆕
🧠 Embeddings generation for vector databases
✍️ Constrained grammars
🖼️ Download Models directly from Huggingface
🥽 Vision API
🆕 Reranker API

💻 Usage

Check out the Getting started section in our documentation.

🔗 Community and integrations

Build and deploy custom containers:

https://github.com/sozercan/aikit

WebUIs:

Model galleries

https://github.com/go-skynet/model-gallery

Other:

Helm chart https://github.com/go-skynet/helm-charts
VSCode extension https://github.com/badgooooor/localai-vscode-plugin
Terminal utility https://github.com/djcopley/ShellOracle
Local Smart assistant https://github.com/mudler/LocalAGI
Home Assistant https://github.com/sammcj/homeassistant-localai / https://github.com/drndos/hass-openai-custom-conversation
Discord bot https://github.com/mudler/LocalAGI/tree/main/examples/discord
Slack bot https://github.com/mudler/LocalAGI/tree/main/examples/slack
Telegram bot https://github.com/mudler/LocalAI/tree/master/examples/telegram-bot
Examples: https://github.com/mudler/LocalAI/tree/master/examples/

🔗 Resources

🆕 New! LLM finetuning guide
How to build locally
How to install in Kubernetes
Projects integrating LocalAI
How tos section (curated by our community)

Citation

If you utilize this repository, data in a downstream project, please consider citing it with:

@misc{localai,
  author = {Ettore Di Giacinto},
  title = {LocalAI: The free, Open source OpenAI alternative},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/go-skynet/LocalAI}},

❤️ Sponsors

Do you find LocalAI useful?

Support the project by becoming a backer or sponsor. Your logo will show up here with a link to your website.

A huge thank you to our generous sponsors who support this project:


Spectro Cloud
Spectro Cloud kindly supports LocalAI by providing GPU and computing resources to run tests on lamdalabs!

And a huge shout-out to individuals sponsoring the project by donating hardware or backing the project.

Sponsor list
JDAM00 (donating HW for the CI)

🌟 Star history

📖 License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT - Author Ettore Di Giacinto

🙇 Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!

🤗 Contributors

This is a community project, a special thanks to our contributors! 🤗

README.md Unescape Escape