🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

ai alpaca api api-rest bloom containers falcon gpt-neox gpt4all guanaco kubernetes llama llm rwkv stable-diffusion tts vicuna

Go to file

mudler b722e7eb7e feat: cleanups, small enhancements Signed-off-by: mudler <mudler@localai.io>		2023-07-04 18:58:19 +02:00
.github	feat: fix CUDA images and update go-llama to use full GPU offloading (#618 )	2023-06-18 08:27:29 +02:00
.vscode	feat: Add more test-cases and remove dev container (#433 )	2023-05-30 13:01:55 +02:00
api	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
examples	example(slack-qa-bot): Add slack QA bot example (#654 )	2023-06-22 18:07:15 +02:00
internal	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
models	Add docker-compose	2023-04-13 01:13:14 +02:00
pkg	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
prompt-templates	docs: enhancements (#133 )	2023-04-30 23:27:02 +02:00
tests	feat: update go-gpt2 (#359 )	2023-05-23 21:47:47 +02:00
.dockerignore	feat: add LangChainGo Huggingface backend (#446 )	2023-06-01 12:00:06 +02:00
.env	Update .env	2023-06-28 18:28:53 +02:00
.gitignore	fix: copy metal file from build (#564 )	2023-06-11 01:07:39 +02:00
Dockerfile	fix: copy git to correctly display version in /version	2023-07-04 18:58:19 +02:00
Earthfile	Rename project to LocalAI (#35 )	2023-04-19 18:43:10 +02:00
LICENSE	docs: update docs/license(clarification) and point to new website (#415 )	2023-05-29 23:09:19 +02:00
Makefile	fix: copy git to correctly display version in /version	2023-07-04 18:58:19 +02:00
README.md	Update README.md	2023-07-03 00:52:26 +02:00
assets.go	feat: Update gpt4all, support multiple implementations in runtime (#472 )	2023-06-01 23:38:52 +02:00
docker-compose.yaml	images: cleanup, drop .dev Dockerfile (#437 )	2023-05-30 15:58:10 +02:00
entrypoint.sh	fix: split build threads from running threads (#703 )	2023-06-29 11:57:09 +02:00
go.mod	fix(deps): update github.com/go-skynet/go-llama.cpp digest to 42ba448 (#698 )	2023-06-28 23:44:04 +02:00
go.sum	fix(deps): update github.com/go-skynet/go-llama.cpp digest to 42ba448 (#698 )	2023-06-28 23:44:04 +02:00
main.go	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
renovate.json	ci: manually update deps	2023-05-04 15:01:29 +02:00

README.md

LocalAI

Documentation website

LocalAI is a drop-in replacement REST API that's compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families that are compatible with the ggml format. Does not require GPU.

For a list of the supported model families, please see the model compatibility table.

In a nutshell:

Local, OpenAI drop-in alternative REST API. You own your data.
NO GPU required. NO Internet access is required either
- Optional, GPU Acceleration is available in llama.cpp-compatible LLMs. See also the build section.
Supports multiple models:
- 📖 Text generation with GPTs (llama.cpp, gpt4all.cpp, ... and more)
- 🗣 Text to Audio 🎺🆕
- 🔈 Audio to Text (Audio transcription with whisper.cpp)
- 🎨 Image generation with stable diffusion
🏃 Once loaded the first time, it keep models loaded in memory for faster inference
⚡ Doesn't shell-out, but uses C++ bindings for a faster inference and better performance.

LocalAI was created by Ettore Di Giacinto and is a community-driven project, focused on making the AI accessible to anyone. Any contribution, feedback and PR is welcome!

See the Getting started and examples sections to learn how to use LocalAI. For a list of curated models check out the model gallery.

ChatGPT OSS alternative	Image generation

Telegram bot	Flowise

Hot topics / Roadmap

Support for embeddings
Support for audio transcription with https://github.com/ggerganov/whisper.cpp
Support for text-to-audio
GPU/CUDA support ( https://github.com/go-skynet/LocalAI/issues/69 )
Enable automatic downloading of models from a curated gallery
Enable automatic downloading of models from HuggingFace
Upstream our golang bindings to llama.cpp (https://github.com/ggerganov/llama.cpp/issues/351)
Enable gallery management directly from the webui.
🔥 OpenAI functions: https://github.com/go-skynet/LocalAI/issues/588

News

🔥🔥🔥 28-06-2023: v1.20.0: Added text to audio and gallery huggingface repositories! Release notes Changelog
🔥🔥🔥 19-06-2023: v1.19.0: CUDA support! Release notes Changelog
🔥🔥🔥 06-06-2023: v1.18.0: Many updates, new features, and much more 🚀, check out the Release notes!
29-05-2023: LocalAI now has a website, https://localai.io! check the news in the dedicated section!

For latest news, follow also on Twitter @LocalAI_API and @mudler_it

Contribute and help

To help the project you can:

Hacker news post - help us out by voting if you like this project.
If you have technological skills and want to contribute to development, have a look at the open issues. If you are new you can have a look at the good-first-issue and help-wanted labels.
If you don't have technological skills you can still help improving documentation or add examples or share your user-stories with our community, any help and contribution is welcome!

Usage

Check out the Getting started section. Here below you will find generic, quick instructions to get ready and use LocalAI.

The easiest way to run LocalAI is by using docker-compose (to build locally, see building LocalAI):


git clone https://github.com/go-skynet/LocalAI

cd LocalAI

# (optional) Checkout a specific LocalAI tag
# git checkout -b build <TAG>

# copy your models to models/
cp your-model.bin models/

# (optional) Edit the .env file to set things like context size and threads
# vim .env

# start with docker-compose
docker-compose up -d --pull always
# or you can build the images with:
# docker-compose up -d --build

# Now API is accessible at localhost:8080
curl http://localhost:8080/v1/models
# {"object":"list","data":[{"id":"your-model.bin","object":"model"}]}

curl http://localhost:8080/v1/completions -H "Content-Type: application/json" -d '{
     "model": "your-model.bin",            
     "prompt": "A long time ago in a galaxy far, far away",
     "temperature": 0.7
   }'

Example: Use GPT4ALL-J model

# Clone LocalAI
git clone https://github.com/go-skynet/LocalAI

cd LocalAI

# (optional) Checkout a specific LocalAI tag
# git checkout -b build <TAG>

# Download gpt4all-j to models/
wget https://gpt4all.io/models/ggml-gpt4all-j.bin -O models/ggml-gpt4all-j

# Use a template from the examples
cp -rf prompt-templates/ggml-gpt4all-j.tmpl models/

# (optional) Edit the .env file to set things like context size and threads
# vim .env

# start with docker-compose
docker-compose up -d --pull always
# or you can build the images with:
# docker-compose up -d --build
# Now API is accessible at localhost:8080
curl http://localhost:8080/v1/models
# {"object":"list","data":[{"id":"ggml-gpt4all-j","object":"model"}]}

curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
     "model": "ggml-gpt4all-j",
     "messages": [{"role": "user", "content": "How are you?"}],
     "temperature": 0.9 
   }'

# {"model":"ggml-gpt4all-j","choices":[{"message":{"role":"assistant","content":"I'm doing well, thanks. How about you?"}}]}

Build locally

In order to build the LocalAI container image locally you can use docker:

# build the image
docker build -t localai .
docker run localai

Or you can build the binary with make:

make build

See the build section in our documentation for detailed instructions.

Run LocalAI in Kubernetes

LocalAI can be installed inside Kubernetes with helm. See installation instructions.

Supported API endpoints

See the list of the supported API endpoints and how to configure image generation and audio transcription.

Frequently asked questions

See the FAQ section for a list of common questions.

Projects already using LocalAI to run local models

Feel free to open up a PR to get your project listed!

Star history

License

LocalAI is a community-driven project created by Ettore Di Giacinto.

MIT

Author

Ettore Di Giacinto and others

Acknowledgements

LocalAI couldn't have been built without the help of great software already available from the community. Thank you!