mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs ggml, gguf, GPTQ, onnx, TF compatible models: llama, llama2, rwkv, whisper, vicuna, koala, cerebras, falcon, dolly, starcoder, and many others

ai alpaca api api-rest bloom containers falcon gpt-neox gpt4all guanaco kubernetes llama llm rwkv stable-diffusion tts vicuna

Go to file

Samuel Maynard deeef5fc24 fix(utf8): prevent multi-byte utf8 characters from being mangled (#981 ) Description This PR fixes #677 using [suggested solution](https://github.com/go-skynet/LocalAI/issues/677#issuecomment-1695939097) from @yantoz before: ``` ❯ curl -N http://localhost:57541/v1/completions -H "Content-Type: application/json" -d '{ "model": "ggml-model-q4_0.bin", "prompt": "", "max_tokens": 32, "temperature": 0.7, "stream": true }' data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"\ufffd"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":" \|"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":" I"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"'"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"text":"m"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} ``` now: ``` ❯ curl -N http://localhost:57541/v1/completions -H Content-Type: application/json -d { "model": "ggml-model-q4_0.bin", "prompt": "", "max_tokens": 32, "temperature": 0.7, "stream": true } data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"😂"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":" "}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"\|"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":" "}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"I"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"'"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} data: {"object":"text_completion","model":"ggml-model-q4_0.bin","choices":[{"index":0,"text":"m"}],"usage":{"prompt_tokens":0,"completion_tokens":0,"total_tokens":0}} ``` Notes for Reviewers [Signed commits](../CONTRIBUTING.md#signing-off-on-commits-developer-certificate-of-origin) - [X] Yes, I signed my commits. <!-- Thank you for contributing to LocalAI! Contributing Conventions: 1. Include descriptive PR titles with [<component-name>] prepended. 2. Build and test your changes before submitting a PR. 3. Sign your commits By following the community's contribution conventions upfront, the review process will be accelerated and your PR merged more quickly. --> Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>		2023-08-30 23:56:59 +00:00
.github	feat: bump llama.cpp, add gguf support (#943 )	2023-08-24 01:18:58 +02:00
.vscode	feat: Add more test-cases and remove dev container (#433 )	2023-05-30 13:01:55 +02:00
api	fix(utf8): prevent multi-byte utf8 characters from being mangled (#981 )	2023-08-30 23:56:59 +00:00
cmd/grpc	feat: add llama-stable backend (#932 )	2023-08-20 16:35:42 +02:00
examples	initial draft of an importable Insomnia profile for developers (#942 )	2023-08-23 18:39:27 +02:00
extra	fix(diffusers): correctly check alpha (#967 )	2023-08-27 15:35:59 +02:00
internal	feat: cleanups, small enhancements	2023-07-04 18:58:19 +02:00
models	Add docker-compose	2023-04-13 01:13:14 +02:00
pkg	fix(llama): resolve lora adapters correctly from the model file (#964 )	2023-08-27 10:11:32 +02:00
prompt-templates	feat(llama2): add template for chat messages (#782 )	2023-07-22 11:31:39 -04:00
tests	feat(llama2): add template for chat messages (#782 )	2023-07-22 11:31:39 -04:00
.dockerignore	Remove .git from .dockerignore	2023-07-06 21:25:10 +02:00
.env	docs: base-Update comments in .env for cublas, openblas, clblas (#867 )	2023-08-07 08:22:42 +00:00
.gitattributes	Create .gitattributes to force git clone to keep the LF line endings on .sh files (#838 )	2023-07-30 15:27:43 +02:00
.gitignore	Feat: rwkv improvements: (#937 )	2023-08-22 18:48:06 +02:00
assets.go	feat: Update gpt4all, support multiple implementations in runtime (#472 )	2023-06-01 23:38:52 +02:00
docker-compose.yaml	images: cleanup, drop .dev Dockerfile (#437 )	2023-05-30 15:58:10 +02:00
Dockerfile	feat: bump llama.cpp, add gguf support (#943 )	2023-08-24 01:18:58 +02:00
Earthfile	Rename project to LocalAI (#35 )	2023-04-19 18:43:10 +02:00
entrypoint.sh	Added CPU information to entrypoint.sh (#794 )	2023-07-23 19:27:55 +00:00
go.mod	fix(deps): update github.com/go-skynet/go-llama.cpp digest to bf3f946 (#979 )	2023-08-30 23:02:19 +02:00
go.sum	fix(deps): update github.com/go-skynet/go-llama.cpp digest to bf3f946 (#979 )	2023-08-30 23:02:19 +02:00
LICENSE	docs: update docs/license(clarification) and point to new website (#415 )	2023-05-29 23:09:19 +02:00
main.go	feat: add --single-active-backend to allow only one backend active at the time (#925 )	2023-08-19 01:49:33 +02:00
Makefile	fix(deps): update go-llama.cpp (#980 )	2023-08-30 23:01:55 +02:00
README.md	readme: link to hot topics in the website	2023-08-07 00:31:46 +02:00
renovate.json	ci: manually update deps	2023-05-04 15:01:29 +02:00

README.md

LocalAI

💡 Get help - ❓FAQ 💭Discussions 💬 Discord 📖 Documentation website

💻 Quickstart 📣 News 🛫 Examples 🖼️ Models

LocalAI is a drop-in replacement REST API that's compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not only) locally or on-prem with consumer grade hardware, supporting multiple model families that are compatible with the ggml format. Does not require GPU.

Follow LocalAI

Connect with the Creator

Share LocalAI Repository

In a nutshell:

Local, OpenAI drop-in alternative REST API. You own your data.
NO GPU required. NO Internet access is required either
- Optional, GPU Acceleration is available in llama.cpp-compatible LLMs. See also the build section.
Supports multiple models
🏃 Once loaded the first time, it keep models loaded in memory for faster inference
⚡ Doesn't shell-out, but uses C++ bindings for a faster inference and better performance.

LocalAI was created by Ettore Di Giacinto and is a community-driven project, focused on making the AI accessible to anyone. Any contribution, feedback and PR is welcome!

Note that this started just as a fun weekend project in order to try to create the necessary pieces for a full AI assistant like ChatGPT: the community is growing fast and we are working hard to make it better and more stable. If you want to help, please consider contributing (see below)!