coyzeng
|
d5d82ba344
|
feat(grpc): backend SPI pluggable in embedding mode (#1621)
* run server
* grpc backend embedded support
* backend providable
|
2024-01-23 08:56:36 +01:00 |
|
LocalAI [bot]
|
efe2883c5d
|
⬆️ Update ggerganov/llama.cpp (#1626)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-22 23:22:01 +01:00 |
|
LocalAI [bot]
|
47237c7c3c
|
⬆️ Update ggerganov/llama.cpp (#1623)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-22 08:54:06 +01:00 |
|
Ettore Di Giacinto
|
697c769b64
|
fix(llama.cpp): enable cont batching when parallel is set (#1622)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2024-01-21 14:59:48 +01:00 |
|
Ettore Di Giacinto
|
94261b1717
|
Update gpt-vision.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-21 10:07:30 +01:00 |
|
Sebastian
|
eaf85a30f9
|
fix(llama.cpp): Enable parallel requests (#1616)
integrate changes from llama.cpp
Signed-off-by: Sebastian <tauven@gmail.com>
|
2024-01-21 09:56:14 +01:00 |
|
LocalAI [bot]
|
6a88b030ea
|
⬆️ Update ggerganov/llama.cpp (#1620)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-20 23:34:46 +01:00 |
|
LocalAI [bot]
|
f538416fb3
|
⬆️ Update docs version mudler/LocalAI (#1619)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-20 21:37:02 +00:00 |
|
Ettore Di Giacinto
|
06cd9ef98d
|
feat(extra-backends): Improvements, adding mamba example (#1618)
* feat(extra-backends): Improvements
vllm: add max_tokens, wire up stream event
mamba: fixups, adding examples for mamba-chat
* examples(mamba-chat): add
* docs: update
|
2024-01-20 17:56:08 +01:00 |
|
James Braza
|
f3d71f8819
|
Modernized LlamaIndex integration (#1613)
Updated LlamaIndex example
|
2024-01-20 10:06:32 +01:00 |
|
James Braza
|
b7127c2dc9
|
Expanded and interlinked Docker documentation (#1614)
* Corrected dockerhub to Docker Hub
* Consolidated two Docker examples
* Linked Container Images in Manual Images
|
2024-01-20 10:05:14 +01:00 |
|
LocalAI [bot]
|
b2dc5fbd7e
|
⬆️ Update ggerganov/llama.cpp (#1612)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-20 00:38:14 +01:00 |
|
Ettore Di Giacinto
|
9e653d6abe
|
feat: 🐍 add mamba support (#1589)
feat(mamba): Initial import
This is a first iteration of the mamba backend, loosely based on
mamba-chat(https://github.com/havenhq/mamba-chat).
|
2024-01-19 23:42:50 +01:00 |
|
Ettore Di Giacinto
|
52c9a7f45d
|
Update README.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-19 19:30:29 +01:00 |
|
Ettore Di Giacinto
|
ee42c9bfe6
|
docs: re-use original permalinks (#1610)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2024-01-19 19:23:58 +01:00 |
|
Ettore Di Giacinto
|
e6c3e483a1
|
Update build.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-19 19:09:35 +01:00 |
|
Ettore Di Giacinto
|
3a253c6cd7
|
Makefile: allow to build without GRPC_BACKENDS (#1607)
|
2024-01-19 15:38:43 +01:00 |
|
Luna Midori
|
e9c3bbc6d7
|
Update README.md (#1601)
Signed-off-by: Luna Midori <118759930+lunamidori5@users.noreply.github.com>
|
2024-01-19 08:55:37 +01:00 |
|
LocalAI [bot]
|
23d64ac53a
|
⬆️ Update ggerganov/llama.cpp (#1604)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-18 21:20:50 +00:00 |
|
Ettore Di Giacinto
|
34f9f20ff4
|
Update quickstart.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-18 20:49:04 +01:00 |
|
Ettore Di Giacinto
|
a4a72a79ae
|
Update integrations.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-18 19:53:41 +01:00 |
|
Ettore Di Giacinto
|
6ca4d38a01
|
docs/examples: enhancements (#1572)
* docs: re-order sections
* fix references
* Add mixtral-instruct, tinyllama-chat, dolphin-2.5-mixtral-8x7b
* Fix link
* Minor corrections
* fix: models is a StringSlice, not a String
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
* WIP: switch docs theme
* content
* Fix GH link
* enhancements
* enhancements
* Fixed how to link
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* fixups
* logo fix
* more fixups
* final touches
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
Co-authored-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
|
2024-01-18 19:41:08 +01:00 |
|
LocalAI [bot]
|
b5c93f176a
|
⬆️ Update ggerganov/llama.cpp (#1599)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-18 14:39:30 +01:00 |
|
LocalAI [bot]
|
1aaf88098d
|
⬆️ Update ggerganov/llama.cpp (#1597)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-17 09:27:02 +01:00 |
|
Dionysius
|
6f447e613d
|
docs: missing golang requirement for local build for debian (#1596)
docs: fix missing golang requirement for local build for debian
|
2024-01-17 09:26:43 +01:00 |
|
LocalAI [bot]
|
dfb7c3b1aa
|
⬆️ Update ggerganov/llama.cpp (#1594)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-16 14:47:57 +01:00 |
|
Dionysius
|
b41eb5e1f3
|
prepend built binaries in PATH for BUILD_GRPC_FOR_BACKEND_LLAMA (#1593)
prepend built binaries in PATH
|
2024-01-16 14:47:47 +01:00 |
|
LocalAI [bot]
|
9c2d264979
|
⬆️ Update ggerganov/llama.cpp (#1590)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-15 09:01:07 +01:00 |
|
LocalAI [bot]
|
b996c3198c
|
⬆️ Update ggerganov/llama.cpp (#1587)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-14 09:46:47 +00:00 |
|
Ettore Di Giacinto
|
f879c07c86
|
Update README.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-14 10:00:46 +01:00 |
|
Dionysius
|
441e2965ff
|
move BUILD_GRPC_FOR_BACKEND_LLAMA logic to makefile: errors in this section now immediately fail the build (#1576)
* move BUILD_GRPC_FOR_BACKEND_LLAMA option to makefile
* review: oversight, fixup cmake_args
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Signed-off-by: Dionysius <1341084+dionysius@users.noreply.github.com>
---------
Signed-off-by: Dionysius <1341084+dionysius@users.noreply.github.com>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-13 10:08:26 +01:00 |
|
LocalAI [bot]
|
cbe9a03e3c
|
⬆️ Update ggerganov/llama.cpp (#1583)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-12 23:04:04 +01:00 |
|
LocalAI [bot]
|
4ee7e73d00
|
⬆️ Update ggerganov/llama.cpp (#1578)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-12 16:04:33 +01:00 |
|
lunamidori5
|
1cca449726
|
Moving the how tos to self hosted (#1574)
* Update _index.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos/easy-setup-sd.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos/easy-setup-full.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos/easy-setup-embeddings.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos/easy-setup-docker.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos/easy-request.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos/easy-model.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Update _index.en.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Update README.md
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
* Delete docs/content/howtos directory
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
---------
Signed-off-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
|
2024-01-11 09:25:18 +01:00 |
|
LocalAI [bot]
|
faf7c1c325
|
⬆️ Update ggerganov/llama.cpp (#1573)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-11 08:41:32 +01:00 |
|
LocalAI [bot]
|
58288494d6
|
⬆️ Update ggerganov/llama.cpp (#1568)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-10 10:18:57 +01:00 |
|
Dionysius
|
72283dc744
|
minor: replace shell pwd in Makefile with CURDIR for better windows compatibility (#1571)
replace shell pwd in Makefile with CURDIR
|
2024-01-10 08:39:50 +00:00 |
|
LocalAI [bot]
|
b8240b4c18
|
⬆️ Update docs version mudler/LocalAI (#1567)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-09 21:56:12 +01:00 |
|
Ettore Di Giacinto
|
5309da40b7
|
Update Dockerfile
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-09 08:55:43 +01:00 |
|
Ettore Di Giacinto
|
08b90b4720
|
Update _index.en.md
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-09 08:50:19 +01:00 |
|
LocalAI [bot]
|
2e890b3838
|
⬆️ Update ggerganov/llama.cpp (#1563)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-09 08:48:40 +01:00 |
|
LocalAI [bot]
|
06656fc057
|
⬆️ Update docs version mudler/LocalAI (#1562)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-09 08:48:24 +01:00 |
|
LocalAI [bot]
|
574fa67bdc
|
⬆️ Update ggerganov/llama.cpp (#1558)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-08 00:38:03 +01:00 |
|
Ettore Di Giacinto
|
e19d7226f8
|
feat: more embedded models, coqui fixes, add model usage and description (#1556)
* feat: add model descriptions and usage
* remove default model gallery
* models: add embeddings and tts
* docs: update table
* docs: updates
* images: cleanup pip cache after install
* images: always run apt-get clean
* ux: improve gRPC connection errors
* ux: improve some messages
* fix: fix coqui when no AudioPath is passed by
* embedded: add more models
* Add usage
* Reorder table
|
2024-01-08 00:37:02 +01:00 |
|
LocalAI [bot]
|
0843fe6c65
|
⬆️ Update docs version mudler/LocalAI (#1557)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-07 09:36:21 +01:00 |
|
Ettore Di Giacinto
|
62a02cd1fe
|
deps(conda): use transformers environment with autogptq (#1555)
|
2024-01-06 15:30:53 +01:00 |
|
Ettore Di Giacinto
|
949da7792d
|
deps(conda): use transformers-env with vllm,exllama(2) (#1554)
* deps(conda): use transformers with vllm
* join vllm, exllama, exllama2, split petals
|
2024-01-06 13:32:28 +01:00 |
|
Ettore Di Giacinto
|
ce724a7e55
|
docs: improve getting started (#1553)
* docs: improve getting started
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
* cleanups
* Use dockerhub links
* Shrink command to minimum
---------
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2024-01-06 01:04:14 +01:00 |
|
LocalAI [bot]
|
0a06c80801
|
⬆️ Update ggerganov/llama.cpp (#1547)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
|
2024-01-05 23:27:51 +01:00 |
|
LocalAI [bot]
|
edc55ade61
|
⬆️ Update docs version mudler/LocalAI (#1546)
Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: mudler <mudler@users.noreply.github.com>
Co-authored-by: lunamidori5 <118759930+lunamidori5@users.noreply.github.com>
|
2024-01-05 23:27:30 +01:00 |
|