Commit Graph

1680 Commits

Author SHA1 Message Date
Ettore Di Giacinto 1370b4482f
📖 Add prompt-templates examples (#25)
Signed-off-by: mudler <mudler@c3os.io>
2023-04-16 10:24:15 +02:00
Ettore Di Giacinto b062f3142b
feat: enhance API, expose more parameters (#24)
Signed-off-by: mudler <mudler@c3os.io>
2023-04-16 10:16:48 +02:00
Marc R Kellerman c37175271f
feature: makefile & updates (#23)
Co-authored-by: mudler <mudler@c3os.io>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2023-04-15 16:39:07 -07:00
Ettore Di Giacinto e8eab66c30
Merge pull request #22 from go-skynet/update-llama.cpp
⬆️ Update go-llama.cpp to `llama.cpp-2f7c8e0`
2023-04-16 00:06:52 +02:00
mudler a73a497143 Update llama.cpp 2023-04-15 23:57:00 +02:00
Ettore Di Giacinto 6aea515e1d
Merge pull request #20 from go-skynet/mudler-patch-1
📖 Update README.md
2023-04-15 00:38:30 +02:00
Ettore Di Giacinto dfc2b7e02a
📖 Update README.md 2023-04-15 00:38:18 +02:00
Ettore Di Giacinto 040290971c
Merge pull request #19 from go-skynet/tags
Use tags for go-llama.cpp
2023-04-15 00:14:47 +02:00
mudler 553bad585e Use tags for go-llama.cpp 2023-04-15 00:07:39 +02:00
Ettore Di Giacinto f76b612506
Merge pull request #17 from go-skynet/mudler-patch-1
Fix comment typo
2023-04-13 15:21:13 +02:00
Ettore Di Giacinto c4e94c88d7
Fix comment typo
Thanks to @deadprogram for noticing it!
2023-04-13 15:20:51 +02:00
mudler a9cd6b3ca3 ci: Fix tag detection for 'latest' 2023-04-13 01:37:09 +02:00
mudler e786576b95 Update README 2023-04-13 01:28:15 +02:00
Ettore Di Giacinto d426571789
Merge pull request #16 from go-skynet/fix_arm
Drop armv7 builds
2023-04-13 01:21:58 +02:00
mudler a896a2b5ad Drop armv7 builds 2023-04-13 01:21:40 +02:00
Ettore Di Giacinto 8273cd5c04
Merge pull request #15 from go-skynet/docker-compose
Add docker-compose file
2023-04-13 01:17:44 +02:00
mudler 16f1281d38 Minor workflow fixes 2023-04-13 01:16:13 +02:00
mudler 8042e9a2d6 Add docker-compose
Fixes #14

Signed-off-by: mudler <mudler@c3os.io>
2023-04-13 01:13:14 +02:00
mudler 624092cb99 Update README 2023-04-12 00:07:30 +02:00
mudler a422a883ac Minor rephrasing 2023-04-12 00:04:15 +02:00
mudler 7858a97254 Update README 2023-04-12 00:02:47 +02:00
mudler 5556aa46dd Small refinements and refactors 2023-04-12 00:02:39 +02:00
mudler eb4257f946 Add .gitignore 2023-04-11 23:44:00 +02:00
mudler ae30bd346d Reorganize repository layout 2023-04-11 23:43:43 +02:00
mudler 93d8977ba2 Return model list 2023-04-10 12:02:40 +02:00
mudler f43aeeb4a1 Add both API endpoints (completion, chat) 2023-04-09 12:30:55 +02:00
mudler c17dcc5e9d Allow to inject prompt as part of the call 2023-04-09 09:36:19 +02:00
mudler 4a932483e1 Small fixup to template loading 2023-04-08 11:59:40 +02:00
mudler b710147b95 Add mutex on same models (parallel isn't supported yet) 2023-04-08 11:45:36 +02:00
mudler ba70363330 Use template input 2023-04-08 11:24:25 +02:00
mudler 9fb581739b Allow to template model prompts inputs 2023-04-08 10:46:51 +02:00
mudler 48aca246e3 Drop unused interactive mode 2023-04-07 11:31:14 +02:00
mudler 12eee097b7 Make it compatible with openAI api, support multiple models
Signed-off-by: mudler <mudler@c3os.io>
2023-04-07 11:30:59 +02:00
mudler b33d015b8c Use go-llama.cpp 2023-04-07 10:08:15 +02:00
Ettore Di Giacinto b7c0a108f5
Update README.md 2023-04-05 22:28:03 +02:00
Ettore Di Giacinto f694a89c28
Update README.md 2023-04-05 22:14:00 +02:00
Ettore Di Giacinto be682e6c2f
Update README.md
Add short-term roadmap and mention webui
2023-04-05 22:04:35 +02:00
mudler bf85a31f9e Don't set a default model path 2023-04-05 22:00:15 +02:00
Ettore Di Giacinto d69048e0b0
Update README.md 2023-04-05 00:41:02 +02:00
mudler 827f189163 Update README 2023-03-30 18:46:11 +02:00
mudler a23deb5ec7 Drop duplicate target 2023-03-29 19:44:41 +02:00
mudler 999676b106 Add gpt4all instructions 2023-03-29 18:58:54 +02:00
mudler c61b023bc8 Drop fat images, will document how to consume models 2023-03-29 18:55:24 +02:00
mudler 650a22aef1 Add compatibility to gpt4all models 2023-03-29 18:53:24 +02:00
mudler 17b1724f7c Update llama-go 2023-03-27 01:18:14 +02:00
mudler e860e62036 Add mutex, build only lite images 2023-03-27 01:01:38 +02:00
Ettore Di Giacinto 1f45ff8cd6
Update README.md 2023-03-26 23:37:26 +02:00
mudler abee34f60a Cleanup leftover 2023-03-25 01:10:50 +01:00
mudler dbc70dc13c Add a simple web-page as index of the API for helping with inference testing 2023-03-25 01:09:51 +01:00
mudler 55142065eb Update README with building instructions 2023-03-24 01:11:13 +01:00