LocalAI/backend
fakezeta 8210ffcb6c
feat: Token Stream support for Transformer, fix: missing package for OpenVINO (#1908)
* Streaming working

* Small fix for regression on CUDA and XPU

* use pip version of optimum[openvino]

* Update backend/python/transformers/transformers_server.py

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

* Token streaming support

fix optimum[openvino] package in install.sh

* Token Streaming support

---------

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2024-03-27 17:50:35 +01:00
..
cpp test/fix: OSX Test Repair (#1843) 2024-03-18 19:19:43 +01:00
go feat(stores): Vector store backend (#1795) 2024-03-22 21:14:04 +01:00
python feat: Token Stream support for Transformer, fix: missing package for OpenVINO (#1908) 2024-03-27 17:50:35 +01:00
backend.proto feat(stores): Vector store backend (#1795) 2024-03-22 21:14:04 +01:00
backend_grpc.pb.go transformers: correctly load automodels (#1643) 2024-01-26 00:13:21 +01:00