mirror of
https://github.com/mudler/LocalAI.git
synced 2024-06-07 19:40:48 +00:00
3882130911
* fixes #1775 and #1774 Add BitsAndBytes Quantization and fixes embedding on CUDA devices * Manage 4bit and 8 bit quantization Manage different BitsAndBytes options with the quantization: parameter in yaml * fix compilation errors on non CUDA environment |
||
---|---|---|
.. | ||
backend_pb2_grpc.py | ||
backend_pb2.py | ||
Makefile | ||
README.md | ||
run.sh | ||
test_transformers_server.py | ||
test.sh | ||
transformers_server.py |
Creating a separate environment for the transformers project
make transformers