LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

History

Ettore Di Giacinto 5d1018495f feat(intel): add diffusers/transformers support (#1746 ) * feat(intel): add diffusers support * try to consume upstream container image * Debug * Manually install deps * Map transformers/hf cache dir to modelpath if not specified * fix(compel): update initialization, pass by all gRPC options * fix: add dependencies, implement transformers for xpu * base it from the oneapi image * Add pillow * set threads if specified when launching the API * Skip conda install if intel * defaults to non-intel * ci: add to pipelines * prepare compel only if enabled * Skip conda install if intel * fix cleanup * Disable compel by default * Install torch 2.1.0 with Intel * Skip conda on some setups * Detect python * Quiet output * Do not override system python with conda * Prefer python3 * Fixups * exllama2: do not install without conda (overrides pytorch version) * exllama/exllama2: do not install if not using cuda * Add missing dataset dependency * Small fixups, symlink to python, add requirements * Add neural_speed to the deps * correctly handle model offloading * fix: device_map == xpu * go back at calling python, fixed at dockerfile level * Exllama2 restricted to only nvidia gpus * Tokenizer to xpu		2024-03-07 14:37:45 +01:00
..
backend_pb2_grpc.py	exllama(v2): fix exllamav1, add exllamav2 (#1384 )	2023-12-05 08:15:37 +01:00
backend_pb2.py	Bump vLLM version + more options when loading models in vLLM (#1782 )	2024-03-01 22:48:53 +01:00
exllama2_backend.py	fix: exllama2 backend (#1484 )	2023-12-24 08:32:12 +00:00
exllama2.yml	exllama(v2): fix exllamav1, add exllamav2 (#1384 )	2023-12-05 08:15:37 +01:00
install.sh	feat(intel): add diffusers/transformers support (#1746 )	2024-03-07 14:37:45 +01:00
Makefile	deps(conda): use transformers-env with vllm,exllama(2) (#1554 )	2024-01-06 13:32:28 +01:00
run.sh	deps(conda): use transformers-env with vllm,exllama(2) (#1554 )	2024-01-06 13:32:28 +01:00