LocalAI/embedded/models/bert-cpp.yaml

backend: bert-embeddings
embeddings: true
f16: true

gpu_layers: 90
mmap: true
name: bert-cpp-minilm-v6

parameters:
  model: bert-MiniLM-L6-v2q4_0.bin

download_files:
- filename: "bert-MiniLM-L6-v2q4_0.bin"
  sha256: "a5a174d8772c8a569faf9f3136c441f2c3855b5bf35ed32274294219533feaad"
  uri: "https://huggingface.co/mudler/all-MiniLM-L6-v2/resolve/main/ggml-model-q4_0.bin"

usage: |
    You can test this model with curl like this:

    curl http://localhost:8080/embeddings -X POST -H "Content-Type: application/json" -d '{
      "input": "Your text string goes here",
      "model": "bert-cpp-minilm-v6"
    }'
feat: more embedded models, coqui fixes, add model usage and description (#1556) * feat: add model descriptions and usage * remove default model gallery * models: add embeddings and tts * docs: update table * docs: updates * images: cleanup pip cache after install * images: always run apt-get clean * ux: improve gRPC connection errors * ux: improve some messages * fix: fix coqui when no AudioPath is passed by * embedded: add more models * Add usage * Reorder table 2024-01-07 23:37:02 +00:00			`backend: bert-embeddings`
			`embeddings: true`
			`f16: true`

			`gpu_layers: 90`
			`mmap: true`
			`name: bert-cpp-minilm-v6`

			`parameters:`
			`model: bert-MiniLM-L6-v2q4_0.bin`

			`download_files:`
			`- filename: "bert-MiniLM-L6-v2q4_0.bin"`
			`sha256: "a5a174d8772c8a569faf9f3136c441f2c3855b5bf35ed32274294219533feaad"`
			`uri: "https://huggingface.co/mudler/all-MiniLM-L6-v2/resolve/main/ggml-model-q4_0.bin"`

			`usage: \|`
			`You can test this model with curl like this:`

			`curl http://localhost:8080/embeddings -X POST -H "Content-Type: application/json" -d '{`
			`"input": "Your text string goes here",`
			`"model": "bert-cpp-minilm-v6"`
			`}'`