LocalAI/aio/gpu-8g/vision.yaml

backend: llama-cpp
context_size: 4096
f16: true

gpu_layers: 90
mmap: true
name: gpt-4-vision-preview

roles:
  user: "USER:"
  assistant: "ASSISTANT:"
  system: "SYSTEM:"

mmproj: llava-v1.6-7b-mmproj-f16.gguf
parameters:
  model: llava-v1.6-mistral-7b.Q5_K_M.gguf
  temperature: 0.2
  top_k: 40
  top_p: 0.95
  seed: -1

template:
  chat: |
    A chat between a curious human and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the human's questions.
    {{.Input}}
    ASSISTANT:

download_files:
- filename: llava-v1.6-mistral-7b.Q5_K_M.gguf
  uri: huggingface://cjpais/llava-1.6-mistral-7b-gguf/llava-v1.6-mistral-7b.Q5_K_M.gguf
- filename: llava-v1.6-7b-mmproj-f16.gguf
  uri: huggingface://cjpais/llava-1.6-mistral-7b-gguf/mmproj-model-f16.gguf

usage: |
    curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
        "model": "gpt-4-vision-preview",
        "messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`backend: llama-cpp`
			`context_size: 4096`
			`f16: true`

			`gpu_layers: 90`
			`mmap: true`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`name: gpt-4-vision-preview`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00
			`roles:`
			`user: "USER:"`
			`assistant: "ASSISTANT:"`
			`system: "SYSTEM:"`

feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`mmproj: llava-v1.6-7b-mmproj-f16.gguf`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`parameters:`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`model: llava-v1.6-mistral-7b.Q5_K_M.gguf`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`temperature: 0.2`
			`top_k: 40`
			`top_p: 0.95`
			`seed: -1`

			`template:`
			`chat: \|`
			`A chat between a curious human and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the human's questions.`
			`{{.Input}}`
			`ASSISTANT:`

			`download_files:`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`- filename: llava-v1.6-mistral-7b.Q5_K_M.gguf`
			`uri: huggingface://cjpais/llava-1.6-mistral-7b-gguf/llava-v1.6-mistral-7b.Q5_K_M.gguf`
			`- filename: llava-v1.6-7b-mmproj-f16.gguf`
			`uri: huggingface://cjpais/llava-1.6-mistral-7b-gguf/mmproj-model-f16.gguf`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00
			`usage: \|`
			`curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{`
feat(aio): add tests, update model definitions (#1880) 2024-03-22 20:13:11 +00:00			`"model": "gpt-4-vision-preview",`
feat(functions/aio): all-in-one images, function template enhancements (#1862) * feat(startup): allow to specify models from local files * feat(aio): add Dockerfile, make targets, aio profiles * feat(template): add Function and LastMessage * add hermes2-pro-mistral * update hermes2 definition * feat(template): add sprig * feat(template): expose FunctionCall * feat(aio): switch llm for text 2024-03-21 00:12:20 +00:00			`"messages": [{"role": "user", "content": [{"type":"text", "text": "What is in the image?"}, {"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" }}], "temperature": 0.9}]}'`