LocalAI/pkg
Ettore Di Giacinto 8ccf5b2044
feat(speculative-sampling): allow to specify a draft model in the model config (#1052)
**Description**

This PR fixes #1013.

It adds `draft_model` and `n_draft` to the model YAML config in order to
load models with speculative sampling. This should be compatible as well
with grammars.

example:

```yaml
backend: llama                                                                                                                                                                   
context_size: 1024                                                                                                                                                                        
name: my-model-name
parameters:
  model: foo-bar
n_draft: 16                                                                                                                                                                      
draft_model: model-name
```

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
2023-09-14 17:44:16 +02:00
..
assets feat: Update gpt4all, support multiple implementations in runtime (#472) 2023-06-01 23:38:52 +02:00
backend feat(speculative-sampling): allow to specify a draft model in the model config (#1052) 2023-09-14 17:44:16 +02:00
gallery feat: Model Gallery Endpoint Refactor / Mutable Galleries Endpoints (#991) 2023-09-02 09:00:44 +02:00
grammar feat: update integer, number and string rules - allow primitives as root types (#862) 2023-08-03 23:32:30 +02:00
grpc feat(speculative-sampling): allow to specify a draft model in the model config (#1052) 2023-09-14 17:44:16 +02:00
langchain feat: add LangChainGo Huggingface backend (#446) 2023-06-01 12:00:06 +02:00
model feat: backend monitor shutdown endpoint, process based (#938) 2023-08-23 18:38:37 +02:00
stablediffusion feat: support upscaled image generation with esrgan (#509) 2023-06-05 17:21:38 +02:00
utils feat: Model Gallery Endpoint Refactor / Mutable Galleries Endpoints (#991) 2023-09-02 09:00:44 +02:00