LocalAI

Mirror/LocalAI

Fork 0

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

Commit Graph

Author	SHA1	Message	Date
Ettore Di Giacinto	453e9c5da9	fix(vllm): set default top_p with vllm (#1078 ) Description This PR fixes vllm when called with a request with an empty top_p Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-19 18:10:23 +02:00
Ettore Di Giacinto	8ccf5b2044	feat(speculative-sampling): allow to specify a draft model in the model config (#1052 ) Description This PR fixes #1013. It adds `draft_model` and `n_draft` to the model YAML config in order to load models with speculative sampling. This should be compatible as well with grammars. example: ```yaml backend: llama context_size: 1024 name: my-model-name parameters: model: foo-bar n_draft: 16 draft_model: model-name ``` --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-14 17:44:16 +02:00
Ettore Di Giacinto	c0bb5c4bf6	feat(vllm): Initial vllm backend implementation Related to: https://github.com/go-skynet/LocalAI/issues/1015 Signed-off-by: Ettore Di Giacinto <mudler@localai.io>	2023-09-09 17:03:23 +02:00

Author

SHA1

Message

Date

Ettore Di Giacinto

453e9c5da9

fix(vllm): set default top_p with vllm (#1078 )

**Description**

This PR fixes vllm when called with a request with an empty top_p

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-09-19 18:10:23 +02:00

Ettore Di Giacinto

8ccf5b2044

feat(speculative-sampling): allow to specify a draft model in the model config (#1052 )

**Description**

This PR fixes #1013.

It adds `draft_model` and `n_draft` to the model YAML config in order to
load models with speculative sampling. This should be compatible as well
with grammars.

example:

```yaml
backend: llama                                                                                                                                                                   
context_size: 1024                                                                                                                                                                        
name: my-model-name
parameters:
  model: foo-bar
n_draft: 16                                                                                                                                                                      
draft_model: model-name
```

---------

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-09-14 17:44:16 +02:00

Ettore Di Giacinto

c0bb5c4bf6

feat(vllm): Initial vllm backend implementation

Related to: https://github.com/go-skynet/LocalAI/issues/1015

Signed-off-by: Ettore Di Giacinto <mudler@localai.io>

2023-09-09 17:03:23 +02:00

3 Commits