2024-01-01 09:51:47 +00:00
|
|
|
name: phi-2
|
|
|
|
context_size: 2048
|
|
|
|
f16: true
|
|
|
|
gpu_layers: 90
|
|
|
|
mmap: true
|
2024-01-01 13:39:42 +00:00
|
|
|
trimsuffix:
|
|
|
|
- "\n"
|
2024-01-01 09:51:47 +00:00
|
|
|
parameters:
|
|
|
|
model: huggingface://TheBloke/phi-2-GGUF/phi-2.Q8_0.gguf
|
|
|
|
temperature: 0.2
|
|
|
|
top_k: 40
|
|
|
|
top_p: 0.95
|
2024-01-23 22:07:31 +00:00
|
|
|
seed: -1
|
2024-03-11 18:49:03 +00:00
|
|
|
|
|
|
|
mirostat: 2
|
|
|
|
mirostat_eta: 1.0
|
|
|
|
mirostat_tau: 1.0
|
2024-01-01 09:51:47 +00:00
|
|
|
template:
|
2024-02-21 22:17:51 +00:00
|
|
|
chat: &template |-
|
2024-01-01 09:51:47 +00:00
|
|
|
Instruct: {{.Input}}
|
|
|
|
Output:
|
2024-01-26 15:22:54 +00:00
|
|
|
completion: *template
|
|
|
|
|
|
|
|
usage: |
|
|
|
|
To use this model, interact with the API (in another terminal) with curl for instance:
|
|
|
|
curl http://localhost:8080/v1/chat/completions -H "Content-Type: application/json" -d '{
|
|
|
|
"model": "phi-2",
|
|
|
|
"messages": [{"role": "user", "content": "How are you doing?", "temperature": 0.1}]
|
|
|
|
}'
|