LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2024-06-07 19:40:48 +00:00

History

Ettore Di Giacinto c56b6ddb1c fix(llama.cpp): disable infinite context shifting (#1704 ) Infinite context loop might as well trigger an infinite loop of context shifting if the model hallucinates and does not stop answering. This has the unpleasant effect that the predicion never terminates, which is the case especially on small models which tends to hallucinate. Workarounds https://github.com/mudler/LocalAI/issues/1333 by removing context-shifting. See also upstream issue: https://github.com/ggerganov/llama.cpp/issues/3969	2024-02-13 21:17:21 +01:00
..
grpc	move BUILD_GRPC_FOR_BACKEND_LLAMA logic to makefile: errors in this section now immediately fail the build (#1576 )	2024-01-13 10:08:26 +01:00
llama	fix(llama.cpp): disable infinite context shifting (#1704 )	2024-02-13 21:17:21 +01:00

Ettore Di Giacinto c56b6ddb1c

fix(llama.cpp): disable infinite context shifting (#1704 )

Infinite context loop might as well trigger an infinite loop of context
shifting if the model hallucinates and does not stop answering.
This has the unpleasant effect that the predicion never terminates,
which is the case especially on small models which tends to hallucinate.

Workarounds https://github.com/mudler/LocalAI/issues/1333 by removing
context-shifting.

See also upstream issue: https://github.com/ggerganov/llama.cpp/issues/3969

2024-02-13 21:17:21 +01:00

grpc

move BUILD_GRPC_FOR_BACKEND_LLAMA logic to makefile: errors in this section now immediately fail the build (#1576 )

2024-01-13 10:08:26 +01:00

llama

fix(llama.cpp): disable infinite context shifting (#1704 )

2024-02-13 21:17:21 +01:00