llama.cpp/prompts
Georgi Gerganov 0e89203b51
speculative : add tree-based sampling example (#3624)
* sampling : one sequence per sampling context

ggml-ci

* speculative : add tree-based sampling support

ggml-ci

* speculative : reuse the n_parallel CLI param

* speculative : refactor sampling

* examples : fix build after sampling refactoring

ggml-ci

* batched : fix n_seq_id

* sampling : fix malloc

ggml-ci

* swift : fix build

ggml-ci

* swift : try to fix build

ggml-ci

* prompts : add assistant.txt

* common : add llama_batch_add() and llama_batch_clear() helpers

* speculative : minor refactor

ggml-ci

* minor : comments + rename

ggml-ci

* speculative : fix off-by-one for n_drafted

* speculative : fix the n_drafted fix + p constants
2023-10-18 16:21:57 +03:00
..
alpaca.txt Revert "main : alternative instruct mode (Vicuna support, etc.) (#863)" (#982) 2023-04-14 22:58:43 +03:00
assistant.txt speculative : add tree-based sampling example (#3624) 2023-10-18 16:21:57 +03:00
chat-with-baichuan.txt feature : support Baichuan serial models (#3009) 2023-09-14 12:32:10 -04:00
chat-with-bob.txt Revert "main : alternative instruct mode (Vicuna support, etc.) (#863)" (#982) 2023-04-14 22:58:43 +03:00
chat-with-vicuna-v0.txt examples : read chat prompts from a template file (#1196) 2023-05-03 20:58:11 +03:00
chat-with-vicuna-v1.txt examples : read chat prompts from a template file (#1196) 2023-05-03 20:58:11 +03:00
chat.txt examples : read chat prompts from a template file (#1196) 2023-05-03 20:58:11 +03:00
dan-modified.txt prompts : model agnostic DAN (#1304) 2023-05-11 18:10:19 +03:00
dan.txt prompts : model agnostic DAN (#1304) 2023-05-11 18:10:19 +03:00
LLM-questions.txt parallel : add option to load external prompt file (#3416) 2023-10-06 16:16:38 +03:00
mnemonics.txt prompts : add mnemonics.txt 2023-10-12 09:35:30 +03:00
parallel-questions.txt prompts : fix editorconfig checks after #3416 2023-10-06 16:36:32 +03:00
reason-act.txt do not force the prompt file to end with a new line (#908) 2023-04-13 11:33:16 +02:00