llama.cpp

History

Marcus Dunn 5be6c803fa llama : remove token functions with `context` args in favor of `model` (#3720 ) * added `llama_model_token_` variants to all the `llama_token_` functions. * added `LLAMA_API` * formatting Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * removed old `llama_token` functions * changed 3 more functions to take in model - `llama_token_get_text` - `llama_token_get_score` - `llama_token_get_type` * added back docs * fixed main.cpp * changed token functions to use new model variants * changed token functions to use new model variants --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-10-23 22:40:03 +03:00
..
CMakeLists.txt	speculative : PoC for speeding-up inference via speculative sampling (#2926 )	2023-09-03 15:12:08 +03:00
speculative.cpp	llama : remove token functions with `context` args in favor of `model` (#3720 )	2023-10-23 22:40:03 +03:00

llama : remove token functions with context args in favor of model (#3720 )

* added `llama_model_token_*` variants to all the `llama_token_*` functions.

* added `LLAMA_API`

* formatting

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* removed old `llama_token` functions

* changed 3 more functions to take in model

- `llama_token_get_text`
- `llama_token_get_score`
- `llama_token_get_type`

* added back docs

* fixed main.cpp

* changed token functions to use new model variants

* changed token functions to use new model variants

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-10-23 22:40:03 +03:00

CMakeLists.txt speculative : PoC for speeding-up inference via speculative sampling (#2926 ) 2023-09-03 15:12:08 +03:00

speculative.cpp llama : remove token functions with context args in favor of model (#3720 ) 2023-10-23 22:40:03 +03:00