llama.cpp

History

Georgi Gerganov d0cee0d36d gguf : add 64-bit support (GGUF v2) (#2821 ) * gguf : bump version to 2 * gguf : add support for 64-bit (no backwards comp yet) * gguf : v1 backwards comp * gguf.py : bump GGUF version * gguf.py : uint64_t on all lengths, sizes and counts, enums still uint32_t * gguf.py : string lengths uint32_t * gguf : update all counts to 64-bit * gguf.py : string len uint64_t and n_dims uint32_t * gguf : fix typo * llama.cpp : print gguf version --------- Co-authored-by: klosax <131523366+klosax@users.noreply.github.com>		2023-08-27 14:19:54 +03:00
..
gguf	gguf : add 64-bit support (GGUF v2) (#2821 )	2023-08-27 14:19:54 +03:00
tests	gguf : make gguf pip-installable	2023-08-25 09:26:05 +03:00
LICENSE	gguf : make gguf pip-installable	2023-08-25 09:26:05 +03:00
pyproject.toml	gguf : export objects to user code (#2780 )	2023-08-25 12:43:41 +03:00
README.md	gguf : make gguf pip-installable	2023-08-25 09:26:05 +03:00

README.md

gguf

This is a Python package for writing binary files in the GGUF (GGML Universal File) format.

See convert-llama-hf-to-gguf.py as an example for its usage.

Installation

pip install gguf

Development

Maintainers who participate in development of this package are advised to install it in editable mode:

cd /path/to/llama.cpp/gguf-py

pip install --editable .

Note: This may require to upgrade your Pip installation, with a message saying that editable installation currently requires setup.py. In this case, upgrade Pip to the latest:

pip install --upgrade pip

Publishing

To publish the package, you need to have twine and build installed:

pip install build twine

Then, folow these steps to release a new version:

Update the version in pyproject.toml.
Build the package:

python -m build

Upload the generated distribution archives:

python -m twine upload dist/*

TODO

Add tests
Include conversion scripts as command line entry points in this package.
Add CI workflow for releasing the package.