llama.cpp/quantize.sh at 2d15d6c9a959749f954d4fbbf44d711e19c5bdff - haraldwolff/llama.cpp - Gitea: Git with a cup of tea

haraldwolff/llama.cpp

Pavol Rusnak d1f224712d

Add quantize script for batch quantization (#92 )

* Add quantize script for batch quantization

* Indentation

* README for new quantize.sh

* Fix script name

* Fix file list on Mac OS

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2023-03-13 18:15:20 +02:00

16 lines

309 B

Bash

Executable file

Raw Blame History

 #!/usr/bin/env bash
 if ! [[ "$1" =~ ^[0-9]{1,2}B$ ]]; then
     echo
     echo "Usage: quantize.sh 7B|13B|30B|65B [--remove-f16]"
     echo
     exit 1
 fi
 for i in `ls models/$1/ggml-model-f16.bin*`; do
     ./quantize "$i" "${i/f16/q4_0}" 2
     if [[ "$2" == "--remove-f16" ]]; then
         rm "$i"
     fi
 done