whisper.cpp/examples
Akash Mahajan c8d0f5fe98
whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058)
* add HuggingFace mirror to download  ggml model

* support tdrz via simple hack overriding solm tokens

* fix incorrect translate/transcribe token_ids that are not static const

* add apollo 13 sample for tdrz demo

* render [SPEAKER TURN] consistently in all terminal output using vocab.id_to_token

* extend whisper_segment with speaker_turn_next field and save in json output

* fix failing go build

* slipped in some python syntax whoops

* whisper : finalize tinydiarize support (add flag + fixes)

* whisper : tdrz support for word-level timestamps (respect max_len)

* java : try to fix tests after adding tdrz_enable flag

* main : remove TODO leftover

* java : fix params order list after adding "tdrz_enable"

* whisper : fix solm and add nosp token

* main : print tinydiarize help

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-07-04 09:45:00 +03:00
..
addon.node whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
bench bench : fix Windows linkage by moving ggml benches in whisper lib .. 2023-01-18 21:19:50 +02:00
bench.wasm whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
command Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)" 2023-07-02 21:53:52 +03:00
command.wasm examples : fix + refactor Levenshtein distance 2023-04-30 19:12:49 +03:00
main whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058) 2023-07-04 09:45:00 +03:00
quantize ggml : sync latest repo (mostly refactoring changes) 2023-07-02 21:46:09 +03:00
stream Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)" 2023-07-02 21:53:52 +03:00
stream.wasm whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
talk Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)" 2023-07-02 21:53:52 +03:00
talk-llama talk-llama : fix new rope interface 2023-07-03 19:24:01 +03:00
talk.wasm whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
whisper.android whisper.android : support decode wav file has 2 channels (#972) 2023-05-31 10:13:14 +03:00
whisper.nvim models : cd statements are quoted to allow spaces in path (#1041) 2023-06-25 15:27:28 +03:00
whisper.objc whisper.objc : enable Core ML in example & fix segmentation fault (#910) 2023-05-14 09:47:02 +03:00
whisper.swiftui whisper.swiftui : update README.md (#682) 2023-03-29 23:04:38 +03:00
whisper.wasm whisper : add memory sizes for Q8_0 (close #846) 2023-05-01 10:03:56 +03:00
CMakeLists.txt whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
common-ggml.cpp ggml : sync latest ggml lib 2023-06-25 14:30:44 +03:00
common-ggml.h whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
common-sdl.cpp examples : refactor in order to reuse code and reduce duplication (#482) 2023-02-15 19:28:10 +02:00
common-sdl.h examples : refactor in order to reuse code and reduce duplication (#482) 2023-02-15 19:28:10 +02:00
common.cpp ggml : sync latest repo (mostly refactoring changes) 2023-07-02 21:46:09 +03:00
common.h ggml : sync latest repo (mostly refactoring changes) 2023-07-02 21:46:09 +03:00
dr_wav.h refactoring : move main + stream in examples + other stuff 2022-10-25 20:53:48 +03:00
generate-karaoke.sh minor : add comment for using "generate_karaoke.sh" 2022-11-26 10:22:42 +02:00
helpers.js whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
livestream.sh livestream.sh : run main with model arg instead of default (#453) 2023-01-27 01:13:31 +02:00
twitch.sh twitch.sh : various fixes and polishing 2022-12-08 19:20:04 +02:00
yt-wsp.sh yt-wsp.sh : print help on empty args 2023-02-18 09:42:31 +02:00