whisper.cpp

History

Akash Mahajan c8d0f5fe98 whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058 ) * add HuggingFace mirror to download ggml model * support tdrz via simple hack overriding solm tokens * fix incorrect translate/transcribe token_ids that are not static const * add apollo 13 sample for tdrz demo * render [SPEAKER TURN] consistently in all terminal output using vocab.id_to_token * extend whisper_segment with speaker_turn_next field and save in json output * fix failing go build * slipped in some python syntax whoops * whisper : finalize tinydiarize support (add flag + fixes) * whisper : tdrz support for word-level timestamps (respect max_len) * java : try to fix tests after adding tdrz_enable flag * main : remove TODO leftover * java : fix params order list after adding "tdrz_enable" * whisper : fix solm and add nosp token * main : print tinydiarize help --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>		2023-07-04 09:45:00 +03:00
..
addon.node	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
bench	bench : fix Windows linkage by moving ggml benches in whisper lib ..	2023-01-18 21:19:50 +02:00
bench.wasm	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
command	Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027 )"	2023-07-02 21:53:52 +03:00
command.wasm	examples : fix + refactor Levenshtein distance	2023-04-30 19:12:49 +03:00
main	whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058 )	2023-07-04 09:45:00 +03:00
quantize	ggml : sync latest repo (mostly refactoring changes)	2023-07-02 21:46:09 +03:00
stream	Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027 )"	2023-07-02 21:53:52 +03:00
stream.wasm	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
talk	Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027 )"	2023-07-02 21:53:52 +03:00
talk-llama	talk-llama : fix new rope interface	2023-07-03 19:24:01 +03:00
talk.wasm	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
whisper.android	whisper.android : support decode wav file has 2 channels (#972 )	2023-05-31 10:13:14 +03:00
whisper.nvim	models : cd statements are quoted to allow spaces in path (#1041 )	2023-06-25 15:27:28 +03:00
whisper.objc	whisper.objc : enable Core ML in example & fix segmentation fault (#910 )	2023-05-14 09:47:02 +03:00
whisper.swiftui	whisper.swiftui : update README.md (#682 )	2023-03-29 23:04:38 +03:00
whisper.wasm	whisper : add memory sizes for Q8_0 (close #846 )	2023-05-01 10:03:56 +03:00
CMakeLists.txt	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
common-ggml.cpp	ggml : sync latest ggml lib	2023-06-25 14:30:44 +03:00
common-ggml.h	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
common-sdl.cpp	examples : refactor in order to reuse code and reduce duplication (#482 )	2023-02-15 19:28:10 +02:00
common-sdl.h	examples : refactor in order to reuse code and reduce duplication (#482 )	2023-02-15 19:28:10 +02:00
common.cpp	ggml : sync latest repo (mostly refactoring changes)	2023-07-02 21:46:09 +03:00
common.h	ggml : sync latest repo (mostly refactoring changes)	2023-07-02 21:46:09 +03:00
dr_wav.h	refactoring : move main + stream in examples + other stuff	2022-10-25 20:53:48 +03:00
generate-karaoke.sh	minor : add comment for using "generate_karaoke.sh"	2022-11-26 10:22:42 +02:00
helpers.js	whisper : add integer quantization support (#540 )	2023-04-30 18:51:57 +03:00
livestream.sh	livestream.sh : run main with model arg instead of default (#453 )	2023-01-27 01:13:31 +02:00
twitch.sh	twitch.sh : various fixes and polishing	2022-12-08 19:20:04 +02:00
yt-wsp.sh	yt-wsp.sh : print help on empty args	2023-02-18 09:42:31 +02:00