whisper.cpp/bindings
Akash Mahajan c8d0f5fe98
whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058)
* add HuggingFace mirror to download  ggml model

* support tdrz via simple hack overriding solm tokens

* fix incorrect translate/transcribe token_ids that are not static const

* add apollo 13 sample for tdrz demo

* render [SPEAKER TURN] consistently in all terminal output using vocab.id_to_token

* extend whisper_segment with speaker_turn_next field and save in json output

* fix failing go build

* slipped in some python syntax whoops

* whisper : finalize tinydiarize support (add flag + fixes)

* whisper : tdrz support for word-level timestamps (respect max_len)

* java : try to fix tests after adding tdrz_enable flag

* main : remove TODO leftover

* java : fix params order list after adding "tdrz_enable"

* whisper : fix solm and add nosp token

* main : print tinydiarize help

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2023-07-04 09:45:00 +03:00
..
go whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058) 2023-07-04 09:45:00 +03:00
ios@de46d9e781 release : v1.4.2 2023-05-14 19:06:45 +03:00
java whisper : support speaker segmentation (local diarization) of mono audio via tinydiarize (#1058) 2023-07-04 09:45:00 +03:00
javascript ggml : update WASM SIMD 2023-05-20 20:00:06 +03:00
ruby whisper : add whisper_state + default state on the whisper_context (#523) 2023-03-05 21:42:19 +02:00
CMakeLists.txt Node.js package (#260) 2022-12-12 20:17:27 +02:00