Commit Graph

259 Commits (c536ff40051502d5692cf3467d40add40a2ca45f)

Author SHA1 Message Date
Georgi Gerganov 6e29d8453c
Merge pull request #34 from tazz4843/master
Add static library make target
2022-10-10 08:05:57 +03:00
Topping1 50b5fe964c
Update main.cpp 2022-10-09 23:35:10 -05:00
0/0 64752acd27
add static library make target 2022-10-09 19:16:42 -06:00
Georgi Gerganov 7edaa7da4b
Merge pull request #31 from lkwq007/master
Add MinGW support
2022-10-09 17:52:46 +03:00
lnyan 4bbb8a587b Add MinGW support 2022-10-09 22:26:37 +08:00
Georgi Gerganov 4a6bf11db3 Minor 2022-10-08 18:13:26 +03:00
Georgi Gerganov 9bbca3110f ref #9 : add API documentation in whisper.h 2022-10-08 18:09:56 +03:00
Georgi Gerganov 5e563ef635 Fix Makefile for MacBook Intel 2022-10-08 17:35:55 +03:00
Georgi Gerganov 2ca8cc77b2 ref #17 : print whisper logs to stderr
Only the transcribed/translted text is printed to stdout.
This way, one can redirect the result to a file.
2022-10-08 17:28:06 +03:00
Georgi Gerganov 8c7c018893 ref #17 : add options to output result to file
Support for:

- plain text
- VTT
- SRT
2022-10-08 17:22:22 +03:00
Georgi Gerganov 4c4ab71d4d
Update README.md 2022-10-08 11:46:34 +03:00
Georgi Gerganov b43b36e006 Update tests 2022-10-08 11:43:42 +03:00
Georgi Gerganov 37110d693e ci : add base model tests to GH Actions 2022-10-08 11:43:42 +03:00
Georgi Gerganov 2d47693435 Update README.md 2022-10-08 11:43:42 +03:00
Georgi Gerganov a53e06757f Create README.md 2022-10-08 11:43:42 +03:00
Georgi Gerganov 0e3ba2f9fc Adding dummy models for testing purposes 2022-10-08 11:43:42 +03:00
Georgi Gerganov 2f069335ab Adding sanitizer tests 2022-10-08 11:43:42 +03:00
Georgi Gerganov 29b041f79b Cleanup CMakeLists.txt 2022-10-08 09:02:41 +03:00
Georgi Gerganov 4a732b2879 cmake : fixes 2022-10-08 09:02:41 +03:00
Georgi Gerganov 68f5962be6 ci : add cmake builds 2022-10-08 09:02:41 +03:00
Georgi Gerganov 332c9d77fe whisper : fix bug in token sampling logic
Could overflow buffer
2022-10-08 09:02:41 +03:00
Georgi Gerganov 877c058179 Add CMake support 2022-10-08 09:02:41 +03:00
Georgi Gerganov 481cd685d5
ref #10 : option to keep context in "stream" example
Seems the results become worse when we keep the context, so by default
this is not enabled
2022-10-07 22:30:44 +03:00
Georgi Gerganov 3f15bb8a08
ref #10 : add "step" argument for "stream" example
Controls how often we run the inference.
By default, we run it every 3 seconds.
2022-10-07 22:07:24 +03:00
Georgi Gerganov 7787b878e1
ref #16, #22 : add "offset" argument
Allows to start processing the input audio at some offset from the
beginning. Useful for splitting a long job into multiple tasks.
2022-10-07 22:00:40 +03:00
Georgi Gerganov e29a5dacc6
ref #11, #18, #26 : fix CACHE_LINE_SIZE constant 2022-10-07 21:56:44 +03:00
Georgi Gerganov 844d60b284 Add CI using Github Actions 2022-10-07 18:34:27 +03:00
Georgi Gerganov 700898e6ed
ref #22 : add option to provide multiple input .wav files 2022-10-05 23:44:10 +03:00
Georgi Gerganov 6b1c3cc198
Update README.md 2022-10-05 23:13:15 +03:00
Georgi Gerganov b8f713482e
Minor updates 2022-10-05 23:11:02 +03:00
Georgi Gerganov 167324584b wip : rpi4 support 2022-10-05 23:03:46 +03:00
Georgi Gerganov ce1fe95902 wip : improve makefile 2022-10-05 23:03:46 +03:00
Georgi Gerganov 74197ffc11
Merge pull request #20 from ArtyomZemlyak/master
Fix: main get language from cli args
2022-10-05 07:27:29 +03:00
Артём Земляк 495b81b367 Fix: main get n_threads from cli 2022-10-05 09:47:48 +07:00
Артём Земляк f007e186fe Fix: main get language from cli args 2022-10-05 09:24:53 +07:00
Georgi Gerganov e7a15876f8
Update README.md 2022-10-04 23:27:25 +03:00
Georgi Gerganov 6814cc9b02 Improve result printing 2022-10-04 23:18:15 +03:00
Georgi Gerganov eba33adadd Extend C-style API with full inference methods 2022-10-04 23:18:15 +03:00
Georgi Gerganov 6b77124e01 Initial C-style interface for whisper.cpp 2022-10-04 23:18:15 +03:00
Georgi Gerganov be8ba034f6
ref #10 : handle Ctrl+C in "stream" app 2022-10-02 20:11:17 +03:00
Georgi Gerganov d71e567656
Update README.md 2022-10-02 18:19:22 +03:00
Georgi Gerganov b6bf906730
ref #10 : quick-and-dirty attempt for real-time audio transciption
- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context
2022-10-02 17:55:45 +03:00
Georgi Gerganov 77d929f603
Fix bug in FFT
The FFT routine does not work for odd N
Solution is to add DFT and use it when N is odd
2022-10-02 17:46:21 +03:00
Georgi Gerganov 6d654d192a
Fix reading of stereo WAV files 2022-10-01 08:41:57 +03:00
Georgi Gerganov 62897e8ae6
Update README.md 2022-10-01 00:01:04 +03:00
Georgi Gerganov 15b49e8baf
Bug fix
Longer prompts could cause out-of-bounds access
2022-09-30 20:37:29 +03:00
Georgi Gerganov 3bcdbdfc32
Reduce memory usage even more + better sampling
- The encode/decode memory buffers are now reused
- If the 30-sec segment goes for too long without a timestamp token, we
  force one. Improves transcription for large model
- Stereo support
- Add "micro-machines.wav" sample
2022-09-30 19:35:27 +03:00
Georgi Gerganov 310f4883d1
Update README.md 2022-09-29 23:48:01 +03:00
Georgi Gerganov fd3f3d748f
Update README.md 2022-09-29 23:37:59 +03:00
Georgi Gerganov 5877c3578e
ref #4 : added transcription timestamps
Can be turned off with "-nt" argument.
Performance has also improved.
2022-09-29 23:09:39 +03:00