Commit Graph

22 Commits

Author SHA1 Message Date
Jong Wook Kim 9d646db9d8 print '?' if a letter can't be encoded using the system default encoding (#859) 2023-01-17 23:28:36 -08:00
adamreis 70861c7ce3 Fix tiny transcribe() docstring typo (#857)
s/successfully/successively, which I believe was the intent.
2023-01-16 22:42:01 -08:00
Jong Wook Kim 02aa851a49 fix to return only the text token ids 2022-11-15 16:25:11 -08:00
Jong Wook Kim d18e9ea5dd transcribe() on English-only model won't complain when language="en" is not given 2022-10-09 02:40:12 -07:00
eudoxos 35713c66e0 Add --threads option to transcribe (#278)
* Add --threads option to transcribe

Torch on CPU uses by default number_of_cores/2. This option allows to
override this default.

* Update transcribe.py

Co-authored-by: Jong Wook Kim <ilikekjw@gmail.com>
2022-10-09 02:11:15 -07:00
Jibin Mathew 0b1ba3d46e Add model_dir to arguments (#202)
* Add model_dir to arguments

* minor formatting change

Co-authored-by: Jong Wook Kim <jongwook@openai.com>
2022-09-30 14:45:51 -07:00
Jong Wook Kim 7cb4cc21bf allowing nonzero initial temperature 2022-09-29 18:05:12 -07:00
Vicki Anand 2b0c2971af Don't update duration if last timestamp is same as begin (#191) 2022-09-29 12:27:48 -07:00
Jong Wook Kim 62fe7f1009 patience definition to match the paper 2022-09-27 19:00:41 -07:00
Nick Konovalchuk b4308c4782 fix: transcribe verbosity (#140) 2022-09-26 11:46:21 -07:00
VulumeCode 2037b65f3f Context prompt (#128)
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 05:22:33 -07:00
EliEron fc0f40981d Write each sentence as a separate line for the txt output (#101)
* Write each sentence as a separate line for the txt output

Write each sentence as a separate line for the txt output

* Update utils.py

Co-authored-by: EliEron <example@example.com>
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 04:52:28 -07:00
fatih ead77fab97 add srt subtitle export utility (#102)
* add srt subtitle export utility

* simplifying

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 03:50:26 -07:00
fatih 9e7e418ff1 add progress bar for transcribe loop (#100)
* add progress bar to transcribe loop

* improved warning message for English-only models

* add --condition_on_previous_text

* progressbar renames

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-26 03:24:13 -07:00
Jong Wook Kim 5d8d3e75a4 add --condition_on_previous_text 2022-09-25 05:16:08 -07:00
Jong Wook Kim 2d3032de01 improved warning message for English-only models 2022-09-25 02:10:36 -07:00
Jong Wook Kim 15ab548263 nocaptions -> nospeech to match the paper figure 2022-09-23 15:45:32 +09:00
mj-kh 61989529b7 Fix possible mistake when loading model to device (#57)
Before this change, the model is loaded into GPU regardless of the value of "device" argument in CLI.

(e.g. whisper "test.wav" --device cpu loads into GPU anyway)
2022-09-23 15:21:47 +09:00
hanacchi c85eaaae29 Use UTF-8 encoding to save the txt and vtt files (#37)
Explicitly set the text encoding to UTF-8 in order to avoid UnicodeEncodeErrors

Co-authored-by: Jong Wook Kim <jongwook@nyu.edu>
2022-09-23 12:10:55 +09:00
EliEron 759e8d47a8 Fix output_dir argument when audio file is a path (#45) 2022-09-23 11:38:37 +09:00
Jong Wook Kim 834f00a0ea making small model the default 2022-09-22 02:45:12 +09:00
Jong Wook Kim 6e3be77e1a initial commit 2022-09-22 01:09:43 +09:00