Jong Wook Kim
9d646db9d8
print '?' if a letter can't be encoded using the system default encoding ( #859 )
2023-01-17 23:28:36 -08:00
adamreis
70861c7ce3
Fix tiny transcribe() docstring typo ( #857 )
...
s/successfully/successively, which I believe was the intent.
2023-01-16 22:42:01 -08:00
Jong Wook Kim
02aa851a49
fix to return only the text token ids
2022-11-15 16:25:11 -08:00
Jong Wook Kim
d18e9ea5dd
transcribe() on English-only model won't complain when language="en" is not given
2022-10-09 02:40:12 -07:00
eudoxos
35713c66e0
Add --threads option to transcribe ( #278 )
...
* Add --threads option to transcribe
Torch on CPU uses by default number_of_cores/2. This option allows to
override this default.
* Update transcribe.py
Co-authored-by: Jong Wook Kim <ilikekjw@gmail.com >
2022-10-09 02:11:15 -07:00
Jibin Mathew
0b1ba3d46e
Add model_dir to arguments ( #202 )
...
* Add model_dir to arguments
* minor formatting change
Co-authored-by: Jong Wook Kim <jongwook@openai.com >
2022-09-30 14:45:51 -07:00
Jong Wook Kim
7cb4cc21bf
allowing nonzero initial temperature
2022-09-29 18:05:12 -07:00
Vicki Anand
2b0c2971af
Don't update duration if last timestamp is same as begin ( #191 )
2022-09-29 12:27:48 -07:00
Jong Wook Kim
62fe7f1009
patience definition to match the paper
2022-09-27 19:00:41 -07:00
Nick Konovalchuk
b4308c4782
fix: transcribe verbosity ( #140 )
2022-09-26 11:46:21 -07:00
VulumeCode
2037b65f3f
Context prompt ( #128 )
...
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu >
2022-09-26 05:22:33 -07:00
EliEron
fc0f40981d
Write each sentence as a separate line for the txt output ( #101 )
...
* Write each sentence as a separate line for the txt output
Write each sentence as a separate line for the txt output
* Update utils.py
Co-authored-by: EliEron <example@example.com >
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu >
2022-09-26 04:52:28 -07:00
fatih
ead77fab97
add srt subtitle export utility ( #102 )
...
* add srt subtitle export utility
* simplifying
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu >
2022-09-26 03:50:26 -07:00
fatih
9e7e418ff1
add progress bar for transcribe loop ( #100 )
...
* add progress bar to transcribe loop
* improved warning message for English-only models
* add --condition_on_previous_text
* progressbar renames
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu >
2022-09-26 03:24:13 -07:00
Jong Wook Kim
5d8d3e75a4
add --condition_on_previous_text
2022-09-25 05:16:08 -07:00
Jong Wook Kim
2d3032de01
improved warning message for English-only models
2022-09-25 02:10:36 -07:00
Jong Wook Kim
15ab548263
nocaptions -> nospeech to match the paper figure
2022-09-23 15:45:32 +09:00
mj-kh
61989529b7
Fix possible mistake when loading model to device ( #57 )
...
Before this change, the model is loaded into GPU regardless of the value of "device" argument in CLI.
(e.g. whisper "test.wav" --device cpu loads into GPU anyway)
2022-09-23 15:21:47 +09:00
hanacchi
c85eaaae29
Use UTF-8 encoding to save the txt and vtt files ( #37 )
...
Explicitly set the text encoding to UTF-8 in order to avoid UnicodeEncodeErrors
Co-authored-by: Jong Wook Kim <jongwook@nyu.edu >
2022-09-23 12:10:55 +09:00
EliEron
759e8d47a8
Fix output_dir argument when audio file is a path ( #45 )
2022-09-23 11:38:37 +09:00
Jong Wook Kim
834f00a0ea
making small model the default
2022-09-22 02:45:12 +09:00
Jong Wook Kim
6e3be77e1a
initial commit
2022-09-22 01:09:43 +09:00