McCloud

joined 1 year ago
[–] [email protected] 1 points 1 year ago

Unfortunately stable-ts and whisper don’t obviously output which files it is working on, so you’re dependent on trying to decipher it from the logs. I tried to add prints to show which files it has queued and started, but with threading, the std-out sometimes gets lost or buffered in strange ways.

[–] [email protected] 1 points 1 year ago

I'm not sure yet. Faster-whisper has some benchmarks of the Largev2 model taking about 1 minute for 13 minutes of audio. Smaller models ought to be quicker. Unsure if the specs of the GPU will make much differenece.

[–] [email protected] 1 points 1 year ago (1 children)

It can only translate into English, but the source audio can be a foreign language.

[–] [email protected] 1 points 1 year ago

I just tried, Emby won't actually send out the webook on an action. I can use the test webhook, but it won't trigger off media actions. Documentation half-implies that it's a premiere options?

[–] [email protected] 1 points 1 year ago (1 children)

If I knew what the endpoints were, nothing would prohibit it. I can add it to my short list.

[–] [email protected] 1 points 1 year ago (1 children)

It should detect the foreign language and make english subtitles, but I haven't personally tried it.

I'm not using whisper.cpp anymore. I did some short comparisons between WhisperX and stable-ts and ultimately decided to go with stable-ts. Functionally, I'm sure they're very similar.