About Karaoke-style movie generation · ggml-org/whisper.cpp · Discussion #1206

ashyrv
Aug 25, 2023

Hey everyone! I have a question about the karaoke-style speech recognition. Do you think it will work in real time? It seems it's really good with recorded audio but will it work in real-time speech? Any feedback or answers appreciated.

Thank you in advance!

Replies: 2 comments

ulatekh
Jun 4, 2024

Whisper is not real-time, nor is it likely to ever be real-time. It works by processing audio 30 seconds at a time, and that processing, even when GPU-accelerated, can take a significant fraction of a second, or even several seconds.

0 replies

ashyrv
Jun 11, 2024
Author

Thanks

...

On Wed, 5 Jun 2024, 00:05 ulatekh, ***@***.***> wrote: Whisper is not real-time, nor is it likely to ever be real-time. It works by processing audio 30 seconds at a time, and that processing, even when GPU-accelerated, can take a significant fraction of a second, or even several seconds. — Reply to this email directly, view it on GitHub <#1206 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AUDBSE67MG25ECR2KBXS4RTZFYT2ZAVCNFSM6AAAAAA36EDHT6VHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM4TMNRZHA2TM> . You are receiving this because you authored the thread.Message ID: ***@***.***>

0 replies

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

About Karaoke-style movie generation #1206

Uh oh!

{{title}}

Uh oh!

ashyrv
Aug 25, 2023

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

ulatekh
Jun 4, 2024

Uh oh!

{{title}}

Uh oh!

ashyrv
Jun 11, 2024
Author

Select a reply

Uh oh!

About Karaoke-style movie generation #1206

Uh oh!

ashyrv Aug 25, 2023

Replies: 2 comments

Uh oh!

ulatekh Jun 4, 2024

Uh oh!

ashyrv Jun 11, 2024 Author

ashyrv
Aug 25, 2023

ulatekh
Jun 4, 2024

ashyrv
Jun 11, 2024
Author