@video-db VideoDB

dineshkumar181094
Oct 12, 2024

Hi VideoDB Team,

I was following up on the example https://docs.videodb.io/adding-ai-generated-voiceovers-with-videodb-and-lovo-70

Here are the few questions that are unclear from the document.

indexing timeline and scene description + llm response.

I see the shot-based indexing created a 85 scenes out of 2.3 minutes of video. But while providing promt to llm you have done it single prompt and the response I got by following the doc has only 41 shots.
Why don't we iterate over each scene and ask llm to generate description to just fill that the timeline.
How we are sure that reponse given by the llm just fill the entire timeline of the video. It would be great if you can provide explaination of this.

Replies: 2 comments

dineshkumar181094
Oct 12, 2024
Author

Just pointing out one more thing if there is slight movement in audio with scene it could create a whole different meaning. by shifiting the position.

0 replies

ashish-spext
Oct 24, 2024
Maintainer

Hi @dineshkumar181094 great observations!

Could the LLM be stopping due to token limit? That might be one of the reason, as there is nothing in the prompt that is instructing LLM to restrict / stop. Ideally it should cover the whole input (85 scenes in your case).
In shot based indexing, where the scene duration is very short (1-2 seconds) the output might not sound coherent, here for better precision maybe better way would be to club on certain threshold (x minutes) and generate audio for those clubbed chunks instead of char based chunks given in the tutorial.
In our experimentation prompt generate a synced script based on the description writes a script with sentences which are roughly the same length as the time stamp of the scene in the description.

Just pointing out one more thing if there is slight movement in audio with scene it could create a whole different meaning. by shifiting the position. - Can you please share some example of this if handy? Probably good chunking should smooth out the cases like this.

0 replies

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

@video-db VideoDB

Video DB Scene Index with LLM. #26

Uh oh!

{{title}}

Uh oh!

dineshkumar181094
Oct 12, 2024

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

dineshkumar181094
Oct 12, 2024
Author

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

ashish-spext
Oct 24, 2024
Maintainer

Select a reply

Uh oh!

@video-db VideoDB

Video DB Scene Index with LLM. #26

Uh oh!

dineshkumar181094 Oct 12, 2024

Replies: 2 comments

Uh oh!

dineshkumar181094 Oct 12, 2024 Author

Uh oh!

Uh oh!

ashish-spext Oct 24, 2024 Maintainer

dineshkumar181094
Oct 12, 2024

dineshkumar181094
Oct 12, 2024
Author

ashish-spext
Oct 24, 2024
Maintainer