I have an audio file with two speakers on 1 channel. I would like to separate the audio in 2 channels (one per speaker).
I was thinking of splitting on silences, or more complicated things like speaker diarization to i.e. to detect different speakers in an audio recording.
How would you do?
Jonas
131k103 gold badges330 silver badges408 bronze badges
-
1It's not a trivial problem. What about trying some of the open source tools mentioned in the Wikipedia article? If you want to roll your own, you need a solid background of information theory, statistical signal processing etc.Jussi Nurminen– Jussi Nurminen2022年04月28日 08:57:57 +00:00Commented Apr 28, 2022 at 8:57
-
Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer.Community– Community Bot2022年04月28日 19:26:18 +00:00Commented Apr 28, 2022 at 19:26
-
Seems to be a complicated speech signal processing problem, specified as blind source seperation. Traditional methods include PCA/ICA and NMF. Modern method introduces neural network.ZR Han– ZR Han2022年04月29日 06:25:11 +00:00Commented Apr 29, 2022 at 6:25
lang-py