Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

speechlib discussion #6

Jan 21, 2024 · 7 comments · 1 reply
Discussion options

👋 Welcome!

We’re using Discussions as a place to connect with other members of our community. We hope that you:

  • Ask questions you’re wondering about.
  • Share ideas.
  • Engage with other community members.
  • Welcome others and are open-minded. Remember that this is a community we
    build together 💪.
You must be logged in to vote

Replies: 7 comments 1 reply

Comment options

@Navodplayer1

Thanks for interesting project.

Could you please tell why speechlib can't recognize speakers here?
(i don't need names, but speaker 1 and speaker 2 will do :)
voice folder is provided, but empty...)

Update: now that i look at output, i see that speaker recognition took 0 seconds. I assumed if voice files not provided it will at least recognize speaker 1 / speaker 2. (without names = folder name of each speaker). Or am i wrong?

Code and files:

from speechlib import Transcriptor
file = "/content/drive/MyDrive/Colab Notebooks/PaypalCEOAlexChrissW.mp3"
voices_folder = "/content/drive/MyDrive/Colab Notebooks/voices"
language = "en"
log_folder = "/content/drive/MyDrive/Colab Notebooks/logs"
modelSize = "large"
quantization = False
transcriptor = Transcriptor(file, log_folder, language, modelSize, voices_folder, quantization)
res = transcriptor.transcribe()
You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

Github doesn't allow to upload mp3 file... let me see if i can find a way :)

update: https://whyp.it/tracks/152176/paypalceoalexchrissw?token=qX9Th

You must be logged in to vote
0 replies
Comment options

By the way, i justed wanted to transcript interview in swedish, provided mp3 file, but forgot to change language (it was left as language = "en")

So speechlib did transcript AND translation :)
and seems accurate. That's amazing.


Still no corect speaker 1, speaker 2 (all are "unknow")

You must be logged in to vote
0 replies
Comment options

@gety9 you can leave the voices_folder as empty string "" . then you will get SPEAKER_00, SPEAKER_01 ... etc. let me know if this solved your issue.

You must be logged in to vote
0 replies
Comment options

speechlib is great!
Can we use WeSpeaker onnx models for diarization? Now I need wespeaker/voxblink2_samresnet100_ft.onnx

You must be logged in to vote
1 reply
Comment options

NavodPeiris Feb 27, 2025
Maintainer Author

curently we don't support onnx models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet

AltStyle によって変換されたページ (->オリジナル) /