211 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
0
votes
0
answers
29
views
Auto detect language in Azure Speech Services SpeechSynthesizer
In the microsoft-cognitiveservices-speech-sdk JavaScript SDK SpeechSynthesizer.FromConfig takes AutoDetectSourceLanguageConfig as input but what does it actually do?
The documentation here Select ...
0
votes
0
answers
66
views
Azure Speech SDK ConversationTranscriber ends with EndOfStream unexpectedly when streaming WebM audio from browser using websocket
I am streaming live audio from a browser (MediaRecorder → WebSocket) into Azure Speech SDK’s ConversationTranscriber in Python.
The browser records with audio/webm;codecs=opus and sends 100 ms chunks. ...
0
votes
0
answers
31
views
Azure Speech Service Speaker Diarization: How to Optimize Real-Time Transcription Latency (Node.js + Angular)
I'm using Azure Speech-to-Text with speaker diarization in a real-time transcription app.
Backend: Node.js (v18), using microsoft-cognitiveservices-speech-sdk and WebSocket server.
Frontend: Angular (...
0
votes
0
answers
36
views
Not able to access the custom neural voice trained model through API
Im pretty new to the world of Azure, and I have been trying to understand the capabilities of Custom neural voice. After multiple trial and error, I was able to train a model and even deploy it, but I'...
0
votes
0
answers
99
views
Strange Speech-to-Text Transcription Issue: Earphones Work on One PC but Not Others Issue
Issue Description: I've encountered a bizarre problem with speech-to-text transcription across my computers. I have three PCs, and I'm experiencing inconsistent behavior with earphones:
On my ...
1
vote
1
answer
265
views
Python streamlit realtime speech-to-text with Azure SDK
I'm trying to create a real-time speech to text using streamlit and azure speech SDK.
I can easily transcribe audio/video files with no issues, but I want to integrate realtime transcription (from ...
0
votes
1
answer
83
views
Azure Python Speech Sdk get mstts:express-as styles from API
I have a Python Kivy App where you can use different TTS APIs. I use the basic SSML-Tags but I want to add the Microsoft exclusive styles of the mstts:express-as attribute. Is there any way to get the ...
2
votes
1
answer
264
views
Trying to build azure speech program that can transcribe and diarize audio real-time, how do I do this on javascript/html? Can't find working examples
I specifically am trying to build an application that can run an html-javascript file that can recognize the speech input from a microphone, transcribe it, and assign it to a speaker, continuously ...
0
votes
1
answer
118
views
Why azure viseme data is not completely generated in linux?
So Im trying to generate speech and viseme data for my animation using azure services. But I have a problem when I run the exact same code in the Linux ubuntu 22.04.
When I run it in windows 11 with ...
1
vote
1
answer
125
views
Web Audio API preprocessing not improving Azure Speech SDK recognition accuracy for real-time meeting transcription
I'm working on a real-time speech-to-text application where microphone input is processed through Web Audio API before being sent to Azure Speech SDK. The main issue is that some audio content is ...
0
votes
1
answer
147
views
How do I send transcribed text from speech directly to another endpoint or an azure function from my speech resource?
I am using Azure Speech Service resource to transcribe real time audio from my mic using microsoft-cognitiveservices-speech-sdk. I want to send the transcribed text to another endpoint (or to an azure ...
-3
votes
1
answer
404
views
Facing too much delay in microsoft-cognitiveservices-speech-sdk speech to text
I'm using the Azure AI Speech Service for speech-to-text functionality with the Microsoft Cognitive Services Speech SDK in Angular. However, I'm experiencing a significant delay in receiving the ...
0
votes
1
answer
411
views
Azure speech service continuous speech recognition
I'm pretty new to Azure speech service and I'm using twilo/plivo service for connecting a number with azure stt and process it further after transcription.
My problem is when I speak something, it's ...
0
votes
1
answer
281
views
Azure Pronunciation Assessment Could not deserialize speech context error
I am trying to implement a pronunciation assessment system using Azure's JS SDK (see doc).
I get the following error in console:
"Could not deserialize speech context. websocket error code: 1007&...
0
votes
1
answer
562
views
Managed Identity Authentication for Azure AI Speech - WebSocket upgrade failed: Authentication error (401)
I am trying to connect with Speech SDK using managed Identity, I dont want to use API key. I followed this article - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-configure-...