Newest 'azure-speech' Questions

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

211 questions

0 votes

0 answers

29 views

Auto detect language in Azure Speech Services SpeechSynthesizer

In the microsoft-cognitiveservices-speech-sdk JavaScript SDK SpeechSynthesizer.FromConfig takes AutoDetectSourceLanguageConfig as input but what does it actually do? The documentation here Select ...

Magnuti's user avatar

Magnuti

asked Nov 27 at 12:12

0 votes

0 answers

66 views

Azure Speech SDK ConversationTranscriber ends with EndOfStream unexpectedly when streaming WebM audio from browser using websocket

I am streaming live audio from a browser (MediaRecorder → WebSocket) into Azure Speech SDK’s ConversationTranscriber in Python. The browser records with audio/webm;codecs=opus and sends 100 ms chunks. ...

Subash Balakrishnan's user avatar

Subash Balakrishnan

asked Sep 27 at 10:12

0 votes

0 answers

31 views

Azure Speech Service Speaker Diarization: How to Optimize Real-Time Transcription Latency (Node.js + Angular)

I'm using Azure Speech-to-Text with speaker diarization in a real-time transcription app. Backend: Node.js (v18), using microsoft-cognitiveservices-speech-sdk and WebSocket server. Frontend: Angular (...

SGR's user avatar

SGR

2,375

asked Sep 5 at 10:30

0 votes

0 answers

36 views

Not able to access the custom neural voice trained model through API

Im pretty new to the world of Azure, and I have been trying to understand the capabilities of Custom neural voice. After multiple trial and error, I was able to train a model and even deploy it, but I'...

vivek.p.n manu's user avatar

vivek.p.n manu

asked Aug 27 at 10:28

0 votes

0 answers

99 views

Strange Speech-to-Text Transcription Issue: Earphones Work on One PC but Not Others Issue

Issue Description: I've encountered a bizarre problem with speech-to-text transcription across my computers. I have three PCs, and I'm experiencing inconsistent behavior with earphones: On my ...

Su Myat's user avatar

Su Myat

asked Apr 17 at 6:38

1 vote

1 answer

265 views

Python streamlit realtime speech-to-text with Azure SDK

I'm trying to create a real-time speech to text using streamlit and azure speech SDK. I can easily transcribe audio/video files with no issues, but I want to integrate realtime transcription (from ...

Raffa50's user avatar

Raffa50

asked Mar 19 at 7:32

0 votes

1 answer

83 views

Azure Python Speech Sdk get mstts:express-as styles from API

I have a Python Kivy App where you can use different TTS APIs. I use the basic SSML-Tags but I want to add the Microsoft exclusive styles of the mstts:express-as attribute. Is there any way to get the ...

Joe's user avatar

Joe

asked Mar 13 at 10:25

2 votes

1 answer

264 views

Trying to build azure speech program that can transcribe and diarize audio real-time, how do I do this on javascript/html? Can't find working examples

I specifically am trying to build an application that can run an html-javascript file that can recognize the speech input from a microphone, transcribe it, and assign it to a speaker, continuously ...

user29960912's user avatar

user29960912

asked Mar 10 at 20:04

0 votes

1 answer

118 views

Why azure viseme data is not completely generated in linux?

So Im trying to generate speech and viseme data for my animation using azure services. But I have a problem when I run the exact same code in the Linux ubuntu 22.04. When I run it in windows 11 with ...

Abstract's user avatar

Abstract

asked Feb 18 at 8:04

1 vote

1 answer

125 views

Web Audio API preprocessing not improving Azure Speech SDK recognition accuracy for real-time meeting transcription

I'm working on a real-time speech-to-text application where microphone input is processed through Web Audio API before being sent to Azure Speech SDK. The main issue is that some audio content is ...

Su Myat's user avatar

Su Myat

asked Feb 14 at 7:47

0 votes

1 answer

147 views

How do I send transcribed text from speech directly to another endpoint or an azure function from my speech resource?

I am using Azure Speech Service resource to transcribe real time audio from my mic using microsoft-cognitiveservices-speech-sdk. I want to send the transcribed text to another endpoint (or to an azure ...

Abdullah Nadeem's user avatar

Abdullah Nadeem

asked Feb 7 at 7:58

-3 votes

1 answer

404 views

Facing too much delay in microsoft-cognitiveservices-speech-sdk speech to text

I'm using the Azure AI Speech Service for speech-to-text functionality with the Microsoft Cognitive Services Speech SDK in Angular. However, I'm experiencing a significant delay in receiving the ...

aravind ks's user avatar

aravind ks

asked Feb 5 at 13:37

0 votes

1 answer

411 views

Azure speech service continuous speech recognition

I'm pretty new to Azure speech service and I'm using twilo/plivo service for connecting a number with azure stt and process it further after transcription. My problem is when I speak something, it's ...

Henven's user avatar

Henven

asked Jan 26 at 16:49

0 votes

1 answer

281 views

Azure Pronunciation Assessment Could not deserialize speech context error

I am trying to implement a pronunciation assessment system using Azure's JS SDK (see doc). I get the following error in console: "Could not deserialize speech context. websocket error code: 1007&...

nico_lrx's user avatar

nico_lrx

asked Jan 25 at 14:32

0 votes

1 answer

562 views

Managed Identity Authentication for Azure AI Speech - WebSocket upgrade failed: Authentication error (401)

I am trying to connect with Speech SDK using managed Identity, I dont want to use API key. I followed this article - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-configure-...

user25879's user avatar

user25879

asked Jan 21 at 15:45

15 30 50 per page

2 3 4 5

...

15 Next

CollectivesTM on Stack Overflow

Auto detect language in Azure Speech Services SpeechSynthesizer

Azure Speech SDK ConversationTranscriber ends with EndOfStream unexpectedly when streaming WebM audio from browser using websocket

Azure Speech Service Speaker Diarization: How to Optimize Real-Time Transcription Latency (Node.js + Angular)

Not able to access the custom neural voice trained model through API

Strange Speech-to-Text Transcription Issue: Earphones Work on One PC but Not Others Issue

Python streamlit realtime speech-to-text with Azure SDK

Azure Python Speech Sdk get mstts:express-as styles from API

Trying to build azure speech program that can transcribe and diarize audio real-time, how do I do this on javascript/html? Can't find working examples

Why azure viseme data is not completely generated in linux?

Web Audio API preprocessing not improving Azure Speech SDK recognition accuracy for real-time meeting transcription

How do I send transcribed text from speech directly to another endpoint or an azure function from my speech resource?

Facing too much delay in microsoft-cognitiveservices-speech-sdk speech to text

Azure speech service continuous speech recognition

Azure Pronunciation Assessment Could not deserialize speech context error

Managed Identity Authentication for Azure AI Speech - WebSocket upgrade failed: Authentication error (401)

Hot Network Questions