Skip to main content
Stack Overflow
  1. About
  2. For Teams
Filter by
Sorted by
Tagged with
0 votes
0 answers
54 views

I’m building a Python voice assistant using the speech_recognition library. Everything works perfectly when I run the code from PyCharm or the terminal, but when I convert it to an .exe using Auto Py ...
0 votes
1 answer
53 views

Consider this Kotlin code to init a Google speech recognizer: recognizerIntent = Intent(RecognizerIntent.ACTION_RECOGNIZE_SPEECH) .apply { putExtra( ...
0 votes
1 answer
45 views

I'm able to get text to speech to audio file using the following code for iOS 12 iPhone 8 to create a car file: audioFile = try AVAudioFile( forWriting: saveToURL, settings: pcmBuffer.format.settings, ...
1 vote
1 answer
81 views

Using Google Speech in Python, I'm able to get a transcript for each phrase spoken using result.alternatives[0].transcript, but when I try to look at the words for the phrase, result.alternatives[0]....
0 votes
0 answers
42 views

I am working on speech quality assessment and using PESQ (Perceptual Evaluation of Speech Quality) to calculate MOS scores for different audio samples. I tested PESQ by providing a reference and a ...
0 votes
0 answers
53 views

Iam having trouble with Streaming Speech to text using the Google Speech To Text API.It works great transcribing English and return the final_transcript very well. The problem is the other languages ...
0 votes
0 answers
20 views

I've been using PyObjC for recognizing text from audio file applying Speech framework. So I check documentation and create this small script, but he was returning an error. What's my error? import ...
1 vote
0 answers
65 views

I am developing a c++ DLL that is used by a c# app via interop. The DLL's purpose is to train the "Default" speech recognition profile. The c# app only sends Training Text and receives ...
Slip's user avatar
  • 43
2 votes
2 answers
413 views

I've encountered same problem which is described in this thread. Since iOS 18, when I use SFSpeechAudioBufferRecognitionRequest, returned not final SFSpeechRecognitionResult discards previously ...
0 votes
0 answers
166 views

In my Nodejs server am using Google's speech to text API to get the lyrics of song, but it doesn't seem to work well with music. I loose most part of words, so my question is, does this api work with ...
1 vote
0 answers
97 views

I'm working on a Laravel 11 application where I need to stream audio from the frontend to Google Cloud Speech-to-Text and then broadcast the transcriptions using Pusher. Frontend Code: let ...
-1 votes
1 answer
99 views

I have a huge text, coming from an academic paper, that I want to transform into audio. Because the audio is too big, I split it into 4096 characters chunks. Then I send it to the OpenAi tts api chunk ...
3 votes
1 answer
94 views

I have a lot of audio recordings for lectures where I say the same thing multiple times, mostly it's incomplete statements like: "this is the part" (and then retrying) "this is the part ...
2 votes
0 answers
30 views

I'm using the German political speech dataset to train the LDA model. My goal here is to categorize each speech into some topics. But the problem is that the generated topics are too similar, and all ...
1 vote
0 answers
48 views

I want to increase the amount of time Android speech recognition. I tried these 3 tags but it not working. TAG 1: EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS TAG 2: ...

15 30 50 per page
1
2 3 4 5
...
64

AltStyle によって変換されたページ (->オリジナル) /