malob/article-to-audio-cloud-function

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
readme.md		readme.md

Repository files navigation

Article to Audio Google Cloud Function

This is a Google Cloud Function I hacked together that takes a url to an article on the web, and generates an audio file of it using Google's new Cloud Text-To-Speech API which has been updated with access to DeepMind's WaveNet voices.

I created it as part of a project to generate a personal podcast of articles I want to consume. To get the full thing working see my other repository with the Cloud Function that generates the Podcast RSS.

Sketch of how it works

The function accepts a POST request with json in the body.
- E.g. {"url": "http://example.com/somearticle"}
It then uses the free Mercury Web Parser API to get the body of the article and some metadata.
Since the body is returned as HTML it then converts it to plain text. I also add some of the metadata at the top of the article, since I wanted this in the audio.
Then it slits up the body into chunks of no larger then 5,000 characters, since that's the limit on what the TTS API can handle per request.
From there is then sends each chunk of text to Google's TTS API which returns the audio encoded as MP3, and writes them to a temporary location.
Since having multiple files for parts of the article is annoying, it then uses FFMPEG to concatenate the audio chunks into one file.
Finally, it stores the audio file as and object in a Google Cloud Storage bucket, along with some of the metadata.

Configuration details

To get this working you need a Google Cloud Project with a Cloud Storage bucket setup, and the Cloud Text-To-Speech API enabled.

You'll then need to create a new Cloud Function (see configuration details below), and replace the undefined global constants in the code, gcpProjectID, gcpBucketName, and mercuryApiKey, with the appropriate values.

Cloud Function configuration

Trigger type: HTTP trigger
Memory allocated: 256 MB
Timeout: 240s
- I had to extend this from default of 60s.

About

Google Cloud Function that takes a url, converts the article at that url to audio using Cloud Text-To-Speech, then stores it in a Cloud Storage bucket.

Releases

No releases published

Packages

No packages published

Languages

JavaScript 100.0%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

malob/article-to-audio-cloud-function

Folders and files

Latest commit

History

Repository files navigation

Article to Audio Google Cloud Function

Sketch of how it works

Configuration details

Cloud Function configuration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Languages

malob/article-to-audio-cloud-function

Folders and files

Latest commit

History

Repository files navigation

Article to Audio Google Cloud Function

Sketch of how it works

Configuration details

Cloud Function configuration

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages