Using the Google Cloud Speech API

The Google Speech API uses a powerful machine learning model to convert audio to text. The API recognizes over 110 languages and can process them as a stream or from stored audio files. The Speech API can perform the conversion by three methods—synchronous recognition, asynchronous recognition, and stream recognition.

We'll perform a simple recipe to use the Speech API to convert a recorded message to text using the synchronous recognition method.

Get Google Cloud Platform Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.