Exploring the Cloud Speech API

Please refer to the Setting up a rest client section from Chapter 3Cloud Vision API, to set up a REST API client, either Postman or cURL, before you continue. Now that we have all the required setup done, let's get started with exploring the API. In this section, we are going to upload a single channel, Linear16 encoded, with a 44100 sample rate, in base64 format, to Cloud Speech API and get its transcription. There are three ways we can convert audio to text using the Cloud Speech API:

  • Synchronous speech recognition
  • Asynchronous speech recognition
  • Streaming speech recognition

Get Google Cloud AI Services Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.