OM Core Websocket Endpoints
Google Speech Recognition
Google Speech Recognition (ASR) API Reference
OpenMind proxies Google Speech Recognition (ASR) API to provide speech recognition capabilities. This endpoint allows you to interact with the Google ASR API to transcribe speech to text.
To minimize latency, the API endpoint utilizes WebSockets for efficient real-time communication.
Installation
Install the OM1
package:
install OM1
If you don’t have uv
installed, you can install it using the following command:
install OM1
Usage
The following example demonstrates how to interact with the Google ASR API using the OM1
package:
usage
The expected response from the Google ASR API will be in the following format:
response
You can also forward the base64 encoded audio data directly to the API endpoint using the following format:
request
The
rate
parameter is optional and defaults to 16000 if not provided. The rate
parameter specifies the sample rate of the audio data in Hz.