Task type: Automatic speech recognition
Services of the asr type accept a byte array as input and return JSON in a specific format. The stream recognition mode is supported.
System name: asr
ASR stands for Automatic Speech Recognition. The task of automatic speech recognition is the conversion of an audio stream into text.
Data types
Predict request
Request format:
{
"audio_base64": "audio data"
}
Predict response
The response format depends significantly on the specific service. The basic interface defines only the required fields, common and identical for all services of this type.
Required response fields:
{
"text": "Hello"
}
In addition to the single text field, the response will contain a multitude of data specific to the particular service. For information about this data, see the service page.