Page updated Jan 16, 2024

Transcribe audio to text

Set up the backend

If you haven't already done so, run amplify init inside your project and then amplify add auth (we recommend selecting the default configuration).

Run amplify add predictions and select Convert. Then use the following answers:

1? What would you like to convert? (Use arrow keys)
2 Translate text into a different language
3 Generate speech audio from text
4> Transcribe text from audio
5
6? Who should have access? Auth and Guest users

Now run amplify push which will generate your amplifyconfiguration.json and create resources in the cloud. You can now either add this to your backend or skip and add more features to your app.

Services used: Amazon Transcribe

Working with the API

You can transcribe a PCM Audio byte buffer to Text, such as a recording from microphone.

1import { Predictions } from '@aws-amplify/predictions';
2
3Predictions.convert({
4 transcription: {
5 source: {
6 bytes
7 }
8 // language: "en-US",
9 }
10})
11.then(({ transcription: { fullText } }) => console.log({ fullText }))
12.catch((err) => console.log({ err }));

To view the complete list of all the supported languages and language specific features refer to the supported languages list. The language data input type has to support streaming for it to work with Amplify Predictions.