Speech to text
Last updated
Last updated
Integrating OpenAI's Speech-to-Text (STT) capabilities into your Unity project enables you to transcribe audio content into written text. This feature is powered by OpenAI's advanced speech recognition models, making it invaluable for applications that involve voice commands, audio content accessibility, or the processing of spoken user inputs.
For detailed information about the Speech-to-Text API, including the models available, parameter options, and best practices for audio files, refer to the .
Audio Transcription: Convert spoken words from audio files into accurate written text. This process facilitates the understanding and utilization of spoken language within your applications.
Audio Translation: Convert and translation spoken language into written text in English.
Transcribe audio content to text. You'll need to provide the audio file as a FormFile
(for API requests) or a AudioClip
object (within Unity).
Transcribe audio content to English text. You'll need to provide the audio file as a FormFile
(for API requests) or a AudioClip
object (within Unity).