Speech to Text (STT)
Convert spoken words from an AudioClip
into text using powerful AI transcription models.
Ideal for voice commands, user feedback, subtitles, or audio-driven gameplay systems.
β
Basic Usage
π The
AudioClip
can be from a microphone, file, or any runtime source.
βοΈ Configuration Options
SetLanguage(SystemLanguage)
Optional hint to improve transcription accuracy
SetModel(model)
Choose which STT model to use (Whisper
, Gemini STT
, etc.)
SetOutputPath(path)
Save transcription to file (optional)
π Translation Mode
You can also translate speech into English using GENTranslation()
:
π£οΈ This uses the same audio input but produces translated text (into English).
π¦ Example Result
Audio Input: "μλ
νμΈμ, μ€λ λ μ¨ μ΄λμ?"
Transcript: "μλ
νμΈμ, μ€λ λ μ¨ μ΄λμ?"
Translation: "Hello, how's the weather today?"
π§ Tips
Works best with clean, mono audio at 16kHz or higher.
SetLanguage
is optional β the model can auto-detect, but accuracy improves with a hint.For multilingual games or voice input, pair this with Text Generation for natural response.
Last updated