AI Dev Kit
API ReferencesDiscordGlitch9
  • Introduction
    • AI DevKit 3.7.0
    • Troubleshooting
    • FAQ
    • Update Logs
  • Provider Setup
    • OpenAI
    • Google Gemini
    • ElevenLabs
    • Ollama
    • OpenRouter
  • Editor Tools
    • Introduction
    • Editor Chat
    • Model Library
    • Voice Library
  • GEN Tasks
    • Introduction - Prefixes
    • Response Generation
    • Chat Session
    • Image Generation
    • Video Generation
    • Sound FX Generation
    • Text to Speech (TTS)
    • Speech to Text (STT)
    • Voice Change
    • Audio Isolation
  • Advanced APIs (Pro)
    • Assistants API
      • How it works
      • Creating custom functions
      • Creating assistants API
    • Realtime API
  • Legacy API
    • OpenAI
      • ๐Ÿ’ฌChat completions
      • ๐Ÿ–ผ๏ธImage operations
      • ๐Ÿ—ฃ๏ธText to speech
      • ๐ŸŽ™๏ธSpeech to text
        • Recording real-time in Unity
      • ๐Ÿ’พFiles
      • ๐Ÿ”ŽEmbeddings
      • ๐Ÿ›ก๏ธModerations
      • โš™๏ธFine-tuning
    • Google Gemini
      • ๐Ÿ“System instructions
      • ๐Ÿ’ฌText generation
      • โš™๏ธFine-tuning
      • โ–ถ๏ธFucntion calling
      • ๐Ÿ”ŽEmbeddings
      • ๐Ÿ›ก๏ธSafety
      • ๐Ÿ’ปCode execution
  • Legacy Documents
    • AI DevKit 1.0 - 2.0
      • AI DevKit 2.0
      • AI DevKit 1.0
      • Preperation
      • Event Handlers
      • Scriptable Toolkits
        • Chat Streamer
        • Image Generator
        • Voice Transcriber
        • Voice Generator
      • Editor Tools
      • Troubleshooting (Legacy)
        • โ—Build Error: The name 'UnityMenu' does not exist in the current context
        • โ—The type or namespace name 'AndroidJavaObject' could not be found
        • โ—The type or namaspace name 'Plastic' does not exist
        • โ—Build Error: The name 'Asset Database' does not exist in the current context
        • โ—'ModelData.Create(Provider, string, UnixTime?, string)': not all code paths return a value
      • Code Generators
        • C# Script Generator
        • Unity Component Generator
      • Generation Menu
      • Editor Chat
      • Editor Vision (TTI, ITI)
      • Editor Speech (TTS)
      • Management Tools
        • Prompt History Viewer
        • AI Model Manager
        • TTS Voice Manager
        • OpenAI File Manager
        • OpenAI Assistant Manager
        • ElevenLabs Voice Library
Powered by GitBook
On this page
  1. GEN Tasks

Audio Isolation

Use AI to extract clean vocals or speech from noisy audio.

This is useful for podcast cleanup, speech recognition preprocessing, or removing background noise in recorded clips. Powered by ElevenLabs Audio Isolation API.


โœ… Basic Usage

AudioClip rawRecording = Resources.Load<AudioClip>("NoisyInput");

AudioClip cleanSpeech = await rawRecording
    .GENAudioIsolation()
    .SetOutputFormat(OutputFormat.MPEG)
    .ExecuteAsync();

๐Ÿงผ The resulting clip will contain mostly vocal content, with noise removed or reduced.


โš™๏ธ Configuration Options

Method
Description

SetOutputFormat(format)

Choose output format (MP3, WAV, PCM, etc.)

SetOutputPath(path)

Save location for processed file


๐Ÿ“ฆ Example Use Case

Input: Phone recording with background street noise Output: Voice is isolated and easier to transcribe or play back cleanly


๐Ÿง  Tips

  • Pair this with Speech to Text to dramatically improve transcription accuracy.

  • If used in real-time input pipelines, consider pre-buffering audio and applying .AsAudioClip() conversion before playback.

PreviousVoice ChangeNextAssistants API

Last updated 27 days ago