AIDevKit - AI Suite for Unity
API ReferencesDiscordGlitch9
  • Introduction
    • AI Dev Kit 3.7.0
    • Troubleshooting
    • FAQ
    • Update Logs
      • AI Dev Kit v2
      • AI Dev Kit v1
  • Quick Start
    • API Key Setup
      • OpenAI
      • Google Gemini
      • ElevenLabs
      • OpenRouter
    • Adding Models & Voices
      • Quick Add Guide
      • Creating Snippets
    • Self-Hosting with Ollama
  • Editor Tools
    • Editor Chat
    • Asset Generators
    • Asset Managers
      • Prompt History
      • File Manager
      • Chatbot Manager
      • Assistant Manager
  • GEN Tasks
    • Overview
      • Prefixes
      • Sequence
    • Response
    • Image
    • Video
    • SoundFX
    • Speech
    • Transcript
    • Voice Change
    • Audio Isolation
  • Components
    • Chatbot
    • Chatbot (Assistants API)
    • Realtime Assistant
    • Modules
    • Event Receivers
  • Platform API
    • OpenAI
      • 💬Chat completions
      • 🖼️Image operations
      • 🗣️Text to speech
      • 🎙️Speech to text
        • Recording real-time in Unity
      • 💾Files
      • 🔎Embeddings
      • 🛡️Moderations
      • ⚙️Fine-tuning
      • Assistants API
        • How it works
        • Creating custom functions
        • Creating assistants API
    • Google Gemini
      • 📝System instructions
      • 💬Text generation
      • ⚙️Fine-tuning
      • ▶️Fucntion calling
      • 🔎Embeddings
      • 🛡️Safety
      • 💻Code execution
  • Legacy Documents
    • AI Dev Kit 1.0
      • Preperation
      • Scriptable Toolkits
        • Chat Streamer
        • Image Generator
        • Voice Transcriber
        • Voice Generator
      • Editor Tools
      • Troubleshooting (Legacy)
        • ❗Build Error: The name 'UnityMenu' does not exist in the current context
        • ❗The type or namespace name 'AndroidJavaObject' could not be found
        • ❗The type or namaspace name 'Plastic' does not exist
        • ❗Build Error: The name 'Asset Database' does not exist in the current context
        • ❗'ModelData.Create(Provider, string, UnixTime?, string)': not all code paths return a value
      • Code Generators
        • C# Script Generator
        • Unity Component Generator
    • AI Dev Kit 2.0
      • Event Handlers
      • Editor Chat
      • Editor Vision (TTI, ITI)
      • Editor Speech (TTS)
      • Management Tools
        • Prompt History Viewer
        • AI Model Manager
        • TTS Voice Manager
        • OpenAI File Manager
        • OpenAI Assistant Manager
        • ElevenLabs Voice Library
Powered by GitBook
On this page
  • The Ultimate AI-Suite for Unity
  • 1. Professional, Developer-Focused
  • 2. Casual, Indie-Friendly
  • 3. Marketing-Friendly / Product Pitch
  • Supported Providers and APIs
  • Feature Comparison
  • Supported Platforms
  1. Introduction

AI Dev Kit 3.7.0

NextTroubleshooting

Last updated 5 days ago

The Ultimate AI-Suite for Unity

AIDevKit empowers beginner developers to effortlessly integrate advanced AI functionalities directly into Unity, dramatically simplifying your game development workflow.

With just a few clicks and simple text prompts, you can create code, generate images, produce sound effects, and even synthesize voices. AIDevKit offers broad API integrations, rich editor tools, extensive voice synthesis options, and unique audio generation capabilities.


1. Professional, Developer-Focused

Fluent AI Tasks for Unity — One Line, Any Content

Generate text, images, speech, or code using .GEN*() extension methods directly on Unity objects.

No boilerplate. No wrappers. Just write:

"Write a quest description."
    .GENResponse()
    .SetModel(OpenAIModel.GPT4o)
    .ExecuteAsync();

texture.GENInpaint("Add a wizard hat")
    .SetModel(OpenAIModel.DallE2)
    .ExecuteAsync();

audioClip.GENTranscript().ExecuteAsync();

"Hello there!"
    .GENSpeech()
    .SetVoice(ElevenLabsVoice.Rachel)
    .ExecuteAsync();

Supported host types:

  • string → text, speech, image, code, sound effects

  • Texture2D → image editing, video generation

  • AudioClip → transcription, translation, voice change, cleanup

Everything is powered by OpenAI, Google Gemini, ElevenLabs, and more — wrapped in a unified fluent API.

This workflow is ideal for rapid prototyping, tool development, and AI-native content pipelines.


2. Casual, Indie-Friendly

What if Unity objects could talk to AI?

Now they can.

"A wizard cat with a skateboard"
    .GENImage()
    .ExecuteAsync();

audioClip
    .GENTranscript()
    .ExecuteAsync();

"Play this line with a cool voice!"
    .GENSpeech()
    .ExecuteAsync();

No setup. No config. Just type .GEN*() and let the magic happen.

Whether you're building dialogue systems, generating assets, or automating pipelines — this is the fastest way to get started with generative AI in Unity.


3. Marketing-Friendly / Product Pitch

From Prompt to Output in One Line

AI Dev Kit gives you instant access to text, image, audio, and code generation — with zero boilerplate.

// Write AI-powered NPC backstories
"Describe a stoic robot farmer on Mars."
    .GENResponse()
    .ExecuteAsync();

// Generate stylized images instantly
"A rusty sci-fi door with bullet holes"
    .GENImage()
    .ExecuteAsync();

// Create voiceover from text
"Welcome back, Commander."
    .GENSpeech()
    .SetVoice(ElevenLabsVoice.Rachel)
    .ExecuteAsync();

Just call .GEN*() on your Unity objects and chain your desired behavior — it's fast, readable, and production-ready.

Works out of the box. Fully extensible. No extra SDKs required.


Supported Providers and APIs

One of the most widely used AI platforms, offering powerful models like GPT-4o. It supports a broad range of APIs for text, audio, image, and assistant functionalities—ideal for building intelligent, multimodal applications.

Supported
All OpenAI APIs (2025.05.18)

🧠 Embeddings

🛠️ Models, Fine-Tuning

📦 Batch

🧪 Evals

📝 Graders

🗂️ Vector Stores, Vector Store Files, Vector Store File Batches

🤖 Assistants, Threads, Messages, Runs, Run steps, Streaming (Pro-only)

📡 Responses, Responses Streaming (Coming Soon for Pro)

A cutting-edge multimodal AI service by Google, capable of both text and image generation. It is well-suited for developers who need tight integration with Google’s cloud infrastructure and rapid performance.

Supported
All Google Gemini APIs (2025.02.11)

🧠 Embeddings

🛠️ Models, Tuning, Permissions

🗂️ Vector Stores, Vector Store Files, Vector Store File Batches

A state-of-the-art voice platform specializing in high-quality Text-to-Speech, Speech-to-Text, and voice transformation. Perfect for voice assistants, narration, and real-time character dubbing.

Supported
All ElevenLabs APIs (2025.04.23)

🧠 Embeddings

🛠️ Audio Isolation

🛠️ Audio Native

🛠️ Forced Alignment

🗂️ Vector Stores, Vector Store Files, Vector Store File Batches

Worried about costs? No Problem! With Ollama you can run LLMs like LLaMA and Mistral on your own machine for free. Ideal for offline development, edge applications, and private inference without relying on cloud services.

Supported
All Ollama APIs (2025.04.23)

🧠 Embeddings

🛠️ Models, Model Management

🛠️ Version

A gateway to multiple third-party LLMs offering 300+ models including Claude, Mistral, Command R, and more—all accessible via a single API. Great for experimenting with various models without switching providers.

Supported
All Ollama APIs (2025.04.23)


Feature Comparison

Feature
Free Version
Pro Version

Supported APIs

OpenAI

Google

OpenAI

Google

ElevenLabs,

Ollama

OpenRouter

Supported Tasks

Response Generation (Text)

Image Generation

Text To Speech

Speech To Text

Response Generation (Text)

Image Generation

Text To Speech

Speech To Text

SoundFX Generation

Video Generation

Voice Change

Audio Isolation

Number of AI Models

120+

450+

Number of AI Voices

10+

4000+

Unity Components

–

Chatbot

Chatbot (with Assistants API)

Realtime Assistant

Modular Components (components you can use with 'Unity Components')

–

Image Generator

Text To Speech

Speech To Text

Voice Changer

Advanced Integration

–

Chat Session, Assistants API, Realtime API

Editor Tools

Model / Voice Library

Model / Voice Library

Editor Chat

Code Generator

Unity Component Generator

Avatar Generator

Icon Generator

Background Generator

Mesh Texture Generator

Speech Generator

SoundFX Generator

Video Generator


Supported Platforms

Supported
Platform

Fully

Windows, OSX, Linux

Partially

Unity WebGL

Fully

Android, iOS, Windows Phone/Store

Fully

PlayStation, Xbox, PS Vita/PSM, Switch

Chat Completions, Streaming, Completions (Legacy)

Image Creation, Image Edit, Image Variation

Speech, Transcript, Translation

Text Moderations, Image Moderations

Files, Uploads

Realtime, Session tokens, Client events, Server events (Pro-only)

Generate Content - Text generation, Vision, JSON Mode, Function calling

Predict - Image generation (via Gemini)

Predict - Image generation (via Imagen3)

Predict Long Running - Video generation (via Veo)

Tokens

Files, Caching

Live API - WebSockets API

Text to Speech, Stream

Text to Speech (WebSockets)

Multi-Context Text to Speech (WebSockets)

Speech to Text

Sound Effects

Voice Changer

Text to Voice (Voice design)

Dubbing

Models, Voices

Voice Library

Chat Completion, Stream, Completion

Chat Completion - Stream, Vision, Structured Outputs, Function calling, Web Search

More API integrations are on the way: Have something you'd like to see next? Let us know via — your feedback shapes the roadmap.

Discord
💬
🏝️
🗣️
🛡️
📁
🎤
💬
🏝️
🖼️
📽️
🪙
📁
🎤
💬
💬
💬
🎙️
🔉
🗣️
🎤
📼
🗄️
📚
💬
💬