AI Dev Kit 3.11.0

The Ultimate AI-Suite for Unity

AIDevKit empowers beginner developers to effortlessly integrate advanced AI functionalities directly into Unity, dramatically simplifying your game development workflow.

With just a few clicks and simple text prompts, you can create code, generate images, produce sound effects, and even synthesize voices. AIDevKit offers broad API integrations, rich editor tools, extensive voice synthesis options, and unique audio generation capabilities.


From Prompt to Output in One Line

AI Dev Kit gives you instant access to text, image, audio, and code generation β€” with zero boilerplate.

// Write AI-powered NPC backstories
"Describe a stoic robot farmer on Mars."
    .GENResponse()
    .ExecuteAsync();

// Generate stylized images instantly
"A rusty sci-fi door with bullet holes"
    .GENImage()
    .ExecuteAsync();

// Create voiceover from text
"Welcome back, Commander."
    .GENSpeech()
    .SetVoice(ElevenLabsVoice.Rachel)
    .ExecuteAsync();

Just call .GEN*() on your Unity objects and chain your desired behavior β€” it's fast, readable, and production-ready.

Works out of the box. Fully extensible. No extra SDKs required.


Supported Providers and APIs

One of the most widely used AI platforms, offering powerful models like GPT-4o. It supports a broad range of APIs for text, audio, image, and assistant functionalitiesβ€”ideal for building intelligent, multimodal applications.

Supported
All OpenAI APIs (2025.05.18)

πŸ’¬ Chat Completions, Streaming, Completions (Legacy)

🏝️ Image Creation, Image Edit, Image Variation

πŸ—£οΈ Speech, Transcript, Translation

πŸ›‘οΈ Text Moderations, Image Moderations

🧠 Embeddings

πŸ› οΈ Models, Fine-Tuning

πŸ“¦ Batch

πŸ“ Files, Uploads

πŸ§ͺ Evals

πŸ“ Graders

πŸ—‚οΈ Vector Stores, Vector Store Files, Vector Store File Batches

πŸ€– Assistants, Threads, Messages, Runs, Run steps, Streaming (Pro-only)

🎀 Realtime, Session tokens, Client events, Server events (Pro-only)

πŸ“‘ Responses, Responses Streaming (Coming Soon for Pro)

A cutting-edge multimodal AI service by Google, capable of both text and image generation. It is well-suited for developers who need tight integration with Google’s cloud infrastructure and rapid performance.

Supported
All Google Gemini APIs (2025.02.11)

πŸ’¬ Generate Content - Text generation, Vision, JSON Mode, Function calling

🏝️ Predict - Image generation (via Gemini)

πŸ–ΌοΈ Predict - Image generation (via Imagen3)

πŸ“½οΈ Predict Long Running - Video generation (via Veo)

🧠 Embeddings

πŸ› οΈ Models, Tuning, Permissions

πŸͺ™ Tokens

πŸ“ Files, Caching

πŸ—‚οΈ Vector Stores, Vector Store Files, Vector Store File Batches

🎀 Live API - WebSockets API

A state-of-the-art voice platform specializing in high-quality Text-to-Speech, Speech-to-Text, and voice transformation. Perfect for voice assistants, narration, and real-time character dubbing.

Supported
All ElevenLabs APIs (2025.04.23)

πŸ’¬ Text to Speech, Stream

πŸ’¬ Text to Speech (WebSockets)

πŸ’¬ Multi-Context Text to Speech (WebSockets)

πŸŽ™οΈ Speech to Text

πŸ”‰ Sound Effects

πŸ—£οΈ Voice Changer

🧠 Embeddings

πŸ› οΈ Audio Isolation

🎀 Text to Voice (Voice design)

πŸ“Ό Dubbing

πŸ› οΈ Audio Native

πŸ› οΈ Forced Alignment

πŸ—„οΈ Models, Voices

πŸ“š Voice Library

πŸ—‚οΈ Vector Stores, Vector Store Files, Vector Store File Batches

Worried about costs? No Problem! With Ollama you can run LLMs like LLaMA and Mistral on your own machine for free. Ideal for offline development, edge applications, and private inference without relying on cloud services.

Supported
All Ollama APIs (2025.04.23)

πŸ’¬ Chat Completion, Stream, Completion

🧠 Embeddings

πŸ› οΈ Models, Model Management

πŸ› οΈ Version

A gateway to multiple third-party LLMs offering 300+ models including Claude, Mistral, Command R, and moreβ€”all accessible via a single API. Great for experimenting with various models without switching providers.

Supported
All Ollama APIs (2025.04.23)

πŸ’¬ Chat Completion - Stream, Vision, Structured Outputs, Function calling, Web Search


Feature Comparison

Feature
Free Version
Pro Version

Supported APIs

OpenAI

Google

OpenAI

Google

ElevenLabs,

Ollama

OpenRouter

Supported Tasks

Response Generation (Text)

Image Generation

Text To Speech

Speech To Text

Response Generation (Text)

Image Generation

Text To Speech

Speech To Text

SoundFX Generation

Video Generation

Voice Change

Audio Isolation

Number of AI Models

120+

450+

Number of AI Voices

10+

4000+

Unity Components

–

Chatbot

Chatbot (with Assistants API)

Realtime Assistant

Modular Components (components you can use with 'Unity Components')

–

Image Generator

Text To Speech

Speech To Text

Voice Changer

Advanced Integration

–

Chat Session, Assistants API, Realtime API

Editor Tools

Model / Voice Library

Model / Voice Library

Editor Chat

Code Generator

Unity Component Generator

Avatar Generator

Icon Generator

Background Generator

Mesh Texture Generator

Speech Generator

SoundFX Generator

Video Generator


Supported Platforms

Supported
Platform

Fully

Windows, OSX, Linux

Partially

Unity WebGL

Fully

Android, iOS, Windows Phone/Store

Fully

PlayStation, Xbox, PS Vita/PSM, Switch

Last updated