AI Dev Kit 3.11.0

The Ultimate AI-Suite for Unity
AIDevKit empowers beginner developers to effortlessly integrate advanced AI functionalities directly into Unity, dramatically simplifying your game development workflow.
With just a few clicks and simple text prompts, you can create code, generate images, produce sound effects, and even synthesize voices. AIDevKit offers broad API integrations, rich editor tools, extensive voice synthesis options, and unique audio generation capabilities.
From Prompt to Output in One Line
AI Dev Kit gives you instant access to text, image, audio, and code generation β with zero boilerplate.
// Write AI-powered NPC backstories
"Describe a stoic robot farmer on Mars."
.GENResponse()
.ExecuteAsync();
// Generate stylized images instantly
"A rusty sci-fi door with bullet holes"
.GENImage()
.ExecuteAsync();
// Create voiceover from text
"Welcome back, Commander."
.GENSpeech()
.SetVoice(ElevenLabsVoice.Rachel)
.ExecuteAsync();
Just call .GEN*()
on your Unity objects and chain your desired behavior β it's fast, readable, and production-ready.
Works out of the box. Fully extensible. No extra SDKs required.
Supported Providers and APIs
One of the most widely used AI platforms, offering powerful models like GPT-4o. It supports a broad range of APIs for text, audio, image, and assistant functionalitiesβideal for building intelligent, multimodal applications.
π¬ Chat Completions, Streaming, Completions (Legacy)
ποΈ Image Creation, Image Edit, Image Variation
π£οΈ Speech, Transcript, Translation
π‘οΈ Text Moderations, Image Moderations
π§ Embeddings
π οΈ Models, Fine-Tuning
π¦ Batch
π Files, Uploads
π§ͺ Evals
π Graders
ποΈ Vector Stores, Vector Store Files, Vector Store File Batches
π€ Assistants, Threads, Messages, Runs, Run steps, Streaming (Pro-only)
π€ Realtime, Session tokens, Client events, Server events (Pro-only)
π‘ Responses, Responses Streaming (Coming Soon for Pro)
A cutting-edge multimodal AI service by Google, capable of both text and image generation. It is well-suited for developers who need tight integration with Googleβs cloud infrastructure and rapid performance.
π¬ Generate Content - Text generation, Vision, JSON Mode, Function calling
ποΈ Predict - Image generation (via Gemini)
πΌοΈ Predict - Image generation (via Imagen3)
π½οΈ Predict Long Running - Video generation (via Veo)
π§ Embeddings
π οΈ Models, Tuning, Permissions
πͺ Tokens
π Files, Caching
ποΈ Vector Stores, Vector Store Files, Vector Store File Batches
π€ Live API - WebSockets API
A state-of-the-art voice platform specializing in high-quality Text-to-Speech, Speech-to-Text, and voice transformation. Perfect for voice assistants, narration, and real-time character dubbing.
π¬ Text to Speech, Stream
π¬ Text to Speech (WebSockets)
π¬ Multi-Context Text to Speech (WebSockets)
ποΈ Speech to Text
π Sound Effects
π£οΈ Voice Changer
π§ Embeddings
π οΈ Audio Isolation
π€ Text to Voice (Voice design)
πΌ Dubbing
π οΈ Audio Native
π οΈ Forced Alignment
ποΈ Models, Voices
π Voice Library
ποΈ Vector Stores, Vector Store Files, Vector Store File Batches


Worried about costs? No Problem! With Ollama you can run LLMs like LLaMA and Mistral on your own machine for free. Ideal for offline development, edge applications, and private inference without relying on cloud services.
π¬ Chat Completion, Stream, Completion
π§ Embeddings
π οΈ Models, Model Management
π οΈ Version
A gateway to multiple third-party LLMs offering 300+ models including Claude, Mistral, Command R, and moreβall accessible via a single API. Great for experimenting with various models without switching providers.
π¬ Chat Completion - Stream, Vision, Structured Outputs, Function calling, Web Search
More API integrations are on the way: Have something you'd like to see next? Let us know via Discord β your feedback shapes the roadmap.
Feature Comparison
Supported APIs
OpenAI
OpenAI
ElevenLabs,
Ollama
OpenRouter
Supported Tasks
Response Generation (Text)
Image Generation
Text To Speech
Speech To Text
Response Generation (Text)
Image Generation
Text To Speech
Speech To Text
SoundFX Generation
Video Generation
Voice Change
Audio Isolation
Number of AI Models
120+
450+
Number of AI Voices
10+
4000+
Unity Components
β
Chatbot
Chatbot (with Assistants API)
Realtime Assistant
Modular Components (components you can use with 'Unity Components')
β
Image Generator
Text To Speech
Speech To Text
Voice Changer
Advanced Integration
β
Chat Session, Assistants API, Realtime API
Editor Tools
Model / Voice Library
Model / Voice Library
Editor Chat
Code Generator
Unity Component Generator
Avatar Generator
Icon Generator
Background Generator
Mesh Texture Generator
Speech Generator
SoundFX Generator
Video Generator
Supported Platforms
Fully
Windows, OSX, Linux
Partially
Unity WebGL
Fully
Android, iOS, Windows Phone/Store
Fully
PlayStation, Xbox, PS Vita/PSM, Switch
Last updated