CoreAI — LLM agents that play your game

CoreAI is a Unity framework for LLM-powered NPCs and agents that call your game code — function calling, tools, persistent memory, and runtime Lua — running on a local 4 GB model or any OpenAI-compatible API. No cloud keys required, no scripted dialogue trees.

_{Quick Start  •
Features  •
Three ways in  •
Models  •
Architecture  •
Docs  •
Tests}

🎬 Imagine this

A player walks up to a blacksmith NPC and types "Got any fire swords?". The NPC calls your inventory code, finds nothing, then replies in-character: "Fresh out of fire blades, but I can forge one if you bring me a Fire Crystal." The player crafts it — the Programmer agent writes Lua, the CoreMechanic agent rolls stats, and a unique Flame Sword lands in their inventory. All at runtime, streaming token-by-token into a chat bubble, on a local model. That's CoreAI.

Why CoreAI?

LLM-in-a-game demos are everywhere; shipping one is the hard part. CoreAI is the missing production layer between a chat box and your gameplay: the model doesn't just talk — it calls your C# and Lua, and the framework handles the messy reality of small local models (wrong tool casing, split streaming tags, runaway loops, rate limits, context overflow) so your game doesn't break.

🆓 Works out of the box, scales to production — one-liner await CoreAi.AskAsync("…") for your first feature; full AgentBuilder + orchestrator + per-role routing when you need it.
🧩 No forced heavy dependencies — LLM and Lua are optional modules (COREAI_NO_LLM / COREAI_NO_LUA, auto-detected from installed packages). Install only what your game uses.
🛡️ Built for small local models — auto-repairs tool-name casing, retries with feedback, survives streaming <think> splits, caps runaway generation. The full PlayMode suite passes on a local Qwen3.5-4B GGUF.

Is it for you?

You are…	Start with	Time to first result
🟢 New / prototyping	`CoreAI → Setup → Create Chat Demo Scene` → Play, or `await CoreAi.AskAsync("…")`	~5 min
🔵 Building a real game	`AgentBuilder` + tools + `IAiOrchestrationService`, per-role LLM routing	grows with you

🚀 30-second start: install (below) → CoreAI → Setup → Create Chat Demo Scene → press Play → type. Jump to Quick Start.

⚡ At a glance

🧠 Agents that call your code — real function calling with tool retry, auto-repair, and memory
🏠 Local-first — full PlayMode suite passes on a 4 GB GGUF; no data leaves the player's machine
💬 Drop-in chat UI — one menu click creates a working streaming chat scene
🎯 Self-Service Skills — the model sees 2 meta-tools instead of hundreds (~91% token savings)
🌊 Streaming that survives reality — split <think> tags, fragmented tool calls, SSE chunking
🗜️ Long chats that don't explode — token budget, rolling summaries, optional smart compaction
🛡️ Production guardrails — per-tool timeout, runaway cap, loop guard, Lua generation rate limit, rate-limit metrics + F10 token-budget overlay
🔄 Dual-backend fallback — local model first, cloud API as automatic backup (or the reverse)
🧩 Optional modules — no MoonSharp? No LLMUnity? Still compiles; features light up when packages appear
🌍 Unity world control — spawn primitives (cube, sphere, cylinder, capsule, plane, empty), move/rotate/scale objects, and add configured components through the reflection-free component_command tool

⭐ If CoreAI saves you time — star the repo! It's the main way other Unity devs find it.

Why not just call the API yourself?

Raw API calls get you text; CoreAI gives you the production layer around that text. Tool-calling, persistent memory, streaming, Lua sandboxing, retries, and multi-backend routing are already wired into Unity and covered by tests. You still own your game logic, but you do not have to rebuild the fragile glue around local models, OpenAI-compatible APIs, tool repair, and long-running conversations. CoreAI is the shortest path from "chat box" to agents that can safely act inside the game.

Releases: version in core package.json and Unity package.json (same semver). Notes: Unity changelog · Core changelog.

	Section
Game-Creation Benchmark	Local LLM game-building benchmark and model ranking
Three ways to call the LLM	Chat UI · `CoreAi` · agents / orchestrator
What CoreAI can do	Agents, tools, Lua, memory · long-chat budget & optional smart compaction (`v1.5+`)
Integration examples & ideas	Practical game-design patterns using CoreAI tools & memory
Architecture	Two packages, diagram
Quick Start	NuGet, UPM, scene
Documentation	Map of docs
Tests	EditMode & PlayMode
Multiplayer & Singleplayer	Host-authoritative multiplayer, local singleplayer
Changelog	Unity + core release notes (single source of truth)
Author & Community	Author, ecosystem, license, contact, support

Game-Creation Benchmark

CoreAI includes a local game-creation benchmark that measures how well an LLM builds a game by driving real execute_lua and world_command tools. It scores 0-100 across seven scenario groups (world-building, Lua logic, math reasoning, playthroughs, instruction adherence, a free-build castle showcase and a comprehensive integration puzzle), adds 0-10 game-fitness ratings per role, and can run against any OpenAI-compatible endpoint — cloud APIs or a local LMStudio multi-model sweep.

Cloud / frontier models — one full G1-G7 run each (suite v1.6, no token caps):

Full ranking table — cloud models

#	Model	Suite	Pass-rate	P/PA/F	Tools	Intent	Task	Determ	Reason	Instr	Eff	Tool-err	Tokens	Run
1	`GPT-5.5`	97	96%	24/1/0	74.6	100	99.6	100	100	100	4.3	14.3%	13789	`20260703_024825`
2	`glm-5.2`	93.5	77.3%	17/4/1	68.8	100	96.3	100	100	96.3	2	6.5%	80861	`20260702_234351`
3	`GPT-5.3 Codex Spark`	93.2	87.5%	21/2/1	92.6	94.1	91.2	0	96.3	96.3	4.4	7.1%	14009	`20260703_033023`
4	`claude-opus-4.8`	92.9	87.5%	21/3/0	59.3	94.1	99.5	100	100	96.3	3.3	10.4%	25758	`20260702_171849`
5	`claude-haiku-4.5`	92.7	80%	20/4/1	86.8	91	99.6	100	100	83.8	3.5	1.4%	36569	`20260703_010715`
6	`claude-fable-5`	89.5	80%	20/4/1	57.9	88.9	95.6	100	90	96.3	3.3	5.6%	27312	`20260702_191106`
7	`claude-sonnet-5`	88.2	84%	21/1/3	72.8	94.4	92	100	100	78.7	4.2	10.6%	16391	`20260702_195533`

Local models (LMStudio) — same suite on consumer hardware:

Full ranking table — local models

#	Model	Suite	Pass-rate	P/PA/F	Tools	Intent	Task	Determ	Reason	Instr	Eff	Tool-err	Tokens	Run
1	`qwen3.6-27b-heretic-…-imatrix-max`	93.1	87%	20/1/2	98	96.1	96.8	100	100	88	1.3	9.7%	52495	`20260702_062928`
2	`deepreinforce-ai_ornith-1.0-9b`	92.1	68.2%	15/7/0	68.8	93.3	99.5	100	100	83.8	2.6	15.6%	96801	`20260702_093857`
3	`qwen3.5-4b-mtp`	92	84%	21/3/1	93	100	93.5	100	100	91.7	4.7	11.9%	62773	`20260702_205410`
4	`qwen3.6-27b-fable-5-experimental`	87.3	73.7%	14/3/2	85.9	83.3	88.9	100	80	83.8	1.9	32.2%	58974	`20260702_054314`
5	`qwen3.5-2b`	84.8	75%	18/2/4	92.6	100	83.3	100	66.7	91.2	4.4	8.5%	57656	`20260702_225056`
6	`deepreinforce-ai_ornith-1.0-35b`	81.1	65.2%	15/4/4	83.3	87.5	91.3	100	95.8	78.7	3.5	16.8%	70744	`20260702_101519`
7	`qwythos-9b-claude-mythos-5-1m`	81	62.5%	15/5/4	69.4	84.6	91.7	100	88.9	78.7	3.2	18.2%	63570	`20260702_052949`
8	`qwen3.5-0.8b`	57.8	41.7%	10/4/10	94.4	80.1	91.4	100	55.6	38	2.4	1%	43383	`20260702_222625`
9	`lfm2-8b-a1b`	12.4	0%	0/0/25	50.9	2.1	0	0	0	57.8	0	0%	52430	`20260702_051709`

Comparisons are rebuilt from any set of past report JSONs you pick — see custom comparisons in the benchmark guide.

Benchmark docs: full benchmark guide and benchmark design. Want the full detail behind one score — per-scenario checkpoints, every tool call, and the exact system prompt sent to the model? See an example full report (one curated run; every real run writes to a gitignored TestResults/CoreAI/Benchmarks/ and is not committed). Per-model G6 free-build castle screenshots live in the benchmark guide's gallery.

🧭 Three ways in: UI · CoreAi · agents

You are building…	Start here	One line
In-game chat for players	`CoreAI → Setup → Create Chat Demo Scene` + `CoreAiChatPanel`	Play and type
Any script, no DI yet	`using CoreAI;` → `await CoreAi.AskAsync("…")` or `StreamAsync`	COREAI_SINGLETON_API
Full agent + tools + orchestrator	`AgentBuilder` + `IAiOrchestrationService`	AGENT_BUILDER

All three paths share the same CoreAILifetimeScope and LLM backend when the scene is set up once.

✨ What CoreAI Can Do

🏗️ Create AI Agents in 3 Lines

var merchant = new AgentBuilder("Blacksmith")
    .WithSystemPrompt("You are a blacksmith. Sell weapons and remember purchases.")
    .WithTool(new InventoryLlmTool(myInventory))  // Knows their stock
    .WithMemory()                                  // Remembers buyers
    .WithMaxOutputTokens(512)                      // Per-agent reply budget
    .Build();                                      // → AgentConfig (in-memory blueprint)

// Attach that blueprint to the global policy created at startup (CoreAILifetimeScope).
// The orchestrator looks up tools/system prompt by RoleId ("Blacksmith") from this policy.
merchant.ApplyToPolicy(CoreAIAgent.Policy);

// Ask* uses CoreAIAgent.Orchestrator (same startup wiring). Needs Play + CoreAILifetimeScope on scene.
merchant.AskWithCallback("Show me your swords");
merchant.AskWithCallback("Show me your swords", (response) => Debug.Log(response));

Build() — returns AgentConfig (role id, tools, prompts, mode). Still unknown to the runtime until registered.
ApplyToPolicy(CoreAIAgent.Policy) — writes into the live AgentMemoryPolicy so RunTask / tool routing can find this role's tools and merged prompts. Without it, "Blacksmith" is just a string the model never gets the right stack for.
AskWithCallback / AskAsync — thin wrappers over CoreAIAgent.Orchestrator → AiTaskRequest with RoleId from the config. Same idea as resolving IAiOrchestrationService from DI — see COREAI_SINGLETON_API.

3 Agent Modes: 🛒 ToolsAndChat · 🤖 ToolsOnly · 💬 ChatOnly

🎯 Self-Service Skills — Agents Load Tools On Demand

When your agent has dozens of tools across different domains (crafting, combat, trading, quests), sending all of them every request wastes tokens. Skills solve this:

// Define skills — each is a group of tools + instructions
var crafting = new SkillSet("Crafting",
    "Forge weapons and armor from raw materials",
    "1. Call get_recipes to list recipes.\n2. Call craft_item to craft.",
    new DelegateLlmTool("get_recipes", "List recipes", (string type) => ...),
    new DelegateLlmTool("craft_item", "Craft item", (string id) => ...));

var combat = new SkillSet("Combat", "Fight enemies", "Call attack with target.",
    new DelegateLlmTool("attack", "Attack target", (string target) => ...));

// Register skills — model sees only 2 meta-tools, not all skill tools
var gm = new AgentBuilder("GameMaster")
    .WithSystemPrompt("You are a Game Master in a fantasy RPG.")
    .WithSkill(crafting)
    .WithSkill(combat)
    .Build();
gm.ApplyToPolicy(policy);

How it works:

Model sees a lightweight catalog (skill names + descriptions) in the system prompt
Model calls read_skill("Crafting") → gets full instructions + tool schemas
Model calls call_skill_tool("get_recipes", "{}") → proxy routes to real tool
Token overhead: constant (2 meta-tools) regardless of total skill/tool count

💡 50 tools across 10 skills? Without skills: ~4,000 tokens. With skills: ~360 tokens. 91% savings.

Mix direct tools and skills freely: WithTool(memory) + WithSkill(crafting) — memory is always visible, crafting loads on demand.

Docs: AGENT_BUILDER.md §Skills · COREAI_SINGLETON_API

💬 Drop-in Chat UI

Add an NPC chat to any scene in minutes — no custom UI code required:

CoreAI → Setup → Create Chat Demo Scene

This generates Assets/CoreAiUnity/Scenes/CoreAiChatDemo.unity with a pre-wired CoreAiChatPanel (UI Toolkit + UXML/USS, dark theme by default; default floating size ~650×910, flush-right scrollbar, optional long-turn hint under the typing row), CoreAiChatConfig_Demo.asset and a fully configured CoreAILifetimeScope — press Play and chat.

// Same stack as the panel — pick your style:
await foreach (var chunk in CoreAi.StreamAsync("Hello", "SmartChat"))
    Debug.Log(chunk);

// Or explicit service (e.g. from DI in tests):
var service = CoreAiChatService.TryCreateFromScene();
await foreach (var chunk in service.SendMessageStreamingAsync("Hello", "SmartChat"))
    if (!string.IsNullOrEmpty(chunk.Text)) Debug.Log(chunk.Text);

Streaming pipeline: HTTP SSE or LLMUnity callback → stateful ThinkBlockStreamFilter (strips <think> blocks split across chunks) → typing indicator → bubble. Cancellation cancels the in-flight HTTP HttpClient request / enumerator on the MEAI path.

Docs: README_CHAT.md · STREAMING_ARCHITECTURE.md

⏳ Powerful Lua Coroutine Execution

Now CoreAI allows Lua scripts (like dynamically parsed world logic) to execute as asynchronous coroutines inside Unity:

-- Runs securely across multiple frames relying on Unity's Time
local start_time = time_now()
while time_now() - start_time < 2.0 do
    coroutine.yield()
end

Automatically maps APIs like time_delta(), time_scale(), and hooks securely via an internal InstructionLimitDebugger budget that yields processing back to Unity so you can run heavy computations without freezing the main thread.

🔧 AI Calls Tools (Function Calling)

AI doesn't just generate text — it calls code for real actions:

Tool	What it does	Who uses it
🌍 WorldCommandTool	Spawns primitives/prefabs and moves, rotates, scales, parents, or toggles objects in the world	Creator AI
🧱 component_command	Adds/configures curated Unity components such as `Rigidbody`, colliders, and `Light` without reflection	Creator AI
⚡ Action/Event Tool	Calls any C# method or triggers an Event	All Agents
🧠 MemoryTool	Saves/reads memory between sessions	All Agents
📜 LuaTool	Executes Lua scripts	Programmer AI
🎒 InventoryTool	Gets NPC inventory	Merchant AI
⚙️ GameConfigTool	Reads/modifies game configs	Creator AI
🎭 SceneLlmTool	Read and change hierarchy/transform in PlayMode	All Agents
📸 CameraLlmTool	Captures screenshots (Base64 JPEG) for Vision	All Agents

Create your own:

public class WeatherLlmTool : ILlmTool
{
    public string Name => "get_weather";
    public string Description => "Get current weather.";
    public IEnumerable<AIFunction> CreateAIFunctions() 
    {
        yield return AIFunctionFactory.Create(
            async ct => await _provider.GetWeatherAsync(ct), "get_weather", "Get weather.");
    }
}

💡 Design tools for token economy: use short parameter keys (q instead of question_text), concise descriptions, indexes instead of strings, and smart defaults. Details: TOOL_CALL_SPEC.md

🎮 Dynamic Mechanics — AI Changes the Game Live

Player: "Craft a weapon from Iron and Fire Crystal"
  ↓
CoreMechanicAI: "Iron + Fire Crystal → Flame Sword, damage 45"
  ↓
Programmer AI: execute_lua → create_item("Flame Sword", "weapon", 75)
               add_special_effect("fire_damage: 15")
  ↓
✨ Player receives a unique item!

🧪 Build a whole game from mods alone

CoreAI mods aren't limited to tuning numbers — a host can expose its own Lua API so the AI authors brand-new content at runtime. The included Unit Forge demo ships an empty arena: every unit type, every wave, and every behaviour is added by Lua mods written through chat.

-- The AI writes this when you ask for "a knight line vs a goblin swarm"
forge_define("knight", "ally",  60, 8, 1.4, 1.1, "#3aa0ff")
forge_define("goblin", "enemy", 25, 4, 2.0, 1.0, "#3cc452")
for i = 1, 3 do forge_spawn("knight", -5, (i - 2) * 1.5) end
for i = 1, 4 do forge_spawn("goblin",  5, (i - 2) * 1.2) end

hooks_every(3.0, function()         -- endless reinforcements, also written by the AI
  if forge_count("enemy") < 6 then forge_spawn("goblin", 6, math.random(-3, 3)) end
end)

Capability tiers keep it safe: forge_* lives behind the WorldEdit tier, while a separate Full tier (opt-in, public members only by default) can reach arbitrary components via reflection. Toggle the optional Lua/LLM packages from CoreAI → Setup → Modules.

Demo scene	What it shows
🛠️ Unit Forge (`Assets/CoreAI.Demos/ModdableUnits`)	New units & whole game loops created by mods alone
🌊 Wave Auto-Battler (`Assets/CoreAI.Demos/LiveMechanicsMods`)	Number-tuning mods over a live battle
🔓 Full Access (`Assets/CoreAI.Demos/FullAccess`)	Reflection-tier Lua reaching scene objects via `unity_*`

Docs: LUA_GAME_API · OPTIONAL_MODULES

🧠 Memory — AI Remembers Everything

	Memory	ChatHistory
Storage	JSON file on disk	In LLMAgent (RAM)
Duration	Between sessions	Current conversation
For what	Facts, purchases, quests	Conversation context

🗜️ Long conversations — budget, summaries & optional “smart compaction”

When WithChatHistory() fills the model window, CoreAI keeps a fresh tail of messages and folds older turns into ## Conversation Summary in the system prompt (deterministic rollup by default). Newer releases add:

Feature	What you get
Context budget	`IContextBudgetPolicy` / `HistoryTokenBudget` — system + user + tools shrink what fits in history fairly.
Persisted summaries	`InMemoryConversationSummaryStore` (process) or `FileConversationSummaryStore` (disk under Unity’s `persistentDataPath`) — summaries accrue across turns.
LLM-assisted compaction (opt-in)	Extra `CompleteAsync` on role `__CoreAI_ContextCompaction` to rewrite the rolling summary (enable globally on `CoreAISettingsAsset`, then tune per agent).
Per-role defaults	`AgentBuilder` agents default on; built-in `Programmer` defaults off (cheaper truncation for tool-heavy Lua roles). `WithLlmContextCompaction(false)` to opt out.

// Custom agent: long chat + smart rollup when global toggle is enabled
new AgentBuilder("LoreKeeper")
    .WithChatHistory(8192, persistBetweenSessions: true)
    .WithLlmContextCompaction(true) // default anyway; explicit for docs
    .Build()
    .ApplyToPolicy(policy);

// Programmer-style role: deterministic only for this builder agent
new AgentBuilder("ToolsFirst")
    .WithChatHistory(4096)
    .WithLlmContextCompaction(false)
    .Build();

Deep dive: CHANGELOG (Core v1.5.2–1.5.3) · MemorySystem · ARCHITECTURE · COREAI_SETTINGS.

🔄 Tool Call Retry + Self-Healing — AI Learns from Mistakes

Small models (Qwen3.5-2B) sometimes forget the format or case of tool names. CoreAI automatically:

🔧 Repairs tool name casing — TryRepairToolName silently maps MEMORY → memory, Spawn_Quiz → spawn_quiz before execution fails.
♻️ Retries on failure — up to 3 retries with error feedback injected into chat history so the model self-corrects.
🌐 Retries HTTP errors — 429 (Rate Limited) and 5xx responses trigger automatic retry with Retry-After header support or exponential backoff (2s → 4s, configurable in Inspector).
✅ Checks fenced Lua blocks immediately.

Model says: {"name":"MEMORY", ...}
     ↓ TryRepairToolName
Executes: {"name":"memory", ...}  ← silently fixed, no error shown

📏 Recommended Models

Model	Size	Tool Calling	When to use
Qwen3.5-4B	4B	✅ Great	Recommended for local GGUF
Qwen3.5-35B (MoE) API	35B/3A	✅ Excellent	Ideal via API — fast & accurate
Gemma 4 26B (via LM Studio)	26B	✅ Excellent	Great via HTTP API
LM Studio / OpenAI API	Any	✅ Excellent	External models via HTTP — best choice
Qwen3.5-2B	2B	⚠️ Works	Works, but sometimes makes mistakes
Qwen3.5-0.8B	0.8B	⚠️ Basic	Most tests pass, struggles with multi-step

💡 Recommendation: Qwen3.5-4B locally or Qwen3.5-35B (MoE) via API
MoE models (Mixture of Experts) activate only 3B parameters per inference — fast as 4B, accurate as 35B.

🧪 PlayMode Test Results by Model Size

The table below is a snapshot of a recorded PlayMode run against real LLM backends (results vary by model build, quantization, and provider). Reproduce it locally to confirm for your setup:

Test Category	0.8B	2B	4B+
Memory Tool (write/append/clear)	✅ Pass	✅ Pass	✅ Pass
Custom Agents (tool calling)	✅ Pass	✅ Pass	✅ Pass
World Commands (list/play/spawn)	✅ Pass	✅ Pass	✅ Pass
Execute Lua (single tool)	✅ Pass	✅ Pass	✅ Pass
Multi-Agent Workflow (Creator→Mechanic→Programmer)	⚠️ Partial	✅ Pass	✅ Pass
Crafting Memory (multi-step: memory + lua)	⚠️ Partial	⚠️ Mostly	✅ Pass
Chat History (persistent context)	❌ Too small	⚠️ Mostly	✅ Pass
Player Chat (NPC dialogue)	✅ Pass	✅ Pass	✅ Pass

🏆 Qwen3.5-4B passed all tests in this recorded run. This is the recommended minimum for production use.
📊 Qwen3.5-0.8B passes most tests — impressive for its size! Struggles only with complex multi-step tool calling chains.
📈 2B is a solid middle ground — occasional mistakes in multi-step scenarios, but mostly reliable.

💡 Integration Examples & Ideas

How can you use CoreAI in your game? Here are some "Brainrot-free" ideas:

The "Alive" Merchant: Instead of a static list, the blacksmith remembers that you sold him a legendary dragon scale yesterday. He might offer you a "special deal" on a Dragon-slaying sword today.
Autonomous Game Master: Let the AI monitor the player's health and resources. If the player is struggling, the GM might "whisper" a hint or "accidentally" spawn a health potion nearby via WorldCommandTool.
Real-time Lore Narrator: As the player enters a new biome, the AI generates unique lore based on the current weather, time of day, and the player's equipped items.
AI-Driven Procedural Quests: Quests are no longer Kill X Wolves. An AI King asks you to Investigate why the wolves are glowing, and you can actually interview the wolves (who might be under a Lua-driven spell).
Voice/Chat-to-Action: "Make it rain fire!" -> The AI parses the intent, checks the WeatherTool, and executes the corresponding Lua script to trigger a firestorm.

🏛️ Architecture

The repository consists of two packages:

Package	What's inside	Dependencies
com.neoxider.coreai	Portable core — pure C# without Unity	VContainer, MoonSharp
com.neoxider.coreaiunity	Unity layer — DI, LLM, MEAI, MessagePipe, tests	Depends on `coreai`

Architecture diagram

┌─────────────────────────────────────────────────────────────┐
│                      Player / Game                           │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│                   AiOrchestrator                              │
│  • Priority queue  • JSON strip (defense-in-depth)            │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│              LoggingLlmClientDecorator                        │
│  • HTTP retry (429/5xx)  • Retry-After  • Exp. backoff        │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│                     LLM Client (MeaiLlmClient)               │
│  • LLMUnity (local GGUF)  • OpenAI HTTP  • Stub             │
│  • TryExtractToolCallsFromText (JSON-in-text → tool call)    │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│               SmartToolCallingChatClient                      │
│  • TryRepairToolName (MEMORY → memory)                       │
│  • Duplicate detection  • Consecutive error tracking          │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│                   AI Agents                                  │
│  🛒 Merchant  📜 Programmer  🎨 Creator  📊 Analyzer        │
│  🗡️ CoreMechanic  💬 SmartChat  + Your custom ones!        │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│                   Tools (ILlmTool)                           │
│  🧠 Memory  📜 Lua  🎒 Inventory  ⚙️ GameConfig  + Yours!   │
│  🎯 SkillSet → read_skill + call_skill_tool (on-demand)     │
└──────────────────────┬──────────────────────────────────────┘
                       ↓
┌─────────────────────────────────────────────────────────────┐
│                   Game World                                 │
│  • Lua Sandbox (MoonSharp)  • MessagePipe  • DI (VContainer)│
└─────────────────────────────────────────────────────────────┘

🚀 Quick Start

📦 Full step-by-step guide: INSTALL.md — base install, the LLM module (NuGet), and the Lua module, with the optional-module switches. The summary below is the short version.

1. Install the LLM pipeline with NuGetForUnity

CoreAI uses Microsoft.Extensions.AI for the LLM pipeline. The recommended install path is NuGetForUnity: install NuGetForUnity, then install one package:

Microsoft.Extensions.AI

NuGetForUnity resolves the rest of the dependency chain automatically. Do not install the transitive packages by hand when using this path.

Manual DLL alternative

If you cannot use NuGetForUnity, clone this repo and copy the entire Assets/Packages/ folder into your project. The restored DLLs live there; the packages.config / NuGet.config manifests are at Assets/packages.config and Assets/NuGet.config.

NuGet Package	Version	Required by
`Microsoft.Extensions.AI`	10.7.0	CoreAI Core
`Microsoft.Extensions.AI.Abstractions`	10.7.0	CoreAI Core
`Microsoft.Bcl.AsyncInterfaces`	10.0.9	System dependency
`System.Text.Json`	10.0.9	JSON serialization
`System.Text.Encodings.Web`	10.0.9	System dependency
`System.Numerics.Tensors`	10.0.9	System dependency
`Microsoft.Extensions.Logging.Abstractions`	10.0.9	Logging
`Microsoft.Extensions.DependencyInjection.Abstractions`	10.0.9	DI
`System.Diagnostics.DiagnosticSource`	10.0.9	System dependency

Supported backends

Backend	Use it for
LLMUnity	On-device GGUF models inside Unity
LM Studio	Local OpenAI-compatible development server
OpenAI / any OpenAI-compatible API	Hosted provider calls through the client-owned API path
vLLM	Self-hosted OpenAI-compatible inference
Ollama	Local OpenAI-compatible model server
Server-managed proxy	Production proxy where your backend owns credentials, budgets, and policy

2. Add Git dependencies to manifest.json (required)

Unity Package Manager does not transitively pull every Git dependency for you.

Preferred: After CoreAiUnity is in the project, use CoreAI → Setup → Install Git Dependencies. It merges any missing package keys only — pins you manage by hand stay untouched.

Manual alternative: Edit Packages/manifest.json and add these entries under "dependencies":

    "jp.hadashikick.vcontainer": "https://github.com/hadashiA/VContainer.git?path=VContainer/Assets/VContainer#1.17.0",
    "com.cysharp.messagepipe": "https://github.com/Cysharp/MessagePipe.git?path=src/MessagePipe.Unity/Assets/Plugins/MessagePipe",
    "com.cysharp.messagepipe.vcontainer": "https://github.com/Cysharp/MessagePipe.git?path=src/MessagePipe.Unity/Assets/Plugins/MessagePipe.VContainer",
    "com.cysharp.unitask": "https://github.com/Cysharp/UniTask.git?path=src/UniTask/Assets/Plugins/UniTask",

Optional modules (since v3.0.0) — CoreAI compiles without them and enables the features automatically when the package is detected:

    "ai.undream.llm": "https://github.com/undreamai/LLMUnity.git",
    "org.moonsharp.moonsharp": "https://github.com/moonsharp-devs/moonsharp.git?path=/interpreter#upm/beta/v3.0",

Optional package	Unlocks	Skip it if
LLMUnity (`ai.undream.llm`)	Local GGUF models on-device	You only use an OpenAI-compatible HTTP API
MoonSharp (`org.moonsharp.moonsharp`)	Lua sandbox & AI-written gameplay scripts	You don't need Lua scripting

3. Install CoreAI packages (Git URL)

Unity Editor → Window → Package Manager → + → Add package from git URL…

Step 1 — Core engine (pure C#, no UnityEngine):

https://github.com/NeoXider/CoreAI.git?path=Assets/CoreAI

Step 2 — Unity layer (MonoBehaviour, LLM clients, tools):

https://github.com/NeoXider/CoreAI.git?path=Assets/CoreAiUnity

4. Setup scene

Recommended first try (chat UI + scope):

CoreAI → Setup → Create Chat Demo Scene

Bare wiring (lifetime scope + settings assets — no demo UI):

CoreAI → Setup → Create Bare Scene (advanced)

Either path can:

✅ Create CoreAILifetimeScope on the scene
✅ Ensure required assets (CoreAISettings, GameLogSettings, AgentPromptsManifest, etc.)
✅ Assign references on the scope
✅ Optionally add LLM + LLMAgent when the backend is LLMUnity (bare-scene wizard)

5. Configure LLM backend

Open settings: CoreAI → Settings and choose your backend:

Backend	Setup
LLMUnity (local)	Download a GGUF model (e.g. Qwen3.5-4B) via LLMUnity Model Manager
HTTP API (LM Studio, OpenAI)	Set `API Base URL` and `API Key` in Settings
Auto	CoreAI picks the best available backend automatically

6. Create your first agent

var storyteller = new AgentBuilder("Storyteller")
    .WithSystemPrompt("You are a campfire storyteller. Share tales about the world.")
    .WithMemory()
    .WithChatHistory()
    .WithMode(AgentMode.ChatOnly)
    .Build();

📖 Full setup guide with LLM configuration: QUICK_START.md
🏗️ Agent Builder reference + ready recipes: AGENT_BUILDER.md

📚 Documentation

Language: All documentation is maintained in English — in-depth Markdown lives under Assets/CoreAiUnity/Docs/ and Assets/CoreAI/Docs/. Follow the linked guides for detail.

Start from the index and pick the level that matches your goal:

Game-Creation Benchmark: live PlayMode benchmark guide and design for scored model-driven game creation: full benchmark guide, benchmark design, and an example full report.

🧭 Docs/README.md — repository documentation entry point. 🧭 DOCS_INDEX.md — CoreAI Unity map (Beginner → Intermediate → Architecture).

Reset CoreAI file persistence (Editor): CoreAI → Delete All Persistent Saves... (exit Play Mode first) deletes persistentDataPath/CoreAI — agent memory, persisted chat JSON, summaries (desktop), Lua/data-overlay version files. Project assets under Assets/ are untouched. See TROUBLESHOOTING.md.

Getting started

Document	What's inside
🚀 QUICK_START.md	Install → open scene → connect LLM → Play
🚀 QUICK_START_FULL.md	Full 10-min walkthrough: LM Studio → Unity → first command
🎯 COREAI_SINGLETON_API.md	`CoreAi` one-liners — beginners + pros
🏗️ AGENT_BUILDER.md	Build an NPC in 3 lines · modes · ready-made recipes
⚙️ COREAI_SETTINGS.md	Backends, models, timeout, streaming toggle

Chat & streaming

Document	What's inside
💬 README_CHAT.md	Drop-in `CoreAiChatPanel` + demo scene
🌊 STREAMING_ARCHITECTURE.md	SSE / LLMUnity → filters → UI · orchestrator streaming
📊 MEAI_TOKENS_FACT_VS_ESTIMATE.md	Provider `usage` vs pre-flight estimates; SSE `include_usage`; HTTP vs orchestrator timeouts
🔒 LUA_SANDBOX_SECURITY.md	Lua sandbox boundary, removed APIs, execution limits, binding rules

Tools, memory, roles

Document	What's inside
🔧 TOOL_CALL_SPEC.md	Tool-calling specification
🛒 CHAT_TOOL_CALLING.md	Merchant NPC with inventory
🧠 MemorySystem.md	Agent memory (disk + chat history)
🤖 AI_AGENT_ROLES.md	Agent roles & prompts

Architecture

Document	What's inside
🗺️ DEVELOPER_GUIDE.md	Code map, LLM→commands flow, PR checklist
📐 DGF_SPEC.md	Normative spec: DI, threads, authority
🛠️ MEAI_TOOL_CALLING.md	MEAI pipeline: `ILlmTool` → `AIFunction` → `FunctionInvokingChatClient`
🧰 TOOL_CALLING_BEST_PRACTICES.md	Tool schema, idempotency, duplicate calls, SkillSet organization
🔀 LLM_ROUTING.md	Portable routing: modes, policy, usage sinks, timeouts
📑 CoreAI/Docs/README.md	Index of all portable CoreAI markdown guides
📋 CHANGELOG.md · CHANGELOG (Unity)	Version history

🧪 Tests

Unity → Window → General → Test Runner
  ├── EditMode — large fast suite (no real LLM): prompts, streaming, Lua sandbox,
  │              tools, rate limit, CoreAi facade, tool name repair, backoff math,
  │              orchestrator streaming, tool call extraction parity, …
  └── PlayMode — integration tests with a configured HTTP or local GGUF backend
                 ├── FullPipelineResiliencePlayModeTests — streaming/non-streaming
                 │   tool calls with no-JSON-leak guarantees, memory write/read,
                 │   orchestrator merchant + inventory, trace diagnostics
                 ├── ToolNameRepairPlayModeTests — hybrid scripted+real-LLM:
                 │   wrong casing repair, unknown tool self-correction, mixed-case
                 └── StreamingToolCallingPlayModeTests — cancel, state parity

Run EditMode first in CI; PlayMode is optional and needs a backend (env vars for HTTP — see LLMUNITY_SETUP_AND_MODELS). Real-model memory recall may Ignore (not fail) if the local OpenAI-compatible server returns HTTP 5xx — see TROUBLESHOOTING.

🌐 Multiplayer and Singleplayer

Singleplayer: Same pipeline, AI works locally
Multiplayer: AI logic on host, clients receive agreed outcomes

One template — for both solo campaign and coop.

Changelog

Per-release notes, migration hints, and version numbers are maintained only in the changelogs (so this README does not need duplicate edits on every ship):

com.neoxider.coreaiunity CHANGELOG — Unity layer: Editor, chat UI, PlayMode tests, docs.
com.neoxider.coreai CHANGELOG — portable core and release-sync lines.

The version field in each package’s package.json is the authoritative semver for that package.

🤝 Author and Community

Author: Neoxider
Ecosystem: NeoxiderTools
License: LICENSE

Contact: neoxider@gmail.com | GitHub Issues

💖 Support the project

CoreAI is free for non-commercial use and developed in spare time. If it saves you hours of work:

⭐ Star the repo — the cheapest way to help it grow
💖 Sponsor on GitHub — fund development of new features
💼 Need it in a commercial game? A separate commercial license is available — write to neoxider@gmail.com
🛠️ Priority support / custom integration — also via email

🎮 CoreAI — stop writing dialogue trees. Ship agents that think, call your code, and remember — from a local 4B model or a cloud API, your choice.

Name		Name	Last commit message	Last commit date
Latest commit History 447 Commits
.github		.github
Assets		Assets
Docs		Docs
Images		Images
Packages		Packages
ProjectSettings		ProjectSettings
TODO		TODO
hooks		hooks
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md

Folders and files

Latest commit

History

Repository files navigation

CoreAI — LLM agents that play your game

🎬 Imagine this

Why CoreAI?

Is it for you?

⚡ At a glance

Why not just call the API yourself?

Contents

Game-Creation Benchmark

🧭 Three ways in: UI · CoreAi · agents

✨ What CoreAI Can Do

🏗️ Create AI Agents in 3 Lines

🎯 Self-Service Skills — Agents Load Tools On Demand

💬 Drop-in Chat UI

⏳ Powerful Lua Coroutine Execution

🔧 AI Calls Tools (Function Calling)

🎮 Dynamic Mechanics — AI Changes the Game Live

🧪 Build a whole game from mods alone

🧠 Memory — AI Remembers Everything

🗜️ Long conversations — budget, summaries & optional “smart compaction”

🔄 Tool Call Retry + Self-Healing — AI Learns from Mistakes

📏 Recommended Models

🧪 PlayMode Test Results by Model Size

💡 Integration Examples & Ideas

🏛️ Architecture

🚀 Quick Start

1. Install the LLM pipeline with NuGetForUnity

Supported backends

2. Add Git dependencies to manifest.json (required)

3. Install CoreAI packages (Git URL)

4. Setup scene

5. Configure LLM backend

6. Create your first agent

📚 Documentation

Getting started

Chat & streaming

Tools, memory, roles

Architecture

🧪 Tests

🌐 Multiplayer and Singleplayer

Changelog

🤝 Author and Community

💖 Support the project

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages