9 AI models, one gateway

OOretz Factory routes every prompt through a multi-provider gateway. Hot-swap between Claude, GPT, Gemini, and Mistral per-message. Different models excel at different tasks — we route automatically, you can override.

Anthropic · 3 modelsOpenAI · 3 modelsGoogle · 2 modelsMistral · 1 model

How we route

Each request runs through a 3-layer routing decision before hitting a model:

01
Tier default

Free → Haiku, Pro → Sonnet, Enterprise → Opus. Every user gets a sensible starting point.

02
Language specialization

TypeScript prefers Claude. Python prefers GPT. Java/Kotlin prefers Gemini. Rust/Go prefers Mistral. Caller can override.

03
Complexity boost

When the manifest has 10+ entities or complex workflows, we bump to the next tier up. Pro user gets Opus for free on hard prompts.

Anthropic

Claude Haiku 4.5
claude-haiku-4-5
Anthropic

Fast + cheap for simple TypeScript/React

Context
200k
In
$1.00/M
Out
$5.00/M
Best for
Quick web prototypesSmall refinementsReact componentsFree-tier default

Default model for the free tier. Excellent for simple iterations under 30 entities.

Claude Sonnet 4.6
claude-sonnet-4-6
Anthropic

Best balance of quality + cost for complex apps

Context
1M
In
$3.00/M
Out
$15.00/M
Best for
TypeScript + Next.jsC# / WPF desktopMulti-entity CRUDPro-tier default

The workhorse. Most production OOretz Factory builds use this.

Claude Opus 4.6
claude-opus-4-6
Anthropic

Top-end reasoning for complex architecture

Context
1M
In
$15.00/M
Out
$75.00/M
Best for
10+ entity appsComplex business logicArchitecture decisionsEnterprise default

When the prompt has nuanced business rules or multi-step workflows.

OpenAI

GPT-4o mini
gpt-4o-mini
OpenAI

Cheap Python + data work

Context
128k
In
$0.15/M
Out
$0.60/M
Best for
Python scriptsData pipelinesQuick iterationCheapest option

Cheapest model in the gateway. Great for simple Python or data work.

GPT-4o
gpt-4o
OpenAI

Strong Python + general-purpose code

Context
128k
In
$2.50/M
Out
$10.00/M
Best for
Python appsAPI integrationsGeneral full-stack

Solid all-rounder. Often used as the second voice in ensemble voting.

GPT-5
gpt-5
OpenAI

Top-tier reasoning across all surfaces

Context
400k
In
$20.00/M
Out
$60.00/M
Best for
Hardest reasoning tasksComplex PythonMulti-modal

Use when Opus struggles or when you need OpenAI-specific strengths.

Google

Gemini 2.5 Flash
gemini-2-5-flash
Google

Massive context, fast Java/Kotlin

Context
1M
In
$0.30/M
Out
$1.20/M
Best for
Java appsKotlin / AndroidLong contexts

Highest context-to-cost ratio. Good for large existing-codebase tasks.

Gemini 2.5 Pro
gemini-2-5-pro
Google

2M context for huge multi-file work

Context
2M
In
$2.50/M
Out
$10.00/M
Best for
Multi-file refactoringLong-form generationKotlin

2M context window unmatched. Use when 1M is not enough.

Mistral

Mistral Large
mistral-large
Mistral

EU-hosted, strong Rust/Go

Context
128k
In
$2.00/M
Out
$6.00/M
Best for
Rust appsGo servicesEU data residency

EU-hosted option for data residency. Strong on systems languages.

No model lock-in

New models added quarterly. Existing apps re-route automatically when better/cheaper options appear in the same tier. Override per request via API or operator UI.

Start free →
Provider pricing reflects 2026-05 list rates. Updated quarterly.