9 AI models, one gateway

OOretz Factory routes every prompt through a multi-provider gateway. Hot-swap between Claude, GPT, Gemini, and Mistral per-message. Different models excel at different tasks — we route automatically, you can override.

Anthropic · 3 modelsOpenAI · 3 modelsGoogle · 2 modelsMistral · 1 model

How we route

Each request runs through a 3-layer routing decision before hitting a model:

Tier default

Free → Haiku, Pro → Sonnet, Enterprise → Opus. Every user gets a sensible starting point.

Language specialization

TypeScript prefers Claude. Python prefers GPT. Java/Kotlin prefers Gemini. Rust/Go prefers Mistral. Caller can override.

Complexity boost

When the manifest has 10+ entities or complex workflows, we bump to the next tier up. Pro user gets Opus for free on hard prompts.

Anthropic

Claude Haiku 4.5

claude-haiku-4-5

Anthropic

Fast + cheap for simple TypeScript/React

Context

200k

$1.00/M

Out

$5.00/M

Best for

Quick web prototypesSmall refinementsReact componentsFree-tier default

Default model for the free tier. Excellent for simple iterations under 30 entities.

Claude Sonnet 4.6

claude-sonnet-4-6

Anthropic

Best balance of quality + cost for complex apps

Context

$3.00/M

Out

$15.00/M

Best for

TypeScript + Next.jsC# / WPF desktopMulti-entity CRUDPro-tier default

The workhorse. Most production OOretz Factory builds use this.

Claude Opus 4.6

claude-opus-4-6

Anthropic

Top-end reasoning for complex architecture

Context

$15.00/M

Out

$75.00/M

Best for

10+ entity appsComplex business logicArchitecture decisionsEnterprise default

When the prompt has nuanced business rules or multi-step workflows.

OpenAI

GPT-4o mini

gpt-4o-mini

OpenAI

Cheap Python + data work

Context

128k

$0.15/M

Out

$0.60/M

Best for

Python scriptsData pipelinesQuick iterationCheapest option

Cheapest model in the gateway. Great for simple Python or data work.

GPT-4o

gpt-4o

OpenAI

Strong Python + general-purpose code

Context

128k

$2.50/M

Out

$10.00/M

Best for

Python appsAPI integrationsGeneral full-stack

Solid all-rounder. Often used as the second voice in ensemble voting.

GPT-5

gpt-5

OpenAI

Top-tier reasoning across all surfaces

Context

400k

$20.00/M

Out

$60.00/M

Best for

Hardest reasoning tasksComplex PythonMulti-modal

Use when Opus struggles or when you need OpenAI-specific strengths.

Google

Gemini 2.5 Flash

gemini-2-5-flash

Google

Massive context, fast Java/Kotlin

Context

$0.30/M

Out

$1.20/M

Best for

Java appsKotlin / AndroidLong contexts

Highest context-to-cost ratio. Good for large existing-codebase tasks.

Gemini 2.5 Pro

gemini-2-5-pro

Google

2M context for huge multi-file work

Context

$2.50/M

Out

$10.00/M

Best for

Multi-file refactoringLong-form generationKotlin

2M context window unmatched. Use when 1M is not enough.

Mistral

Mistral Large

mistral-large

Mistral

EU-hosted, strong Rust/Go

Context

128k

$2.00/M

Out

$6.00/M

Best for

Rust appsGo servicesEU data residency

EU-hosted option for data residency. Strong on systems languages.

No model lock-in

New models added quarterly. Existing apps re-route automatically when better/cheaper options appear in the same tier. Override per request via API or operator UI.

Start free →