LLM gateway

The LLM gateway routes AI workloads to the right model per capability key. Cartographer planners, VLM grounding, export generators, Guard Debugger, and Trace AI features all call through it — never with raw API keys in browser or worker payloads.

Production: hosted gateway behind https://api.molar.it (dashboard API proxy)

Local dev: http://127.0.0.1:8787

Packages: @molar/llm-core, @molar/llm-gateway

Why a gateway

Concern	How gateway helps
Security	Workers send integration IDs, not plaintext keys (SEC-05)
Routing	Different models per capability (planner vs VLM vs embeddings)
Metering	Debugger and LLM-heavy tools bill `debugger_tokens_*` meters
BYOK	Org-owned provider keys via integration IDs

Capability keys

Key	Typical use
`planner`	Cartographer crawl planning
`grounding_tier0`	DOM / accessibility grounding
`grounding_tier1_vlm`	Vision-language grounding
`export_generator`	Playwright test export
`export_healer`	Heal broken selectors
`passive_ux_vlm`	UX quality signals
`embeddings`	Semantic search / clustering
`stagehand_planner`	Stagehand-style act() planning
`stagehand_vlm`	Stagehand VLM steps

Routing configuration

Org or run settings carry model_routing — integration IDs only:

{
  "model_routing": {
    "planner": {
      "provider_integration_id": "molar-managed",
      "model": "claude-sonnet-4-5-20251022"
    },
    "grounding_tier1_vlm": {
      "provider_integration_id": "byok-openai-prod",
      "model": "gpt-4o"
    }
  }
}

`provider_integration_id`	Meaning
`molar-managed`	Molar-hosted keys; subject to plan debugger budget and spend cap
`byok-*`	Your stored provider credential — see BYOK

OpenAI-compatible API

The gateway exposes an OpenAI-compatible HTTP API for workers:

Cartographer FastAPI / Python workers → MOLAR_LLM_BASE_URL
TypeScript services → @molar/llm-core route resolution

Service-to-service auth uses MOLAR_LLM_SERVICE_KEY (not your org PAT).

Browser auth path

Browser sessions never hit the gateway directly:

User session via Better Auth (HttpOnly cookie)
Next.js dashboard API at /api/cartographer/v1/* mints short-lived internal JWT (aud: cartographer-bff)
FastAPI workers call gateway with service key

See Cartographer auth docs for the full dashboard API flow.

Local development

# Gateway
cd molar-platform/apps/llm-gateway && pnpm dev   # :8787

# Cartographer workers
export MOLAR_LLM_BASE_URL=http://localhost:8787
export MOLAR_LLM_SERVICE_KEY=dev-molar-llm-service-key
export BFF_JWT_SECRET=dev-bff-jwt-secret-min-32-characters-long

A full Cartographer self-host stack (docker compose up in the bundle) starts LiteLLM proxy (:4000), LLM gateway (:8787), and workers together.

Dashboard visibility

Settings and health probes surface gateway status:

GET /api/v1/llm/metrics — latency and error rates (dashboard services health)
Spend against debugger budget on Billing → Usage

Spend caps & errors

Status	Meaning
`402 org LLM spend cap exceeded`	Hosted-model budget exhausted
Fix	Add BYOK integration or raise cap in billing settings

BYOK routes bypass the org hosted spend cap but still count toward your provider bill.

BYOK
Billing at app.molar.it
Agent comms — OTP LLM fallback uses separate OpenAI key
Environment variables

LLM gateway

LLM gateway

Why a gateway

Capability keys

Routing configuration

OpenAI-compatible API

Browser auth path

Local development

Dashboard visibility

Spend caps & errors

Related