Skip to content

Agents > Inference & providers

Bring Your Own API Key

Open in ChatGPT ↗
Ask ChatGPT about this page
Open in Claude ↗
Ask Claude about this page
Copied!

Warp lets you bring your own API keys (BYOK) for OpenAI, Anthropic, and Google AI models.

Warp supports Bring Your Own API Key (BYOK) for users who want to connect Warp’s agents to their own Anthropic, OpenAI, or Google API accounts.

This lets you use your own API keys to access models directly, giving you full control over model selection, billing, and data routing. See Model Choice for a list of supported models.

BYOK provides greater flexibility in model access and ensures Warp never consumes your AI credits for requests routed through your own keys.

How BYOK differs from custom inference endpoints and BYOLLM

Section titled “How BYOK differs from custom inference endpoints and BYOLLM”

Warp offers three ways to bring your own AI infrastructure. Use this table to pick the right one, and follow the links for full details.

NameMeaningPlans
Bring Your Own API Key (BYOK)Use your own API key for OpenAI, Anthropic, or Google models. Keys are stored locally on your device.Free and all eligible paid plans
Custom inference endpointConnect Warp to an OpenAI-compatible endpoint such as OpenRouter, LiteLLM, z.ai, or an internal gateway.Free and all eligible paid plans
Bring Your Own LLM (BYOLLM)Enterprise-managed inference through your cloud provider (AWS Bedrock today; Azure Foundry and Google Vertex coming soon), with Warp handling routing, orchestration, governance, and observability.Enterprise only

See warp.dev/pricing for current plan availability.

Platform credits apply to every cloud agent run on any plan, and to local agent runs on Business and Enterprise when using BYOK, a custom inference endpoint, or BYOLLM. See platform credits for the full breakdown.

When you add your own model API keys in Warp, those keys are stored locally on your device and are never synced to the cloud.

Warp uses these API keys when routing your agent requests to the model provider you’ve configured.

When a model is selected using your own key:

  • Warp does not consume any of your credits.
  • Costs are billed directly through your model provider account.
  • Warp does not retain or store your API key on any of its servers.

Diagram showing how Warp routes BYOK agent requests directly through your provider API key, bypassing Warp credits.

To enable and configure your API keys:

  1. Open Settings and search for API keys to jump to the BYOK configuration.
  2. Add your API key(s) for Anthropic, OpenAI, or Google.
  3. Once added, you’ll see a key icon next to supported models in the model picker.

Key icon shown next to supported models in the model picker after BYOK API keys are configured.

When you explicitly select a model with a key icon, Warp routes requests through your own API key instead of consuming Warp’s credits.

Warp’s Auto models dynamically route requests across different models based on context and performance. Because this routing logic depends on Warp’s infrastructure, Auto always consumes Warp’s credits, even if you’ve configured your own API keys.

To use your own key, select a specific provider model (for example, Claude Opus 4.7, Claude Sonnet 4.6, GPT-5.5, or Gemini 3.1 Pro) directly from the model picker with a key icon.

When you select a model with the key icon in your model picker, Warp routes the request through your API key. In that case:

  • Inference is billed directly through your provider account rather than drawing from your Warp AI credits.
  • Agent Mode prioritizes BYOK over any available Warp credits.

Other AI features in Warp

Some AI-powered features are not affected by BYOK and are included as part of Warp’s paid plans.

FeatureUses Warp’s creditsDescription
Active AI RecommendationsNoAlways included with Build and higher plans.
Codebase ContextYesUses Warp’s AI infrastructure and consumes credits.
Cloud AgentsYesBYOK keys are stored locally and not available to cloud-hosted runs.

These features will continue to function normally regardless of whether you’ve configured BYOK.

If Warp detects an issue with your API key, you’ll see a clear error message corresponding with the AI request.

If your key:

  • Is invalid: Warp notifies you and halts the request.
  • Hits usage or rate limits: Warp will not retry using credits.

You can update or replace your keys anytime by opening Settings and searching for API keys.

Failover and fallback:

By default, Warp does not fall back to your credits when a BYOK request fails.

You can choose to enable Warp credit fallback. When enabled, if an agent request fails with your BYOK model (for example, due to an API error or quota limit), Warp will automatically route the request to one of Warp’s provided models. Warp always prioritizes your API keys first and only uses Warp credits when necessary.

Setting to enable Warp credit fallback when a BYOK request fails.

Warp is SOC 2 compliant and has Zero Data Retention (ZDR) policies with all of its contracted LLM providers. No customer AI data is retained, stored, or used for training by the model providers.

However, when you use your own API key:

  • Data retention policies depend on your provider’s account settings.
  • Warp cannot enforce ZDR for requests sent through your API keys.
  • If your Anthropic, OpenAI, or Google account does not have ZDR enabled, your requests may be retained by the provider according to their terms.

Warp itself never stores your LLM API keys.

BYOK is configured at the user level on every plan, including Business and Enterprise:

  • Each team member adds and manages their own API keys locally on their device.
  • Centrally configured, admin-managed BYOK is not yet available — admins cannot enforce or share API keys across team members from a single place.
  • There is no organization-level Admin Panel for BYOK management today.

If your organization needs centrally managed model routing today, see Bring Your Own LLM for the Enterprise-managed option, or contact sales.

  • Custom inference endpoint — Route Warp through any OpenAI-compatible endpoint, such as OpenRouter, LiteLLM, z.ai, or an internal gateway.
  • Bring Your Own LLM — Enterprise-managed inference through your cloud provider or approved infrastructure.
  • Model Choice — Full list of supported models and model_id values.
  • Credits — How Warp credits work and when they’re consumed.