Agents > Inference & providers

Bring Your Own API Key

Warp lets you bring your own API keys (BYOK) for OpenAI, Anthropic, and Google AI models.

Warp supports Bring Your Own API Key (BYOK) for users who want to connect Warp’s agents to their own Anthropic, OpenAI, or Google API accounts.

This lets you use your own API keys for model access, giving you control over model selection, billing, and data routing. See Model Choice for a list of supported models.

BYOK provides greater flexibility in model access and ensures Warp never consumes your AI credits for requests routed through your own keys.

How BYOK differs from custom inference endpoints and BYOLLM

Warp offers three ways to bring your own AI infrastructure. Use this table to pick the right one, and follow the links for full details.

Name	Meaning	Plans
Bring Your Own API Key (BYOK)	Use your own API key for OpenAI, Anthropic, or Google models. Keys are stored locally on your device.	Free and all eligible paid plans
Custom inference endpoint	Connect Warp to an OpenAI-compatible endpoint such as OpenRouter, LiteLLM, z.ai, or an internal gateway.	Free and all eligible paid plans
Bring Your Own LLM (BYOLLM)	Enterprise-managed inference through your cloud provider (AWS Bedrock today; Azure Foundry and Google Vertex coming soon), with Warp handling routing, orchestration, governance, and observability.	Enterprise only

See Warp pricing for current plan availability.

Platform credits apply to every cloud agent run on any plan, and to local agent runs on Business and Enterprise when using BYOK, a custom inference endpoint, or BYOLLM. See platform credits for the full breakdown.

How BYOK works

When you add your own model API keys in Warp, those keys are stored only on your device (in your OS keychain or equivalent secure storage), never on Warp’s servers. They’re used to make requests to your chosen model provider.

When you send a prompt using a model with the key icon:

Your local Warp client pulls your API key from your device’s secure storage and sends it up to Warp’s backend along with your prompt.
Warp’s agent harness, which runs on Warp’s backend, assembles the full request (system instructions, conversation context, tools) and uses your key in-flight to call your chosen model provider (Anthropic, OpenAI, or Google).
The provider’s response streams back through Warp’s backend to your client.

Your API key passes through Warp’s servers each time you send a request, but Warp never stores it there — it’s used only in-flight to call the provider, then discarded.

When a model is selected using your own key:

Warp does not consume any of your credits.
Costs are billed directly through your model provider account.
Warp does not retain or store your API key on any of its servers.

Diagram showing how Warp authenticates BYOK agent requests with your provider API key, bypassing Warp credits.

Enabling BYOK

To enable and configure your API keys:

Open Settings and search for API keys to jump to the BYOK configuration.
Add your API key(s) for Anthropic, OpenAI, or Google.
Once added, you’ll see a key icon next to supported models in the model picker.

Key icon shown next to supported models in the model picker after BYOK API keys are configured.

When you explicitly select a model with a key icon, Warp routes requests through your own API key instead of consuming Warp’s credits.

BYOK usage and billing behavior

Auto Model

Warp’s Auto models dynamically route requests across different models based on context and performance. Because this routing logic depends on Warp’s infrastructure, Auto always consumes Warp’s credits, even if you’ve configured your own API keys.

To use your own key, select a specific provider model (for example, Claude Opus 4.7, Claude Sonnet 4.6, GPT-5.5, or Gemini 3.1 Pro) directly from the model picker with a key icon.

Credit usage

When you select a model with the key icon in your model picker, Warp routes the request through your API key. In that case:

Inference is billed directly through your provider account rather than drawing from your Warp AI credits.
Agent Mode prioritizes BYOK over any available Warp credits.

Other AI features in Warp

Some AI-powered features are not affected by BYOK and are included as part of Warp’s paid plans.

Feature	Uses Warp’s credits	Description
Active AI Recommendations	No	Always included with Build and higher plans.
Codebase Context	Yes	Uses Warp’s AI infrastructure and consumes credits.
Cloud Agents	Yes	BYOK keys are stored locally and not available to cloud-hosted runs.

These features will continue to function normally regardless of whether you’ve configured BYOK.

Failover and fallback behavior

If Warp detects an issue with your API key, you’ll see a clear error message corresponding with the AI request.

If your key:

Is invalid: Warp notifies you and halts the request.
Hits usage or rate limits: Warp will not retry using credits.

You can update or replace your keys anytime by opening Settings and searching for API keys.

Failover and fallback:

By default, Warp does not fall back to your credits when a BYOK request fails.

You can choose to enable Warp credit fallback. When enabled, if an agent request fails with your BYOK model (for example, due to an API error or quota limit), Warp will automatically route the request to one of Warp’s provided models. Warp always prioritizes your API keys first and only uses Warp credits when necessary.

Setting to enable Warp credit fallback when a BYOK request fails.

Zero Data Retention (ZDR) and BYOK

Warp is SOC 2 compliant and has Zero Data Retention (ZDR) policies with all of its contracted LLM providers. No customer AI data is retained, stored, or used for training by the model providers.

BYOK prompts and responses transit Warp’s backend (see How BYOK works). Warp does not use this content for training; retention and analytics handling follow the same account-level privacy and telemetry settings that apply to Warp-billed traffic.

However, when you use your own API key:

Data retention policies on the provider side depend on your provider’s account settings.
Warp cannot enforce ZDR for requests sent through your API keys.
If your Anthropic, OpenAI, or Google account does not have ZDR enabled, your requests may be retained by the provider according to their terms.

Warp itself never stores your LLM API keys.

BYOK on Business and Enterprise plans

BYOK is configured at the user level on every plan, including Business and Enterprise:

Each team member adds and manages their own API keys locally on their device.
Centrally configured, admin-managed BYOK is not yet available — admins cannot enforce or share API keys across team members from a single place.
There is no organization-level Admin Panel for BYOK management today.

If your organization needs centrally managed model routing today, see Bring Your Own LLM for the Enterprise-managed option, or contact sales.

Custom inference endpoint — Route Warp through any OpenAI-compatible endpoint, such as OpenRouter, LiteLLM, z.ai, or an internal gateway.
Bring Your Own LLM — Enterprise-managed inference through your cloud provider or approved infrastructure.
Model Choice — Full list of supported models and model_id values.
Credits — How Warp credits work and when they’re consumed.