Claude Code on Google Vertex AI
Learn about configuring Claude Code through Google Vertex AI, including setup, IAM configuration, and troubleshooting.
Prerequisites
Before configuring Claude Code with Vertex AI, ensure you have:
- A Google Cloud Platform (GCP) account with billing enabled
- A GCP project with Vertex AI API enabled
- Access to desired Claude models (e.g., Claude Sonnet 4)
- Google Cloud SDK (
gcloud
) installed and configured - Quota allocated in desired GCP region
Vertex AI may not support the Claude Code default models on non-us-east5
regions. Ensure you are using us-east5
and have quota allocated, or switch to supported models.
Setup
1. Enable Vertex AI API
Enable the Vertex AI API in your GCP project:
2. Request model access
Request access to Claude models in Vertex AI:
- Navigate to the Vertex AI Model Garden
- Search for “Claude” models
- Request access to desired Claude models (e.g., Claude Sonnet 4)
- Wait for approval (may take 24-48 hours)
3. Configure GCP credentials
Claude Code uses standard Google Cloud authentication.
For more information, see Google Cloud authentication documentation.
4. Configure Claude Code
Set the following environment variables:
For heightened rate limits and prompt caching enablement, contact Google Cloud support. Once enabled, remove the DISABLE_PROMPT_CACHING
setting.
5. Model configuration
Claude Code uses these default models for Vertex AI:
Model type | Default value |
---|---|
Primary model | claude-sonnet-4@20250514 |
Small/fast model | claude-3-5-haiku@20241022 |
To customize models:
IAM configuration
Grant the required IAM roles for Claude Code.
For details, see Vertex IAM documentation.
We recommend creating a dedicated GCP project for Claude Code to simplify cost tracking and access control.
Troubleshooting
If you encounter quota issues:
- Check current quotas or request quota increase through Cloud Console
If you encounter “model not found” 404 errors:
- Verify you have access to the specified region
- Confirm model is Enabled in Model Garden
If you encounter 429 errors:
- Ensure the primary model and small/fast model are supported in your selected region