Obtaining a List of Claude Models Using the Anthropic API
By using the Anthropic API's /v1/models endpoint, you can programmatically retrieve a list of available Claude models along with their specifications.
By using the Anthropic API's /v1/models endpoint, you can programmatically retrieve a list of available Claude models along with their specifications.
Claude Code is Anthropic's official coding agent, but by setting environment variables you can use other models such as DeepSeek V4 Pro via OpenRouter. This article explains how to configure that.
When using Claude via API, you have more than two options: in addition to calling the Anthropic API directly, you can also use it via AWS Bedrock, Google Vertex AI, or Microsoft Azure (Azure AI Foundry). Base pricing is the same across all routes, but there are differences in batch processing and cloud ecosystem integration.
Unit: USD / 1M tokens (MTok). Information as of March 2026.
| Model | Type | Anthropic API | Bedrock | Vertex AI | Azure |
|---|---|---|---|---|---|
| Claude Opus 4.6 | Input | $5.00 | $5.00 | $5.00 | $5.00 |
| Output | $25.00 | $25.00 | $25.00 | $25.00 | |
| Claude Sonnet 4.6 | Input | $3.00 | $3.00 | $3.00 | $3.00 |
| Output | $15.00 | $15.00 | $15.00 | $15.00 | |
| Claude Haiku 4.5 | Input | $1.00 | $1.00 | $1.00 | $1.00 |
| Output | $5.00 | $5.00 | $5.00 | $5.00 | |
| Claude Sonnet 4.5 | Input | $3.00 | $3.00 | $3.00 | $3.00 |
| Output | $15.00 | $15.00 | $15.00 | $15.00 |
Base pricing is identical across all routes.
Note that Vertex AI regional endpoints carry a 10% surcharge over global endpoint pricing. Bedrock offers Long Context variants as separate SKUs at the same price; on the Anthropic API, Long Context is integrated into the standard models.
Prompt Caching rates are also identical across all routes.
| Model | Cache Type | Anthropic API | Bedrock | Vertex AI | Azure |
|---|---|---|---|---|---|
| Claude Opus 4.6 | 5-min cache write | $6.25 | $6.25 | $6.25 | $6.25 |
| 1-hour cache write | $10.00 | $10.00 | $10.00 | $10.00 | |
| Cache read | $0.50 | $0.50 | $0.50 | $0.50 | |
| Claude Sonnet 4.6 | 5-min cache write | $3.75 | $3.75 | $3.75 | $3.75 |
| 1-hour cache write | $6.00 | $6.00 | $6.00 | $6.00 | |
| Cache read | $0.30 | $0.30 | $0.30 | $0.30 | |
| Claude Haiku 4.5 | 5-min cache write | $1.25 | $1.25 | $1.25 | $1.25 |
| 1-hour cache write | $2.00 | $2.00 | $2.00 | $2.00 | |
| Cache read | $0.10 | $0.10 | $0.10 | $0.10 |
Cache writes come in two TTL tiers: 5-minute (short-term) and 1-hour (long-term). Longer TTL means higher write cost, but for applications with lengthy system prompts that are read repeatedly, the savings on read pricing more than compensate.
Bedrock, Vertex AI, and the Anthropic API all offer an asynchronous batch API at 50% off on-demand pricing. Azure does not explicitly list batch pricing at this time.
| Model | Batch Input | Batch Output |
|---|---|---|
| Claude Opus 4.6 | $2.50 | $12.50 |
| Claude Sonnet 4.6 | $1.50 | $7.50 |
| Claude Haiku 4.5 | $0.50 | $2.50 |
| Claude Sonnet 4.5 | $1.50 | $7.50 |
For large-scale batch workloads (log analysis, embedding generation, etc.), any of these routes can cut costs in half.
| Feature | Anthropic API | Bedrock | Vertex AI | Azure |
|---|---|---|---|---|
| Base pricing | Same | Same | Same | Same |
| Regional surcharge | — | — | +10% (regional) | — |
| Batch processing (50% off) | ○ | ○ | ○ | Not listed |
| Tokyo region | — | ○ | ○ | — |
| IAM / audit log integration | — | AWS | Google Cloud | Azure |
| VPC / PrivateLink | — | ○ | ○ | ○ |
| Billing integration | Anthropic direct | AWS | Google Cloud | Azure |
| New feature rollout speed | Fastest | Delayed | Delayed | Delayed |
New features (such as Extended Thinking) roll out to the Anthropic API first; Vertex AI, Bedrock, and Azure typically follow weeks later.
Adding "skills" to an AI agent lets you extend its capabilities, just like installing a plugin for an app. This article explains how Agent Skills work and what an agent actually does internally when using them.
First, an AI agent is an AI program that receives instructions and autonomously completes tasks.
Unlike a simple AI that just answers questions (like ChatGPT in basic use), an agent can:
Agent Skills is a mechanism for giving agents new abilities and domain knowledge.
Think of it like handing a new employee a work manual. Once the agent reads the manual (the skill), it understands how to approach that task correctly.
Without skills: "Write a blog post" → Agent writes something generic
With skills: "Write a blog post" → Agent follows the manual and produces consistent, quality output
Skills are primarily written as Markdown files (SKILL.md) and can include:
AI agents are extremely capable, but they don't know anything specific about your project.
For example:
Without skills, agents can't know any of this. Skills let agents understand "the right way to do things" before acting.
Let's look at what's happening inside the agent.
Here are the key points:
The agent reads the skill at the start. The skill content is passed as part of the LLM's input (prompt). The LLM reads this and understands "the right approach for this task."
Based on the instructions, the LLM breaks the task into smaller steps: "Read 3 existing posts first," "then decide on a filename," "then write the frontmatter," and so on.
At each step, the agent calls tools as needed — reading files, searching the web, executing code — following the procedure defined by the skill.
Tool results are passed back to the LLM. The LLM looks at the results and decides what to do next, looping until the task is complete.
Skills can be invoked as slash commands (/command-name).
When a command is called, the corresponding Markdown file's content is expanded as a prompt, and the agent begins executing those steps.
The Agent Skills format was developed and open-sourced by Anthropic and is now supported by many tools:
| Tool | Supported |
|---|---|
| Claude Code | ✅ |
| GitHub Copilot | ✅ |
| Cursor | ✅ |
| Gemini CLI | ✅ |
| OpenAI Codex | ✅ |
| VS Code | ✅ |
The biggest advantage is that the same skill can be reused across different tools.
SKILL.md)With skills, you no longer have to explain the same things to your AI every time — agents can perform tasks with consistent quality, exactly the way you want.