⚠️ Disclaimer: All code examples in this post are provided for illustrative and educational purposes only. They are not intended for direct use in production environments without thorough review, testing, and validation against your specific security, compliance, and operational requirements. API parameter names and structures shown may differ from the actual service API—always consult the Amazon Bedrock AgentCore API Reference for the latest specifications. Always conduct your own testing before deploying to production.

AgentCore Gateway Deep Dive: Architecture, Interceptors, and Semantic Discovery

📅 2026-03-01📖 ~30 min readAmazon BedrockAgentCoreMCPAI Agents

Amazon Bedrock AgentCore Gateway is a fully managed service that provides a unified MCP endpoint for AI agents to discover and securely access tools. This deep dive explores its architecture, interceptor system, semantic tool discovery, and enterprise patterns — everything you need to evaluate and adopt Gateway for production agent systems.

📖 Table of Contents

Service Overview & Architecture
MCP Server Hosting on Gateway
Gateway Targets
Semantic Tool Discovery
Agent-to-Agent Communication
Integration with AgentCore Identity
Security & Access Control
Interceptors — Deep Dive
SDK & API Reference
Best Practices & Design Patterns
Real-World Use Cases
Troubleshooting
Quotas, Limits & Pricing
Comparison & Positioning
Key Links & Resources

1. Service Overview & Architecture

What is AgentCore Gateway?

AgentCore Gateway is a fully managed, serverless service within Amazon Bedrock AgentCore that provides a unified MCP (Model Context Protocol) endpoint for AI agents to discover and securely access tools. It transforms existing APIs, Lambda functions, and MCP servers into MCP-compatible tools with built-in authentication, credential management, and semantic search.

Core Value Proposition: Agents need tools. Enterprises have APIs, Lambda functions, and services scattered across their infrastructure with different auth mechanisms. Gateway provides a single, secure endpoint where agents can discover and invoke any of these tools using the standardized MCP protocol — eliminating months of custom integration work.

Source: Amazon Bedrock AgentCore Gateway Developer Guide

Positioning Within AgentCore

AgentCore consists of 9 modular services that can be used together or independently:

Service	Purpose	Relationship to Gateway
Runtime	Hosts agent containers in Firecracker microVMs	Agents running in Runtime connect to Gateway for tools
Gateway	Unified MCP endpoint for tool access	This service
Identity	Workload identities, OAuth, JWT auth	Provides auth tokens Gateway validates
Memory	Short-term events, long-term semantic memory	Independent — agents use both Memory and Gateway
Policy	Cedar policy enforcement	Can enforce policies on Gateway tool calls
Code Interpreter	Sandboxed code execution	Can be exposed as a tool through Gateway
Browser	Headless browser sessions	Can be exposed as a tool through Gateway
Observability	CloudWatch, OpenTelemetry, X-Ray	Monitors Gateway operations
Evaluations	Agent quality assessment	Independent evaluation pipeline

Source: Introducing Amazon Bedrock AgentCore Gateway

Six Core Architectural Functions

Gateway serves six fundamental roles:

1. Security Guard — Manages OAuth authorization; ensures only valid users and agents can access tools. Dual-layer: inbound (who calls the gateway) and outbound (how gateway calls backends).

2. Translation — Converts agent requests from MCP protocol into API requests and Lambda invocations. Eliminates the need for agents to understand each backend’s native protocol.

3. Composition — Combines multiple APIs, Lambda functions, and tools into a single MCP endpoint. Agents see one unified tool catalog.

4. Secure Credential Exchange — Handles credential injection for each tool target. Agents never see or manage backend credentials directly.

5. Semantic Tool Selection — Vector embedding-based search across available tools. Reduces prompt size and latency by up to 3x.

6. Infrastructure Manager — Fully serverless with automatic scaling, built-in observability and auditing.

Source: Transform your MCP architecture: Unite MCP servers through AgentCore Gateway

Architecture Overview

🚀 AgentCore Gateway Architecture

                                    ┌─────────────────────────────┐
                                    │     AgentCore Gateway       │
                                    │                             │
┌──────────┐   MCP/JWT or IAM       │  ┌───────────────────────┐  │    ┌──────────────┐
│  Agent   │ ─────────────────────►│  │  Inbound Auth         │  │───►│ Lambda Fn A  │
│ (Runtime │   tools/list           │  │  (JWT / IAM / None)   │  │    └──────────────┘
│  or      │   tools/call           │  └───────────────────────┘  │
│  External│   search               │  ┌───────────────────────┐  │    ┌──────────────┐
│  Client) │                        │  │  Request Interceptor  │  │───►│ OpenAPI Svc  │
└──────────┘                        │  │  (optional Lambda)    │  │    └──────────────┘
                                    │  └───────────────────────┘  │
                                    │  ┌───────────────────────┐  │    ┌──────────────┐
                                    │  │  Tool Router /        │  │───►│ MCP Server   │
                                    │  │  Semantic Search      │  │    └──────────────┘
                                    │  └───────────────────────┘  │
                                    │  ┌───────────────────────┐  │    ┌──────────────┐
                                    │  │  Outbound Auth        │  │───►│ API Gateway  │
                                    │  │  (IAM/OAuth/API Key)  │  │    └──────────────┘
                                    │  └───────────────────────┘  │
                                    │  ┌───────────────────────┐  │    ┌──────────────┐
                                    │  │  Response Interceptor │  │───►│ Smithy API   │
                                    │  │  (optional Lambda)    │  │    └──────────────┘
                                    │  └───────────────────────┘  │
                                    └─────────────────────────────┘

Supported Protocols: MCP versions 2025-06-18 and 2025-03-26 over Streamable HTTP. A2A (Agent-to-Agent) communication is also supported.

Available in 15 AWS Regions: us-east-1, us-east-2, us-west-2, ap-south-1, ap-southeast-1, ap-southeast-2, ap-northeast-1, ap-northeast-2, ca-central-1, eu-central-1, eu-west-1, eu-west-2, eu-west-3, eu-north-1, sa-east-1.

2. MCP Server Hosting on Gateway

Gateway speaks the MCP protocol natively. When an agent connects, it communicates using standard MCP JSON-RPC operations over HTTP.

Gateway Endpoint Formats

endpoint

https://{gateway-id}.gateway.bedrock-agentcore.{region}.amazonaws.com/mcp
https://bedrock-agentcore-gateway.{region}.amazonaws.com/gateways/{gateway-id}/mcp

Three Core MCP Operations

List Tools (tools/list)

python

import requests

GATEWAY_URL = "https://{gateway-id}.gateway.bedrock-agentcore.{region}.amazonaws.com/mcp"

response = requests.post(GATEWAY_URL, json={
    "jsonrpc": "2.0",
    "method": "tools/list",
    "id": "1"
}, headers={"Authorization": f"Bearer {token}"})

Call Tool (tools/call)

python

response = requests.post(GATEWAY_URL, json={
    "jsonrpc": "2.0",
    "method": "tools/call",
    "id": "2",
    "params": {
        "name": "weather_tool___get_weather",  # {target_name}___{tool_name}
        "arguments": {"city": "Seattle"}
    }
}, headers={"Authorization": f"Bearer {token}"})

Semantic Search (tools/call with built-in search tool)

python

response = requests.post(GATEWAY_URL, json={
    "jsonrpc": "2.0",
    "method": "tools/call",
    "id": "3",
    "params": {
        "name": "x_amz_bedrock_agentcore_search",
        "arguments": {"query": "how to check order status"}
    }
}, headers={"Authorization": f"Bearer {token}"})

Source: Using MCP semantic search with AgentCore Gateway

Tool Naming Convention

Tools are namespaced as {target_name}___{tool_name} (three underscores). For example, a target named weather_tool with tool get_weather becomes weather_tool___get_weather.

Connecting MCP Clients

Gateway works with any MCP-compatible client. Here are common integration patterns:

python — Strands Agents

from strands import Agent
from strands.tools.mcp import MCPClient
from mcp.client.streamable_http import streamablehttp_client

def create_gateway_client(access_token, gateway_url):
    return MCPClient(
        lambda: streamablehttp_client(
            url=gateway_url,
            headers={"Authorization": f"Bearer {access_token}"}
        ),
        prefix="gateway"
    )

agent = Agent(tools=[create_gateway_client(token, gateway_url)])
result = agent("Check the weather in Seattle")

python — LangGraph + MCP Adapters

from langchain_mcp_adapters.client import MultiServerMCPClient
from langgraph.prebuilt import create_react_agent

mcp_client = MultiServerMCPClient({
    "gateway": {
        "transport": "streamable_http",
        "url": gateway_url,
        "headers": {"Authorization": f"Bearer {access_token}"}
    }
})
tools = await mcp_client.get_tools()
graph = create_react_agent(model=bedrock_model, tools=tools)

Source: Amazon Bedrock AgentCore Samples — Tutorial 04

Gateway retrieves tools from MCP server targets in batches of 100, supporting up to 10,000 tools per target. Use the SynchronizeGatewayTargets API to force refresh.

3. Gateway Targets

Target Type	Source	Use Case
Lambda	AWS Lambda function ARN	Custom business logic, database queries, internal APIs
OpenAPI	OpenAPI/Swagger spec	Expose any REST API as MCP tools
Smithy	Smithy model definition	AWS-style API models
MCP Server	External MCP server URL	Connect existing MCP servers
API Gateway	REST API stage ARN	Expose API Gateway endpoints

Pre-built 1-click integrations with automatic OAuth: Salesforce, Slack, Jira, Asana, Zendesk.

Lambda Handler Pattern

When Gateway invokes a Lambda target, it passes the tool name via context.client_context.custom['bedrockAgentCoreToolName'] and arguments directly in the event:

python

def handler(event, context):
    delimiter = "___"
    original_tool_name = context.client_context.custom["bedrockAgentCoreToolName"]
    if delimiter in original_tool_name:
        tool_name = original_tool_name[original_tool_name.index(delimiter) + len(delimiter):]
    else:
        tool_name = original_tool_name

    if tool_name == "get_weather":
        city = event.get("city", "Unknown")
        return {"content": [{"type": "text", "text": f"Weather in {city}: 72F, Sunny"}]}

    return {"error": f"Unknown tool: {tool_name}"}

Source: Amazon Bedrock AgentCore Samples — Tutorial 01

Credential Provider Types

Type	Use Case
GATEWAY_IAM_ROLE	Lambda targets, AWS services (uses gateway’s own execution role)
API_KEY	REST APIs with API key auth (stored in Token Vault)
OAUTH	OAuth2-protected services (client credentials or auth code grant)

Gateway supports 25+ OAuth2 vendors including Google, GitHub, Slack, Salesforce, Microsoft, Atlassian, Okta, Auth0, Cognito, and more.

4. Semantic Tool Discovery

The Tool Overload Problem

As enterprise tool catalogs grow, agents face a fundamental challenge: tool overload. When an agent’s system prompt includes descriptions for hundreds or thousands of tools:

Hallucination: LLMs may hallucinate tool names or parameters when overwhelmed
Wrong tool selection: The model may select a semantically similar but incorrect tool
Prompt bloat: Including all tool descriptions inflates prompt size, increasing cost and latency
Context window limits: Large tool catalogs may exceed the model’s effective context window

Semantic search solves this by letting agents dynamically discover relevant tools at runtime.

Source: Transform your MCP architecture: Unite MCP servers through AgentCore Gateway

The Built-In Search Tool: x_amz_bedrock_agentcore_search

When semantic search is enabled, a built-in tool called x_amz_bedrock_agentcore_search becomes accessible via standard MCP tools/call. This tool:

Takes a natural language query as input
Returns a ranked list of tools matching the query
Uses vector embeddings generated from tool names and descriptions
Is accessible through the same MCP endpoint as all other tools

python

response = requests.post(GATEWAY_URL, json={
    "jsonrpc": "2.0",
    "method": "tools/call",
    "id": "1",
    "params": {
        "name": "x_amz_bedrock_agentcore_search",
        "arguments": {"query": "how to check order status"}
    }
}, headers={"Authorization": f"Bearer {token}"})

Benefits

Reduced prompt size: Instead of listing all 10,000 tools in the system prompt, agents search dynamically
Up to 3x latency improvement: Less context = faster LLM inference
Natural discovery: Agents find tools even without knowing exact names
Reduced hallucination: Agents only see relevant tools, reducing wrong selections
Scalability: Works with thousands of tools without performance degradation

Source: Using MCP semantic search with AgentCore Gateway

5. Agent-to-Agent (A2A) Communication

Gateway supports A2A patterns where one agent’s capabilities are exposed as tools for other agents through MCP. An agent running on AgentCore Runtime can be registered as a Gateway target:

🚀 Agents-as-Tools Pattern

Agent A ──► Gateway ──► Agent B (MCP server on Runtime)
                   ──► Agent C (MCP server on Runtime)

This enables hierarchical multi-agent patterns where a coordinator delegates to specialized sub-agents through Gateway. MCP session headers (mcpSessionId, mcpProtocolVersion) track conversations.

Scenario	Use
Agent needs to call a tool/API	MCP via Gateway
Agent needs to delegate a complex task	A2A via Gateway
Orchestrating multiple specialist agents	A2A with agents-as-tools pattern
Simple request-response to a backend	MCP via Gateway targets

Source: Amazon Bedrock AgentCore Samples — Tutorial 12

6. Integration with AgentCore Identity

AgentCore Identity provides Workload Identities (digital identities for agents), a Token Vault (encrypted storage for credentials), OAuth2 Credential Providers, and API Key Credential Providers.

Inbound Authentication

Authorizer Type	Use Case	Details
CUSTOM_JWT	Production (recommended)	Validates JWT from OIDC providers (Cognito). Supports allowedClients, allowedAudience, allowedScopes, customClaims with EQUALS/CONTAINS/CONTAINS_ANY operators.
AWS_IAM	AWS-internal agents	SigV4 signing. Best for agents with existing IAM roles.
NONE	Testing only	No authentication. Never use in production.

Outbound Authentication

Method	Use Case
IAM Role	Lambda, Smithy, AWS services — uses Gateway’s execution role
API Key	REST APIs — stored in Token Vault, injected in header/query
OAuth 2.0	OAuth-protected services — client credentials or auth code grant

User identity is propagated through Gateway via the X-Amzn-Bedrock-AgentCore-Runtime-User-Id header. Identity has no additional charges when used through Runtime or Gateway.

Source: Amazon Bedrock AgentCore Gateway Developer Guide

7. Security & Access Control

IAM Permissions for Gateway Role

The Gateway execution role (service principal: bedrock-agentcore.amazonaws.com) needs these permissions depending on configuration:

Lambda targets: lambda:InvokeFunction
Runtime targets: bedrock-agentcore:InvokeRuntime, bedrock-agentcore:InvokeRuntimeWithResponseStream
Logging: logs:CreateLogGroup, logs:CreateLogStream, logs:PutLogEvents
Credential providers: bedrock-agentcore:GetOAuth2CredentialProvider, secretsmanager:GetSecretValue, etc.
Semantic search: bedrock:InvokeModel

Policy Engine (Cedar)

Gateway integrates with AgentCore Policy for declarative access control using Cedar. Two modes: LOG_ONLY (audit without blocking) and ENFORCE (block violations).

Multi-Tenant Access Pattern

🚀 Multi-Tenant Tool Access

User A (group: finance)     ─► Gateway ─► Interceptor ─► Finance tools only
User B (group: engineering) ─► Gateway ─► Interceptor ─► Engineering tools only
User C (group: admin)       ─► Gateway ─► Interceptor ─► All tools

Source: Apply fine-grained access control with Bedrock AgentCore Gateway interceptors

8. Interceptors — Deep Dive

Interceptors are Lambda functions that run before (REQUEST) or after (RESPONSE) tool invocations. They enable powerful middleware-like capabilities.

Source: AgentCore Gateway Interceptors

Interceptor Flow

🚀 Interceptor Execution Flow

Agent ─► Inbound Auth ─► REQUEST Interceptor ─► Target ─► RESPONSE Interceptor ─► Agent

Constraints: Maximum 1 interceptor per type (1 REQUEST + 1 RESPONSE = 2 total). Lambda-only.

REQUEST Interceptor Input Payload

json

{
  "interceptorInputVersion": "1.0",
  "mcp": {
    "rawGatewayRequest": {
      "body": "<raw_request_body>"
    },
    "gatewayRequest": {
      "path": "/mcp",
      "httpMethod": "POST",
      "headers": {
        "Authorization": "Bearer <jwt_token>",
        "Content-Type": "application/json"
      },
      "body": {
        "jsonrpc": "2.0",
        "id": 1,
        "method": "tools/list"
      }
    }
  }
}

Source: AgentCore Gateway Interceptor Types

REQUEST Interceptor Output — Key Behavior

🚨 Critical detail: If the REQUEST interceptor output contains a transformedGatewayResponse, Gateway responds immediately without invoking the target (short-circuit). If both REQUEST and RESPONSE interceptors exist and REQUEST short-circuits, the RESPONSE interceptor still runs.

Two possible outcomes:

Pass-through: Return modified gateway request, forwarded to target
Short-circuit: Return transformedGatewayResponse, Gateway responds immediately

RESPONSE Interceptor Input

The RESPONSE interceptor receives both the original request and the target’s response, enabling context-aware decisions:

json

{
  "interceptorInputVersion": "1.0",
  "mcp": {
    "gatewayRequest": {
      "path": "/mcp",
      "httpMethod": "POST",
      "headers": { ... },
      "body": { "jsonrpc": "2.0", "id": 1, "method": "tools/call", "params": { ... } }
    },
    "gatewayResponse": {
      "statusCode": 200,
      "headers": { ... },
      "body": {
        "jsonrpc": "2.0",
        "id": 1,
        "result": { "content": [{"type": "text", "text": "..."}] }
      }
    }
  }
}

Source: AgentCore Gateway Interceptor Types

Real-World Interceptor Use Cases

1. Fine-Grained Access Control (JWT Claims)

The REQUEST interceptor decodes the JWT, extracts user claims, and determines authorization. Unauthorized requests are short-circuited:

python

def handler(event, context):
    headers = event["mcp"]["gatewayRequest"]["headers"]
    body = event["mcp"]["gatewayRequest"]["body"]
    user_groups = extract_groups_from_jwt(headers.get("Authorization", ""))
    tool_name = body.get("params", {}).get("name", "")
    target_name = tool_name.split("___")[0] if "___" in tool_name else tool_name

    if not is_authorized(target_name, user_groups):
        return {"transformedGatewayResponse": {
            "statusCode": 403,
            "body": {"error": "Access denied"}
        }}
    return event["mcp"]["gatewayRequest"]

Source: Apply fine-grained access control with Gateway interceptors

2. Dynamic Tool Filtering

The RESPONSE interceptor filters tools/list results based on user permissions, preventing unauthorized tools from even appearing in the catalog.

3. Schema Translation

REQUEST interceptor transforms MCP requests into downstream-compatible formats. RESPONSE interceptor converts responses back. Useful for legacy API integration.

4. PII/SPI Data Redaction

RESPONSE interceptor inspects tool responses and masks sensitive data (credit cards, SSNs, emails) before returning to the agent, preventing data leaks into agent context.

5. Tenant Isolation for Multi-Tenant SaaS

REQUEST interceptor extracts tenant ID from JWT claims and injects it into downstream requests, ensuring data isolation across tenants.

6. Custom Header Propagation

Interceptor enriches downstream requests with identity headers (X-User-Id, X-User-Groups), correlation IDs for tracing, and tenant-specific configuration.

7. Token Exchange (Act-on-Behalf)

AWS recommends the act-on-behalf pattern over impersonation: the agent uses its own identity but includes user context so downstream services know who the action is for. The REQUEST interceptor handles token exchange.

Sources: AgentCore Samples Tutorials 07-10, 14-15

Tutorial Coverage

#	Tutorial	Pattern
07	Bearer token injection	Auth token propagation
08	Custom header propagation	Identity context management
09	Fine-grained access control	RBAC based on JWT claims
10	Sensitive data masking	PII/SPI redaction
14	Token exchange at request interceptor	Act-on-behalf pattern
15	Prevent SQL injection	Input validation

9. SDK & API Reference

Control Plane vs Data Plane

API	Endpoint	Example Operations
Control Plane	`bedrock-agentcore-control.{region}.amazonaws.com`	CreateGateway, ListGateways, CreateGatewayTarget
Data Plane	`bedrock-agentcore.{region}.amazonaws.com`	GetWorkloadAccessToken, GetResourceOauth2Token

boto3 Example

python

import boto3

control = boto3.client("bedrock-agentcore-control", region_name="us-west-2")

# Create Gateway
response = control.create_gateway(
    name="my_gateway",
    protocolType="MCP",
    roleArn=role_arn,
    authorizerType="CUSTOM_JWT",
    authorizerConfiguration={
        "customJwtAuthorizer": {
            "discoveryUrl": discovery_url,
            "allowedClients": [client_id],
        }
    },
    protocolConfiguration={
        "mcp": {"supportedVersions": ["2025-03-26"], "searchType": "SEMANTIC"}
    }
)

# Create Target
control.create_gateway_target(
    gatewayIdentifier=response["gatewayId"],
    name="weather_tools",
    targetConfiguration={
        "mcp": {
            "lambda": {
                "lambdaArn": lambda_arn,
                "toolSchema": {"inlinePayload": tools_json}
            }
        }
    },
    credentialProviderConfigurations=[{"credentialProviderType": "GATEWAY_IAM_ROLE"}]
)

Source: Amazon Bedrock AgentCore API Reference

Resource naming: Pattern ^[a-zA-Z][a-zA-Z0-9_]{0,47}$. Must start with a letter. No hyphens — use underscores. Maximum 48 characters.

10. Best Practices & Design Patterns

Architecture Patterns

Pattern 1: Hub-and-Spoke (Most Common) — Single Gateway as central hub. All agents connect to one Gateway that routes to multiple targets. Best for: most production scenarios.

Pattern 2: Agent-as-Tool (Hierarchical Multi-Agent) — Coordinator agent delegates to specialist agents registered as Gateway targets. Best for: complex tasks requiring multiple specialized agents.

Pattern 3: Per-Domain Gateways — Separate Gateway per business domain (finance, engineering). Best for: strict domain isolation and regulatory compliance.

Pattern 4: Gateway + Interceptors for Multi-Tenant — Shared Gateway with request interceptor for tenant isolation. Best for: SaaS platforms.

Source: Transform your MCP architecture: Unite MCP servers through AgentCore Gateway

Key Recommendations

Domain-driven target organization: Group tools by business domain, use meaningful names like crm_tools not lambda_1
Never hardcode credentials: Use Token Vault for OAuth tokens and API keys
Enable semantic search to reduce prompt size (up to 3x latency improvement)
Batch tool schemas: Group related tools in a single Lambda target
Use DEBUG exception level during development, remove in production
Return structured errors from Lambda handlers: {"error": "message"}

11. Real-World Use Cases

Official Tutorials (15 Gateway Tutorials)

#	Tutorial	Description
01	Transform Lambda into MCP tools	Convert existing Lambda functions to agent tools
02	Transform APIs into MCP tools	Expose REST APIs via OpenAPI specs
03	Search tools	Semantic tool discovery
04	Integration patterns	Framework integration (Strands, LangGraph)
05	MCP Server as a target	Connect external MCP servers
06	Gateway observability	Monitoring and logging setup
07-10	Interceptor patterns	Auth, headers, RBAC, data masking
11	API Gateway as a target	Route through AWS API Gateway
12	Agents as tools using MCP	Multi-agent composition
13-15	Advanced auth patterns	OAuth code grant, token exchange, SQL injection prevention

Source: Amazon Bedrock AgentCore Samples

Problems Gateway Solves

Tool Sprawl: Dozens of APIs with different auth → unified endpoint
Auth Complexity: Different credentials per backend → Token Vault manages all
Tool Discovery at Scale: 1000+ tools → semantic search finds relevant ones
Protocol Translation: REST/GraphQL/custom protocols → automatic MCP translation
Enterprise Governance: Audit trails, access control, data masking → Gateway + Policy + Interceptors

12. Common Issues & Troubleshooting

Issue	Symptom	Fix
MCP version mismatch	Gateway rejects requests	Use versions 2025-06-18 or 2025-03-26
Auth errors (401/403)	Rejected requests	Check JWT expiry, discovery URL, allowedClients, audience
Missing tools	tools/list returns empty	Call SynchronizeGatewayTargets, wait for READY status
Lambda errors	tools/call fails	Check tool name format (3 underscores), return format, IAM permissions
Name validation	ValidationException	Names must match ^[a-zA-Z][a-zA-Z0-9_]{0,47}$, no hyphens
Semantic search empty	No search results	Enable SEMANTIC searchType, grant bedrock:InvokeModel, sync targets
boto3 unknown service	UnknownServiceError	pip install –upgrade boto3 botocore

Source: Amazon Bedrock AgentCore Gateway Developer Guide

13. Quotas, Limits & Pricing

Resource	Limit
Tools per target (MCP Server)	10,000
Tool retrieval batch size	100
Interceptors per Gateway	2 (max 1 REQUEST + 1 RESPONSE)
Credential providers per target	1 (exactly one required)
Resource name length	48 characters max
Tags per resource	50 max

Pricing: Consumption-based with no upfront commitments. Billed per MCP operation, per semantic search query, and per tool indexed. Identity has no additional charges when used through Gateway.

Source: Amazon Bedrock AgentCore Pricing

14. Comparison & Positioning

Gateway vs Self-Hosted MCP Servers

Aspect	AgentCore Gateway	Self-Hosted MCP
Infrastructure	Fully managed, serverless	You manage servers
Scaling	Automatic	Manual
Auth	Built-in JWT/IAM + Token Vault	You implement
Tool discovery	Semantic search built-in	Not available
Protocol translation	OpenAPI/Lambda → MCP automatic	You implement
Cost	Pay-per-call	Always-on infra costs

When Gateway is Essential

Multiple backend systems with different auth mechanisms
Enterprise governance (audit, access control, data masking)
Large tool catalogs (100+ tools) requiring semantic discovery
Multi-tenant SaaS with per-tenant tool access
OAuth-heavy integrations (Salesforce, Slack, Jira)

When Gateway is Overkill

Single MCP server with a few tools
Prototype/hackathon — just connect MCP client directly
Non-AWS infrastructure
Single-agent, single-tool scenarios

15. Key Links & Resources

Official Documentation

AWS Blog Posts

GitHub Repositories

Authentication Flow

🚀 End-to-End Authentication Flow

Agent ── 1. client_credentials ──► Cognito (IdP)
Agent ◄── 2. access_token ──────── Cognito
Agent ── 3. MCP tools/call ──────► Gateway
                                   4. Validate JWT (via discovery URL)
                                   Gateway ── 5. Get outbound creds ──► Token Vault
                                   Gateway ◄── 6. OAuth/API key ──── Token Vault
                                   Gateway ── 7. Call backend ──────► Target
                                   Gateway ◄── 8. Response ────────── Target
Agent ◄── 9. MCP response ──────── Gateway

This article is based on publicly available AWS documentation, official GitHub repositories, and AWS blog posts. All sources are cited inline. Last updated: 2026-03-01.

📝 Note: This blog post represents my personal views and experiences and does not represent the views of my employer. Any recommendations or architectural patterns discussed are based on publicly available documentation and my own analysis.

AgentCore Gateway Deep Dive: Architecture, Interceptors, and Semantic Discovery

1. Service Overview & Architecture

What is AgentCore Gateway?

Positioning Within AgentCore

Six Core Architectural Functions

Architecture Overview

2. MCP Server Hosting on Gateway

Gateway Endpoint Formats

Three Core MCP Operations

List Tools (tools/list)

Call Tool (tools/call)

Semantic Search (tools/call with built-in search tool)

Tool Naming Convention

Connecting MCP Clients

3. Gateway Targets

Lambda Handler Pattern

Credential Provider Types

4. Semantic Tool Discovery

The Tool Overload Problem

The Built-In Search Tool: x_amz_bedrock_agentcore_search

Benefits

5. Agent-to-Agent (A2A) Communication

6. Integration with AgentCore Identity

Inbound Authentication

Outbound Authentication

7. Security & Access Control

IAM Permissions for Gateway Role

Policy Engine (Cedar)

Multi-Tenant Access Pattern

8. Interceptors — Deep Dive

Interceptor Flow

REQUEST Interceptor Input Payload

REQUEST Interceptor Output — Key Behavior

RESPONSE Interceptor Input

Real-World Interceptor Use Cases

1. Fine-Grained Access Control (JWT Claims)

2. Dynamic Tool Filtering

3. Schema Translation

4. PII/SPI Data Redaction

5. Tenant Isolation for Multi-Tenant SaaS

6. Custom Header Propagation

7. Token Exchange (Act-on-Behalf)

Tutorial Coverage

9. SDK & API Reference

Control Plane vs Data Plane

boto3 Example

10. Best Practices & Design Patterns

Architecture Patterns

Key Recommendations

11. Real-World Use Cases

Official Tutorials (15 Gateway Tutorials)

Problems Gateway Solves

12. Common Issues & Troubleshooting

13. Quotas, Limits & Pricing

14. Comparison & Positioning

Gateway vs Self-Hosted MCP Servers

When Gateway is Essential

When Gateway is Overkill

15. Key Links & Resources

Official Documentation

AWS Blog Posts

GitHub Repositories

Authentication Flow

Related Posts