【智能体架构】介绍Amazon Bedrock

语言 Chinese, Simplified

SEO Title

Introducing Amazon Bedrock AgentCore: Securely deploy and operate AI agents at any scale (preview)

步骤1–使用AgentCore Runtime部署到云端

AgentCore Runtime是一项新服务，用于安全地部署、运行和扩展AI智能体，提供隔离，使每个用户会话在其自己的受保护环境中运行，以帮助防止数据泄漏——这是处理敏感数据的应用程序的关键要求。

为了匹配不同的安全态势，智能体可以使用不同的网络配置：

公共–使用受管理的互联网接入运行。

仅限VPC（即将推出）-此选项将允许访问客户VPC中托管的资源或通过AWS PrivateLink端点连接的资源。

为了将智能体部署到云端并使用AgentCore Runtime获得安全的无服务器端点，我使用AgentCore SDK向原型添加了几行代码：

导入AgentCore SDK。
创建AgentCore应用程序。
指定哪个函数是调用智能体的入口点。

使用不同的或自定义的智能体框架是替换入口点函数内的智能体调用的问题。

这是原型的代码。我为使用AgentCore Runtime添加的三行是前面有注释的行。

import json

from strands import Agent, tool
from strands_tools import calculator, current_time

# Import the AgentCore SDK
from bedrock_agentcore.runtime import BedrockAgentCoreApp

WELCOME_MESSAGE = """
Welcome to the Customer Support Assistant! How can I help you today?
"""

SYSTEM_PROMPT = """
You are an helpful customer support assistant.
When provided with a customer email, gather all necessary info and prepare the response email.
When asked about an order, look for it and tell the full description and date of the order to the customer.
Don't mention the customer ID in your reply.
"""

@tool
def get_customer_id(email_address: str) -> str:
    "Get customer ID from email address"
    if email_address == "me@example.net":
        response = { "customer_id": 123 }
    else:
        response = { "message": "customer not found" }
    try:
        return json.dumps(response)
    except Exception as e:
        return str(e)


@tool
def get_orders(customer_id: int) -> str:
    "Get orders from customer ID"
    if customer_id == 123:
        response = [{
            "order_id": 1234,
            "items": [ "smartphone", "smartphone USB-C charger", "smartphone black cover"],
            "date": "20250607"
        }]
    else:
        response = { "message": "no order found" }
    try:
        return json.dumps(response)
    except Exception as e:
        return str(e)

@tool
def get_knowledge_base_info(topic: str) -> str:
    "Get knowledge base info from topic"
    response = []
    if "smartphone" in topic:
        if "cover" in topic:
            response.append("To put on the cover, insert the bottom first, then push from the back up to the top.")
            response.append("To remove the cover, push the top and bottom of the cover at the same time.")
        if "charger" in topic:
            response.append("Input: 100-240V AC, 50/60Hz")
            response.append("Includes US/UK/EU plug adapters")
    if len(response) == 0:
        response = { "message": "no info found" }
    try:
        return json.dumps(response)
    except Exception as e:
        return str(e)

# Create an AgentCore app
app = BedrockAgentCoreApp()

agent = Agent(
    model="us.amazon.nova-lite-v1:0",
    system_prompt=SYSTEM_PROMPT,
    tools=[calculator, current_time, get_customer_id, get_orders, get_knowledge_base_info]
)

# Specify the entry point function invoking the agent
@app.entrypoint
def invoke(payload):
    """Handler for agent invocation"""
    user_message = payload.get(
        "prompt", "No prompt found in input, please guide customer to create a json payload with prompt key"
    )
    response = agent(user_message)
    return response.message['content'][0]['text']
    
if __name__ == "__main__":
    app.run()

The previous code needs the Strands Agents modules installed in the Python environment. To do so,

To install dependencies, I create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate

I add Strands Agents modules, AgentCore SDK, and AgentCore starter toolkit to the dependency file (requirements.txt):

strands-agents
strands-agents-tools
bedrock-agentcore
bedrock-agentcore-starter-toolkit

I then install all the requirements in the virtual environment:

pip install -r requirements.txt

Now the virtual environment, gives me access to the AgentCore command line interface (CLI) provided by the starter toolkit.

First, I use agentcore configure --entrypoint my_agent.py to configure the agent. I press Enter to auto-create the AWS Identity and Access Management (IAM) execution role and the Amazon Elastic Container Registry (Amazon ECR) repository and to confirm the detected dependency file.

In this case, the agent only needs access to Amazon Bedrock to invoke the model. The role can give access to other AWS resources used by an agent, such as an Amazon Simple Storage Service (Amazon S3) bucket or a Amazon DynamoDB table. The ECR repository is used to store the container image created when deploying the agent.

By default, the agent configuration enables observability. To enable trace delivery, I use the AWS Command Line Interface (AWS CLI) to set up Transaction Search in Amazon CloudWatch. This switches all trace ingestion for the entire account into cost effective collection mode using CloudWatch Application Signals pricing plan.

aws xray update-trace-segment-destination --destination CloudWatchLogs
aws xray update-indexing-rule --name "Default" --rule '{"Probabilistic": {"DesiredSamplingPercentage": 1}}'

I check the result of these commands with:

aws xray get-trace-segment-destination
aws xray get-indexing-rules

I launch the agent locally with agentcore launch --local. When running locally, I can interact with the agent using agentcore invoke --local <PAYLOAD>. The payload is passed to the entry point function. Note that the JSON syntax of the invocations is defined in the entry point function. In this case, I look for prompt in the JSON payload, but can use a different syntax depending on your use case.

When I am satisfied by local testing, I use agentcore launch to deploy to the cloud.

After the deployment is succesful and an endpoint has been created, I check the status of the endpoint with agentcore status and invoke the endpoint with agentcore invoke <PAYLOAD>. For example, I pass a customer support request in the invocation:

agentcore invoke '{"prompt": "From: me@example.net – Hi, I bought a smartphone from your store.
 I am traveling to Europe next week, will I be able to use the charger? 
 Also, I struggle to remove the cover. Thanks, Danilo"}'

Step 2 – Enabling memory for context

After an agent has been deployed in the AgentCore Runtime, the context needs to be persisted to be available for a new invocation. I add AgentCore Memory to maintain session context using its short-term memory capabilities.

First, I create a memory client and the memory store for the conversations:

from bedrock_agentcore.memory import MemoryClient

memory_client = MemoryClient(region_name="us-east-1")

memory = memory_client.create_memory_and_wait(
    name="CustomerSupport", 
    description="Customer support conversations",
    strategies=[]
)

I can now use create_event to stores agent interactions into short-term memory:

memory_client.create_event(
    memory_id=memory.get("id"), # Identifies the memory store
    actor_id="user-123",        # Identifies the user
    session_id="session-456",   # Identifies the session
    messages=[
        ("Hi, ...", "USER"),
        ("I'm sorry to hear that...", "ASSISTANT"),
        ("get_orders(customer_id='123')", "TOOL"),
        . . .
    ]
)

I can load the most recent turns of a conversations from short-term memory using list_events:

conversations = memory_client.list_events(
    memory_id=memory.get("id"), # Identifies the memory store
    actor_id="user-123",        # Identifies the user 
    session_id="session-456",   # Identifies the session
    max_results=5               # Number of most recent turns to retrieve
)

With this capability, the agent can maintain context during long sessions. But when a users come back with a new session, the conversation starts blank. Using long-term memory, the agent can personalize user experiences by retaining insights across multiple interactions.

To extract memories from a conversation, I can use built-in AgentCore Memory policies for user preferences, summarization, and semantic memory (to capture facts) or create custom policies for specialized needs. Data is stored encrypted using a namespace-based storage for data segmentation.

I change the previous code creating the memory store to include long-term capabilities by passing a semantic memory strategy. Note that an existing memory store can be updated to add strategies. In that case, the new strategies are applied to newer events as they are created.

memory = memory_client.create_memory_and_wait(
    name="CustomerSupport", 
    description="Customer support conversations",
    strategies=[{
        "semanticMemoryStrategy": {
            "name": "semanticFacts",
            "namespaces": ["/facts/{actorId}"]
        }
    }]
)

After long-term memory has been configured for a memory store, calling create_event will automatically apply those strategies to extract information from the conversations. I can then retrieve memories extracted from the conversation using a semantic query:

memories = memory_client.retrieve_memories(
    memory_id=memory.get("id"),
    namespace="/facts/user-123",
    query="smartphone model"
)

In this way, I can quickly improve the user experience so that the agent remembers customer preferences and facts that are outside of the scope of the CRM and use this information to improve the replies.

Step 3 – Adding identity and access controls

Without proper identity controls, access from the agent to internal tools always uses the same access level. To follow security requirements, I integrate AgentCore Identity so that the agent can use access controls scoped to the user’s or agent’s identity context.

I set up an identity client and create a workload identity, a unique identifier that represents the agent within the AgentCore Identity system:

from bedrock_agentcore.services.identity import IdentityClient

identity_client = IdentityClient("us-east-1")
workload_identity = identity_client.create_workload_identity(name="my-agent")

Then, I configure the credential providers, for example:

google_provider = identity_client.create_oauth2_credential_provider(
    {
        "name": "google-workspace",
        "credentialProviderVendor": "GoogleOauth2",
        "oauth2ProviderConfigInput": {
            "googleOauth2ProviderConfig": {
                "clientId": "your-google-client-id",
                "clientSecret": "your-google-client-secret",
            }
        },
    }
)

perplexity_provider = identity_client.create_api_key_credential_provider(
    {
        "name": "perplexity-ai",
        "apiKey": "perplexity-api-key"
    }
)

然后，我可以将@requires_access_token Python修饰符（传递提供者名称、范围等）添加到需要访问令牌来执行其活动的函数中。

采用这种方法，智能体可以通过公司现有的身份基础设施验证身份，以一个独特的、经过身份验证的身份进行操作，以有限权限行事，并跨多个身份提供者（如Amazon Cognito、Okta或Microsoft Entra ID）和服务边界（包括AWS和第三方工具和服务，如Slack、GitHub和Salesforce）进行集成。

为了在简化终端用户和智能体构建者体验的同时提供稳健且安全的访问控制，AgentCore Identity实现了一个安全的令牌保险库，用于存储用户的令牌并允许智能体安全地检索它们。

对于与OAuth 2.0兼容的工具和服务，当用户首次授权智能体代表其执行操作时，AgentCore Identity会收集并存储该工具为其颁发的用户令牌，同时还会安全地存储智能体的OAuth客户端凭据。智能体以自身独特的身份运行，并在用户调用时，可以根据需要访问这些令牌，从而减少频繁获取用户授权的必要性。

当用户令牌过期时，AgentCore Identity会向用户触发一个新的授权提示，以便智能体获取更新的用户令牌。对于使用API密钥的工具，AgentCore Identity还会安全地存储这些密钥，并在需要时授予智能体受控访问权限以检索它们。这种安全存储方式在保持强大访问控制的同时简化了用户体验，使智能体能够在各种工具和服务中有效运行。

步骤4 - 使用AgentCore Gateway扩展智能体功能

到目前为止，所有内部工具都是在代码中模拟的。许多智能体框架（包括Strands Agents）都原生支持MCP以连接到远程工具。为了通过MCP接口访问内部系统（如客户关系管理和订单管理），我使用了AgentCore Gateway。

借助AgentCore Gateway，智能体可以使用Smithy模型、Lambda函数以及内部API和第三方提供商（使用OpenAPI规范）来访问AWS服务。它采用双重身份验证模型，对目标资源的传入请求和传出连接进行安全访问控制。Lambda函数可用于集成外部系统，特别是那些缺乏标准API或需要多个步骤来检索信息的应用程序。

AgentCore Gateway 提供了大多数客户原本需要自行构建的跨领域功能，包括身份验证、授权、限流、自定义请求/响应转换（以匹配底层API格式）、多租户和工具选择。

工具选择功能有助于为特定智能体的任务找到最相关的工具。AgentCore Gateway为所有这些工具提供了一个统一的MCP接口，并使用AgentCore Identity为那些不支持开箱即用OAuth的工具（如AWS服务）提供一个OAuth接口。

第五步——使用AgentCore代码解释器和浏览器工具添加功能

为了回应客户请求，客户支持智能体需要进行计算。为了简化这一过程，我使用了AgentCode SDK来添加对AgentCore代码解释器的访问。

同样，智能体所需的一些集成并未实现编程式API，而是需要通过Web界面进行访问。我授予AgentCore Browser访问权限，以便智能体能够自主浏览这些网站。

步骤6——通过可观察性提升可见性

既然智能体已投入生产，我需要了解其活动和性能。AgentCore提供了增强的可观察性，以帮助开发人员有效地调试、审核和监控其在生产环境中的智能体性能。它内置了仪表板，可跟踪关键操作指标，如会话计数、延迟、持续时间、令牌使用情况、错误率以及组件级别的延迟和错误细分。AgentCore还通过捕获和可视化端到端跟踪以及捕获智能体工作流每个步骤（包括工具调用、内存）的“跨度”，来了解智能体的行为

该服务提供的内置仪表板有助于揭示性能瓶颈，并确定某些交互失败的原因，从而推动持续改进，并在出现问题时缩短平均检测时间（MTTD）和平均修复时间（MTTR）。

AgentCore支持OpenTelemetry，有助于将智能体遥测数据与现有的可观察性平台（包括CloudWatch、Datadog、LangSmith和Langfuse）集成。我只需要在智能体配置中启用可观察性，然后重新启动它，即可开始向CloudWatch发送遥测数据。请检查智能体使用的IAM角色是否具有执行此操作所需的必要权限。

第7步 - 结论

通过这一历程，我们将一个本地原型转化为一个可投入生产的系统。我们采用AgentCore的模块化方法，逐步实现了企业需求——从基本部署到复杂的内存管理、身份认证和工具集成——同时保持了现有智能体代码的完整性。

须知事项
Amazon Bedrock AgentCore现已在美国东部（北弗吉尼亚州）、美国西部（俄勒冈州）、亚太地区（悉尼）和欧洲（法兰克福）提供预览版。您可以通过AWS管理控制台、AWS命令行界面（AWS CLI）、AWS开发工具包或AgentCore开发工具包开始使用AgentCore服务。

在2025年9月16日之前，您可以免费试用AgentCore服务。在使用AgentCore的过程中，如需使用任何额外的AWS服务，将按照标准AWS定价收费（例如，AgentCore Observability将按照CloudWatch定价收费）。自2025年9月17日起，AWS将根据本页面所述向您收取AgentCore服务使用费用。

无论您是在构建客户支持智能体、工作流程自动化，还是创新的人工智能体验，AgentCore都能为您提供从原型开发到生产部署所需的基础设施，让您信心满满。

要了解更多信息并开始部署可用于生产的智能体，请访问AgentCore文档。如需查看代码示例和集成指南，请访问AgentCore示例GitHub仓库。

加入AgentCore预览版Discord服务器，提供反馈并讨论用例。我们期待听到您的声音！

— 达尼洛

本文地址

https://architect.pub/introducing-amazon-bedrock-agentcore-securely-deploy-and-operate-ai-agents-any-scale-preview

登录发表评论
29 次浏览

发布日期

星期二, 九月 23, 2025 - 17:30

最后修改

星期六, 十月 11, 2025 - 17:35

【智能体架构】介绍Amazon Bedrock AgentCore：在任何规模安全部署和操作AI智能体（预览）

category

步骤1–使用AgentCore Runtime部署到云端

Step 2 – Enabling memory for context

Step 3 – Adding identity and access controls

步骤4 - 使用AgentCore Gateway扩展智能体功能

第五步——使用AgentCore代码解释器和浏览器工具添加功能

步骤6——通过可观察性提升可见性

第7步 - 结论

Tags

Content type

Content type

Tags

Tags

category

category

Tags