Generative AI is a new trend, but implementing it in every use case and context where it could be beneficial can be challenging.
Recently, we published a blog about Amazon Bedrock, which includes an overview of the builder tools this service provides. One of these tools is Bedrock Agents, which allows you to automate the use of Foundation Models (FMs) in AWS and connect them to other services. Put simply, you set them up once, and they do the work for you.
In this blog post, we'll explore what Bedrock Agents are, how they work, real-world examples of their use, their main features, and new capabilities for governing agents at scale.
Amazon Bedrock Agents offer the ability to use foundation models (FMs) to automatically interpret user input, initiate API calls, retrieve needed information, and call additional APIs as required.
Agents essentially act as orchestrators between FMs, data sources, software applications, and user interactions.
Let's dive a little deeper into what you'll need to select or configure to use Amazon Bedrock Agents. Below, you'll find a simple diagram illustrating how it works.

Agents for Amazon Bedrock leverage foundation models from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon. You select the foundation model that is already pre-trained, and you can fine-tune it with your own data.
The agent invokes the model and leverages it to interpret users' input and generate responses based on the provided data.

This simply instructs the Agent what to do; it is basically what you would normally instruct a person to do.

If the Agent should be responsible for reserving a table in a restaurant, you can write something like this:
"Start each interaction by greeting the user and asking for the reservation details. Gather information on the date, time, number of guests, and any seating preferences. Confirm all details back to the customer before finalizing. If the requested time isn't available, offer alternative times close to the original request. Close the conversation by thanking the customer. If any issues arise, apologize and guide the user on what to do next."
An action group is something that is called by the Agent to perform the selected task. You define the action that will be done when the Agent will be called, which might be information retrieval, processing, or performing any other task. This can be performed by AWS Lambda function or an API schema.

The typical process, for example, for a chatbot app in a restaurant and how it leverages the action group is:
This provides the possibility of using an Agent memory. Before this was introduced, Agents couldn't remember what was happening in between each interaction, so one user could see the same information over and over. With this function, you can store the interactions separated by every user. Currently, only some FMs are supported.
Enhanced Memory with Episodic Learning: AgentCore Memory now includes episodic functionality that helps agents learn from past experiences and adapt solutions across similar situations. For example, a travel booking agent learns from your booking patterns (such as needing flexible return options for work trips with client meetings) and applies these learnings to future bookings. This structured episode capture of context, reasoning, actions, and outcomes, combined with automatic reflection analysis, enables agents to recognize and adapt to individual needs without requiring exhaustive custom instructions.
Every Bedrock Agent has to use an FM, instructions, and action group. The task for the Bedrock Agent can be based on an API call from your app, but it can also be connected to an event in your AWS environment (for example, a new file upload to S3).

So, let's dive into the Bedrock Agent example. If you are more interested in exact examples from AWS, visit this Git repo: Bedrock Agents Examples.
Example: A Bedrock Agent analyses past patient cases based on previous doctors' notes, prescriptions, and treatments. This can be done by creating a knowledge base. For a new case, it reviews similar past cases and highlights what could be done differently to improve the treatment.
Actions: The Agent searches the knowledge base for relevant past cases, finds patterns or better approaches, and summarizes these for the doctor. If the Agent identifies a treatment alternative, it suggests it directly, helping doctors make more informed decisions.
Enterprise-Grade Agent Governance: Bedrock Agents now include AgentCore Policy and Evaluations capabilities that enable organizations to govern agent behavior at scale. With Policy, security and compliance teams can define natural language rules that intercept agent tool calls in real time, ensuring agents access only authorized data and services. AgentCore Evaluations continuously monitors agent quality across 13 built-in dimensions (correctness, helpfulness, tool selection, safety, and more) with automated alerts when performance drops.
If you're still deciding whether Bedrock or a more conversational tool like ChatGPT is a better fit for your needs, we've outlined the key differences between them in our Amazon Bedrock vs. ChatGPT comparison.
Security: Built on AWS's secure infrastructure, Bedrock Agents inherit robust security measures, including data encryption and compliance with industry regulations. Any provided data are saved in the selected region and are not shared with third parties. AgentCore Policy now adds real-time governance layer that intercepts agent tool calls before execution.
Simple setup: No complex AI expertise needed. Bedrock Agents eliminate the need for deep machine learning expertise, making it accessible for businesses without AI-specialized teams. Natural language policy definition and built-in evaluators reduce operational burden.
AWS integration: Bedrock Agents can seamlessly integrate with existing AWS infrastructure, enabling businesses to enhance their current applications.
Production readiness: AgentCore transforms agents from prototypes to production-grade systems with memory, evaluations, policies, and observability built in.
As usual, AWS secures your data as much as possible, but you are partly responsible for setting it up.
Here are just two points that should help you understand how the security of Bedrock Agents works:
Your provided data is not used to improve or enhance the base FMs. Your data is not shared with any model providers, and you set who has access to the data through IAM policies and properly setting up the API security. AgentCore Policy adds an additional governance layer where teams can define which tools and data agents can access using natural language.
As a regional service, Amazon Bedrock and its Agents offer possibilities for storing data where you need it. The data that you store, including prompts, attachments that you use for prompts, AI responses and customized models, remain in the selected region and in your AWS account. AgentCore Runtime provides complete session isolation for multi-tenant environments.
The best option is to run through a simple Bedrock Agent workshop to learn the basics of how they work and try to connect what you do with your possible use case.
AgentCore Features for Production Deployment: When deploying agents to production, leverage AgentCore's new capabilities: use Policy to define agent boundaries in natural language, implement AgentCore Evaluations with built-in quality metrics to catch issues before customer impact, enable AgentCore Memory episodic functionality so agents learn from experience, and use bidirectional streaming in AgentCore Runtime for natural voice conversations. These capabilities work with any framework and model you choose.
Bedrock Agents allow you to use generative AI in AWS environments without the need to learn code or develop a deep understanding of their setup. Because they are directly connected to other AWS services, they can play a big role for those who already store data in AWS without the need to set outside connections that could bring issues with latency.
With AgentCore, Bedrock Agents have evolved from a prototype tool to an enterprise-grade agentic AI platform. Natural language policy definition, continuous quality evaluations, episodic memory learning, and bidirectional voice streaming enable organizations to confidently deploy agents at scale. Combined with framework flexibility and framework-agnostic infrastructure, Bedrock Agents position organizations to extract real business value from autonomous AI systems with built-in governance and observability.
An AWS Solutions Architect with over 5 years of experience in designing, assessing, and optimizing AWS cloud architectures. At Stormit, he supports customers across the full cloud lifecycle — from pre-sales consulting and solution design to AWS funding programs such as AWS Activate, Proof of Concept (PoC), and the Migration Acceleration Program (MAP).