Configure and verify a distributed training cluster with AWS Deep Learning Containers on Amazon EKS

Favorite Training state-of-the-art large language models (LLMs) demands massive, distributed compute infrastructure. Meta’s Llama 3, for instance, ran on 16,000 NVIDIA H100 GPUs for over 30.84 million GPU hours. Amazon Elastic Kubernetes Service (Amazon EKS) is a managed service that simplifies the deployment, management, and scaling of Kubernetes clusters that

Read More
Shared by AWS Machine Learning October 16, 2025

Building smarter AI agents: AgentCore long-term memory deep dive

Favorite Building AI agents that remember user interactions requires more than just storing raw conversations. While Amazon Bedrock AgentCore short-term memory captures immediate context, the real challenge lies in transforming these interactions into persistent, actionable knowledge that spans across sessions. This is the information that transforms fleeting interactions into meaningful,

Read More
Shared by AWS Machine Learning October 16, 2025

How Amazon Bedrock Custom Model Import streamlined LLM deployment for Salesforce

Favorite This post is cowritten by Salesforce’s AI Platform team members Srikanta Prasad, Utkarsh Arora, Raghav Tanaji, Nitin Surya, Gokulakrishnan Gopalakrishnan, and Akhilesh Deepak Gotmare. Salesforce’s Artificial Intelligence (AI) platform team runs customized large language models (LLMs)—fine-tuned versions of Llama, Qwen, and Mistral—for agentic AI applications like Agentforce. Deploying these

Read More
Shared by AWS Machine Learning October 15, 2025

Build a device management agent with Amazon Bedrock AgentCore

Favorite The proliferation of Internet of Things (IoT) devices has transformed how we interact with our environments, from homes to industrial settings. However, as the number of connected devices grows, so does the complexity of managing them. Traditional device management interfaces often require navigating through multiple applications, each with its

Read More
Shared by AWS Machine Learning October 15, 2025

Connect Amazon Quick Suite to enterprise apps and agents with MCP

Favorite Organizations need solutions for people and AI agents to securely collaborate through a single interface to the organization’s data and take actions across enterprise applications to improve productivity. The ability of an AI agent to securely and seamlessly connect with organizational knowledge bases, enterprise applications, and other AI agents

Read More
Shared by AWS Machine Learning October 14, 2025

Medical reports analysis dashboard using Amazon Bedrock, LangChain, and Streamlit

Favorite In healthcare, the ability to quickly analyze and interpret medical reports is crucial for both healthcare providers and patients. While medical reports contain valuable information, they often remain underutilized due to their complex nature and the time-intensive process of analysis. This complexity manifests in several ways: the interpretation of

Read More
Shared by AWS Machine Learning October 14, 2025

Transforming the physical world with AI: the next frontier in intelligent automation 

Favorite The convergence of artificial intelligence with physical systems marks a pivotal moment in technological evolution. Physical AI, where algorithms transcend digital boundaries to perceive, understand, and manipulate the tangible world, will fundamentally transform how enterprises operate across industries. These intelligent systems bridge the gap between digital intelligence and physical

Read More
Shared by AWS Machine Learning October 14, 2025