Come Partner with Us

Customize agent workflows with advanced orchestration techniques using Strands Agents

Favorite Large Language Model (LLM) agents have revolutionized how we approach complex, multi-step tasks by combining the reasoning capabilities of foundation models with specialized tools and domain expertise. While single-agent systems using frameworks like ReAct work well for straightforward tasks, real-world challenges often require multiple specialized agents working in coordination.

Read More
Shared by AWS Machine Learning December 15, 2025

Adaptive infrastructure for foundation model training with elastic training on SageMaker HyperPod

Favorite Modern AI infrastructure serves multiple concurrent workloads on the same cluster, from foundation model (FM) pre-training and fine-tuning to production inference and evaluation. In this shared environment, the demands for AI accelerators fluctuates continuously as inference workloads scale with traffic patterns, and experiments complete and release resources. Despite this

Read More
Shared by AWS Machine Learning December 15, 2025

Checkpointless training on Amazon SageMaker HyperPod: Production-scale training with faster fault recovery

Favorite Foundation model training has reached an inflection point where traditional checkpoint-based recovery methods are becoming a bottleneck to efficiency and cost-effectiveness. As models grow to trillions of parameters and training clusters expand to thousands of AI accelerators, even minor disruptions can result in significant costs and delays. In this

Read More
Shared by AWS Machine Learning December 15, 2025

Building a voice-driven AWS assistant with Amazon Nova Sonic

Favorite As cloud infrastructure becomes increasingly complex, the need for intuitive and efficient management interfaces has never been greater. Traditional command-line interfaces (CLI) and web consoles, while powerful, can create barriers to quick decision-making and operational efficiency. What if you could speak to your AWS infrastructure and get immediate, intelligent

Read More
Shared by AWS Machine Learning December 12, 2025

Celebrating Generosity and Growth in the OSI Community

Favorite Members Newsletter – December 2025 Dear OSI supporters, As we reach the final weeks of the year, I find myself reflecting on a season that invites both gratitude and giving, two values that feel especially resonant for our community. Serving as Interim Executive Director these past months has only

Read More
Shared by voicesofopensource December 12, 2025

Amazon Bedrock AgentCore Observability with Langfuse

Favorite The rise of artificial intelligence (AI) agents marks a change in software development and how applications make decisions and interact with users. While traditional systems follow predictable paths, AI agents engage in complex reasoning that remains hidden from view. This invisibility creates a challenge for organizations: how can they

Read More
Shared by AWS Machine Learning December 11, 2025

Scaling MLflow for enterprise AI: What’s New in SageMaker AI with MLflow

Favorite Today we’re announcing Amazon SageMaker AI with MLflow, now including a serverless capability that dynamically manages infrastructure provisioning, scaling, and operations for artificial intelligence and machine learning (AI/ML) development tasks. It scales resources up during intensive experimentation and down to zero when not in use, reducing operational overhead. It

Read More
Shared by AWS Machine Learning December 11, 2025