AWS AI League: Model customization and agentic showdown

Favorite Building intelligent agents to handle complex, real-world tasks can be daunting. Additionally, rather than relying solely on large, pre-trained foundation models, organizations often need to fine-tune and customize smaller, more specialized models to outperform them for their specific use cases. The AWS AI League provides an innovative program to help

Read More
Shared by AWS Machine Learning December 23, 2025

Exploring the zero operator access design of Mantle

Favorite At Amazon, our culture, built on honest and transparent discussion of our growth opportunities, enables us to focus on investing and innovating to continually raise the standard on our ability to deliver value for our customers. Earlier this month, we had the opportunity to share an example of this

Read More
Shared by AWS Machine Learning December 23, 2025

Build a multimodal generative AI assistant for root cause diagnosis in predictive maintenance using Amazon Bedrock

Favorite Predictive maintenance is a strategy that uses data from equipment sensors and advanced analytics to predict when a machine is likely to fail, ensuring maintenance can be performed proactively to prevent breakdowns. This enables industries to reduce unexpected failures, improve operational efficiency, and extend the lifespan of critical equipment. It is applicable

Read More
Shared by AWS Machine Learning December 22, 2025

Enhance document analytics with Strands AI Agents for the GenAI IDP Accelerator

Favorite Extracting structured information from unstructured data is a critical first step to unlocking business value. Our Generative AI Intelligent Document Processing (GenAI IDP) Accelerator has been at the forefront of this transformation, already having processed tens of millions of documents for hundreds of customers. Although organizations can use intelligent

Read More
Shared by AWS Machine Learning December 22, 2025

Deploy Mistral AI’s Voxtral on Amazon SageMaker AI

Favorite Mistral AI’s Voxtral models combine text and audio processing capabilities in a single framework. The Voxtral family includes two distinct variants designed for different use cases and resource requirements. The Voxtral-Mini-3B-2507 is a compact 3-billion-parameter model optimized for efficient audio transcription and basic multimodal understanding, making it ideal for

Read More
Shared by AWS Machine Learning December 22, 2025

Move Beyond Chain-of-Thought with Chain-of-Draft on Amazon Bedrock

Favorite As organizations scale their generative AI implementations, the critical challenge of balancing quality, cost, and latency becomes increasingly complex. With inference costs dominating 70–90% of large language model (LLM) operational expenses, and verbose prompting strategies inflating token volume by 3–5x, organizations are actively seeking more efficient approaches to model

Read More
Shared by AWS Machine Learning December 22, 2025

Introducing SOCI indexing for Amazon SageMaker Studio: Faster container startup times for AI/ML workloads

Favorite Today, we are excited to introduce a new feature for SageMaker Studio: SOCI (Seekable Open Container Initiative) indexing. SOCI supports lazy loading of container images, where only the necessary parts of an image are downloaded initially rather than the entire container. SageMaker Studio serves as a web Integrated Development Environment (IDE)

Read More
Shared by AWS Machine Learning December 19, 2025

Bi-directional streaming for real-time agent interactions now available in Amazon Bedrock AgentCore Runtime

Favorite Building natural voice conversations with AI agents requires complex infrastructure and lots of code from engineering teams. Text-based agent interactions follow a turn-based pattern: a user sends a complete request, waits for the agent to process it, and receives a full response before continuing. Bi-directional streaming removes this constraint

Read More
Shared by AWS Machine Learning December 18, 2025

Build and deploy scalable AI agents with NVIDIA NeMo, Amazon Bedrock AgentCore, and Strands Agents

Favorite This post is co-written with Ranjit Rajan, Abdullahi Olaoye, and Abhishek Sawarkar from NVIDIA. AI’s next frontier isn’t merely smarter chat-based assistants, it’s autonomous agents that reason, plan, and execute across entire systems. But to accomplish this, enterprise developers need to move from prototypes to production-ready AI agents that

Read More
Shared by AWS Machine Learning December 18, 2025

Track machine learning experiments with MLflow on Amazon SageMaker using Snowflake integration

Favorite A user can conduct machine learning (ML) data experiments in data environments, such as Snowflake, using the Snowpark library. However, tracking these experiments across diverse environments can be challenging due to the difficulty in maintaining a central repository to monitor experiment metadata, parameters, hyperparameters, models, results, and other pertinent

Read More
Shared by AWS Machine Learning December 17, 2025