Unlock retail intelligence by transforming data into actionable insights using generative AI with Amazon Q Business

Favorite Businesses often face challenges in managing and deriving value from their data. According to McKinsey, 78% of organizations now use AI in at least one business function (as of 2024), showing the growing importance of AI solutions in business. Additionally, 21% of organizations using generative AI have fundamentally redesigned

Read More
Shared by AWS Machine Learning July 10, 2025

AWS AI infrastructure with NVIDIA Blackwell: Two powerful compute solutions for the next frontier of AI

Favorite Imagine a system that can explore multiple approaches to complex problems, drawing on its understanding of vast amounts of data, from scientific datasets to source code to business documents, and reasoning through the possibilities in real time. This lightning-fast reasoning isn’t waiting on the horizon. It’s happening today in

Read More
Shared by AWS Machine Learning July 10, 2025

Build real-time conversational AI experiences using Amazon Nova Sonic and LiveKit

Favorite The rapid growth of generative AI technology has been a catalyst for business productivity growth, creating new opportunities for greater efficiency, enhanced customer service experiences, and more successful customer outcomes. Today’s generative AI advances are helping existing technologies achieve their long-promised potential. For example, voice-first applications have been gaining

Read More
Shared by AWS Machine Learning July 10, 2025

Build an MCP application with Mistral models on AWS

Favorite This post is cowritten with Siddhant Waghjale and Samuel Barry from Mistral AI. Model Context Protocol (MCP) is a standard that has been gaining significant traction in recent months. At a high level, it consists of a standardized interface designed to streamline and enhance how AI models interact with

Read More
Shared by AWS Machine Learning July 10, 2025

Use K8sGPT and Amazon Bedrock for simplified Kubernetes cluster maintenance

Favorite As Kubernetes clusters grow in complexity, managing them efficiently becomes increasingly challenging. Troubleshooting modern Kubernetes environments requires deep expertise across multiple domains—networking, storage, security, and the expanding ecosystem of CNCF plugins. With Kubernetes now hosting mission-critical workloads, rapid issue resolution has become paramount to maintaining business continuity. Integrating advanced

Read More
Shared by AWS Machine Learning July 10, 2025

Amazon SageMaker HyperPod launches model deployments to accelerate the generative AI model development lifecycle

Favorite Today, we’re excited to announce that Amazon SageMaker HyperPod now supports deploying foundation models (FMs) from Amazon SageMaker JumpStart, as well as custom or fine-tuned models from Amazon S3 or Amazon FSx. With this launch, you can train, fine-tune, and deploy models on the same HyperPod compute resources, maximizing

Read More
Shared by AWS Machine Learning July 10, 2025

Accelerating generative AI development with fully managed MLflow 3.0 on Amazon SageMaker AI

Favorite Amazon SageMaker now offers fully managed support for MLflow 3.0 that streamlines AI experimentation and accelerates your generative AI journey from idea to production. This release transforms managed MLflow from experiment tracking to providing end-to-end observability, reducing time-to-market for generative AI development. As customers across industries accelerate their generative

Read More
Shared by AWS Machine Learning July 10, 2025

Accelerate foundation model development with one-click observability in Amazon SageMaker HyperPod

Favorite Amazon SageMaker HyperPod now provides a comprehensive, out-of-the-box dashboard that delivers insights into foundation model (FM) development tasks and cluster resources. This unified observability solution automatically publishes key metrics to Amazon Managed Service for Prometheus and visualizes them in Amazon Managed Grafana dashboards, optimized specifically for FM development with

Read More
Shared by AWS Machine Learning July 10, 2025