Amazon SageMaker HyperPod launches model deployments to accelerate the generative AI model development lifecycle

Favorite Today, we’re excited to announce that Amazon SageMaker HyperPod now supports deploying foundation models (FMs) from Amazon SageMaker JumpStart, as well as custom or fine-tuned models from Amazon S3 or Amazon FSx. With this launch, you can train, fine-tune, and deploy models on the same HyperPod compute resources, maximizing

Read More
Shared by AWS Machine Learning July 10, 2025

Accelerating generative AI development with fully managed MLflow 3.0 on Amazon SageMaker AI

Favorite Amazon SageMaker now offers fully managed support for MLflow 3.0 that streamlines AI experimentation and accelerates your generative AI journey from idea to production. This release transforms managed MLflow from experiment tracking to providing end-to-end observability, reducing time-to-market for generative AI development. As customers across industries accelerate their generative

Read More
Shared by AWS Machine Learning July 10, 2025

Accelerate foundation model development with one-click observability in Amazon SageMaker HyperPod

Favorite Amazon SageMaker HyperPod now provides a comprehensive, out-of-the-box dashboard that delivers insights into foundation model (FM) development tasks and cluster resources. This unified observability solution automatically publishes key metrics to Amazon Managed Service for Prometheus and visualizes them in Amazon Managed Grafana dashboards, optimized specifically for FM development with

Read More
Shared by AWS Machine Learning July 10, 2025

Accelerate AI development with Amazon Bedrock API keys

Favorite Today, we’re excited to announce a significant improvement to the developer experience of Amazon Bedrock: API keys. API keys provide quick access to the Amazon Bedrock APIs, streamlining the authentication process so that developers can focus on building rather than configuration. CamelAI is an open-source, modular framework for building

Read More
Shared by AWS Machine Learning July 9, 2025

Scale generative AI use cases, Part 1: Multi-tenant hub and spoke architecture using AWS Transit Gateway

Favorite Generative AI continues to reshape how businesses approach innovation and problem-solving. Customers are moving from experimentation to scaling generative AI use cases across their organizations, with more businesses fully integrating these technologies into their core processes. This evolution spans across lines of business (LOBs), teams, and software as a

Read More
Shared by AWS Machine Learning July 9, 2025

Improve conversational AI response times for enterprise applications with the Amazon Bedrock streaming API and AWS AppSync

Favorite Many enterprises are using large language models (LLMs) in Amazon Bedrock to gain insights from their internal data sources. Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI,

Read More
Shared by AWS Machine Learning July 9, 2025

Configure fine-grained access to Amazon Bedrock models using Amazon SageMaker Unified Studio

Favorite Enterprises adopting advanced AI solutions recognize that robust security and precise access control are essential for protecting valuable data, maintaining compliance, and preserving user trust. As organizations expand AI usage across teams and applications, they require granular permissions to safeguard sensitive information and manage who can access powerful models.

Read More
Shared by AWS Machine Learning July 9, 2025

Query Amazon Aurora PostgreSQL using Amazon Bedrock Knowledge Bases structured data

Favorite Amazon Bedrock Knowledge Bases offers a fully managed Retrieval Augmented Generation (RAG) feature that connects large language models (LLMs) to internal data sources. This feature enhances foundation model (FM) outputs with contextual information from private data, making responses more relevant and accurate. At AWS re:Invent 2024, we announced Amazon

Read More
Shared by AWS Machine Learning July 9, 2025