Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2

Favorite This post is the second part of the DeepSeek series focusing on model customization with Amazon SageMaker HyperPod recipes (or recipes for brevity). In Part 1, we demonstrated the performance and ease of fine-tuning DeepSeek-R1 distilled models using these recipes. In this post, we use the recipes to fine-tune

Read More
Shared by AWS Machine Learning May 15, 2025

Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia

Favorite PixArt-Sigma is a diffusion transformer model that is capable of image generation at 4k resolution. This model shows significant improvements over previous generation PixArt models like Pixart-Alpha and other diffusion models through dataset and architectural improvements. AWS Trainium and AWS Inferentia are purpose-built AI chips to accelerate machine learning

Read More
Shared by AWS Machine Learning May 15, 2025

How Hexagon built an AI assistant using AWS generative AI services

Favorite This post was co-written with Julio P. Roque Hexagon ALI. Recognizing the transformative benefits of generative AI for enterprises, we at Hexagon’s Asset Lifecycle Intelligence division sought to enhance how users interact with our Enterprise Asset Management (EAM) products. Understanding these advantages, we partnered with AWS to embark on

Read More
Shared by AWS Machine Learning May 14, 2025

Build scalable containerized RAG based generative AI applications in AWS using Amazon EKS with Amazon Bedrock

Favorite Generative artificial intelligence (AI) applications are commonly built using a technique called Retrieval Augmented Generation (RAG) that provides foundation models (FMs) access to additional data they didn’t have during training. This data is used to enrich the generative AI prompt to deliver more context-specific and accurate responses without continuously

Read More
Shared by AWS Machine Learning May 14, 2025

Securing Amazon Bedrock Agents: A guide to safeguarding against indirect prompt injections

Favorite Generative AI tools have transformed how we work, create, and process information. At Amazon Web Services (AWS), security is our top priority. Therefore, Amazon Bedrock provides comprehensive security controls and best practices to help protect your applications and data. In this post, we explore the security measures and practical

Read More
Shared by AWS Machine Learning May 14, 2025

Build an intelligent community agent to revolutionize IT support with Amazon Q Business

Favorite In the era of AI and machine learning (ML), there is a growing emphasis on enhancing security— especially in IT contexts. In this post, we demonstrate how your organization can reduce the end-to-end burden of resolving regular challenges experienced by your IT support teams—from understanding errors and reviewing diagnoses,

Read More
Shared by AWS Machine Learning May 13, 2025

Elevate marketing intelligence with Amazon Bedrock and LLMs for content creation, sentiment analysis, and campaign performance evaluation

Favorite In the media and entertainment industry, understanding and predicting the effectiveness of marketing campaigns is crucial for success. Marketing campaigns are the driving force behind successful businesses, playing a pivotal role in attracting new customers, retaining existing ones, and ultimately boosting revenue. However, launching a campaign isn’t enough; to

Read More
Shared by AWS Machine Learning May 10, 2025

How Deutsche Bahn redefines forecasting using Chronos models – Now available on Amazon Bedrock Marketplace

Favorite This post is co-written with Kilian Zimmerer and Daniel Ringler from Deutsche Bahn. Every day, Deutsche Bahn (DB) moves over 6.6 million passengers across Germany, requiring precise time series forecasting for a wide range of purposes. However, building accurate forecasting models traditionally required significant expertise and weeks of development

Read More
Shared by AWS Machine Learning May 8, 2025

Use custom metrics to evaluate your generative AI application with Amazon Bedrock

Favorite With Amazon Bedrock Evaluations, you can evaluate foundation models (FMs) and Retrieval Augmented Generation (RAG) systems, whether hosted on Amazon Bedrock or another model or RAG system hosted elsewhere, including Amazon Bedrock Knowledge Bases or multi-cloud and on-premises deployments. We recently announced the general availability of the large language

Read More
Shared by AWS Machine Learning May 7, 2025

Get faster and actionable AWS Trusted Advisor insights to make data-driven decisions using Amazon Q Business

Favorite Our customers’ key strategic objectives are cost savings and building secure and resilient infrastructure. At AWS, we’re dedicated to helping you meet these critical goals with our unparalleled expertise and industry-leading tools. One of the most valuable resources we offer is the AWS Trusted Advisor detailed report, which provides

Read More
Shared by AWS Machine Learning May 3, 2025