Introducing Amazon EKS support in Amazon SageMaker HyperPod

Favorite We are thrilled to introduce Amazon Elastic Kubernetes Service (Amazon EKS) support in Amazon SageMaker HyperPod, a purpose-built infrastructure engineered with resilience at its core. This capability allows for the seamless addition of SageMaker HyperPod managed compute to EKS clusters, using automated node and job resiliency features for foundation

Read More
Shared by AWS Machine Learning September 12, 2024

Exploring data using AI chat at Domo with Amazon Bedrock

Favorite This post is co-written with Joe Clark from Domo. Data insights are crucial for businesses to enable data-driven decisions, identify trends, and optimize operations. Traditionally, gaining these insights required skilled analysts using specialized tools, which can make the process slow and less accessible. Generative artificial intelligence (AI) has revolutionized

Read More
Shared by AWS Machine Learning September 10, 2024

Amazon EC2 P5e instances are generally available

Favorite State-of-the-art generative AI models and high performance computing (HPC) applications are driving the need for unprecedented levels of compute. Customers are pushing the boundaries of these technologies to bring higher fidelity products and experiences to market across industries. The size of large language models (LLMs), as measured by the

Read More
Shared by AWS Machine Learning September 10, 2024

Align Meta Llama 3 to human preferences with DPO, Amazon SageMaker Studio, and Amazon SageMaker Ground Truth

Favorite Large language models (LLMs) have remarkable capabilities. Nevertheless, using them in customer-facing applications often requires tailoring their responses to align with your organization’s values and brand identity. In this post, we demonstrate how to use direct preference optimization (DPO), a technique that allows you to fine-tune an LLM with

Read More
Shared by AWS Machine Learning September 10, 2024

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

Favorite Generative artificial intelligence (AI) applications powered by large language models (LLMs) are rapidly gaining traction for question answering use cases. From internal knowledge bases for customer support to external conversational AI assistants, these applications use LLMs to provide human-like responses to natural language queries. However, building and deploying such

Read More
Shared by AWS Machine Learning September 7, 2024

Fine-tune Llama 3 for text generation on Amazon SageMaker JumpStart

Favorite Generative artificial intelligence (AI) models have become increasingly popular and powerful, enabling a wide range of applications such as text generation, summarization, question answering, and code generation. However, despite their impressive capabilities, these models often struggle with domain-specific tasks or use cases due to their general training data. To

Read More
Shared by AWS Machine Learning September 7, 2024

Evaluating prompts at scale with Prompt Management and Prompt Flows for Amazon Bedrock

Favorite As generative artificial intelligence (AI) continues to revolutionize every industry, the importance of effective prompt optimization through prompt engineering techniques has become key to efficiently balancing the quality of outputs, response time, and costs. Prompt engineering refers to the practice of crafting and optimizing inputs to the models by

Read More
Shared by AWS Machine Learning September 6, 2024