Align Meta Llama 3 to human preferences with DPO, Amazon SageMaker Studio, and Amazon SageMaker Ground Truth

Favorite Large language models (LLMs) have remarkable capabilities. Nevertheless, using them in customer-facing applications often requires tailoring their responses to align with your organization’s values and brand identity. In this post, we demonstrate how to use direct preference optimization (DPO), a technique that allows you to fine-tune an LLM with

Read More
Shared by AWS Machine Learning September 10, 2024

Ground truth curation and metric interpretation best practices for evaluating generative AI question answering using FMEval

Favorite Generative artificial intelligence (AI) applications powered by large language models (LLMs) are rapidly gaining traction for question answering use cases. From internal knowledge bases for customer support to external conversational AI assistants, these applications use LLMs to provide human-like responses to natural language queries. However, building and deploying such

Read More
Shared by AWS Machine Learning September 7, 2024

Fine-tune Llama 3 for text generation on Amazon SageMaker JumpStart

Favorite Generative artificial intelligence (AI) models have become increasingly popular and powerful, enabling a wide range of applications such as text generation, summarization, question answering, and code generation. However, despite their impressive capabilities, these models often struggle with domain-specific tasks or use cases due to their general training data. To

Read More
Shared by AWS Machine Learning September 7, 2024

Evaluating prompts at scale with Prompt Management and Prompt Flows for Amazon Bedrock

Favorite As generative artificial intelligence (AI) continues to revolutionize every industry, the importance of effective prompt optimization through prompt engineering techniques has become key to efficiently balancing the quality of outputs, response time, and costs. Prompt engineering refers to the practice of crafting and optimizing inputs to the models by

Read More
Shared by AWS Machine Learning September 6, 2024