Implement secure API access to your Amazon Q Business applications with IAM federation user access management

Favorite Amazon Q Business is a conversational assistant powered by generative AI that enhances workforce productivity by answering questions and completing tasks based on information in your enterprise systems, which each user is authorized to access. AWS recommends using AWS IAM Identity Center when you have a large number of

Read More
Shared by AWS Machine Learning November 23, 2024

Amazon Bedrock Flows is now generally available with enhanced safety and traceability

Favorite Today, we are excited to announce the general availability of Amazon Bedrock Flows (previously known as Prompt Flows). With Bedrock Flows, you can quickly build and execute complex generative AI workflows without writing code. Key benefits include: Simplified generative AI workflow development with an intuitive visual interface. Seamless integration

Read More
Shared by AWS Machine Learning November 23, 2024

Governing the ML lifecycle at scale, Part 3: Setting up data governance at scale

Favorite This post is part of an ongoing series about governing the machine learning (ML) lifecycle at scale. To view this series from the beginning, start with Part 1. This post dives deep into how to set up data governance at scale using Amazon DataZone for the data mesh. The

Read More
Shared by AWS Machine Learning November 23, 2024

Improve factual consistency with LLM Debates

Favorite In this post, we demonstrate the potential of large language model (LLM) debates using a supervised dataset with ground truth. In this LLM debate, we have two debater LLMs, each one taking one side of an argument and defending it based on the previous arguments for N(=3) rounds. The

Read More
Shared by AWS Machine Learning November 23, 2024

Build generative AI applications on Amazon Bedrock with the AWS SDK for Python (Boto3)

Favorite Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies like AI21 Labs, Anthropic, Cohere, Meta, Mistral AI, Stability AI, and Amazon through a single API, along with a broad set of capabilities to build generative AI applications with

Read More
Shared by AWS Machine Learning November 23, 2024

Orchestrate generative AI workflows with Amazon Bedrock and AWS Step Functions

Favorite Companies across all industries are harnessing the power of generative AI to address various use cases. Cloud providers have recognized the need to offer model inference through an API call, significantly streamlining the implementation of AI within applications. Although a single API call can address simple use cases, more

Read More
Shared by AWS Machine Learning November 23, 2024

Amazon SageMaker Inference now supports G6e instances

Favorite As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G6e instances powered by NVIDIA’s L40S Tensor Core GPUs on Amazon SageMaker. You will have the option to

Read More
Shared by AWS Machine Learning November 23, 2024

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Favorite Companies across various scales and industries are using large language models (LLMs) to develop generative AI applications that provide innovative experiences for customers and employees. However, building or fine-tuning these pre-trained LLMs on extensive datasets demands substantial computational resources and engineering effort. With the increase in sizes of these

Read More
Shared by AWS Machine Learning November 23, 2024

Fine-tune large language models with Amazon SageMaker Autopilot

Favorite Fine-tuning foundation models (FMs) is a process that involves exposing a pre-trained FM to task-specific data and fine-tuning its parameters. It can then develop a deeper understanding and produce more accurate and relevant outputs for that particular domain. In this post, we show how to use an Amazon SageMaker

Read More
Shared by AWS Machine Learning November 22, 2024

Revolutionizing knowledge management: VW’s AI prototype journey with AWS

Favorite Today, we’re excited to share the journey of the VW—an innovator in the automotive industry and Europe’s largest car maker—to enhance knowledge management by using generative AI, Amazon Bedrock, and Amazon Kendra to devise a solution based on Retrieval Augmented Generation (RAG) that makes internal information more easily accessible

Read More
Shared by AWS Machine Learning November 22, 2024