Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

Favorite The post is co-written with Michael Shaul and Sasha Korman from NetApp. Generative artificial intelligence (AI) applications are commonly built using a technique called Retrieval Augmented Generation (RAG) that provides foundation models (FMs) access to additional data they didn’t have during training. This data is used to enrich the

Read More
Shared by AWS Machine Learning September 18, 2024

Improve RAG performance using Cohere Rerank

Favorite This post is co-written with Pradeep Prabhakaran from Cohere. Retrieval Augmented Generation (RAG) is a powerful technique that can help enterprises develop generative artificial intelligence (AI) apps that integrate real-time data and enable rich, interactive conversations using proprietary data. RAG allows these AI applications to tap into external, reliable

Read More
Shared by AWS Machine Learning September 17, 2024

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon SageMaker

Favorite Amazon SageMaker is a fully managed machine learning (ML) service. With SageMaker, data scientists and developers can quickly and confidently build, train, and deploy ML models into a production-ready hosted environment. SageMaker provides a broad selection of ML infrastructure and model deployment options to help meet your ML inference

Read More
Shared by AWS Machine Learning September 14, 2024

Streamline workflow orchestration of a system of enterprise APIs using chaining with Amazon Bedrock Agents

Favorite Intricate workflows that require dynamic and complex API orchestration can often be complex to manage. In industries like insurance, where unpredictable scenarios are the norm, traditional automation falls short, leading to inefficiencies and missed opportunities. With the power of intelligent agents, you can simplify these challenges. In this post,

Read More
Shared by AWS Machine Learning September 14, 2024

Scaling Thomson Reuters’ language model research with Amazon SageMaker HyperPod

Favorite Thomson Reuters, a global content and technology-driven company, has been using artificial intelligence and machine learning (AI/ML) in its professional information products for decades. The introduction of generative AI provides another opportunity for Thomson Reuters to work with customers and advance how they do their work, helping professionals draw

Read More
Shared by AWS Machine Learning September 13, 2024