Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Favorite This post has been co-written with Artem Sysuev, Danny Portman, Matúš Chládek, and Saurabh Gupta from Zeta Global. Zeta Global is a leading data-driven, cloud-based marketing technology company that empowers enterprises to acquire, grow and retain customers. The company’s Zeta Marketing Platform (ZMP) is the largest omnichannel marketing platform

Read More
Shared by AWS Machine Learning September 19, 2024

Reinvent personalization with generative AI on Amazon Bedrock using task decomposition for agentic workflows

Favorite Personalization has become a cornerstone of delivering tangible benefits to businesses and their customers. Generative AI and large language models (LLMs) offer new possibilities, although some businesses might hesitate due to concerns about consistency and adherence to company guidelines. This post presents an automated personalization solution that balances the

Read More
Shared by AWS Machine Learning September 19, 2024

Support for AWS DeepComposer ending soon

Favorite AWS DeepComposer was first introduced during AWS re:Invent 2019 as a fun way for developers to compose music by using generative AI. AWS DeepComposer was the world’s first machine learning (ML)-enabled keyboard for developers to get hands-on—literally—with a musical keyboard and the latest ML techniques to compose their own

Read More
Shared by AWS Machine Learning September 18, 2024

Build RAG-based generative AI applications in AWS using Amazon FSx for NetApp ONTAP with Amazon Bedrock

Favorite The post is co-written with Michael Shaul and Sasha Korman from NetApp. Generative artificial intelligence (AI) applications are commonly built using a technique called Retrieval Augmented Generation (RAG) that provides foundation models (FMs) access to additional data they didn’t have during training. This data is used to enrich the

Read More
Shared by AWS Machine Learning September 18, 2024

Improve RAG performance using Cohere Rerank

Favorite This post is co-written with Pradeep Prabhakaran from Cohere. Retrieval Augmented Generation (RAG) is a powerful technique that can help enterprises develop generative artificial intelligence (AI) apps that integrate real-time data and enable rich, interactive conversations using proprietary data. RAG allows these AI applications to tap into external, reliable

Read More
Shared by AWS Machine Learning September 17, 2024

Build ultra-low latency multimodal generative AI applications using sticky session routing in Amazon SageMaker

Favorite Amazon SageMaker is a fully managed machine learning (ML) service. With SageMaker, data scientists and developers can quickly and confidently build, train, and deploy ML models into a production-ready hosted environment. SageMaker provides a broad selection of ML infrastructure and model deployment options to help meet your ML inference

Read More
Shared by AWS Machine Learning September 14, 2024