Techniques and approaches for monitoring large language models on AWS

Favorite Large Language Models (LLMs) have revolutionized the field of natural language processing (NLP), improving tasks such as language translation, text summarization, and sentiment analysis. However, as these models continue to grow in size and complexity, monitoring their performance and behavior has become increasingly challenging. Monitoring the performance and behavior

Read More
Shared by AWS Machine Learning February 26, 2024

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

Favorite ZOO Digital provides end-to-end localization and media services to adapt original TV and movie content to different languages, regions, and cultures. It makes globalization easier for the world’s best content creators. Trusted by the biggest names in entertainment, ZOO Digital delivers high-quality localization and media services at scale, including

Read More
Shared by AWS Machine Learning February 20, 2024

Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints

Favorite Amazon SageMaker multi-model endpoints (MMEs) are a fully managed capability of SageMaker inference that allows you to deploy thousands of models on a single endpoint. Previously, MMEs pre-determinedly allocated CPU computing power to models statically regardless the model traffic load, using Multi Model Server (MMS) as its model server.

Read More
Shared by AWS Machine Learning February 19, 2024

Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock

Favorite With the advent of generative AI solutions, organizations are finding different ways to apply these technologies to gain edge over their competitors. Intelligent applications, powered by advanced foundation models (FMs) trained on huge datasets, can now understand natural language, interpret meaning and intent, and generate contextually relevant and human-like

Read More
Shared by AWS Machine Learning February 14, 2024