Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints

Favorite Amazon SageMaker multi-model endpoints (MMEs) are a fully managed capability of SageMaker inference that allows you to deploy thousands of models on a single endpoint. Previously, MMEs pre-determinedly allocated CPU computing power to models statically regardless the model traffic load, using Multi Model Server (MMS) as its model server.

Read More
Shared by AWS Machine Learning February 19, 2024

What is a long context window?

Favorite Gemini 1.5 Pro brings big improvements to speed and efficiency, but one of its innovations is its long context window, which measures how many tokens that the model can … View Original Source (blog.google/technology/ai/) Here.

How AI can strengthen digital security

Favorite We’re launching the AI Cyber Defense Initiative to help transform cybersecurity and use AI to reverse the dynamic known as the “Defender’s Dilemma” View Original Source (blog.google/technology/ai/) Here.

Our next-generation model: Gemini 1.5

Favorite Gemini 1.5 delivers dramatically enhanced performance, with a breakthrough in long-context understanding across modalities. View Original Source (blog.google/technology/ai/) Here.

Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock

Favorite With the advent of generative AI solutions, organizations are finding different ways to apply these technologies to gain edge over their competitors. Intelligent applications, powered by advanced foundation models (FMs) trained on huge datasets, can now understand natural language, interpret meaning and intent, and generate contextually relevant and human-like

Read More
Shared by AWS Machine Learning February 14, 2024

Enhance Amazon Connect and Lex with generative AI capabilities

Favorite Effective self-service options are becoming increasingly critical for contact centers, but implementing them well presents unique challenges. Amazon Lex provides your Amazon Connect contact center with chatbot functionalities such as automatic speech recognition (ASR) and natural language understanding (NLU) capabilities through voice and text channels. The bot takes natural

Read More
Shared by AWS Machine Learning February 14, 2024