Reduce conversational AI response time through inference at the edge with AWS Local Zones

Favorite Recent advances in generative AI have led to the proliferation of new generation of conversational AI assistants powered by foundation models (FMs). These latency-sensitive applications enable real-time text and voice interactions, responding naturally to human conversations. Their applications span a variety of sectors, including customer service, healthcare, education, personal

Read More
Shared by AWS Machine Learning March 3, 2025

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Favorite Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. To achieve optimal performance for specific use cases, customers are adopting and adapting these FMs to their unique domain requirements. This need for customization has become even more pronounced with the emergence of

Read More
Shared by AWS Machine Learning March 3, 2025

The end of an era: the final AWS DeepRacer League Championship at re:Invent 2024

Favorite AWS DeepRacer League 2024 Championship finalists at re:Invent 2024 The AWS DeepRacer League is the world’s first global autonomous racing league powered by machine learning (ML). Over the past 6 years, a diverse community of over 560,000 builders from more than 150 countries worldwide have participated in the League

Read More
Shared by AWS Machine Learning February 28, 2025

Optimizing AI implementation costs with Automat-it

Favorite This post was written by Claudiu Bota, Oleg Yurchenko, and Vladyslav Melnyk of AWS Partner Automat-it. As organizations adopt AI and machine learning (ML), they’re using these technologies to improve processes and enhance products. AI use cases include video analytics, market predictions, fraud detection, and natural language processing, all

Read More
Shared by AWS Machine Learning February 28, 2025

Level up your problem-solving and strategic thinking skills with Amazon Bedrock

Favorite Organizations across many industries are harnessing the power of foundation models (FMs) and large language models (LLMs) to build generative AI applications to deliver new customer experiences, boost employee productivity, and drive innovation. Amazon Bedrock, a fully managed service that offers a choice of high-performing FMs from leading AI

Read More
Shared by AWS Machine Learning February 28, 2025

Streamline work insights with the Amazon Q Business connector for Smartsheet

Favorite Amazon Q Business is a fully managed, generative AI–powered assistant that empowers enterprises to unlock the full potential of their data and organizational knowledge. With Amazon Q Business, you can quickly access answers to questions, generate summaries and content, and complete tasks by using the expertise and information stored

Read More
Shared by AWS Machine Learning February 28, 2025

AWS DeepRacer: Closing time at AWS re:Invent 2024 –How did that physical racing go?

Favorite Having spent the last years studying the art of AWS DeepRacer in the physical world, the author went to AWS re:Invent 2024. How did it go? In AWS DeepRacer: How to master physical racing?, I wrote in detail about some aspects relevant to racing AWS DeepRacer in the physical

Read More
Shared by AWS Machine Learning February 27, 2025

Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS

Favorite In our previous blog posts, we explored various techniques such as fine-tuning large language models (LLMs), prompt engineering, and Retrieval Augmented Generation (RAG) using Amazon Bedrock to generate impressions from the findings section in radiology reports using generative AI. Part 1 focused on model fine-tuning. Part 2 introduced RAG,

Read More
Shared by AWS Machine Learning February 27, 2025

ByteDance processes billions of daily videos using their multimodal video understanding models on AWS Inferentia2

Favorite This is a guest post authored by the team at ByteDance. ByteDance is a technology company that operates a range of content platforms to inform, educate, entertain, and inspire people across languages, cultures, and geographies. Users trust and enjoy our content platforms because of the rich, intuitive, and safe

Read More
Shared by AWS Machine Learning February 26, 2025

How to configure cross-account model deployment using Amazon Bedrock Custom Model Import

Favorite In enterprise environments, organizations often divide their AI operations into two specialized teams: an AI research team and a model hosting team. The research team is dedicated to developing and enhancing AI models using model training and fine-tuning techniques. Meanwhile, a separate hosting team is responsible for deploying these

Read More
Shared by AWS Machine Learning February 26, 2025