Reduce conversational AI response time through inference at the edge with AWS Local Zones

Favorite Recent advances in generative AI have led to the proliferation of new generation of conversational AI assistants powered by foundation models (FMs). These latency-sensitive applications enable real-time text and voice interactions, responding naturally to human conversations. Their applications span a variety of sectors, including customer service, healthcare, education, personal

Read More
Shared by AWS Machine Learning March 3, 2025

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1

Favorite Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. To achieve optimal performance for specific use cases, customers are adopting and adapting these FMs to their unique domain requirements. This need for customization has become even more pronounced with the emergence of

Read More
Shared by AWS Machine Learning March 3, 2025

Optimizing AI implementation costs with Automat-it

Favorite This post was written by Claudiu Bota, Oleg Yurchenko, and Vladyslav Melnyk of AWS Partner Automat-it. As organizations adopt AI and machine learning (ML), they’re using these technologies to improve processes and enhance products. AI use cases include video analytics, market predictions, fraud detection, and natural language processing, all

Read More
Shared by AWS Machine Learning February 28, 2025