Amazon SageMaker AI introduces EAGLE based adaptive speculative decoding to accelerate generative AI inference

Favorite Generative AI models continue to expand in scale and capability, increasing the demand for faster and more efficient inference. Applications need low latency and consistent performance without compromising output quality. Amazon SageMaker AI introduces new enhancements to its inference optimization toolkit that bring EAGLE based adaptive speculative decoding to

Read More
Shared by AWS Machine Learning November 26, 2025

Enhanced performance for Amazon Bedrock Custom Model Import

Favorite You can now achieve significant performance improvements when using Amazon Bedrock Custom Model Import, with reduced end-to-end latency, faster time-to-first-token, and improved throughput through advanced PyTorch compilation and CUDA graph optimizations. With Amazon Bedrock Custom Model Import you can to bring your own foundation models to Amazon Bedrock for

Read More
Shared by AWS Machine Learning November 26, 2025

Beyond the technology: Workforce changes for AI

Favorite Workplaces are increasingly integrating AI tools into daily operations, with AI assistants supporting teams, predictive analytics informing strategies, and automation streamlining workflows. AI has moved from experimental technology to standard business practice, changing how work gets done. Organizations need to understand what AI can do and how it affects

Read More
Shared by AWS Machine Learning November 26, 2025

Building AI-Powered Voice Applications: Amazon Nova Sonic Telephony Integration Guide

Favorite Organizations are increasingly seeking to enhance customer experiences through natural, responsive voice interactions across their telephony systems. Amazon Nova Sonic addresses this need as a speech-to-speech generative AI model that delivers real-time voice conversations with low latency and natural turn-taking. It understands speech across different accents and speaking styles,

Read More
Shared by AWS Machine Learning November 26, 2025