Building custom model provider for Strands Agents with LLMs hosted on SageMaker AI endpoints

Favorite Organizations increasingly deploy custom large language models (LLMs) on Amazon SageMaker AI real-time endpoints using their preferred serving frameworks—such as SGLang, vLLM, or TorchServe—to help gain greater control over their deployments, optimize costs, and align with compliance requirements. However, this flexibility introduces a critical technical challenge: response format incompatibility

Read More
Shared by AWS Machine Learning March 6, 2026

Drive organizational growth with Amazon Lex multi-developer CI/CD pipeline

Favorite As your conversational AI initiatives evolve, developing Amazon Lex assistants becomes increasingly complex. Multiple developers working on the same shared Lex instance leads to configuration conflicts, overwritten changes, and slower iteration cycles. Scaling Amazon Lex development requires isolated environments, version control, and automated deployment pipelines. By adopting well-structured continuous

Read More
Shared by AWS Machine Learning March 6, 2026

How Ricoh built a scalable intelligent document processing solution on AWS

Favorite This post is cowritten by Jeremy Jacobson and Rado Fulek from Ricoh. This post demonstrates how enterprises can overcome document processing scaling limits by combining generative AI, serverless architecture, and standardized frameworks. Ricoh engineered a repeatable, reusable framework using the AWS GenAI Intelligent Document Processing (IDP) Accelerator. This framework

Read More
Shared by AWS Machine Learning March 5, 2026

Embed Amazon Quick Suite chat agents in enterprise applications

Favorite Organizations can face two critical challenges with conversational AI. First, users need answers where they work—in their CRM, support console, or analytics portal—not in separate tools. Second, implementing a secure embedded chat in their applications can require weeks of development to build authentication, token validation, domain security, and global

Read More
Shared by AWS Machine Learning March 5, 2026

How Tines enhances security analysis with Amazon Quick Suite

Favorite Organizations face challenges in quickly detecting and responding to user account security events, such as repeated login attempts from unusual locations. Although security data exists across multiple applications, manually correlating information and making corrective actions often delays effective response. With Amazon Quick Suite and Tines, you can automate the

Read More
Shared by AWS Machine Learning March 4, 2026