How Lendi revamped the refinance journey for its customers using agentic AI in 16 weeks using Amazon Bedrock

Favorite This post was co-written with Davesh Maheshwari from Lendi Group and Samuel Casey from Mantel Group. Most Australians don’t know whether their home loan is still competitive. Rates shift, property values move, personal circumstances change—yet for the average homeowner, staying informed of these changes is difficult. It’s often their

Read More
Shared by AWS Machine Learning March 4, 2026

Building a scalable virtual try-on solution using Amazon Nova on AWS: part 1

Favorite In this first post in a two-part series, we examine how retailers can implement a virtual try-on to improve customer experience. In part 2, we will further explore real-world applications and benefits of this innovative technology. Every fourth piece of clothing bought online is returned to the retailer, feeding

Read More
Shared by AWS Machine Learning March 4, 2026

Build safe generative AI applications like a Pro: Best Practices with Amazon Bedrock Guardrails

Favorite Are you struggling to balance generative AI safety with accuracy, performance, and costs? Many organizations face this challenge when deploying generative AI applications to production. A guardrail that’s too strict blocks legitimate user requests, which frustrates customers. One that’s too lenient exposes your application to harmful content, prompt attacks,

Read More
Shared by AWS Machine Learning March 3, 2026

Build a serverless conversational AI agent using Claude with LangGraph and managed MLflow on Amazon SageMaker AI

Favorite Customer service teams face a persistent challenge. Existing chat-based assistants frustrate users with rigid responses, while direct large language model (LLM) implementations lack the structure needed for reliable business operations. When customers need help with order inquiries, cancellations, or status updates, traditional approaches either fail to understand natural language

Read More
Shared by AWS Machine Learning March 3, 2026

Building specialized AI without sacrificing intelligence: Nova Forge data mixing in action

Favorite Large language models (LLMs) perform well on general tasks but struggle with specialized work that requires understanding proprietary data, internal processes, and industry-specific terminology. Supervised fine-tuning (SFT) adapts LLMs to these organizational contexts. SFT can be implemented through two distinct methodologies: Parameter-Efficient Fine-Tuning (PEFT), which updates only a subset

Read More
Shared by AWS Machine Learning March 3, 2026

Large model inference container – latest capabilities and performance enhancements

Favorite Modern large language model (LLM) deployments face an escalating cost and performance challenge driven by token count growth. Token count, which is directly related to word count, image size, and other input factors, determines both computational requirements and costs. Longer contexts translate to higher expenses per inference request. This

Read More
Shared by AWS Machine Learning February 27, 2026

Reinforcement fine-tuning for Amazon Nova: Teaching AI through feedback

Favorite Foundation models deliver impressive out-of-the-box performance for general tasks, but many organizations need models to consume their business knowledge. Model customization helps you bridge the gap between general-purpose AI and your specific business needs when building applications that require domain-specific expertise, enforcing communication styles, optimizing for specialized tasks like

Read More
Shared by AWS Machine Learning February 27, 2026

Learnings from COBOL modernization in the real world

Favorite There’s a lot of excitement right now about AI enabling mainframe application modernization. Boards are paying attention. CIOs are getting asked for a plan. AI is a genuine accelerator for COBOL modernization but to get results, AI needs additional context that source code alone can’t provide.Here’s what we’ve learned

Read More
Shared by AWS Machine Learning February 27, 2026

Building intelligent event agents using Amazon Bedrock AgentCore and Amazon Bedrock Knowledge Bases

Favorite Large conferences and events generate overwhelming amounts of information—from hundreds of sessions and workshops to speaker profiles, venue maps, and constantly updating schedules. While basic AI assistants can answer simple questions about event logistics, most fail to deliver the personalized guidance and contextual awareness that attendees need to navigate

Read More
Shared by AWS Machine Learning February 26, 2026

Efficiently serve dozens of fine-tuned models with vLLM on Amazon SageMaker AI and Amazon Bedrock

Favorite Organizations and individuals running multiple custom AI models, especially recent Mixture of Experts (MoE) model families, can face the challenge of paying for idle GPU capacity when the individual models don’t receive enough traffic to saturate a dedicated compute endpoint. To solve this problem, we have partnered with the

Read More
Shared by AWS Machine Learning February 26, 2026