Simulate realistic users to evaluate multi-turn AI agents in Strands Evals

Favorite Evaluating single-turn agent interactions follows a pattern that most teams understand well. You provide an input, collect the output, and judge the result. Frameworks like Strands Evaluation SDK make this process systematic through evaluators that assess helpfulness, faithfulness, and tool usage. In a previous blog post, we covered how

Read More
Shared by AWS Machine Learning April 2, 2026

New ways to balance cost and reliability in the Gemini API

Favorite Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency. View Original Source (blog.google/technology/ai/) Here.

Create, edit and share videos at no cost in Google Vids

Favorite New AI capabilities are coming to Google Vids, powered by Lyria 3 and Veo 3.1, like high-quality video generation at no cost and more. View Original Source (blog.google/technology/ai/) Here.

Build reliable AI agents with Amazon Bedrock AgentCore Evaluations

Favorite Your AI agent worked in the demo, impressed stakeholders, handled test scenarios, and seemed ready for production. Then you deployed it, and the picture changed. Real users experienced wrong tool calls, inconsistent responses, and failure modes nobody anticipated during testing. The result is a gap between expected agent behavior

Read More
Shared by AWS Machine Learning April 1, 2026

Automating competitive price intelligence with Amazon Nova Act

Favorite Monitoring competitor prices is essential for ecommerce teams to maintain a market edge. However, many teams remain trapped in manual tracking, wasting hours daily checking individual websites. This inefficient approach delays decision-making, raises operational costs, and risks human errors that result in missed revenue and lost opportunities. Amazon Nova

Read More
Shared by AWS Machine Learning April 1, 2026

We’re creating a new satellite imagery map to help protect Brazil’s forests.

Favorite Google partnered with the Brazilian government on a satellite imagery map to help protect the country’s forests. View Original Source (blog.google/technology/ai/) Here.

The latest AI news we announced in March 2026

Favorite Here are Google’s latest AI updates from March 2026 View Original Source (blog.google/technology/ai/) Here.

Can your governance keep pace with your AI ambitions? AI risk intelligence in the agentic era

Favorite DevOps used to be predictable: same input, same output, binary success, static dependencies, concrete metrics. You could control what you could predict, measure what was concrete, and secure what followed known patterns. Then agentic AI arrived, and everything changed. Agents operate non-deterministically; they don’t follow fixed patterns. Ask the

Read More
Shared by AWS Machine Learning March 31, 2026

AWS launches frontier agents for security testing and cloud operations

Favorite I’m excited to announce that AWS Security Agent on-demand penetration testing and AWS DevOps Agent are now generally available, representing a new class of AI capabilities we announced at re:Invent called frontier agents. These autonomous systems work independently to achieve goals, scale massively to tackle concurrent tasks, and run

Read More
Shared by AWS Machine Learning March 31, 2026

Accelerating software delivery with agentic QA automation using Amazon Nova Act

Favorite Quality assurance (QA) automation is critical for modern software delivery. It catches regressions before production, validates user journeys at scale, and enables confident feature releases. But traditional QA automation solutions are brittle and demand specialized programming knowledge, decelerating software delivery. Automation frameworks rely on implementation details including UI selectors,

Read More
Shared by AWS Machine Learning March 31, 2026