Reduce ML training costs with Amazon SageMaker HyperPod

Favorite Training a frontier model is highly compute-intensive, requiring a distributed system of hundreds, or thousands, of accelerated instances running for several weeks or months to complete a single job. For example, pre-training the Llama 3 70B model with 15 trillion training tokens took 6.5 million H100 GPU hours. On

Read More
Shared by AWS Machine Learning April 11, 2025

Multi-LLM routing strategies for generative AI applications on AWS

Favorite Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. The multi-LLM approach enables organizations to effectively choose the right model

Read More
Shared by AWS Machine Learning April 10, 2025

Multi-LLM routing strategies for generative AI applications on AWS

Favorite Organizations are increasingly using multiple large language models (LLMs) when building generative AI applications. Although an individual LLM can be highly capable, it might not optimally address a wide range of use cases or meet diverse performance requirements. The multi-LLM approach enables organizations to effectively choose the right model

Read More
Shared by AWS Machine Learning April 10, 2025

Boost team productivity with Amazon Q Business Insights

Favorite Employee productivity is a critical factor in maintaining a competitive advantage. Amazon Q Business offers a unique opportunity to enhance workforce efficiency by providing AI-powered assistance that can significantly reduce the time spent searching for information, generating content, and completing routine tasks. Amazon Q Business is a fully managed,

Read More
Shared by AWS Machine Learning April 10, 2025

Boost team productivity with Amazon Q Business Insights

Favorite Employee productivity is a critical factor in maintaining a competitive advantage. Amazon Q Business offers a unique opportunity to enhance workforce efficiency by providing AI-powered assistance that can significantly reduce the time spent searching for information, generating content, and completing routine tasks. Amazon Q Business is a fully managed,

Read More
Shared by AWS Machine Learning April 10, 2025

Implement human-in-the-loop confirmation with Amazon Bedrock Agents

Favorite Agents are revolutionizing how businesses automate complex workflows and decision-making processes. Amazon Bedrock Agents helps you accelerate generative AI application development by orchestrating multi-step tasks. Agents use the reasoning capability of foundation models (FMs) to break down user-requested tasks into multiple steps. In addition, they use the developer-provided instruction

Read More
Shared by AWS Machine Learning April 10, 2025

Implement human-in-the-loop confirmation with Amazon Bedrock Agents

Favorite Agents are revolutionizing how businesses automate complex workflows and decision-making processes. Amazon Bedrock Agents helps you accelerate generative AI application development by orchestrating multi-step tasks. Agents use the reasoning capability of foundation models (FMs) to break down user-requested tasks into multiple steps. In addition, they use the developer-provided instruction

Read More
Shared by AWS Machine Learning April 10, 2025

Advanced tracing and evaluation of generative AI agents using LangChain and Amazon SageMaker AI MLFlow

Favorite Developing generative AI agents that can tackle real-world tasks is complex, and building production-grade agentic applications requires integrating agents with additional tools such as user interfaces, evaluation frameworks, and continuous improvement mechanisms. Developers often find themselves grappling with unpredictable behaviors, intricate workflows, and a web of complex interactions. The

Read More
Shared by AWS Machine Learning April 8, 2025