Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters

Favorite Implementing hardware resiliency in your training infrastructure is crucial to mitigating risks and enabling uninterrupted model training. By implementing features such as proactive health monitoring and automated recovery mechanisms, organizations can create a fault-tolerant environment capable of handling hardware failures or other issues without compromising the integrity of the

Read More
Shared by AWS Machine Learning July 25, 2024

Evaluate conversational AI agents with Amazon Bedrock

Favorite As conversational artificial intelligence (AI) agents gain traction across industries, providing reliability and consistency is crucial for delivering seamless and trustworthy user experiences. However, the dynamic and conversational nature of these interactions makes traditional testing and evaluation methods challenging. Conversational AI agents also encompass multiple layers, from Retrieval Augmented

Read More
Shared by AWS Machine Learning July 25, 2024

Find answers accurately and quickly using Amazon Q Business with the SharePoint Online connector

Favorite Amazon Q Business is a fully managed, generative artificial intelligence (AI)-powered assistant that helps enterprises unlock the value of their data and knowledge. With Amazon Q, you can quickly find answers to questions, generate summaries and content, and complete tasks by using the information and expertise stored across your

Read More
Shared by AWS Machine Learning July 25, 2024

Amazon SageMaker inference launches faster auto scaling for generative AI models

Favorite Today, we are excited to announce a new capability in Amazon SageMaker inference that can help you reduce the time it takes for your generative artificial intelligence (AI) models to scale automatically. You can now use sub-minute metrics and significantly reduce overall scaling latency for generative AI models. With

Read More
Shared by AWS Machine Learning July 25, 2024

Boosting Salesforce Einstein’s code generating model performance with Amazon SageMaker

Favorite This post is a joint collaboration between Salesforce and AWS and is being cross-published on both the Salesforce Engineering Blog and the AWS Machine Learning Blog. Salesforce, Inc. is an American cloud-based software company headquartered in San Francisco, California. It provides customer relationship management (CRM) software and applications focused

Read More
Shared by AWS Machine Learning July 24, 2024

Discover insights from Amazon S3 with Amazon Q S3 connector

Favorite Amazon Q is a fully managed, generative artificial intelligence (AI) powered assistant that you can configure to answer questions, provide summaries, generate content, gain insights, and complete tasks based on data in your enterprise. The enterprise data required for these generative-AI powered assistants can reside in varied repositories across

Read More
Shared by AWS Machine Learning July 24, 2024

LLM experimentation at scale using Amazon SageMaker Pipelines and MLflow

Favorite Large language models (LLMs) have achieved remarkable success in various natural language processing (NLP) tasks, but they may not always generalize well to specific domains or tasks. You may need to customize an LLM to adapt to your unique use case, improving its performance on your specific dataset or

Read More
Shared by AWS Machine Learning July 24, 2024

Mistral Large 2 is now available in Amazon Bedrock

Favorite Mistral AI’s Mistral Large 2 (24.07) foundation model (FM) is now generally available in Amazon Bedrock. Mistral Large 2 is the newest version of Mistral Large, and according to Mistral AI offers significant improvements across multilingual capabilities, math, reasoning, coding, and much more. In this post, we discuss the

Read More
Shared by AWS Machine Learning July 24, 2024

Llama 3.1 models are now available in Amazon SageMaker JumpStart

Favorite Today, we are excited to announce that the state-of-the-art Llama 3.1 collection of multilingual large language models (LLMs), which includes pre-trained and instruction tuned generative AI models in 8B, 70B, and 405B sizes, is available through Amazon SageMaker JumpStart to deploy for inference. Llama is a publicly accessible LLM

Read More
Shared by AWS Machine Learning July 23, 2024

Use Llama 3.1 405B to generate synthetic data for fine-tuning tasks

Favorite Today, we are excited to announce the availability of the Llama 3.1 405B model on Amazon SageMaker JumpStart, and Amazon Bedrock in preview. The Llama 3.1 models are a collection of state-of-the-art pre-trained and instruct fine-tuned generative artificial intelligence (AI) models in 8B, 70B, and 405B sizes. Amazon SageMaker

Read More
Shared by AWS Machine Learning July 23, 2024

1 2 3 … 211 »