Amazon SageMaker Inference now supports G6e instances

Favorite As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G6e instances powered by NVIDIA’s L40S Tensor Core GPUs on Amazon SageMaker. You will have the option to

Read More
Shared by AWS Machine Learning November 23, 2024

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Favorite Companies across various scales and industries are using large language models (LLMs) to develop generative AI applications that provide innovative experiences for customers and employees. However, building or fine-tuning these pre-trained LLMs on extensive datasets demands substantial computational resources and engineering effort. With the increase in sizes of these

Read More
Shared by AWS Machine Learning November 23, 2024

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Favorite As generative AI models advance in creating multimedia content, the difference between good and great output often lies in the details that only human feedback can capture. Audio and video segmentation provides a structured way to gather this detailed feedback, allowing models to learn through reinforcement learning from human

Read More
Shared by AWS Machine Learning November 22, 2024

Embedding secure generative AI in mission-critical public safety applications

Favorite This post is co-written with Lawrence Zorio III from Mark43. Public safety organizations face the challenge of accessing and analyzing vast amounts of data quickly while maintaining strict security protocols. First responders need immediate access to relevant data across multiple systems, while command staff require rapid insights for operational decisions.

Read More
Shared by AWS Machine Learning November 21, 2024