Improve factual consistency with LLM Debates

Favorite In this post, we demonstrate the potential of large language model (LLM) debates using a supervised dataset with ground truth. In this LLM debate, we have two debater LLMs, each one taking one side of an argument and defending it based on the previous arguments for N(=3) rounds. The

Read More
Shared by AWS Machine Learning November 23, 2024

Amazon SageMaker Inference now supports G6e instances

Favorite As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G6e instances powered by NVIDIA’s L40S Tensor Core GPUs on Amazon SageMaker. You will have the option to

Read More
Shared by AWS Machine Learning November 23, 2024

Accelerating Mixtral MoE fine-tuning on Amazon SageMaker with QLoRA

Favorite Companies across various scales and industries are using large language models (LLMs) to develop generative AI applications that provide innovative experiences for customers and employees. However, building or fine-tuning these pre-trained LLMs on extensive datasets demands substantial computational resources and engineering effort. With the increase in sizes of these

Read More
Shared by AWS Machine Learning November 23, 2024

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker

Favorite As generative AI models advance in creating multimedia content, the difference between good and great output often lies in the details that only human feedback can capture. Audio and video segmentation provides a structured way to gather this detailed feedback, allowing models to learn through reinforcement learning from human

Read More
Shared by AWS Machine Learning November 22, 2024