AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency

Favorite The size of the machine learning (ML) models––large language models (LLMs) and foundation models (FMs)––is growing fast year-over-year, and these models need faster and more powerful accelerators, especially for generative AI. AWS Inferentia2 was designed from the ground up to deliver higher performance while lowering the cost of LLMs

Read More
Shared by AWS Machine Learning June 14, 2023

Reinventing the data experience: Use generative AI and modern data architecture to unlock insights

Favorite Implementing a modern data architecture provides a scalable method to integrate data from disparate sources. By organizing data by business domains instead of infrastructure, each domain can choose tools that suit their needs. Organizations can maximize the value of their modern data architecture with generative AI solutions while innovating

Read More
Shared by AWS Machine Learning June 14, 2023

Fine-tune GPT-J using an Amazon SageMaker Hugging Face estimator and the model parallel library

Favorite GPT-J is an open-source 6-billion-parameter model released by Eleuther AI. The model is trained on the Pile and can perform various tasks in language processing. It can support a wide variety of use cases, including text classification, token classification, text generation, question and answering, entity extraction, summarization, sentiment analysis,

Read More
Shared by AWS Machine Learning June 13, 2023

Build custom chatbot applications using OpenChatkit models on Amazon SageMaker

Favorite Open-source large language models (LLMs) have become popular, allowing researchers, developers, and organizations to access these models to foster innovation and experimentation. This encourages collaboration from the open-source community to contribute to developments and improvement of LLMs. Open-source LLMs provide transparency to the model architecture, training process, and training

Read More
Shared by AWS Machine Learning June 13, 2023