Open source observability for AWS Inferentia nodes within Amazon EKS clusters

Favorite Recent developments in machine learning (ML) have led to increasingly large models, some of which require hundreds of billions of parameters. Although they are more powerful, training and inference on those models require significant computational resources. Despite the availability of advanced distributed training libraries, it’s common for training and

Read More
Shared by AWS Machine Learning April 18, 2024

Uncover hidden connections in unstructured financial data with Amazon Bedrock and Amazon Neptune

Favorite In asset management, portfolio managers need to closely monitor companies in their investment universe to identify risks and opportunities, and guide investment decisions. Tracking direct events like earnings reports or credit downgrades is straightforward—you can set up alerts to notify managers of news containing company names. However, detecting second

Read More
Shared by AWS Machine Learning April 18, 2024

A secure approach to generative AI with AWS

Favorite Generative artificial intelligence (AI) is transforming the customer experience in industries across the globe. Customers are building generative AI applications using large language models (LLMs) and other foundation models (FMs), which enhance customer experiences, transform operations, improve employee productivity, and create new revenue channels. FMs and the applications built

Read More
Shared by AWS Machine Learning April 17, 2024

Manage your Amazon Lex bot via AWS CloudFormation templates

Favorite Amazon Lex is a fully managed artificial intelligence (AI) service with advanced natural language models to design, build, test, and deploy conversational interfaces in applications. It employs advanced deep learning technologies to understand user input, enabling developers to create chatbots, virtual assistants, and other applications that can interact with

Read More
Shared by AWS Machine Learning April 17, 2024

Distributed training and efficient scaling with the Amazon SageMaker Model Parallel and Data Parallel Libraries

Favorite There has been tremendous progress in the field of distributed deep learning for large language models (LLMs), especially after the release of ChatGPT in December 2022. LLMs continue to grow in size with billions or even trillions of parameters, and they often won’t fit into a single accelerator device

Read More
Shared by AWS Machine Learning April 17, 2024

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

Favorite Amazon SageMaker Studio provides a fully managed solution for data scientists to interactively build, train, and deploy machine learning (ML) models. In the process of working on their ML tasks, data scientists typically start their workflow by discovering relevant data sources and connecting to them. They then use SQL

Read More
Shared by AWS Machine Learning April 17, 2024

AWS at NVIDIA GTC 2024: Accelerate innovation with generative AI on AWS

Favorite AWS was delighted to present to and connect with over 18,000 in-person and 267,000 virtual attendees at NVIDIA GTC, a global artificial intelligence (AI) conference that took place March 2024 in San Jose, California, returning to a hybrid, in-person experience for the first time since 2019. AWS has had

Read More
Shared by AWS Machine Learning April 12, 2024

Cost-effective document classification using the Amazon Titan Multimodal Embeddings Model

Favorite Organizations across industries want to categorize and extract insights from high volumes of documents of different formats. Manually processing these documents to classify and extract information remains expensive, error prone, and difficult to scale. Advances in generative artificial intelligence (AI) have given rise to intelligent document processing (IDP) solutions

Read More
Shared by AWS Machine Learning April 12, 2024

Build an active learning pipeline for automatic annotation of images with AWS services

Favorite This blog post is co-written with Caroline Chung from Veoneer. Veoneer is a global automotive electronics company and a world leader in automotive electronic safety systems. They offer best-in-class restraint control systems and have delivered over 1 billion electronic control units and crash sensors to car manufacturers globally. The

Read More
Shared by AWS Machine Learning April 11, 2024

Knowledge Bases for Amazon Bedrock now supports custom prompts for the RetrieveAndGenerate API and configuration of the maximum number of retrieved results

Favorite With Knowledge Bases for Amazon Bedrock, you can securely connect foundation models (FMs) in Amazon Bedrock to your company data for Retrieval Augmented Generation (RAG). Access to additional data helps the model generate more relevant, context-specific, and accurate responses without retraining the FMs. In this post, we discuss two

Read More
Shared by AWS Machine Learning April 10, 2024