Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances

Favorite When deploying Deep Learning models at scale, it is crucial to effectively utilize the underlying hardware to maximize performance and cost benefits. For production workloads requiring high throughput and low latency, the selection of the Amazon Elastic Compute Cloud (EC2) instance, model serving stack, and deployment architecture is very

Read More
Shared by AWS Machine Learning July 25, 2023

How Patsnap used GPT-2 inference on Amazon SageMaker with low latency and cost

Favorite This blog post was co-authored, and includes an introduction, by Zilong Bai, senior natural language processing engineer at Patsnap. You’re likely familiar with the autocomplete suggestion feature when you search for something on Google or Amazon. Although the search terms in these scenarios are pretty common keywords or expressions

Read More
Shared by AWS Machine Learning July 25, 2023

Analyze rodent infestation using Amazon SageMaker geospatial capabilities

Favorite Rodents such as rats and mice are associated with a number of health risks and are known to spread more than 35 diseases. Identifying regions of high rodent activity can help local authorities and pest control organizations plan for interventions effectively and exterminate the rodents. In this post, we

Read More
Shared by AWS Machine Learning July 22, 2023

Efficiently train, tune, and deploy custom ensembles using Amazon SageMaker

Favorite Artificial intelligence (AI) has become an important and popular topic in the technology community. As AI has evolved, we have seen different types of machine learning (ML) models emerge. One approach, known as ensemble modeling, has been rapidly gaining traction among data scientists and practitioners. In this post, we

Read More
Shared by AWS Machine Learning July 21, 2023

Enel automates large-scale power grid asset management and anomaly detection using Amazon SageMaker

Favorite This is a guest post by Mario Namtao Shianti Larcher, Head of Computer Vision at Enel. Enel, which started as Italy’s national entity for electricity, is today a multinational company present in 32 countries and the first private network operator in the world with 74 million users. It is

Read More
Shared by AWS Machine Learning July 21, 2023

Integrate Amazon SageMaker Model Cards with the model registry

Favorite Amazon SageMaker Model Cards enable you to standardize how models are documented, thereby achieving visibility into the lifecycle of a model, from designing, building, training, and evaluation. Model cards are intended to be a single source of truth for business and technical metadata about the model that can reliably

Read More
Shared by AWS Machine Learning July 20, 2023

Use a generative AI foundation model for summarization and question answering using your own data

Favorite Large language models (LLMs) can be used to analyze complex documents and provide summaries and answers to questions. The post Domain-adaptation Fine-tuning of Foundation Models in Amazon SageMaker JumpStart on Financial data describes how to fine-tune an LLM using your own dataset. Once you have a solid LLM, you’ll

Read More
Shared by AWS Machine Learning July 20, 2023

Llama 2 foundation models from Meta are now available in Amazon SageMaker JumpStart

Favorite Today, we are excited to announce that Llama 2 foundation models developed by Meta are available for customers through Amazon SageMaker JumpStart. The Llama 2 family of large language models (LLMs) is a collection of pre-trained and fine-tuned generative text models ranging in scale from 7 billion to 70

Read More
Shared by AWS Machine Learning July 19, 2023

Build an email spam detector using Amazon SageMaker

Favorite Spam emails, also known as junk mail, are sent to a large number of users at once and often contain scams, phishing content, or cryptic messages. Spam emails are sometimes sent manually by a human, but most often they are sent using a bot. Examples of spam emails include

Read More
Shared by AWS Machine Learning July 19, 2023

Enhance Amazon Lex with LLMs and improve the FAQ experience using URL ingestion

Favorite In today’s digital world, most consumers would rather find answers to their customer service questions on their own rather than taking the time to reach out to businesses and/or service providers. This blog post explores an innovative solution to build a question and answer chatbot in Amazon Lex that

Read More
Shared by AWS Machine Learning July 19, 2023