Efficiently train models with large sequence lengths using Amazon SageMaker model parallel

Favorite Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. Across diverse industries—including healthcare, finance, and marketing—organizations are now engaged in pre-training and fine-tuning these increasingly larger LLMs, which often boast billions of parameters

Read More
Shared by AWS Machine Learning November 28, 2024

Embodied AI Chess with Amazon Bedrock

Favorite Generative AI continues to transform numerous industries and activities, with one such application being the enhancement of chess, a traditional human game, with sophisticated AI and large language models (LLMs). Using the Custom Model Import feature in Amazon Bedrock, you can now create engaging matches between foundation models (FMs)

Read More
Shared by AWS Machine Learning November 28, 2024

Search enterprise data assets using LLMs backed by knowledge graphs

Favorite Enterprises are facing challenges in accessing their data assets scattered across various sources because of increasing complexities in managing vast amount of data. Traditional search methods often fail to provide comprehensive and contextual results, particularly for unstructured data or complex queries. Search solutions in modern big data management must

Read More
Shared by AWS Machine Learning November 28, 2024

John Snow Labs Medical LLMs are now available in Amazon SageMaker JumpStart

Favorite Today, we are excited to announce that John Snow Labs’ Medical LLM – Small and Medical LLM – Medium large language models (LLMs) are now available on Amazon SageMaker Jumpstart. Medical LLM is optimized for the following medical language understanding tasks: Summarizing clinical encounters – Summarizing discharge notes, progress

Read More
Shared by AWS Machine Learning November 27, 2024

Connect SharePoint Online to Amazon Q Business using OAuth 2.0 ROPC flow authentication

Favorite Enterprises face significant challenges accessing and utilizing the vast amounts of information scattered across organization’s various systems. What if you could simply ask a question and get instant, accurate answers from your company’s entire knowledge base, while accounting for an individual user’s data access levels? Amazon Q Business is

Read More
Shared by AWS Machine Learning November 27, 2024

How 123RF saved over 90% of their translation costs by switching to Amazon Bedrock

Favorite In the rapidly evolving digital content industry, multilingual accessibility is crucial for global reach and user engagement. 123RF, a leading provider of royalty-free digital content, is an online resource for creative assets, including AI-generated images from text. In 2023, they used Amazon OpenSearch Service to improve discovery of images

Read More
Shared by AWS Machine Learning November 27, 2024

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency

Favorite This post is co-written with Isaac Smothers and James Healy-Mirkovich from Crexi. With the current demand for AI and machine learning (AI/ML) solutions, the processes to train and deploy models and scale inference are crucial to business success. Even though AI/ML and especially generative AI progress is rapid, machine

Read More
Shared by AWS Machine Learning November 27, 2024

Read graphs, diagrams, tables, and scanned pages using multimodal prompts in Amazon Bedrock

Favorite Large language models (LLMs) have come a long way from being able to read only text to now being able to read and understand graphs, diagrams, tables, and images. In this post, we discuss how to use LLMs from Amazon Bedrock to not only extract text, but also understand

Read More
Shared by AWS Machine Learning November 27, 2024

Rad AI reduces real-time inference latency by 50% using Amazon SageMaker

Favorite This post is co-written with Ken Kao and Hasan Ali Demirci from Rad AI. Rad AI has reshaped radiology reporting, developing solutions that streamline the most tedious and repetitive tasks, and saving radiologists’ time. Since 2018, using state-of-the-art proprietary and open source large language models (LLMs), our flagship product—Rad

Read More
Shared by AWS Machine Learning November 27, 2024

Build a read-through semantic cache with Amazon OpenSearch Serverless and Amazon Bedrock

Favorite In the field of generative AI, latency and cost pose significant challenges. The commonly used large language models (LLMs) often process text sequentially, predicting one token at a time in an autoregressive manner. This approach can introduce delays, resulting in less-than-ideal user experiences. Additionally, the growing demand for AI-powered

Read More
Shared by AWS Machine Learning November 27, 2024