Efficient continual pre-training LLMs for financial domains

Favorite Large language models (LLMs) are generally trained on large publicly available datasets that are domain agnostic. For example, Meta’s Llama models are trained on datasets such as CommonCrawl, C4, Wikipedia, and ArXiv. These datasets encompass a broad range of topics and domains. Although the resulting models yield amazingly good

Read More
Shared by AWS Machine Learning March 29, 2024

Advanced RAG patterns on Amazon SageMaker

Favorite Today, customers of all industries—whether it’s financial services, healthcare and life sciences, travel and hospitality, media and entertainment, telecommunications, software as a service (SaaS), and even proprietary model providers—are using large language models (LLMs) to build applications like question and answering (QnA) chatbots, search engines, and knowledge bases. These

Read More
Shared by AWS Machine Learning March 29, 2024

Achieve DevOps maturity with BMC AMI zAdviser Enterprise and Amazon Bedrock

Favorite In software engineering, there is a direct correlation between team performance and building robust, stable applications. The data community aims to adopt the rigorous engineering principles commonly used in software development into their own practices, which includes systematic approaches to design, development, testing, and maintenance. This requires carefully combining

Read More
Shared by AWS Machine Learning March 28, 2024

Build a receipt and invoice processing pipeline with Amazon Textract

Favorite In today’s business landscape, organizations are constantly seeking ways to optimize their financial processes, enhance efficiency, and drive cost savings. One area that holds significant potential for improvement is accounts payable. On a high level, the accounts payable process includes receiving and scanning invoices, extraction of the relevant data

Read More
Shared by AWS Machine Learning March 27, 2024

Best practices for building secure applications with Amazon Transcribe

Favorite Amazon Transcribe is an AWS service that allows customers to convert speech to text in either batch or streaming mode. It uses machine learning–powered automatic speech recognition (ASR), automatic language identification, and post-processing technologies. Amazon Transcribe can be used for transcription of customer care calls, multiparty conference calls, and

Read More
Shared by AWS Machine Learning March 26, 2024

Enhance performance of generative language models with self-consistency prompting on Amazon Bedrock

Favorite Generative language models have proven remarkably skillful at solving logical and analytical natural language processing (NLP) tasks. Furthermore, the use of prompt engineering can notably enhance their performance. For example, chain-of-thought (CoT) is known to improve a model’s capacity for complex multi-step problems. To additionally boost accuracy on tasks

Read More
Shared by AWS Machine Learning March 19, 2024

Unlock the potential of generative AI in industrial operations

Favorite In the evolving landscape of manufacturing, the transformative power of AI and machine learning (ML) is evident, driving a digital revolution that streamlines operations and boosts productivity. However, this progress introduces unique challenges for enterprises navigating data-driven solutions. Industrial facilities grapple with vast volumes of unstructured data, sourced from

Read More
Shared by AWS Machine Learning March 19, 2024

Fine-tune Code Llama on Amazon SageMaker JumpStart

Favorite Today, we are excited to announce the capability to fine-tune Code Llama models by Meta using Amazon SageMaker JumpStart. The Code Llama family of large language models (LLMs) is a collection of pre-trained and fine-tuned code generation models ranging in scale from 7 billion to 70 billion parameters. Fine-tuned

Read More
Shared by AWS Machine Learning March 18, 2024