Favorite In the first post of this series, we introduced a comprehensive evaluation framework for Amazon Q Business, a fully managed Retrieval Augmented Generation (RAG) solution that uses your company’s proprietary data without the complexity of managing large language models (LLMs). The first post focused on selecting appropriate use cases,
Read More
Shared by AWS Machine Learning April 23, 2025
Favorite Today, we’re excited to announce the launch of Amazon SageMaker Large Model Inference (LMI) container v15, powered by vLLM 0.8.4 with support for the vLLM V1 engine. This version now supports the latest open-source models, such as Meta’s Llama 4 models Scout and Maverick, Google’s Gemma 3, Alibaba’s Qwen,
Read More
Shared by AWS Machine Learning April 23, 2025
Favorite Maintainer Month returns this May, and the Open Source Initiative (OSI) is proud to join GitHub and a global community of contributors in honoring the individuals who steward and sustain Open Source projects. In 2025, Maintainer Month enters its fourth year with a clear and urgent theme: Securing Open
Read More
Shared by voicesofopensource April 22, 2025
Favorite Large language models (LLMs) have become integral to numerous applications across industries, ranging from enhanced customer interactions to automated business processes. Deploying these models in real-world scenarios presents significant challenges, particularly in ensuring accuracy, fairness, relevance, and mitigating hallucinations. Thorough evaluation of the performance and outputs of these models
Read More
Shared by AWS Machine Learning April 22, 2025
Favorite This post is co-written with Vikram Gundeti and Nate Folkert from Foursquare. Personalization is key to creating memorable experiences. Whether it’s recommending the perfect movie or suggesting a new restaurant, tailoring suggestions to individual preferences can make all the difference. But when it comes to food and activities, there’s
Read More
Shared by AWS Machine Learning April 22, 2025
Favorite Yuewen Group is a global leader in online literature and IP operations. Through its overseas platform WebNovel, it has attracted about 260 million users in over 200 countries and regions, promoting Chinese web literature globally. The company also adapts quality web novels into films, animations for international markets, expanding
Read More
Shared by AWS Machine Learning April 22, 2025
Favorite Today, we’re opening applications for our second Google for Startups AI Academy: American Infrastructure cohort.This six-month program provides tailored technical suppor… View Original Source (blog.google/technology/ai/) Here.
Favorite Retrieval Augmented Generation (RAG) enhances AI responses by combining the generative AI model’s capabilities with information from external data sources, rather than relying solely on the model’s built-in knowledge. In this post, we showcase the custom data connector capability in Amazon Bedrock Knowledge Bases that makes it straightforward to
Read More
Shared by AWS Machine Learning April 19, 2025
Favorite AI agents are revolutionizing how businesses enhance their operational capabilities and enterprise applications. By enabling natural language interactions, these agents provide customers with a streamlined, personalized experience. Amazon Bedrock Agents uses the capabilities of foundation models (FMs), combining them with APIs and data to process user requests, gather information,
Read More
Shared by AWS Machine Learning April 19, 2025
Favorite This post is a joint collaboration between Salesforce and AWS and is being cross-published on both the Salesforce Engineering Blog and the AWS Machine Learning Blog. The Salesforce AI Model Serving team is working to push the boundaries of natural language processing and AI capabilities for enterprise applications. Their
Read More
Shared by AWS Machine Learning April 18, 2025