Host concurrent LLMs with LoRAX

Favorite Businesses are increasingly seeking domain-adapted and specialized foundation models (FMs) to meet specific needs in areas such as document summarization, industry-specific adaptations, and technical code generation and advisory. The increased usage of generative AI models has offered tailored experiences with minimal technical expertise, and organizations are increasingly using these

Read More
Shared by AWS Machine Learning April 17, 2025

Automate Amazon EKS troubleshooting using an Amazon Bedrock agentic workflow

Favorite As organizations scale their Amazon Elastic Kubernetes Service (Amazon EKS) deployments, platform administrators face increasing challenges in efficiently managing multi-tenant clusters. Tasks such as investigating pod failures, addressing resource constraints, and resolving misconfiguration can consume significant time and effort. Instead of spending valuable engineering hours manually parsing logs, tracking

Read More
Shared by AWS Machine Learning April 17, 2025

Elevate business productivity with Amazon Q and Amazon Connect

Favorite Modern banking faces dual challenges: delivering rapid loan processing while maintaining robust security against sophisticated fraud. Amazon Q Business provides AI-driven analysis of regulatory requirements and lending patterns. Additionally, you can now report fraud from the same interface with a custom plugin capability that can integrate with Amazon Connect.

Read More
Shared by AWS Machine Learning April 16, 2025

Optimizing Mixtral 8x7B on Amazon SageMaker with AWS Inferentia2

Favorite Organizations are constantly seeking ways to harness the power of advanced large language models (LLMs) to enable a wide range of applications such as text generation, summarizationquestion answering, and many others. As these models grow more powerful and capable, deploying them in production environments while optimizing performance and cost-efficiency

Read More
Shared by AWS Machine Learning April 16, 2025

Build multi-agent systems with LangGraph and Amazon Bedrock

Favorite Large language models (LLMs) have raised the bar for human-computer interaction where the expectation from users is that they can communicate with their applications through natural language. Beyond simple language understanding, real-world applications require managing complex workflows, connecting to external data, and coordinating multiple AI capabilities. Imagine scheduling a

Read More
Shared by AWS Machine Learning April 15, 2025

Racing beyond DeepRacer: Debut of the AWS LLM League

Favorite The AWS DeepRacer League is the world’s first autonomous racing league, open to anyone. Announced at re:Invent 2018, it puts machine learning in the hands of every developer through the fun and excitement of developing and racing self-driving remote control cars. Through the past 7 years, over 560 thousand

Read More
Shared by AWS Machine Learning April 12, 2025

Building an AIOps chatbot with Amazon Q Business custom plugins

Favorite Many organizations rely on multiple third-party applications and services for different aspects of their operations, such as scheduling, HR management, financial data, customer relationship management (CRM) systems, and more. However, these systems often exist in silos, requiring users to manually navigate different interfaces, switch between environments, and perform repetitive

Read More
Shared by AWS Machine Learning April 12, 2025