Host concurrent LLMs with LoRAX

Favorite Businesses are increasingly seeking domain-adapted and specialized foundation models (FMs) to meet specific needs in areas such as document summarization, industry-specific adaptations, and technical code generation and advisory. The increased usage of generative AI models has offered tailored experiences with minimal technical expertise, and organizations are increasingly using these

Read More
Shared by AWS Machine Learning April 17, 2025

Automate Amazon EKS troubleshooting using an Amazon Bedrock agentic workflow

Favorite As organizations scale their Amazon Elastic Kubernetes Service (Amazon EKS) deployments, platform administrators face increasing challenges in efficiently managing multi-tenant clusters. Tasks such as investigating pod failures, addressing resource constraints, and resolving misconfiguration can consume significant time and effort. Instead of spending valuable engineering hours manually parsing logs, tracking

Read More
Shared by AWS Machine Learning April 17, 2025

Elevate business productivity with Amazon Q and Amazon Connect

Favorite Modern banking faces dual challenges: delivering rapid loan processing while maintaining robust security against sophisticated fraud. Amazon Q Business provides AI-driven analysis of regulatory requirements and lending patterns. Additionally, you can now report fraud from the same interface with a custom plugin capability that can integrate with Amazon Connect.

Read More
Shared by AWS Machine Learning April 16, 2025

Optimizing Mixtral 8x7B on Amazon SageMaker with AWS Inferentia2

Favorite Organizations are constantly seeking ways to harness the power of advanced large language models (LLMs) to enable a wide range of applications such as text generation, summarizationquestion answering, and many others. As these models grow more powerful and capable, deploying them in production environments while optimizing performance and cost-efficiency

Read More
Shared by AWS Machine Learning April 16, 2025