AWS Unveils Sweeping Updates to SageMaker, Amazon Q, and Bedrock, Plus New Large Language Models

Reese Morgan

Reese Morgan

December 04, 2024 · 4 min read
AWS Unveils Sweeping Updates to SageMaker, Amazon Q, and Bedrock, Plus New Large Language Models

AWS is making a significant push to integrate generative AI tools into every aspect of application development, as evident from the slew of updates and announcements made by CEO Matt Garman at the annual re:Invent conference. The company is leaving no stone unturned to make its offerings more attractive to developers, with a focus on simplifying AI and ML workflows.

One of the major announcements was the launch of SageMaker Unified Studio, a new service that combines SQL analytics, data processing, AI development, data streaming, business intelligence, and search analytics. This consolidated platform is currently in preview and aims to provide a unified experience for data analysts and data scientists. Additionally, SageMaker Lakehouse, an Apache Iceberg-compatible lakehouse, has been made generally available.

Amazon Q, the company's answer to Microsoft's GPT-driven Copilot generative AI assistant, has received significant updates. The new capabilities of Q Developer include automating code reviews, unit tests, and generating documentation, all of which are designed to ease developers' workloads and help them finish their development tasks faster. Furthermore, AWS has unveiled several code translation capabilities for Q in preview, including the ability to modernize .Net apps from Windows to Linux, mainframe code modernization, and the ability to help migrate VMware workloads.

Another key area of focus for AWS is its proprietary platform for building generative AI models and applications, Amazon Bedrock. The company has announced Amazon Bedrock Model Distillation, a managed service currently in preview, which is designed to help enterprises bring down their cost of running large language models (LLMs). Model Distillation is the process of inferring specialized knowledge from a larger LLM into a smaller LLM for a specific use case, making it cheaper and faster to run. The service works by generating responses from teacher models and fine-tunes a student model, with the added benefit of proprietary data synthesis.

In addition to Model Distillation, AWS has also added Automated Reasoning Checks to Amazon Bedrock Guardrails, a capability currently in preview that uses mathematical, logic-based algorithmic verification and reasoning processes to verify the information generated by a model. This feature is designed to prevent factual errors from hallucinations. Furthermore, the company has added support for multi-agent collaboration support inside Amazon Bedrock Agents, currently in preview.

AWS has also released a new range of large language models, dubbed Nova, which the company claims are either at par or better than rival models, especially in terms of cost. The Nova family of models includes Micro, Lite, Pro, and Premier, with all models being generally available except Premier, which is expected to be made generally available by March. The company has also announced plans to release two new models in the coming year, Nova Speech to Speech and Nova Any to Any.

In addition to the software updates, AWS has also showcased its new chip, Trainium2, designed to boost support for generative AI workloads. Trainium2-powered EC2 instances have been generally available, offering four times faster performance, four times the memory bandwidth, and three times more memory capability than its previous generation powered by Trainium1.

The implications of these updates are significant, as they demonstrate AWS's commitment to making AI and ML more accessible and easier to use for developers. With the company's focus on simplifying workflows and reducing costs, it's likely that we'll see increased adoption of generative AI tools across various industries. As the AI landscape continues to evolve, it will be interesting to see how AWS's new offerings shape the market and influence the development of new applications and services.

Similiar Posts

Copyright © 2024 Starfolk. All rights reserved.