November 25, 2024
May 7, 2024
November 25, 2024
May 7, 2024
Optimize compute, storage and data
Choose copilot or autopilot execution
Continuously improve with reinforcement learning
In environments where applications are not suitable for microservices architectures, rightsizing and in particular vertical scaling becomes a critical strategy to achieve cost-effective operations while meeting performance requirements. This approach involves choosing the right virtual machine type based on the CPU and memory resources required. Rightsizing is a known best practice for AWS EC2 VM cost optimization but is hard to implement in practice.
While AWS utilization data has not been disclosed publicly, analysis of the most recently released Azure VM usage dataset (available on Microsoft’s Github here) shows that VM users had an an average utilization of just 8.2%, with 72% of users having an average utilization of below 20% (see distribution below). A common pattern uncovered in the dataset was the selection of a small number of powerful, but oversized VMs.
Source: “Using Virtual Machine Size Recommendation Algorithms to Reduce Cloud Cost”, March 2023
Below is an example from a Sedai AWS EC2 customer environment of an underutilized instance with averaging just a few percent of CPU being used across a one month period:
Effective vertical scaling is also important as developers often default to overprovisioning AWS EC2 VM resources, opting for a simpler and quicker setup rather than conducting extensive testing across multiple instance types. This approach, while expedient, typically results in selecting VM configurations that exceed the application's actual requirements, leading to increased costs. The reluctance to engage in detailed testing stems from the time and complexity involved in evaluating each instance type's performance under different workloads. Consequently, developers lean towards a 'better safe than sorry' strategy. Although reducing the risk of underperformance this inefficiently raises cloud costs.
Many Virtual Machine workloads have bursty traffic patterns, especially for small databases, development environments, and low-traffic websites. For example, in the case below of a dev/test workload CPU utilization stays around 5% but surges a few times to the 30-40% range.
Given warm up periods may range from a few minutes to a few hours, horizontal scaling may not be viable. Finding suitable burstable instance types may be the preferred approach. In the absence of that, low average utilization will be achieved.
A high proportion of applications running on AWS EC2 run directly on virtual machines. These applications have not been replatformed to a microservice architecture such as Kubernetes (including AWS EC2 Kubernetes Service (AKS)), or serverless frameworks (AWS EC2 Functions). One key reason is that many of these applications do not benefit significantly from the horizontal scaling capabilities offered by microservices architectures. They may have architectural or design constraints that make such a transition complex or suboptimal, limiting their ability to efficiently use newer computing paradigms.
Vertical scaling is also complicated by the many types of VMs available. There are currently709 types of AWS EC2 Virtual Machine options offering varying Compute, Memory and other characteristics.
Asking an engineer to make the optimal choice across potentially hundreds or thousands of services can be challenging, especially if new code updates change the service's characteristics and then require a new determination of the right instance type.
Vertical scaling is particularly advantageous for applications that require high-performance levels from single instances or have dependencies that complicate distribution across multiple servers. By optimizing the configuration of AWS EC2 VMs to align closely with actual workload requirements, organizations can ensure that their applications perform optimally without incurring unnecessary costs from overprovisioning. This method allows for more precise control over resource allocation, leading to enhanced performance and reduced expenditures.
Sedai’s Automated Optimization utilizes advanced AI technology to deeply comprehend AWS EC2 VM configurations and their impact on application cost and performance. This results in AWS EC2 VMs that are optimally sized and configured to meet the specific needs of applications without incurring unnecessary costs or performance issues. Key benefits include:
Sedai’s Automated Optimization uses advanced AI that not only deeply understands AWS EC2 VM configurations and how they are impacting application cost and performance. This results in VMs that are optimally sized and configured to meet the specific needs of their applications without any excess cost or underperformance.
Our AI-driven platform continuously analyzes your AWS EC2 VMs to detect inefficiencies. It then autonomously implements optimizations, adjusting resources in real-time without requiring manual intervention.
The Sedai platform operates on a simple yet effective process: Discover, Recommend, Validate, Execute, and Track:
These capabilities form part of Sedai’s overall AWS EC2 VM optimization approach which can be seen below:
Some of the key elements above are:
Early adopters have seen significant improvements in both performance and cost efficiency. For instance, a technology company has identified over $75k of annual savings, a 34% saving, in its dev / test environments through rightsizing using Sedai’s optimization:
To gain insights on the state of your VM fleet you can scan it at a glance to see where applications are over or under provisioned as well as optimized based on Sedai’s findings. The example below only 12% of the apps have been optimized to date (shown as green).
The service is available now, with flexible pricing based on the scale of your AWS EC2 VM deployment. Request a demo to see how Sedai can help you rightsize your AWS EC2 VMs.
May 7, 2024
November 25, 2024
In environments where applications are not suitable for microservices architectures, rightsizing and in particular vertical scaling becomes a critical strategy to achieve cost-effective operations while meeting performance requirements. This approach involves choosing the right virtual machine type based on the CPU and memory resources required. Rightsizing is a known best practice for AWS EC2 VM cost optimization but is hard to implement in practice.
While AWS utilization data has not been disclosed publicly, analysis of the most recently released Azure VM usage dataset (available on Microsoft’s Github here) shows that VM users had an an average utilization of just 8.2%, with 72% of users having an average utilization of below 20% (see distribution below). A common pattern uncovered in the dataset was the selection of a small number of powerful, but oversized VMs.
Source: “Using Virtual Machine Size Recommendation Algorithms to Reduce Cloud Cost”, March 2023
Below is an example from a Sedai AWS EC2 customer environment of an underutilized instance with averaging just a few percent of CPU being used across a one month period:
Effective vertical scaling is also important as developers often default to overprovisioning AWS EC2 VM resources, opting for a simpler and quicker setup rather than conducting extensive testing across multiple instance types. This approach, while expedient, typically results in selecting VM configurations that exceed the application's actual requirements, leading to increased costs. The reluctance to engage in detailed testing stems from the time and complexity involved in evaluating each instance type's performance under different workloads. Consequently, developers lean towards a 'better safe than sorry' strategy. Although reducing the risk of underperformance this inefficiently raises cloud costs.
Many Virtual Machine workloads have bursty traffic patterns, especially for small databases, development environments, and low-traffic websites. For example, in the case below of a dev/test workload CPU utilization stays around 5% but surges a few times to the 30-40% range.
Given warm up periods may range from a few minutes to a few hours, horizontal scaling may not be viable. Finding suitable burstable instance types may be the preferred approach. In the absence of that, low average utilization will be achieved.
A high proportion of applications running on AWS EC2 run directly on virtual machines. These applications have not been replatformed to a microservice architecture such as Kubernetes (including AWS EC2 Kubernetes Service (AKS)), or serverless frameworks (AWS EC2 Functions). One key reason is that many of these applications do not benefit significantly from the horizontal scaling capabilities offered by microservices architectures. They may have architectural or design constraints that make such a transition complex or suboptimal, limiting their ability to efficiently use newer computing paradigms.
Vertical scaling is also complicated by the many types of VMs available. There are currently709 types of AWS EC2 Virtual Machine options offering varying Compute, Memory and other characteristics.
Asking an engineer to make the optimal choice across potentially hundreds or thousands of services can be challenging, especially if new code updates change the service's characteristics and then require a new determination of the right instance type.
Vertical scaling is particularly advantageous for applications that require high-performance levels from single instances or have dependencies that complicate distribution across multiple servers. By optimizing the configuration of AWS EC2 VMs to align closely with actual workload requirements, organizations can ensure that their applications perform optimally without incurring unnecessary costs from overprovisioning. This method allows for more precise control over resource allocation, leading to enhanced performance and reduced expenditures.
Sedai’s Automated Optimization utilizes advanced AI technology to deeply comprehend AWS EC2 VM configurations and their impact on application cost and performance. This results in AWS EC2 VMs that are optimally sized and configured to meet the specific needs of applications without incurring unnecessary costs or performance issues. Key benefits include:
Sedai’s Automated Optimization uses advanced AI that not only deeply understands AWS EC2 VM configurations and how they are impacting application cost and performance. This results in VMs that are optimally sized and configured to meet the specific needs of their applications without any excess cost or underperformance.
Our AI-driven platform continuously analyzes your AWS EC2 VMs to detect inefficiencies. It then autonomously implements optimizations, adjusting resources in real-time without requiring manual intervention.
The Sedai platform operates on a simple yet effective process: Discover, Recommend, Validate, Execute, and Track:
These capabilities form part of Sedai’s overall AWS EC2 VM optimization approach which can be seen below:
Some of the key elements above are:
Early adopters have seen significant improvements in both performance and cost efficiency. For instance, a technology company has identified over $75k of annual savings, a 34% saving, in its dev / test environments through rightsizing using Sedai’s optimization:
To gain insights on the state of your VM fleet you can scan it at a glance to see where applications are over or under provisioned as well as optimized based on Sedai’s findings. The example below only 12% of the apps have been optimized to date (shown as green).
The service is available now, with flexible pricing based on the scale of your AWS EC2 VM deployment. Request a demo to see how Sedai can help you rightsize your AWS EC2 VMs.