Smart scaling
Smart scaling uses machine learning to provide predictions based on historical workload patterns. By analyzing the data on the Administering > Workload patterns page, you can learn how to optimize the usage of your system resources.
You can obtain more information about the following system resources:
- Historical and Predicted NZP (Netezza Performance Metric)
- Historical or Suggested Contour Profile
- Confidence score
Seed models
Seed models are pretrained models that are uploaded on your temporary workspace. A prediction API uses these models for the duration of a training grace period (7 days or more), after the system is deployed, starting from after day 1. At this stage, the confidence score is low because the models do not reflect your workloads yet.
After the training grace period is over, the model is retrained with the datasets that were generated from the collected workload data. The prediction API runs inference based on the pretrained models. Then, the confidence score is updated based on the accuracy of the prediction, and system scheduler triggers the training model task daily for 2 weeks, and then on a weekly basis.
Graph elements
Performance property
Use the Performance property dropdown to navigate between different charts. The default Workload Utilization graph visualizes multiple data points that are related to your system resources.
When you are looking at the graph overall, use the legend to identify the contour profile.
Time interval
Use the Time interval dropdown to select different time intervals.
By default, the current date stays the same. When you select between the daily, weekly, or monthly view, the date is aggregated over time accordingly.
A line in the middle of the graph shows the cut off between historical and predicted or suggested data.
Predicted data is only displayed when the system has been collecting historical data for at least 7 days. Until then, the graph displays a data point for each day. After 7 days, a prediction for an extra two weeks' worth of information appears.
Historical and Predicted NZP
The purple and pink lines represent Historical NZP and Predicted NZP.
NZP (Netezza Performance Metric) is a measure of system utilization.
High level NZP indicates elevated levels of utilization. Low NZP indicates decreased resource usage.
The historical and predicted NZP lines are measured on the scale
to the left of the chart between 0% and 100%.
Historical and Suggested contour profile
Historical Contour Profile and Suggested Contour Profile are visualized by the blue bars.
The contours that are displayed to the left of the current date line are historical contours. They represent the profiles that are used on each time interval.
The bars that are displayed on the right side of the current date line are suggested
contours for each time interval.
Contour profiles are suggested based on the predicted NZP during that time interval. If NZP exceeds 80%, it is suggested that contour profile is scaled up the next closest contour to increase compute and improve performance for heavy workload periods. When the algorithm predicts NZP at or below 20% for a time interval, the graph suggests to scale down to a lower profile.
Confidence score
Confidence score reflects the accuracy of the predictions that are provided based on your system historical workload patterns. By analyzing the confidence score, you can choose how to use your system resources.
Smart scaling example
-
Go to Administering > Workload patterns.
-
Review the smart scaling utilization graph.
In this example, as a database administrator, you want to to get a deeper understanding of your system utilization over time.In this scenario, it turns out that every Sunday, your system utilization drops below average as compared to other days of the week. Basing on this information, you can update your scaling schedule to match anticipated peaks and dips in workloads.
-
Configure the scheduler to scale down the instance for every Sunday as described in Editing scheduled scaling.
-
Configure the scheduler to scale up back starting every Monday as described in Editing scheduled scaling.
As a result, you will see performance improvement and reduction in resource waste and costs in the future.