Train your solution by using historical data to predict numeric outputs, such as a temperature or a stock price. For example, you can use regression to estimate the time it takes to resolve an incident or a case.

Before you begin

Important: Support for new regression solutions is deprecated in the Yokohama release. You will still be able to edit and train any existing solutions, but you won't be able to create new ones.

Role required: ml_admin or admin

About this task

Regression solutions enable you to predict a point estimate and prediction interval. The resulting model delivers the following statistics:
  • Mean Absolute Error (MAE), which measures the mean deviation of a predicted value from the actual value. This metric is useful as it's easy to understand as its scale is the same as that of its target. However, MAE is unbounded, making it difficult to compare across models.
  • Symmetric Mean Absolute Percentage Error (SMAPE) is a percentage value of the deviation from the predicted to the actual. SMAPE is a bounded version of MAE except that it has a value range between 0 and 100. The lower the SMAPE value, the better the model accuracy.
  • Range Accuracy is the percentage of actual values between a predicted range. In other words, it's the range between the upper and lower bounds of the prediction. For example, if four out of five actuals lie within the predicted range, the range accuracy is 80%.
  • Average Interval Width is the difference between the upper and lower bounds of the prediction. This metric explains how informative the interval is. The smaller the average width, the better the model

When making predictions, regression also enables you to specify a confidence level for the prediction interval (range).

In this example procedure, you create and train a regression solution definition to predict the amount of time it takes to restore a cloud database.

Procedure

  1. Navigate to All > Predictive Intelligence > Regression > Solution Definitions.
  2. On the Regression Definitions list, click New.
  3. On the Regression Definition form, configure these fields per the following guidance.
  4. Click the appropriate context menu option or button for your solution definition.
    OptionDescription
    Save or Save & Train Save your solution definition record so you can return to it later, or save and submit it for training.
    Submit or Submit & Train Create your solution definition record and submit it, or submit and train it.
  5. If you submitted the solution for training, click OK on the Training Activation window to confirm.

    The system schedules the solution for training with the nearest training service. The system sends you a notification when the training completes, including any errors that may have occurred in the training. Any other users can subscribe to the Predictive Intelligence Notifications category. When training completes, the system uploads the solution as an Attachment record.

What to do next

In this example scenario, you created an ML solution from your solution definition. The Solution Statistics, Test Solution, and Solution Definition tabs appear in the Related Links section of your ML solution.

On the Solution Statistics tab, review the Point Estimate and Range (prediction interval) statistics generated by your solution.

The prediction statistics for the solution you created and trained.

On the Test Solutions tab of your solution, you can test the prediction output for the records you used as input to the prediction by entering values for the input fields, such as the Source datacenter, Target datacenter, and Database size. You can also use the default prediction confidence level of 95, or enter a different level between 0 and 100. Using 95 as the value means that the system is 95% confident that the actual prediction falls within the prediction interval. Click the Run Test button to find the prediction output.

The values you need to enter in order to run a prediction output test.

After you run the test, the prediction output statistics appear. The Point Estimate on the screen is a single value at one point in time. For example, the database restore takes 134.47 seconds to complete. The Lower and Upper bounds on the screen signify a range accuracy value. For example, the database restore takes from 84.53 to 185.41 seconds to complete.

The test output values for the Point Estimate and Range Accuracy predictions.