Demand model metrics

Model Summary

The Model Summary table provides information on the fitted model. Note that some fields are not included for Life Cycle models.

Many of these metrics are data dependent in terms of accuracy, and their relative values alone cannot always be used as the sole basis for comparison. For example, an Out of Sample RMSE value of 400 on one time series and 10,000 on other time series does not necessarily indicate which forecast is more accurate - it depends on the mean demand.

Similarly, MAPE (Mean Absolute Percentage Error) indicates an In Sample or Out of Sample percentage error. A 10% MAPE value on one time series and 20% MAPE value on other time series does not necessarily favor the first time series in terms of accuracy - the first time series may simply exhibit more regular patterns and just be easier to forecast.

In general, the following guidelines apply:

Lower values of these metrics indicate high accuracy.
There should not be a big difference between the In Sample and Out of Sample values of any of these metrics. If the Out of Sample RMSE > 2 * In Sample RMSE, it means your model is overfitting. Try increasing the demand history or decreasing the number of causals used.
Run Naïve Forecasting or Simple Moving Average algorithms. Use their In Sample and Out of Sample errors as a baseline, and compare that with the accuracy of more sophisticated (Machine Learning) algorithms to determine the level of improvement over the baseline.

Time Series

Time series for which the demand profile was generated.

Demand Model

Name assigned by the user for the demand model. The default is Automatic.

Time Stamp

Indicates the time at which a demand model was fit for a particular time series.

Aggregation Level

Indicates the frequency level (time bucket) of the demand time series at which a model was fitted.

Train Data

Number of data points in the historical data allocated for initial training using the slice ratio.

Start Date

Start date detected for each SKU.

End Date

End date detected for each SKU.

Test Data

Number of data points in the historical data used as test data points.

Forecast Horizon

Number of future time periods for which forecasts are generated.

Key Error Metric Selected / Key Error Metric Applied

Key Error Metric Select is the user-entered value of the error metric, while the Key Error Metric Applied is the error metric used by the algorithm. In most cases, the values of these two fields are the same. However, when values of error metrics such as MPE, MAPE, or WMAPE go to infinity, the algorithm uses the default metric RMSE as the key error metric. This generally occurs when one or more zero demand points are included in the time series.

Lead Time

The actual number of time bucket periods used to calculate lead time for out-of-sample MAPE.

Selected Model

Final algorithm chosen; one of the several algorithms supported by Demand Guru. For the intermittent algorithm, this field includes both the name of the algorithm and the algorithm method; for example, intermittent-auto or intermittent-croston.

For Life Cycle models, the value here is NA.

Hyperparameters

Name of the hyperparameters for the algorithm, if any, as a string. For example, Cost is a hyperparameter in the SVM Linear algorithm, with a value that can be set by a user.

For Life Cycle models, the value here is NA.

Optimal Hyperparameters

Optimal values of the hyperparameters for any algorithm.

For Life Cycle models, the value here is NA.

In Sample ME

Train Mean Error.

Let di represent the ith true demand point in a time series and fi denote the corresponding forecast. Then ei = di – fi is called as the forecast error.

The mean of forecast errors over the train part of the time series is the In Sample Mean Error (InSampleME).

If there are n data points in a time series, and first ‘k’ data points are in the train part, then

In Sample RMSE

Train Root Mean Squared Error.

This value represents the square root of mean of squared error taken over the train part of the time series.

In Sample MAE

Train Mean Absolute Error.

This value represents the mean of absolute differences between true demand points and forecasts over the train part of the time series.

In Sample MPE

Training Mean Percentage Error.

This value represents the mean of percentage error between true demand points and forecasts over the train part of the time series.

In Sample MAPE

Training Mean Absolute Percentage Error.

This value represents the average of individual MAPE (Mean Absolute Percent Error) values calculated over the train part, weighted by the actual demand in each period.

In Sample WMAPE

Training Weighted Mean Absolute Percentage Error.

This value represents the weighted mean of absolute values of percentage error between true demand points and forecasts over the train part of the time series.

For two periods P1 and P2, let the actual demand from those periods be D1 and D2, respectively. Also, let the forecast for the two periods be F1 and F2 respectively. If the difference between demand values and forecast values are the same for both periods (for example, if |D1 – F1| = |D2 – F2|), then the period with higher demand contributes more towards the WMAPE value. If D1 > D2, then period P1 contributes more to the error than period P2, due to the weighted average part of WMAPE formula.

Note that this field is not displayed for Life Cycle models.

Out Of Sample ME

Test Mean Error.

If there are n data points in a time series and first k data points are in the train part, then k+1 to n points will be in the test part. The mean of forecast error calculated over the test part of the time series is the Out of Sample Mean Error.

Out Of Sample RMSE

Test Root Mean Squared Error.

This value represents the square root of mean of squared error taken over the test part of the time series.

Out Of Sample MAE

Test Mean Absolute Error.

This value represents the mean of absolute differences between true demand points and forecasts over the test part of the time series.

Out Of Sample MPE

Test Mean Percentage Error.

This value represents the mean of percentage error between true demand points and forecasts over the test part of the time series.

Out Of Sample MAPE

Test Mean Absolute Percentage Error.

This value represents the mean of absolute values of percentage error between true demand points and forecasts over the test part of the time series.

Out of Sample WMAPE

Test Weighted Mean Absolute Percentage Error.

This value represents the mean of absolute values of percentage error between true demand points and forecasts over the test part of the time series, weighted by the actual demand in each period.

Note that this field is not displayed for Life Cycle models.

Residual Mean

Residuals in forecasting is a time series formed by differencing the actual demand and the fitted values obtained using the forecasting algorithm. ResidualMean is the average of residuals. ResidualMean represents the bias in the model, and an ideal model should have this average as close to zero as possible.