Performance metrics¶

class pystran.Evaluation(observed, modelled)[source]¶

Class for deriving different evaluation criteria.

References

[E1]	(1, 2, 3) Gupta H.V., Sorooshian S., Yapo P.O.(1998), Toward improved calibration of hydrologic models: Multiple and noncommensurable measures of information, Water Resources Research,pp 751-763

[E2]	H. Hauduc, M. B. Neumann, D. Muschalla, V. Gamerith, S. Gillot and P.A. Vanrolleghem (2011), Towards quantitative quality criteria to evaluate simulation results in wastewater treatment – A critical review. Proceedings 8th symposium on systems analysis and integrated assessment (Watermatex 2011)

AME()[source]¶

Absolute Maximum Error

Notes

The absolute maximum error indicates the maximum error of the model [E1]. This criterion is very sensitive to outliers

range: [0, inf]
optimum: 0
category: Absolute criteria

APBIAS()[source]¶

Absolute Percent Bias

Notes

Useful in combi with PBIAS, eg if PBIAS small and APB very large one could conclude that volumes are ok, but timing is missing (continuous gap)

range: [0, inf]
optimum: 0
category: Total Relative error criteria

BIAS(optim=False)[source]¶

Bias E[obs-mod]

Parameters:

optim : bool

Notes

range: [-inf, inf]
optimum: 0
category: Total Relative error criteria

CrBal(optim=False)[source]¶

Balance Criterion

Parameters:

optim : bool

Notes

[E9] use the balance criterion to measure the ability of the model to reproduce the same cumulative as observed. The difference between the inversed fractions penalises larger differences between observed and modelled cumulative values.

range: [-inf, 1]
optimum: 1
category: Total Relative error criteria

HighFDCE()[source]¶

Flow Duration Curve based high flow criterion

Notes

Uses the upper part (lowest percentiles) of the Flow Duration Curve to focus on high flow regimes. Always use in combination with a second criterion to make sure the timing of the model is also satifying. Used in [E15].

IA(optim=False)[source]¶

Index of agreement

Parameters:

optim : bool

Notes

Index of agreement is te ratio of the sum of squared errors (SSE) and the largest potential error with respect to the mean of the observed values, [E4]. This is sensitive to the model mean and to the peak values, and is insensitive to low magnitude values.

range: [0, 1]
optimum: 1
category: comparison with reference model

LowFDCE()[source]¶

Flow Duration Curve based low flow criterion

Notes

Uses the lower part (highest percentiles) of the Flow Duration Curve to focus on low flow regimes. Always use in combination with a second criterion to make sure the timing of the model is also satifying. Used in [E15].

MAE()[source]¶

Mean Absolute Error

Notes

The mean absolute error indicates the average magnitude of the model error (accuracy) [E4]. Taking the absolute value avoids error compensation, but does not indicate the direction of the deviation.

range: [0, inf]
optimum: 0
category: Absolute criteria

References

[E4]	(1, 2, 3) Willmott C.J., Ackleson S.G., Davis R.E., Feddema J.J., Klink K.M., Legates D.R., O’Donnell J. and Rowe C.M. (1985) Statistics for the evaluation and comparison of models. Journal of Geophysical Research, 90(C5), 8995-9005.

MAPE()[source]¶

Mean Absolute Percent Error

Notes

The mean absolute percent error used by [E3] is close to MARE. However, the errors are relative to the predicted values instead of the observed values. Consequently, the under-predicted values are penalised (for a similar error). This is an interesting criterion for situations in which one wants to determine a risk to reach concentration limits.

range: [0, inf]
optimum: 0
category: Relative criteria

MARE()[source]¶

Mean Absolute Relative Error

Notes

The mean absolute relative error is similar to the Mean Relative Error, but avoids the compensation of errors [E7].

range: [0, inf]
optimum: 0
category: Relative criteria

References

[E7]	Petersen B., Gernaey K., Henze M. and Vanrolleghem P.A. (2002) Evaluation of an ASM1 model calibration procedure on a municipal-industrial wastewater treatment plant. Journal of Hydroinformatics, 4(1), 15-38.

ME(optim=False)[source]¶

Mean Error

Parameters:

optim : bool

Notes

The mean of residuals allows highlighting the existence of systematic bias, i.e. characteristic of a model leading to systematic over- or under-prediction [E1]. However, with this criterion errors can compensate each other, so no information on the magnitude of the errors is obtained.

range: [-inf, inf]
optimum: 0
category: Absolute criteria

References

[E3]	(1, 2, 3) Power M. (1993) The predictive validation of ecological and environmental models. Ecological Modelling, 68(1-2), 33-50.

MPE(optim=False)[source]¶

Mean Percent Error

Parameters:

optim : bool

See also

MRE

Notes

The mean percent error [E3] provide the average relative model error. However, negative and positive errors can compensate for each other.

range: [-inf, inf]
optimum: 0
category: Relative criteria

MRE(optim=False)[source]¶

Mean Relative Error

Parameters:

optim : bool

Notes

The mean relative error [E5] provide the average relative model error. However, negative and positive errors can compensate for each other.

range: [-inf, inf]
optimum: 0
category: Relative criteria

MSDE()[source]¶

Mean Square Derivative Error

Notes

The mean square derivative error is the square of the differences of predicted and observed variations between two time steps [E5]. This criterion penalizes noisy time series and series with timing error; it thus allows evaluating the peak’s timing.

range: [0, inf]
optimum: 0
category: Absolute criteria

MSE()[source]¶

Mean Squared Error

Notes

The mean square error avoids error compensations and emphasises high errors [E4].

range: [0, inf]
optimum: 0
category: Absolute criteria

MSLE()[source]¶

Mean Squared Logarithm Error

Notes

The mean square logarithm error is the sum of the squares of the differences of the natural logarithm of the predicted and observed value [E5]. It emphasises low magnitude errors.

range: [0, inf]
optimum: 0
category: Absolute criteria

References

[E5]	(1, 2, 3, 4, 5, 6, 7) Dawson C.W., Abrahart R.J. and See L.M. (2010) HydroTest: Further development of a web resource for the standardised assessment of hydrological models. Environmental Modelling and Software, 25(11), 1481-1482.

MSRE()[source]¶

Mean Square Relative Error

Notes

The mean square relative error avoids compensation of errors and emphasises larger relative errors [E5].

range: [0, inf]
optimum: 0
category: Relative criteria

MSSoE()[source]¶

Mean Sqaured sorted Errors

Notes

The mean square error of sorted errors is calculated based on sorted observed and predicted data (van Griensven and Bauwens, 2003). Observations and predictions are sorted independently one from the other. The sorted series are then compared (comparison of their cumulative distributions) and it is evaluated whether the model reproduces the same distribution as the observed data.

The time of occurrence of a given value of the variable is not accounted for in the MSSoE method.

range: [0, inf]
optimum: 0
category: Absolute criteria, timestep independent

References

[E6]	van Griensven A. and Bauwens W. (2003) Multiobjective autocalibration for semidistributed water quality models. Water Resources Research, 39(12), SWC91-SWC99.

MeAPE()[source]¶

Median Absolute Percent Error

Notes

Median of the absolute relative error expressed in percentage [E5]. This criterion is less affected by outliers and the errors distribution form as the MARE criterion.

range: [0, inf]
optimum: 0
category: Relative criteria

NSC()[source]¶

Number of Sign Changes of the residuals

Notes

The number of sign changes,[E1]_, counts the number of times the residual (Oi-Pi) sign change. The minimum value is zero and the maximum n. A value close to zero indicates a systematic error (overestimating or under-estimating model) but a more consistent model. A value close to n indicates a random error.

range: [0, nsize]
optimum: /
category: Absolute criteria

NSE(optim=False)[source]¶

Nash-Sutcliffe Efficiency criterion

Parameters:

optim : bool

Notes

Widely used criterion in hydrology, values ranging from -infty -> 1 A zero value means the model is not better than the ‘no knowledge’ model, which is characterised by the mean of the observations. Sensitive to extreme values.

range: [-inf, 1]
optimum: 1
category: comparison with reference model

NSE_BIAS()[source]¶

Combination of Nash Sutcliff and BIAS

Notes

The criterium is gaining importance by the combined effect and is proposed in [E16]. Here an adaptation is implemented by taking the absolute value of the bias, to make the function symmetrical around the optimal value.

References

[E16]

Viney, N.R., J. Perraud, J. Vaze F.H.S. Chiew, D.A. Post and A. Yang (2009b). The usefulness of bias constraints in model calibration for regionalisation to ungauged catchments. Proceedings, MODSIM 200

NSE_FDChigh(w1=1.0, w2=1.0)[source]¶

Nash Sutcliff (mod) + high Flow; zelfde gewichtsfactor: als fout groter, ook beide groter!

Parameters:

w1 : float (0-1)

w2 : float (0-1)

NSE_FDClow(w1=1.0, w2=1.0)[source]¶

Nash Sutcliffe (mod) + low Flow; zelfde gewichtsfactor, als fout groter, ook beide groter!

Parameters:

w1 : float (0-1)

w2 : float (0-1)

NSE_boxcox(optim=False, llambda=0.25)[source]¶

Nash-Sutcliffe Efficiency criterion with boxcox transformed values

Parameters:

optim : bool

Notes

Widely used criterion in hydrology, values ranging from -infty -> 1 A zero value means the model is not better than the ‘no knowledge’ model, which is characterised by the mean of the observations.

Model residuals typically increase with higher flowvalues. This means that themodel residual variance or standard deviation typically increases with increasing flow. It also means that the higher flow values receive more weight in the goodness-of-fit statistics, [E10].

range: [-inf, 1]
optimum: 1
category: comparison with reference model

References

[E10]

Willems, P. A Time Series Tool to Support the Multi-criteria Performance Evaluation of Rainfall-runoff Models. Environmental Modelling & Software 24, no. 3 (March 2009): 311–321. http://linkinghub.elsevier.com/retrieve/pii/S1364815208001606.

NSE_log(optim=False)[source]¶

Nash-Sutcliffe Efficiency criterion with logarithmic values

Parameters:

optim : bool

Notes

Widely used criterion in hydrology, values ranging from -infty -> 1 A zero value means the model is not better than the ‘no knowledge’ model, which is characterised by the mean of the observations. The log values of the observed and measured values are used to give more emphasis to the lower values

range: [-inf, 1]
optimum: 1
category: comparison with reference model

NSE_sqrt(optim=False)[source]¶

Nash-Sutcliffe Efficiency criterion with root values

Parameters:

optim : bool

Notes

Widely used criterion in hydrology, values ranging from -infty -> 1 A zero value means the model is not better than the ‘no knowledge’ model, which is characterised by the mean of the observations. The root values of the observed and measured values are used to give more emphasis to the lower values

range: [-inf, 1]
optimum: 1
category: comparison with reference model

PBIAS(optim=False)[source]¶

Percent Bias

Parameters:

optim : bool

Notes

The percent bias [E5] and relative volume error are the sum of errors related to the sum of observed values, expressed as relative value or in percentage. This criterion measures an overall adequacy, but the errors can be compensated.

(Also known as DEVRV, the Deviation of runoff volumes, From Statistical evaluation of WATFLOOD, Angela MacLean, University of Waterloo)

range: [-inf, inf]
optimum: 0
category: Total Relative error criteria

PDIFF(optim=False)[source]¶

Peak Difference

Parameters:

optim : bool

Notes

This criterion evaluate how well the highest modelled value matches the highest observed value in percent. However, it does not take into account whether the max(Oi) and max(Pi) occur at the same time-step i.

Consequently, in case of multiple events on the same time-series, first the single events must be extracted from the whole time series to have less chance to mix up with peaks from another event.

range: [-inf, inf]
optimum: 0
category: single event

PEP(optim=False)[source]¶

Percent Error In Peak

Parameters:

optim : bool

Notes

This criterion evaluate how well the highest modelled value matches the highest observed value in percent. However, it does not take into account whether the max(Oi) and max(Pi) occur at the same time-step i.

Consequently, in case of multiple events on the same time-series, first the single events must be extracted from the whole time series to have less chance to mix up with peaks from another event.

range: [-inf, inf]
optimum: 0
category: single event

PI(optim=False)[source]¶

Coefficient of Persistance

Parameters:

optim : bool

Notes

The coefficient of persistence is close tot the NSE criterion, but the simplistic model used is th elast observed value instead of the mean of observed values, [E12].

range: [0, 1]
optimum: 1
category: comparison with reference model

R4MS4E()[source]¶

Root 4 Mean Square 4 Error

Performance metrics¶

Related Topics

This Page