The menu validation statistics for more information.
allows to evaluate the performance of merged data performed with CDT, satellite rainfall estimates products and the output of climate models, and to evaluate the skill of a specific methods or parameters used for the bias adjustment or the merging. Here <name of the climate variable > is the precipitation data or other climate data such as the temperature, relative humidity, radiation, pressure or wind data. See the table of the computedThe gridded data is first extracted at the station locations using
before the computation of the validation statistics.
The tab Input allows to specify
the input station data used to validate and the extracted gridded data
at the station locations to be evaluated, and the folder to save the
output.
Select the temporal resolution of the data. Available temporal resolution are: daily, pentad, dekadal and monthly data.
Select the file containing the station data (in CDT station data format) used to validate from the drop down list if it is already loaded, or open it through .
Select the file containing the station data (in CDT station data format) to be evaluated from the drop down list if it is already loaded, or open it through .
Enter the full path to the folder to save the output, or use the browse button .
The tab Validation allows to
specify the season and the period over which the validation will be
performed, and the way the validation statistics will be computed.
Select the months or the season you want to include in the validation process. For example, if your input data is a daily rainfall and you want to perform the validation with the seasonal rainfall total of the rainy season.
Provide the start and end year of the period over which the validation will be performed.
If you want to aggregate the data before performing the validation, check the box Aggregate data before validation , it will activate the button allowing you to select the function to use to aggregate the data and to set the minimum number of observations to compute one output time step. See Aggregation parameters to set the parameters in the dialog box.
Select the method to transform the data before computing the validation statistics. Available methods are: All Data, Spatial Average and Each Station.
Suppose you have the CDT station format data to be used to validate (station-observation) and to be validated (gridded-extracted) as follow
station-observation | gridded-extracted |
---|---|
All Data: the validation statistics are
computed from two series obtained by
concatenating the data of all stations from station-observation
and gridded-extracted respectively. The data of the stations
become
station-observation | gridded-extracted |
---|---|
Spatial Average: the validation statistics are computed from two series obtained by spatially averaging station-observation and gridded-extracted respectively. The data would be as follow
station-observation | gridded-extracted |
---|---|
Each Station: the validation statistics are computed for each pair of series constituted by the station from station-observation and gridded-extracted respectively.
To create the contingency table to be used to compute the categorical statistics, click on
button. It displays a dialog box allowing you to select the operator to be applied and the the threshold to be used.In case of precipitation data, click on
button to set the parameters to use to compute the volumetric statistics. It displays a dialog box allowing you to specify the threshold to be used.If you want to use a specific threshold above which the statistics
will be computed, check the box
User specified values .
If you selected All Data or
Spatial Average as method used to transform
the data, enter the value of the threshold to be used.
In case of Each Station, if you want to use a unique threshold for all stations, check the box Use a unique threshold value then enter the value of the threshold to be used.
If you want to use different threshold for each station, uncheck the box Use a unique threshold value and select or open the file containing the threshold values.
The threshold data must be in CDT station format, the number and order of the stations have to be the same as the station observation used to validate (station-observation). It may be a percentile data already computed or any threshold values you want. Below is an example of the threshold data related to the example provided above.
If you want to use a thresholds using a percentile computed from the
input data (observed or estimated data),
uncheck the box
User specified values , then
select the source of data you want to compute the percentile:
Observed Data or Estimated Data, and enter the
percentile to be calculated.
In case of Spatial Average and Each Station, specify the base period over which the percentile will be calculated by clicking on the button .
Click the button
to compute the validations statistics.
A folder named VALIDATION_<name of the file
containing the station data used to validate> is created under
the folder you provided to save the output. It contains the file
VALIDATION_DATA_OUT.rds and two folders
OBS_GRD_DATA and STATISTICS_DATA
All_Data_Statistics_<temporal resolution>_<moths or season>_<period>.csv: contains the statistics computed using All Data method.
Spatial_Average_Statistics_<temporal resolution>_<moths or season>_<period>.csv: contains the statistics computed using Spatial Average method.
Stations_Statistics_<temporal resolution>_<moths or season>_<period>.csv: contains the statistics computed using Each Station method.
Where <temporal resolution> is one of daily, pentad, dekadal, monthly or seasonal; <moths or season> the months or season used to compute the statistics; <period> the period over which the statistics were computed.
The tab Plot allows to display a table and map of the validation statistics, a scatter plot and a cumulative distribution function (CDF) curves.
If the validation statistics have been computed with All Data and Spatial Average, you can display the table of statistics, a scatter plot and CDF curve.
In case of Spatial Average, you also can display a chart of the observed and the estimated data by selecting Line chart.
Click on the button here, and for Cumulative Distribution Function and Line chart options dialog box are here.
to change the range of the x and y axis, to add labels and legends, to change the title, and to change the type and color of the plot. The options dialog box for Scatter Plot can be found
If the validation statistics have been computed with Each
Station, you can display the table of statistics for each
station; the map of all statistics; and a scatter plot, CDF curve and
line for each station.
The tab Layers allows to add a
shapefile and a gridded elevation data over the map.