(T. Leblanc, J. Bandoro, G. Payen, F. Gabarrot, and A. vanGijsel)
1. Purpose
Currently, NDACC data originators do not use a consistent and homogeneous definition of vertical resolution. The purpose of producing tools for the standardization of vertical resolution is to insure that the Vertical Resolution values reported in the data files archived at the NDACC Data Handling Facility consistently use the same definition from one NDACC lidar data file to another.
The Tools currently developped
3. Tools Validation
After being written, the tools needed a thorough validation to insure the consistency within all existing NDACC data. This validation was made using simulated lidar signals. Using a "reverse-type" model, synthetic lidar signals were created from known ozone and temperature profiles. The signals were simulated using various noise configurations, and with or without a number of common corrections usually applied to real lidar data. This flexibility insured to verify the full self-consistency between the reverse and forward models before the standardization tools themselves were tested. The results below therefore cover several steps of the validation, ranging from the self-consistency of the simulated and retrieved profiles (section 3.3) to the full and final validation of the tools (sections 3.4 and 3.5). Note that the validation results below are not meant to validate the ozone and temperature algoithms, but to validate the implementaiton of the Standadization Tools into these algorithms. That means focus must be made on the degree of agreement between the experimental and theoretical transfer functions (DF defintion), and the degree of agreemeent between the experimental and theroetical impulse response widths (IR definition). The main validation results and conclusions are summarized before all the plots are shown (top of sections 3.4 and 3.5). They apply similarly to all NDACC products listed in section 3.1. A few exceptions are covered individually below each relevant plot.
3.1 Contributing Investigators
The validation results are presented below by alphabetical order of the instrument abbreviations, ozone first, temperature second. The following investigators participated to the validation work, covering eight existing or future NDACC products:
- Thierry Leblanc, for stratospheric ozone and temperature at Mauna Loa, Hawaii (MLO)
- Justin Bandoro, for temperature at Univ. of Western Ontario (PCT for Rayleigh and PCW for Raman)
- Guillaume Payen, for stratospheric ozone at Observatoire de Haute Provence (SHO)
- Anne vanGijsel, for stratospheric ozone and temperature at Lauder (SWL)
- Franck Gabarrot, for tropospheric ozone at Reunion Island (TRO)
3.2 Summary of the Simulations (by date) and their Purpose
Each date corresponds to a specific simulation configuration. The choice of a specific date has no geophysical meaning. A large range of dates was chosen only to avoid confusion between the simulation configurations. As a rule-of-thumb (01/14 excluded), even dates apply to the validation of the tools for ozone retrievals, while odd dates apply to the validation of the tools for temperature retrievals.
Simulated date: 2012/01/14
A near-real experiment. The simulated signals are representative of a real measurement, i.e., they contain background noise, saturation, partial overlap, they are absorbed by ozone and attenuated by molecular extinction. The purpose of these datasets was to insure that simulated signals could be ingested similarly to real signals without an unexpected crash of the data processing softwares. These signals were used for both ozone and temperature retrievals. This simulation run was a pre-requisite to all validation tests posted thereafter, but is not part of the Tools' validation in itself, and therefore not shown below.
Simulated date: 2012/01/18
Disturbance-free signals with the purpose of testing the consistency of the ozone forward and reverse models. The simulated signals
contain no correction susceptible to introduce biases between the original and retrieved ozone profiles. They are therefore random noise-free, saturation-free, there is complete overlap, and no correction is necessary for molecular extinction. The ozone absorption cross-sections were taken as known constants to avoid possible discrepancies associated with their temperature dependence. This simulation run was a pre-requisite (for ozone) to all validation tests posted thereafter.
Simulated date: 2012/01/19
Disturbance-free signals with the purpose of testing the consistency of the temperature forward and reverse models. Like 01/18, the simulated signals
contain no correction susceptible to introduce biases between the original and retrieved profiles. They are therefore random noise-free, saturation-free, there is complete overlap, and no correction is necessary for molecular extinction or ozone absorption. This simulation run was a pre-requisite (for temperature) to all validation tests posted thereafter.
Simulated date: 2012/01/20
Signals specifically dedicated to the validation of the Vertical Resolution Tool NDACC_ResolDF for ozone. The signals are just like those of 01/18, except that random (white) noise was added to the ozone profiles in order to quantify the effect of filtering and report this effect following the NDACC-Standardized DF cutoff frequency definition.
Simulated date: 2012/01/21
Signals specifically dedicated to the validation of the Vertical Resolution Tool NDACC_ResolDF for temperature. The signals are just like those of 01/19, except that random (white) noise was added to the temperature profiles in order to quantify the effect of filtering and report this effect following the NDACC-Standardized DF cutoff frequency definition.
Simulated dates: March and April 2013, even dates
Same as 2012/01/20. These are randomized repeats of 01/20, which are then averaged in order to minimize the noise associated with the experimental determination of the transfer function (TF) for ozone retrievals. The TFs are calculated for each of the 30 experiments, then averaged together. The resulting averaged TF is compared to the theoretical TF. Though a single experiment like 01/20 is sufficient when dealing with filters with a small number of coefficients, the averaging process is necessary to deal with filters having a large number of coefficients with respect to the total sampling window considered (for example 17 coefficients out of a 200-pts total window, or 133 coefficients out of 860-pts total window).
Simulated dates: March and April 2013, odd dates
Same as 2012/01/21. These are randomized repeats of 01/21, which are then averaged in order to minimize the noise associated with the experimental determination of the transfer function (TF) for temperature retrievals. The TFs are calculated for each of the 30 experiments, then averaged together. The resulting averaged TF is compared to the theoretical TF. Though a single experiment like 01/21 is sufficient when dealing with filters with a small number of coefficients, the averaging process is necessary to deal with filters having a large number of coefficients with respect to the total sampling window considered (for example 17 coefficients out of 200-pts total window, or 133 coefficients out of 860-pts total window).
Simulated date: 2012/01/22
Signals specifically dedicated to the validation of the Vertical Resolution Tool NDACC_ResolIR for ozone. The signals are just like those of 01/18, except that a Dirac Delta Function was added to the simulated ozone profile in order to quantify the effect of filtering and report this effect following the NDACC-Standardized FWHM Impulse Response definition.
Simulated date: 2012/01/23
Signals specifically dedicated to the validation of the Vertical Resolution Tool NDACC_ResolIR for temperature. The signals are just like those of 01/19, except that a Dirac Delta Function was added to the simulated temperature profile in order to quantify the effect of filtering and report this effect following the NDACC-Standardized FWHM Impulse Response definition
3.3 Checking the Consistency of the Reverse and Forward Models (01/18 and 01/19)
The work described in this section is only a pre-requisite to the validation results presented in section 3.4. If you are only interested in the validation of the vertical resolution standardization tools, you can skip this section and go to section 3.4 now.
The purpose of this check is to insure that, in the absence of noise, the retrieved profiles are fully consistent with the simulated profiles. If they were not, systematic differences between retrieved and simulated profiles would introduce an undesired "noise" component susceptible to bias the validation of the vertical resolution standardization tools. All eight plots in this section show basically the same behavior: For ozone, the retrieved and simulated profiles agree well, with only residual numerical noise owed to
SHO Ozone
Numerical noise above 30 km is higher for SHO above (and TRO below) than for the other ozone products (MLO and SWL) because of the number of significant digits used in the raw data format. This numerical noise is centred to zero and is not an issue for the validation of the ISSI tools. In the other simulation experiments presented below, the simulated signals contain photon-counting (random) noise and the data processing algorithms use filtering, so this numerical discretization problem does not exist (as can be seen in sections 3.4 and 3.5).
TRO Tropospheric Ozone
Numerical noise above 5 km is higher for TRO above (and SHO) than for the other ozone products (MLO and SWL) because of the number of significant digits used in the raw data format. This numerical noise is centred to zero and is not an issue for the validation of the ISSI tools. In the other simulation experiments presented below, the simulated signals contain photon-counting (random) noise and the data processing algorithms use filtering, so this numerical discretization problem does not exist (as can be seen in sections 3.4 and 3.5).
3.4 Validation of NDACC_ResolDF: Consistency of the Actual (observed) and Theoretical (NDACC_ResolDF) Transfer Functions (simulation dates: 30 days from March to April 2012)
One interesting advantage of the new standardization tool is the abiltiy to be used multiple times within the same data processing chain, if necessary. When smoothing/filtering occurs at more than one occasion through the data processing chain, the subroutine NDACC_ResolDF can be inserted each time such a occurence is found. The output TF of the first call is used in input of the second call, and so on until all smoothing occurrences have been covered. The final TF is the product of the TF calculated separately for each smoothing filter applied.
The consistency of the observed and theoretical TFs observed in all the plots below validates the NDACC_ResolDF tool. The output value of the tool is referred to as "dz_cutoff" in the plots below. It is the reciprocal of the cutoff wavenumber (wavenumber where the TF equals 0.5). dz_cutoff can then be multiplied by the sampling width (bin size) to obtain the NDACC-Standardized Vertical Resolution (follwoing the DF definition). The tool can now be used for all NDACC Temperature and Ozone PIs/instruments to report Vertical Resolution based on the new standardized definition (defined uniquely from the number and value of the filter coefficients used). The new tool can also serve as a conversion device between the vertical resolution reported until today by the NDACC PIs and the vertical resolution meant to be reported from now on following the ISSI Team recommendations.
MLO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 7-pts
.
MLO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 13-pts
SHO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 7-pts
SHO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 15-pts
SHO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 33-pts
SWL Ozone, Least-Square Polynomial Degree 1, Derivative filter, 5-pts
SWL Ozone, Least-Square Polynomial Degree 1, Derivative filter, 17-pts
TRO Tropospheric ozone, Least-Square Polynomial Degree 2, Derivative filter, 15-pts
TRO Tropospheric ozone, Least-Square Polynomial Degree 2, Derivative filter, 67-pts
MLO Temperature, Hann window, 9-pts
MLO Temperature, Hann window, 17-pts
PCL Rayleigh Temperature, "Smoothing by 3s and 5s" (7-pts)
PCL Raman Temperature, "Smoothing by 3s and 5s" (7-pts)
SWL Temperature, 2 filters: First, Least-Square Polynomial Degree 1, 3-pts, then Least-Square Polynomial Degree 1, 5-pts
The case of SWL-temperature (above and below)) is still under investigation. The PI has reported the use of two successive filters, namely, a Least Square polynomial fit of degree 1 over 3 points, followed by a Least-Square polynomial fit of degree 1 over 5 points for the case presented above (or 17 points in the case presented 2 plots below). However the current comparisons with the theoretical solution seems to point towards no effect of the 3-point smoothing filter. Much better agremeent with the theoretical solution is found if we assume non-existence of the first filter (plot below) than if we assume the existence of the first filter (plot above).
SWL Temperature, Least-Square Polynomial Degree 1, 5-pt
SWL Temperature, Least-Square Polynomial Degree 1, 17-pts
Though we are not showing the case of 2 successive, 3-point, then 17-point filters, the same conclusion as that for the 5-point filter seems to hold for the 17-point filter. The difference between the 2-filter case and 1-filter case is not as obvious as for the 5-point filter because the width of the second filter is much larger than that of the first filter, therefore shadowing the effect of the first, narrow width filter. For this reason we are only showing here the case of one 17-point filter.
3.5 Validation of NDACC_ResolIR: Consistency of the Actual (observed) and Theoretical (NDACC_ResolIR) FWHM Impulse Responses (simulation dates 01/22 and 01/23)
As for NDACC_ResolDF, the subroutine NDACC_ResolIR can be inserted and called multiple times in the data processing chain. The output response of the first call is used in input of the second call instead of the initial impulse. The final FWHM is the width of the final response obtained after the last call.
The excellent agreement observed on most of the plots below validate the NDACC_ResolIR tool. The output value of the tool is referred to as "dz_fwhm" in the plots below. It can then be multiplied by the sampling width (bin size) to obtain the NDACC-Standardized Vertical Resolution (following the IR definition). Though a few issues need to be resolved for a couple of cases presented below, the tool can now be used safely for all NDACC Temperature and Ozone PIs/instruments to report Vertical Resolution based on the new standardized definition (defined uniquely from the theoretical width of the impulse response). The new tool can also serve as a conversion device between the vertical resolution reported until today by the NDACC PIs and the vertical resolution meant to be reported from now on following the ISSI Team recommendations.
MLO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 13-pts
MLO Temperature, Hann Window, 9-pts
PCL Rayleigh Temperature, "smoothing by 3s and 5s" (7 pts)
Reaching full agreement between the expected and observed IR solutions for the Purple Crow Lidar case (Rayleigh "PCT" above and Raman "PCW" below) is problematic because the initial 24-m resolution data are binned together at a resolution of about 1000-m before being smoothed. The smoothing process is applied to this 1000-m resolution data. The aymmetric effect (with respect to the altitude of the impulse) of density integration over these thicker layers increases the differences between the simulated and retrieved temperatures above and below the impulse, and therefore tends to "stretch" the shape of the response farther away from the ideal, symmetrical shape. Expectedly, the observed Impulse Response does not fully match the theoretical Impulse Response. The difference is about one bin for a window width of 5 bins (20%). Possible actions to mitigate this problem are currently under consideration. One likely action will be to submit the PCL temperature product to the database using raw data binned to vertical resolutions similar to the other NDACC lidar temperature products presented here (e.g., 300 m for MLO and SWL).
PCL Raman Temperature, "smoothing by 3s and 5s" (7 pts)
SHO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 7-pts
SHO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 7-pts
SHO Ozone, Least-Square Polynomial Degree 2, Derivative filter, 7-pts
SWL Ozone, Least-Square Polynomial Degree 1, Derivative filter, 5-pts
SWL Ozone, Least-Square Polynomial Degree 1, Derivative filter, 17-pts
SWL Temperature, 2 filters: First a Least-Square Polynomial Degree 1, 3-pts, then a Least-Square Polynomial Degree 1, 5-pts
The case of SWL-temperature (above and below)) is still under investigation. The PI has reported the use of two successive filters, namely, a Least Square polynomial fit of degree 1 over 3 points, followed by a Least-Square polynomial fit of degree 1 over 5 points for the case presented here (or 17 points in the case presented 2 plots below). However the current comparisons with the theoretical solution seems to point towards no effect of the 3-point smoothing filter. Much better agremeent with the theoretical solution is found if we assume no first filter (plot below) than if we assume the existence of the first filter (plot above).
SWL Temperature, Least-Square Polynomial Degree 1, 5-pts
SWL Temperature, Least-Square Polynomial Degree 1, 17-pts
Though we are not showing the case of 2 successive, 3-point, then 17-point filters, the same conclusion as that for the 5-point filter seems to hold for the 17-point filter. The difference between the 2-filter case and 1-filter case is not as obvious as for the 5-point filter because the width of the second filter is much larger than that of the first filter, therefore shadowing the effect of the first, narrow width filter. For this reason we are only showing here the case of one 17-point filter.
TRO Tropospheric Ozone, Least-Square Polynomial Degree 2, Derivative filter, 15-pts
TRO Tropospheric Ozone, Least-Square Polynomial Degree 2, Derivative filter, 33-pts
3.6 What's next with NDACC_ResolDF and NDACC_ResolIR ?
Technically, both subroutines now can be "permanently" inserted in the NDACC PIs' data processing softwares. The first use of these routines will consist of establishing a detailed map of the existing individual products' vertical resolution and their NDACC-standardized equivalent. Once the mapping is in place, the existing Ames files will be replaced by new ones that will contain the mapped values of vertical resolution following the NDACC-standardized definition. The second use of the routines consists of producing the values of vertical resolution directly based on the NDACC-standardizes defintion, i.e., without going through a prior conversion process.
3.7 The Cherry on the Cake: Can this work be used for other NDACC lidar products ?