the Creative Commons Attribution 4.0 License.

the Creative Commons Attribution 4.0 License.

# A numerical method to improve the spatial interpolation of water vapor from numerical weather models: a case study in South and Central America

### Laura I. Fernández

### Amalia M. Meza

### M. Paula Natali

### Clara E. Bianchi

Commonly, numerical weather model (NWM) users can get the vertically integrated water vapor (IWV) value at a given location from the values at nearby grid points. In this study we used a validated and freely available global navigation satellite system (GNSS) IWV data set to analyze the very well-known effect of height differences. To this end, we studied the behavior of 67 GNSS stations in Central and South America with the prerequisite that they have a minimum of 5 years of data during the period from 2007 to 2013. The values of IWV from GNSS were compared with the respective values from ERA-Interim and MERRA-2 from the same period.
Firstly, the total set of stations was compared in order to detect cases in which the geopotential difference between GNSS and NWM required correction. An additive integral correction to the IWV values from ERA-Interim was then proposed. For the calculation of this correction, the multilevel values of specific humidity and temperature given at 37 pressure levels by ERA-Interim were used. The performance of the numerical integration method was tested by accurately reproducing the IWV values at every individual grid point surrounding each of the GNSS sites under study.
Finally, considering the IWV_{GNSS} values as a reference, the improvement introduced to the IWV_{ERA-Interim} values after correction was analyzed. In general, the corrections were always recommended, but they are
not advisable in marine coastal areas or on islands as at least two grid points of the model are usually in the water. In such cases, the additive correction could overestimate the IWV.

Water vapor is an abundant natural greenhouse gas in the atmosphere. The knowledge of its variability in time and space is very important with respect to understanding the global climate system (Dessler et al., 2008). Most of the regional comparisons of IWV from the global navigation satellite system (GNSS) are aimed at validating the technique via comparison with radiosonde and microwave radiometers where available. An example of this is the work of Van Malderen et al. (2014), who compared IWV GPS (Global Positioning System) information with IWV derived from ground-based sun photometers, radiosondes and satellite-based values from GOME, SCIAMACHY, GOME-2 and AIRS instruments at 28 sites in the Northern Hemisphere. Because their comparison is oriented toward a climatology application, they deal with long-term time series (>10 years). The authors asseverate that the mean biases of the GPS with the different instruments only vary between −0.3 and 0.5 kg m^{−2}, but there are large standard deviations especially for the satellite instruments.

However, some other comparisons examine the IWV_{GNSS} values with respect to the respective estimates from numerical weather models (NWMs). If focusing on the application of the current state-of-the-art ERA-Interim reanalysis from the European Centre for Medium-Range Weather Forecasts (ECMWF), both on a local and global scale, some recent papers deserve to be mentioned. For example, Heise et al. (2009) used ground pressure data from ECMWF to calculate IWV from 5 min zenith total delay (ZTD) at stations where meteorological data are not available. The authors validated their results using stations with local measurements of pressure and temperature. They also compared IWV from GPS with respect to IWV from ERA-Interim on a global scale. The authors found that IWV from GPS and ECMWF show good agreement at most stations on the global scale, except in mountain regions. Moreover, they addressed the fact that temporal station pressure interpolation may result in up to 0.5 kg m^{−2} IWV uncertainty if a local weather event occurred. According to the authors, this phenomenon is particularly noticeable in the tropics and is due to the fact that the ECMWF analysis does not adequately represent the local situation if faced with an increase in the diurnal cycle of the surface atmospheric pressure.

Buehler et al. (2012) compared IWV values over Kiruna in the north of Sweden from five different techniques (radiosondes, GPS, a ground-based Fourier transform infrared (FTIR) spectrometer, a ground-based microwave radiometer and a satellite-based microwave radiometer) with IWV from ERA-Interim reanalysis. The processed GPS data set covered a 10-year period from November 1996 to November 2006. The authors found good overall agreement between IWV from GPS and from ERA-Interim, with mean differences of 0.29 kg m^{−2} and a standard deviation 1.25 kg m^{−2}. They also point out that ERA-Interim is drier than the GPS at small IWV values and slightly moister at high IWV values (above 15 kg m^{−2}).
The authors also considered altitude limits when comparing measurements from different data sets. They proposed an empirical solution by computing linear regression slopes as a function of height and corrected all measurements to a common reference altitude of 430 m. Thus, they established a relative bias of −3.5 % per 100 m that introduced absolute errors below 0.2 kg m^{−2}. Nevertheless, they asseverated that the good performance of the method depends on location and probably on season.

Ning et al. (2013) evaluated IWV from GPS in comparison with IWV from ERA-Interim and IWV from the regional Rossby Centre atmospheric (RCA) climate model at 99 European sites for a 14-year period. Because RCA is not an assimilation model, the standard deviation of RCA − GPS was 3 times larger than ERA-GPS. The IWV difference for individual sites varies from −0.21 to 1.12 kg m^{−2} for ERA-GPS and the corresponding standard deviation is 0.35 kg m^{−2}.
The authors investigated the influence of the differences between NWM and GPS in the vertical and horizontal positions. In particular, they studied subsets of stations with absolute height difference values smaller than 100 m. Consequently, they did not consider if there were over- or underestimations by the models. Thus, they found monthly mean IWV difference values smaller than 0.5 kg m^{−2}. Moreover, the authors also highlighted that the models overestimated IWV for sites near the sea.

Bordi et al. (2014) studied global trend patterns of the yearly mean of IWV from the 20th century atmosphere model (ERA-20CM) and ERA-Interim, which are both from ECMWF. The authors highlighted a regional dipole pattern of interannual climate variability over South America from ERA-Interim data. According to this study, the Andean Amazon Basin and northeast Brazil are characterized by rising and decreasing water content associated with water vapor convergence (divergence) and upward (downward) mass fluxes, respectively. In addition, the authors also compared IWV from ERA-Interim with the values estimated at two GPS stations in Bogotá and Brasilia. Such comparison on a monthly timescale uncovered a systematic bias attributed to a lack of coincidence in the elevation of the GPS stations and the model grid points.

Mengistu Tsidu et al. (2015) presented a comparison between IWV from a Fourier transform infrared spectrometer (FTIR, at Addis Ababa), GPS, radiosondes and ERA-Interim over Ethiopia for the period from 2007 to 2011. The study is focused on the characterization of the different error sources affecting the data time series. In particular, from the study of diurnal and seasonal variabilities, the authors addressed differences in the magnitude and sign of the IWV bias between ERA-Interim and GPS. They linked this effect to the sensitivity of the convection model with respect to the topography.

Wang et al. (2015) performed a 12-year comparison of IWV from three third-generation atmospheric reanalysis models including ERA-Interim, MERRA and the Climate Forecast System Reanalysis (CFSR) on a global scale. IWV values from the reanalysis models were also compared with radiosonde observations on land and remote sensing systems (RSS) on satellites over oceans. The authors asseverated that the main discrepancies between the three data sets are in central Africa, northern South America and in highland areas.

In this paper, we investigated the differences between IWV from GNSS using data products from Bianchi et al. (2016a) and IWV values given by ERA-Interim and MERRA-2. The comparison was performed by considering the geopotential differences (Δ*z*) between each GNSS station and the corresponding values assigned by the models.
We proposed an additive numerical correction to the IWV from the NWM, and the strategy was tested for the ERA-Interim reanalysis model.
Section 2 describes the different sets of data used in this study. In the following, we give an explanation of the methodology and present the results obtained after applying the proposed correction to the IWV values from ERA-Interim.

## 2.1 IWV from GNSS

GNSS data are the main source of information for the spatial and temporal distribution of water vapor. Thus, the main variable considered is the IWV estimated from the delay caused by the troposphere to the GNSS radio signals during its travel from the satellite to the ground receiver. The total delay projected onto the zenith direction (ZTD) is usually split into two contributions: the hydrostatic delay (ZHD – zenith hydrostatic delay), depending merely on the atmospheric pressure, and the zenith wet delay (ZWD), depending mainly on the humidity. The IWV_{GNSS} can be obtained from the ZWD by multiplying it by a function of the mean temperature of the atmosphere.

The reference database of IWV_{GNSS} used in this study comes from the geodetic processing of over 136 tracking stations on the American continent, located from southern California to Antarctica (see Fig. 1), during a 7-year period from January 2007 to December 2013 (Bianchi et al., 2016b). Specifically, the data series of IWV_{GNSS} used here is restricted to the 67 stations with IWV time series spanning more than 5 years (see Fig. 1 and Table 1). More details regarding the steps, models and conventions followed by the geodetic processing to obtain the IWV_{GNSS} values are given in Bianchi et al. (2016a).

## 2.2 IWV from NWM

The values of columnar integrated content of water vapor (IWV) as reanalysis products from ERA-Interim (Dee et al., 2011) and MERRA-2 (GMAO, 2015; Bosilovich et al., 2015; Gelaro et al., 2017) were evaluated in this study. The horizontal resolutions are 0.25^{∘} × 0.25^{∘} for ERA-Interim and 0.625^{∘} × 0.50^{∘} for MERRA-2. Because ERA-Interim data are given four times a day, in order to perform the comparison (even if MERRA-2 gives hourly data), we collected IWV data from MERRA-2 every 6 h at 00:00, 06:00, 12:00 and 18:00 UT (universal time). Thus, in order to be able to carry out the comparison, MERRA-2 was only partially evaluated.

ERA-Interim is the global atmospheric reanalysis produced by the European Centre for Medium-Range Weather Forecasts (ECMWF). It covers the period from 1979 to present and supersedes the ERA-40 reanalysis. ERA-Interim addresses some of the difficulties of ERA-40 with respect to data assimilation mainly regarding the representation of the hydrological cycle, the quality of the stratospheric circulation and the consistency in terms of reanalyzed geophysical fields (Dee et al., 2011).

MERRA-2 is the successor of the Modern-Era Retrospective Analysis for Research and Applications (MERRA) from NASA's Global Modeling and Assimilation Office (Rienecker et al., 2011). MERRA-2 is improved because it contains fewer trends and jumps linked to changes in the observing systems than MERRA. Additionally, MERRA-2 assimilates observations not available to MERRA and reduces bias and imbalances in the water cycle (Gelaro et al., 2017). Moreover, the longitudinal resolution of MERRA-2 data is changed from 0.667^{∘} in MERRA to 0.625^{∘}, whereas the latitudinal resolution remains unchanged (0.5^{∘}) (Bosilovich et al., 2015).

For this application, we used two different kind of data sets: 2-D values of the IWV from both reanalysis models along with the correspondent geopotential invariant; and three 3-D data sets from ERA-Interim, including the air temperature (*T*), the specific humidity (*q*) and the geopotential (*z*). These variables are given at 37 levels of atmospheric pressure from 1 to 1000 hPa.

We will use the terms *z*_{i}, *T*_{i}, *p*_{i} and *q*_{i} to refer to the value of the abovementioned variables at a given level “*i*” and at a given instant. This set of data will be used for the calculation of the integral correction that will be developed in the following section.

As previously mentioned, even when both reanalysis models give grid values of the vertical integral of the water vapor, the solution provided by each model is linked to its respective geopotential surface invariant. Nevertheless, elevation differences between the geopotential from each model grid and that computed from the GNSS height must be addressed.

In order to compute the geopotential of the GNSS stations (*z*_{GNSS}) we followed the van Dam et al. (2010) algorithm. Because the geodetic coordinates ($\mathit{\varphi},\phantom{\rule{0.25em}{0ex}}\mathit{\lambda},\phantom{\rule{0.25em}{0ex}}h$) of the GNSS site are known, we obtained the orthometric height (*H*) at each GNSS station by correcting the ellipsoidal height using the EGM08 model (Pavlis et al., 2012).
Thus, the geopotential is (van Dam et al., 2010)

where the radius of the ellipsoid at geodetic latitude *ϕ* is

Here *a* = 6 378 137 m and *b* = 6 356 752.3142 m are the semi-major and semi-minor axis of the WGS84 ellipsoid, respectively (Hofmann-Wellenhof and Moritz, 2006). Moreover, the value of the gravity on the ellipsoid at geodetic latitude *ϕ* can be written as follows (van Dam et al., 2010):

where *e*^{2}=0.00669437999014 is the first eccentricity squared of the WGS84 ellipsoid, *g*_{E}=9.7803253359 m s^{−2} is the normal gravity at the Equator (Hofmann-Wellenhof and Moritz, 2006) and *k*_{s}=0.001931853 (van Dam et al., 2010).

For a given GNSS station, the respective geopotential from each of the two reanalysis models resulted from a bilinear interpolation of the respective invariant static geopotential at the four grid points around the GNSS site, referred to as ${z}_{\mathrm{NWM}}^{k}$ ($k=\mathrm{1},\mathrm{2},\mathrm{3},\mathrm{4}$).
Because the points of the NWM grid surrounding the GNSS station have different geopotential values and those values are in turn different from *z*_{GNSS}, we propose to correct on each of these four points.
Thus, if Δ*z*^{k} refers to the difference between *z*_{GNSS} and ${z}_{\mathrm{NWM}}^{k}$,

where NWM corresponds to ERA-Interim or MERRA-2.

We will then propose a correction procedure that, by compensating for Δ*z*^{k} in each of the four grid points, corrects the values of IWV_{NWM}. After “moving” the grid points to *z*_{GNSS}, a bilinear interpolation is performed to obtained the corrected value of IWV at the location of the GNSS site.
In brief, this procedure is equivalent to lifting (or dropping, as appropriate) each of the grid points in order to create a plane at *z*_{GNSS}.

Prior to the correction, we analyze the performance of ERA-Interim and MERRA-2 with respect to GNSS.
Thus, although IWV_{GNSS} is produced every 30 min and IWV_{MERRA-2} is available hourly, we only consider the epochs when ERA-Interim data are available to perform the comparison at 00:00, 06:00, 12:00 and 18:00 UT.
Table 2 shows the mean values of IWV from GNSS (${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}$) during the period from 2007 to 2013 and its standard deviation for all of the stations.
We assume that the static geopotential from the NWM at each GNSS site (*z*_{NWM}) is obtained from a bilinear interpolation of the static geopotential at the four grid points surrounding it (${z}_{\mathrm{NWM}}^{k}$).
Thus, Table 2 shows the geopotential difference ($\mathrm{\Delta}z={z}_{\mathrm{GNSS}}-{z}_{\mathrm{NWM}}$) in addition to the respective differences of the mean values ($\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}={\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}-{\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{NWM}}$), where NWM indicates ERA-Interim and MERRA-2. All the averages are computed over the period from 2007 to 2013.

In general, regarding Table 2, we can observe that the best agreements between the average IWV values from GNSS and the corresponding average from the models are where the Δ*z* are small (e.g., CONZ, VITH, SMAR, LPGS, MAPA and SCUB, among others). In other words, the NWMs generally represent the IWV values very well ($\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}<\mathrm{1.5}$ kg m^{−2}) if |Δ*z*| is small. That means, the geopotential difference is in the order of 500 m^{2} s^{−2} at most.

Conversely, the difference of the model representation of the IWV with respect to GNSS increases as the height differences (Δ*z*) become larger, and this is true for all values of ${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}$. The SANT (${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}\sim \mathrm{12}$ kg m^{−2}, |Δ*z*|>10000 m^{2} s^{−2}) and CUCU (${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}\sim \mathrm{43}$ kg m^{−2}, |Δ*z*|∼8000 m^{2} s^{−2}) cases are good examples of this.

However, other than the abovementioned cases, which can be considered to be critical, the differences are also important at sites with moderate |Δ*z*| (larger that 500 m^{2} s^{−2}) and ${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}>\mathrm{20}$ kg^{−2} (e.g., CEFE, BRAZ, RIOD and GUAT).

Note that some MERRA-2 difference values could be a little larger than ERA-Interim values, which would be expected due to the coarser grid. However, this is not a general rule and some stations are in fact better represented by MERRA-2 with |Δ*z*| and $\left|\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}\right|$ values smaller than ERA-Interim even if they are located in highlands (see SANT and COPO).

Figure 2a and b show the mean IWV values from GNSS (${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}$) as a function of geopotential differences (Δ*z*). Results for MERRA-2 are shown in Fig. 2a and Fig. 2b shows the results for ERA-Interim.

It is assumed that these different values of $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ are due to the geopotential difference. Therefore, they commonly carry the inverse sign to the Δ*z*, as expected. Effectively, $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ is nothing but the difference between the mean values from GNSS and NWM; thus, a negative value for negative Δ*z* indicates an overestimation by the reanalysis model and vise versa: the underestimation by the model is shown using red dots where Δ*z* is positive (see Fig. 2).

However the Fig. 2a and b show that there are some cases where $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ has the same sign as Δ*z*, evidencing that the reanalysis models can overvalue or undervalue $\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$; this over- or undervaluing should not be due to Δ*z*. This effect can be seen within a rectangle with a $\mathrm{\Delta}z=\pm \mathrm{2000}$ m^{2} s^{−2} and $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}=\pm \mathrm{2}$ kg m^{2}. Such a cloud of points represents about 21 % of the total number of stations for ERA-Interim and approximately 27 % of the total number of stations for MERRA-2.
Figure 2c and d show the spatial distribution of $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ for MERRA-2 (panel c) and ERA-Interim (panel d).
The red dots indicate positive differences greater than 2.5 kg m^{2}, and the intermediate differences between 0 and 2.5 kg m^{2} are shown using a red gradient. Similarly, the blue dots show negative differences less than −2.5 kg m^{2}, and the differences between −2.5 and 0 kg m^{2} are shown using a blue gradient.

If we take the RNNA station in both maps as a reference (see station number 51 in Fig. 1), and advance towards the south along the Atlantic coast, the behavior of both models is similar.
Both reanalyses are dryer than GNSS, and this same effect is seen in the southern mountainous areas. However, moving along the Atlantic coast from RNNA to the north and up to the Amazon River, we see different behavior in the reanalyses: while ERA-Interim continues underestimating IWV_{GNSS}, MERRA-2 is shown to be wetter than GNSS. The overall agreement between GNSS and MERRA-2 is $-\mathrm{0.39}\pm \mathrm{2.77}$,
and the overall agreement between GNSS and ERA-Interim is 0.13±2.52. These values are the result of an average over all of the $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ differences.

These findings show that MERRA-2 resulted in wetter conditions than GNSS, whereas ERA-Interim is slightly dryer than GNSS in Central and South America. This is in agreement with the findings of Buehler et al. (2012), who reported that the mean value of the differences between IWV from GPS and ERA is 0.28±1.25 for a high-latitude location in Sweden, and also revealed an underestimation of the reanalysis model.

Finally, the correlation coefficients between ${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}$ values and the respective values for both NWMs are higher than 0.95 for most of the GNSS stations (not shown).

## Computation of the integral correction

In the following, we proceed to calculate a correction in order to provide a better estimation of the IWV_{NWM} at the GNSS site. We start by correcting each of the grid points around the station prior to applying a bilinear interpolation. The correction will only be computed for one of the two reanalysis models tested.
We have chosen ERA-Interim over MERRA 2 for the calculation and testing of these corrections, not only because ERA-Interim has a thinner grid, but also due to the results of Zhu et al. (2014). Effectively, Zhu et al. (2014) compared several reanalysis projects with independent sounding observations recorded in the eastern Himalayas during June 2010. From the reanalysis models examined, ERA-Interim and MERRA (the predecessor of MERRA 2) were included. The authors analyzed temperature, specific humidity, *u*-wind and *v*-wind between 100 and 650 hPa. They found that ERA-Interim showed the best performance for all variables including specific humidity, which is the key variable with respect to producing the integrated water vapor.

Thus, we used air temperature (*T*_{i}) and specific humidity (*q*_{i}) at 37 atmospheric pressure levels from ERA-Interim data to compute the proposed correction.

Recall that the index *k* refers to the grid point surrounding the GNSS site, whereas the index *i* refers to the atmospheric pressure level.
As previously mentioned, the GNSS geopotential (*z*_{GNSS}) is set as a reference, and the value of the geopotential from ERA-Interim (${z}_{\text{ERA-Interim}}^{k}$) at each of the four grid points surrounding the GNSS site are generally not the same and could differ by up to 2 orders of magnitude.
Commonly, neither *z*_{GNSS} nor the geopotential at any of the four grid points matches the geopotential of the nearby pressure level. Therefore, the values of all parameters in the adjacent levels must be used to interpolate (or extrapolate) pressure, temperature and specific humidity in the unknown geopotential (*z*_{GNSS} and ${z}_{\text{ERA-Interim}}^{k}$).

Thus, the expression of the pressure at an unknown geopotential *z*_{j}, where *j* can be any of the unknowns, with respect to a given reference data level (*z*_{0}) at *i*=0 is as follows (van Dam et al., 2010):

where *T*_{0} and *p*_{0} refer to the temperature and pressure values at a reference level *z*_{0}, respectively, *R*=287.04 J kg^{−1} K is the gas constant, *λ*=0.006499 K m^{−1} is the lapse rate of the temperature, and *δ**z* is the geopotential difference between *z*_{i} and the reference level *z*_{0}.
Notice that *δ**z* is different concept than Δ*z*, where the Δ*z* refers to the difference between *z*_{GNSS} and *z*_{NWM}.
The numerator of Eq. (5) is the temperature estimated at the desired geopotential *z*_{j}, assuming that the temperature decreases with altitude according to *λ*.
This expression is used to compute *p* in both *z*_{GNSS} and in the four grid points of the model (${z}_{\text{ERA-Interim}}^{k}$).
Finally, the specific humidity (*q*) is also estimated at the desired *z*_{j} using linear interpolation (extrapolation) from data at the adjacent layers.

When *p*,*T* and *q* are known at each geopotential, *z*_{GNSS} and the four grid points of ${z}_{\text{ERA-Interim}}^{k}$, we can estimate the necessary corrections for the grid points. Such additive corrections to the IWV values at the grid points are equivalent to move the static geopotential of the grid to the *z*_{GNSS}. The corrected IWV_{NWM} is then obtained at the GNSS site by a bilinear interpolation of the four corrected values.

Each value of IWV provided by ERA-Interim is the result of the numerical integration of the expression (Berrisford et al., 2011)

where *g*_{0} is the standard acceleration of the gravity at mean sea level, *q*(*p*) is the specific humidity of the air at the pressure level *p* and the integral is calculated from the first level (*p*_{1}) up to the model surface level (*p*_{s}), i.e., up to the static geopotential that corresponds to the station.

Thus, the proposed correction can generally be written as

where the NWM is ERA-Interim, *A* corresponds to the highest *z* (*z*_{GNSS} or ${z}_{\mathrm{NWM}}^{k}$) and *B* corresponds to the lowest *z*. Note that the values of *q* and *p* in Eq. (7) can be computed as previously explained (Eq. 5).
Thus, *q*_{j} and *p*_{j} are *q* and *p* at *z*_{j} and *q*_{j+1} and *p*_{j+1} are *q* and *p* at *z*_{j+1}, respectively. The values of *p* grow downwards resulting in *p*_{1}=1 hPa and *p*_{37}=1000 hPa.
Assuming that the integral of the water vapor is computed from the top down, if the height of a given point from a model is located lower than the position of the receiver, the model integrates a larger column of water vapor and vise versa if the geopotential value from model is larger than the geopotential of the GNSS receiver.
Hence, this quantity has to be additive if ${z}_{\mathrm{GNSS}}<{z}_{\mathrm{NWM}}^{k}$ or subtractive if the inverse is true and the sign is determined by *n* (see Eq. 7).

In a given instant, we know the geopotential of the GNSS station and the static geopotential assigned by the NWM to the four grid points surrounding it (*z*_{GNSS} and ${z}_{\mathrm{NWM}}^{k}$, $k=\mathrm{1},\mathrm{2},\mathrm{3},\mathrm{4}$). We also know the geopotential at 37 pressure levels (*z*_{i}) from 1 to 1000 hPa, as well as specific humidity (*q*) and temperature (*T*) at these levels. We should consider that the pressure value of each level is constant at any time, but this is not necessarily the case for geopotential height.

Figure 3 illustrates the application of the correction to an example.
We take just one of the four grid points and assume that both unknowns (${z}_{\mathrm{NWM}}^{k}$ and *z*_{GNSS}) are located between levels 27 (750 hPa) and 28 (775 hPa). Thus, we could use the available data at levels 27 and 28 along with Eq. (5) as well as the abovementioned considerations to estimate *p*,*t* and *q* at ${z}_{\mathrm{NWM}}^{k}$ and *z*_{GNSS}. Finally, ΔIWV is computed by means of Eq. (7) for this example.

Before analyzing the results of the correction process explained in the previous section, we present a validation of the numerical integration method used.
To this end, we calculate the values of IWV_{ERA-Interim} at each grid point using the numerical integral of Eq. (6). The integration limits range from 1 hPa to the static geopotential value assigned by the model to the point (${z}_{\text{ERA-Interim}}^{i}$).
Table 3 shows the results obtained using this procedure. In each grid point the mean value of the differences (IWV_{ERA-Interim} data – IWV_{ERA-Interim} calculated) is presented. Standard deviations are also shown. It can be seen that the resulting values are generally very close to zero.

In order to evaluate the improvements introduced by the correction, Fig. 4 shows the $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ as a function of Δ*z* after applying the proposed integral correction to ERA-Interim data.
At first glance we can see that, regardless of whether Δ*z* is positive or negative, the differences ($\left|\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}\right|$) decrease to 2 kg m^{−2} for all of the stations except in three cases where they barely exceed the abovementioned value.
Moreover, the correlation between $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$ and the geopotential difference decreases to 0.13, as expected.

In addition, if we focus on the plot area of Fig. 4 that is limited for a $\mathrm{\Delta}z=\pm \mathrm{1500}$ m^{2} s^{−2} and $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}=\mathrm{1.5}$ kg m^{−2} (zoom not shown), we can see that most of the stations (94 %) are included. This shows that the proposed correction decreases $\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$, even for low stations (small *z*) that generally have the smallest values of Δ*z*.

The performance of the proposed correction can also be seen in Fig. 5. Where Fig. 5a, c and e show stations with positive Δ*z*, it means that GNSS station is higher than the location assigned by ERA-Interim. Accordingly, the model integrates a thicker layer of atmosphere and, thus, IWV_{ERA-Interim} values are larger than those from IWV_{GNSS}. The opposite (Δ*z* is negative) is represented by the sites in Fig. 5b, d and f. Moreover, the differences in Δ*z* are presented increasing from top to bottom in each column in the figure.

We can see that the most important corrections are at BOGT in Bogotá, Colombia, and SANT in Santiago de Chile, Chile. In these examples, the differences (IWV_{GNSS}−IWV_{ERA-Interim}), which can reach up to 7 kg m^{−2}, are significantly reduced.

However the application of this correction, in some cases, should be precautionary. Effectively, sometimes different shortcomings of the model overlap with the height problem; therefore, the proposed correction will not work. For example, in the case of coastal and/or insular stations where two or more grid points are in the ocean: in all of these cases the value of IWV calculated from the bilinear interpolation will be overvalued. Looking at stations near the seashore in more detail (e.g., PARC in Punta Arenas, Chile), where two of the four grid points are in the ocean (see Fig. 6), $\mathrm{\Delta}z=-\mathrm{1271.86}$ at PARC indicates that the geopotential from ERA-Interim is larger than the GNSS geopotential; thus, the proposed correction will be additive. In addition to this result, the IWV_{ERA-Interim} values are overestimated due to the application of a bilinear interpolation that uses data points located in the ocean. In conclusion, the (IWV_{ERA-Interim} + correction) value overestimates IWV_{GNSS}. Thus, this is an example where applying the suggested correction may worsen the results. The same situation is noted at RIO2 at the Argentinean Atlantic coast.

The effect of different heights when comparing results from several data sources not only affects the determination of IWV but also impacts other parameters. For instance, Gao et al. (2012) studied the height corrections for the ERA-Interim 2 m temperature data in the central Alps, and they also found large biases that must be corrected in mountainous areas. Some other authors also studied the tropospheric refraction effects on space geodetic techniques by considering this effect. For example, Teke et al. (2013) performed an inter-technique comparison of ZTD in the framework of four continuous very long baseline interferometry (VLBI) campaigns, including NWM and taking the effect of the height differences into account.

NWM users commonly utilize the IWV values on a grid and use them to calculate the IWV value at a desired location by way of an interpolation method.
In this work, taking the values of IWV_{GNSS} as reference, we show that there are cases where the IWV values obtained from a NWM have differences of several kilograms per square meter and that these discrepancies are mainly due to the difference in geopotentials.

We analyzed the discrepancies between the vertically integrated water vapor values provided by two reanalysis models (ERA-Interim and MERRA-2) with respect to the IWV_{GNSS} values taken as a reference on the southern and central American continent for the period from 2007 to 2013.
The results of this comparison allow us to establish that MERRA-2 results in wetter conditions than GNSS, whereas ERA-Interim is slightly dryer.
In addition, when geopotential differences are moderate or large (|Δ*z*|>500 m^{2} s^{−2}) and ${\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}}_{\mathrm{GNSS}}>\mathrm{20}$ kg m^{−2}, the discrepancies ($\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}$) are greater than 2 kg m^{−2} at about 22 % of the stations for both models.

Several authors have reported problems related to the elevation correction for data from the reanalysis models. The artificial bias in the IWV introduced by this altitude difference has previously been reported by Bock et al. (2007), Heise et al. (2009), Van Malderen et al. (2014), Bordi et al. (2014) and Bianchi et al. (2016a).

Heise et al. (2009) derived IWV from global GNSS ZTD at almost 300 sites using ground pressure and temperature values from ECMWF and then compared IWV from GNSS and ECMWF. Similar to our work, they found large discrepancies in mountain regions due to the difference in altitudes that caused errors in the estimations of meteorological values.
Moreover, the analysis performed in our work is also in agreement with Bordi et al. (2014), who compared GNSS and ERA-Interim IWV values on a monthly timescale for the period from 2002 to 2012. In this case, the authors found significant biases of 6.4 kg m^{−2} in BOGT and 2.5 kg m^{−2} in BRAZ, and they related them to the different elevations between the correspondent GNSS site and the grid points of the model. Thus, the values from Bordi et al. (2014) are comparable with the correspondent $\left|\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}\right|$ values estimated in our study (6.87 kg m^{−2} in BOGT and 2.03 kg m^{−2} in BRAZ, see Table 2).

In this work, we proposed an integral correction that compensates IWV for the effect of the geopotential difference between GNSS and the interpolated grid points in the reanalysis model. The results were tested with the respective values from ERA-Interim.
The correction is computed as the numerical integration of the specific humidity where the integral limit is a pressure difference at Δ*z* (see Eq. 7). Considering that 67 % of the stations have $\left|\mathrm{\Delta}\stackrel{\mathrm{\u203e}}{\mathrm{IWV}}\right|<\mathrm{1.5}$ kg m^{−2} for ERA-Interim prior to the correction, the application of the numerical correction improves the results (the percentage of stations below 1.5 kg m^{−2} increased to 94 %).

Nevertheless, the application of this correction is not advisable at coastal or insular stations in South and Central America due to the fact that the overvaluation of the model near the coast overlaps with the height problem. These results are in agreement with Ning et al. (2013), who also compared IWV from GNSS with values from two NWMs (ERA-Interim and the Rossby Centre atmospheric climate model – RCA) in Europe for 14 years. The authors also found that models give IWV values larger than GNSS at the seaside or along coasts where the tile of the model includes more than 60 % water.

For this reason, the corrections we propose are always recommended, but they are not advisable at in coastal areas or on islands as at least two grid points of the model are usually in the water.

Data are available at https://doi.org/10.1594/PANGAEA.858234 (Bianchi et al., 2016b).

LIF led the study and contributed to data collection, analysis and interpretation of the results. AMM and MPN cowrote the paper; they also contributed to the statistical analysis and the interpretation of the results. CEB contributed to data collection. All authors read and approved the final paper.

The authors declare that they have no conflict of interest.

We would like to thank the two anonymous reviewers for their valuable comments that highly improved this paper. We would also like to thank the people, organizations and agencies responsible for collecting, computing, maintaining and openly providing the observations and the products employed in this work, including the European Centre for Medium-Range Weather Forecasts (ECMWF) that provided the ERA-Interim reanalysis data (https://apps.ecmwf.int/datasets/, last access: 24 May 2019) and the Global Modeling and Assimilation Office (GMAO) from the National Aeronautics and Space Administration (NASA, USA) that provided the MERRA-2 data (https://gmao.gsfc.nasa.gov/reanalysis/MERRA-2/, last access: 27 October 2016).

This research was supported by the National Scientific and Technical Council of Argentina (CONICET; grant no. PIP 112-201201-00292) and La Plata National University (UNLP; project no. 11G/142).

This paper was edited by Vassiliki Kotroni and reviewed by three anonymous referees.

Berrisford, P., Kållberg, P., Kobayashi, S., Dee, D., Uppala, S., Simmons, A. J., Poli, P., and Sato, H.: Atmospheric conservation properties in ERA-Interim, Q. J. Roy. Meteor. Soc., 137, 1381–1399, https://doi.org/10.1002/qj.864, 2011. a

Bianchi, C. E., Mendoza, L. P. O., Fernández, L. I., Natali, M. P., Meza, A. M., and Moirano, J. F.: Multi-year GNSS monitoring of atmospheric IWV over Central and South America for climate studies, Ann. Geophys., 34, 623–639, https://doi.org/10.5194/angeo-34-623-2016, 2016a. a, b, c

Bianchi, C. E., Mendoza, L. P. O., Fernández, L., Natali, M. P., Meza, A., and Moirano, J.: Time series of atmospheric water vapour and troposphere zenith total delay, over Central and South America, from a homogeneous GNSS reprocessing (MAGGIA ZTD & IWV Solution 1), PANGAEA, https://doi.org/10.1594/PANGAEA.858234, 2016b. a, b

Bock, O., Bouin, M.-N., Walpersd orf, A., Lafore, J. P., Janicot, S., Guichard, F., and Agusti-Panareda, A.: Comparison of ground-based GPS precipitable water vapour to independent observations and NWP model reanalyses over Africa, Q. J. Roy. Meteor. Soc., 133, 2011–2027, https://doi.org/10.1002/qj.185, 2007. a

Bordi, I., Bonis, R. D., Fraedrich, K., and Sutera, A.: Interannual variability patterns of the world's total column water content: Amazon River basin, Theor. Appl. Climatol., 122, 441–455, https://doi.org/10.1007/s00704-014-1304-y, 2014. a, b, c, d

Bosilovich, M. G., Lucchesi, R., and Suarez, M.: MERRA-2: File Specification, available at: https://ntrs.nasa.gov/search.jsp?R=20150019760 (last access: 27 October 2016), 2015. a, b

Buehler, S. A., Östman, S., Melsheimer, C., Holl, G., Eliasson, S., John, V. O., Blumenstock, T., Hase, F., Elgered, G., Raffalski, U., Nasuno, T., Satoh, M., Milz, M., and Mendrok, J.: A multi-instrument comparison of integrated water vapour measurements at a high latitude site, Atmos. Chem. Phys., 12, 10925–10943, https://doi.org/10.5194/acp-12-10925-2012, 2012. a, b

Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C. M., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., McNally, A. P., Monge-Sanz, B. M., Morcrette, J.-J., Park, B.-K., Peubey, C., de Rosnay, P., Tavolato, C., and Thépaut, J.-N., and Vitart, F.: The ERA-Interim reanalysis: configuration and performance of the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597, https://doi.org/10.1002/qj.828, 2011. a, b

Dessler, A. E., Zhang, Z., and Yang, P.: Water-vapor climate feedback inferred from climate fluctuations, 2003–2008, Geophys. Res. Lett., 35, L20704, https://doi.org/10.1029/2008gl035333, 2008. a

Gao, L., Bernhardt, M., and Schulz, K.: Elevation correction of ERA-Interim temperature data in complex terrain, Hydrol. Earth Syst. Sci., 16, 4661–4673, https://doi.org/10.5194/hess-16-4661-2012, 2012. a

Gelaro, R., McCarty, W., Suárez, M. J., Todling, R., Molod, A., Takacs, L., Randles, C. A., Darmenov, A., Bosilovich, M. G., Reichle, R., Wargan, K., Coy, L., Cullather, R., Draper, C., Akella, S., Buchard, V., Conaty, A., da Silva, A. M., Gu, W., Kim, G.-K., Koster, R., Lucchesi, R., Merkova, D., Nielsen, J. E., Partyka, G., Pawson, S., Putman, W., Rienecker, M., Schubert, S. D., Sienkiewicz, M., Zhao, B.: The modern-era retrospective analysis for research and applications, version 2 (MERRA-2), J. Climate, 30, 5419–5454, 2017. a, b

GMAO: Global Modeling and Assimilation Office, MERRA-2inst1_2d_int_Nx:2d, 1-Hourly, Instantaneous, Single-Level, Assimilation, Vertically Integrated Diagnostics V5.12.4. Greenbelt, MD, USA. Goddard Earth Sciences Data and Information Services Center (GES DISC), https://doi.org/10.5067/G0U6NGQ3BLE0, 2015. a

Heise, S., Dick, G., Gendt, G., Schmidt, T., and Wickert, J.: Integrated water vapor from IGS ground-based GPS observations: initial results from a global 5-min data set, Ann. Geophys., 27, 2851–2859, https://doi.org/10.5194/angeo-27-2851-2009, 2009. a, b, c

Hofmann-Wellenhof, B. and Moritz, H.: Physical geodesy, Springer Science & Business Media, Vienna, Austria, 2006. a, b

Mengistu Tsidu, G., Blumenstock, T., and Hase, F.: Observations of precipitable water vapour over complex topography of Ethiopia from ground-based GPS, FTIR, radiosonde and ERA-Interim reanalysis, Atmos. Meas. Tech., 8, 3277–3295, https://doi.org/10.5194/amt-8-3277-2015, 2015. a

Ning, T., Elgered, G., Willén, U., and Johansson, J. M.: Evaluation of the atmospheric water vapor content in a regional climate model using ground-based GPS measurements, J. Geophys. Res.-Atmos., 118, 329–339, https://doi.org/10.1029/2012jd018053, 2013. a, b

Pavlis, N. K., Holmes, S. A., Kenyon, S. C., and Factor, J. K.: The development and evaluation of the Earth Gravitational Model 2008 (EGM2008), J. Geophys. Res.-Sol. Ea., 117, B04406, https://doi.org/10.1029/2011JB008916, 2012. a

Rienecker, M. M., Suarez, M. J., Gelaro, R., Todling, R., Bacmeister, J., Liu, E., Bosilovich, M. G., Schubert, S. D., Takacs, L., Kim, G.-K., Bloom, S., Chen, J., Collins, D., Conaty, A., da Silva, A., Gu, W., Joiner, J., Koster, R. D., Lucchesi, R., Molod, A., Owens, T., Pawson, S., Pegion, P., Redder, C. R., Reichle, R., Robertson, F. R., Ruddick, A. G., Sienkiewicz, M., and Woollen, J.: MERRA: NASA's Modern-Era Retrospective Analysis for Research and Applications, J. Climate, 24, 3624–3648, https://doi.org/10.1175/jcli-d-11-00015.1, 2011. a

Teke, K., Nilsson, T., Böhm, J., Hobiger, T., Steigenberger, P., García-Espada, S., Haas, R., and Willis, P.: Troposphere delays from space geodetic techniques, water vapor radiometers, and numerical weather models over a series of continuous VLBI campaigns, J. Geodesy, 87, 981–1001, https://doi.org/10.1007/s00190-013-0662-z, 2013. a

van Dam, T., Altamimi, Z., Collilieux, X., and Ray, J.: Topographically induced height errors in predicted atmospheric loading effects, J. Geophys. Res., 115, B07415, https://doi.org/10.1029/2009jb006810, 2010. a, b, c, d, e

Van Malderen, R., Brenot, H., Pottiaux, E., Beirle, S., Hermans, C., De Mazière, M., Wagner, T., De Backer, H., and Bruyninx, C.: A multi-site intercomparison of integrated water vapour observations for climate change analysis, Atmos. Meas. Tech., 7, 2487–2512, https://doi.org/10.5194/amt-7-2487-2014, 2014. a, b

Wang, Y., Zhang, Y., Fu, Y., Li, R., and Yang, Y.: A climatological comparison of column-integrated water vapor for the third-generation reanalysis datasets, Science China Earth Sciences, 59, 296–306, https://doi.org/10.1007/s11430-015-5183-6, 2015. a

Zhu, J.-H., Ma, S.-P., Zou, H., Zhou, L.-B., and Li, P.: Evaluation of reanalysis products with in situ GPS sounding observations in the Eastern Himalayas, Atmos. Ocean. Sci. Lett., 7, 17–22, https://doi.org/10.3878/j.issn.1674-2834.13.0050, 2014. a, b