Impact of Argo Observation on the Regional Ocean Reanalysis of China Coastal Waters and Adjacent Seas : A Twin-Experiment Study

A regional ocean reanalysis system of China coastal waters and adjacent seas, called CORA (China ocean reanalysis), has been recently developed at the National Marine Data and Information Service (NMDIS). In this study, based on CORA, the impact of Argo profiles on the regional reanalysis is evaluated using a twin-experiment approach. It is found that, by assimilating Argo observations, the reanalysis quality is much improved: the root mean square (RMS) error of temperature and salinity can be further reduced by about 10% and the RMS error of current can be further reduced by 18%, compared to the case only assimilating conventional in situ temperature and salinity observations. Consistent with the unique feature of Argo observations, the temperature is improved in all levels and the largest improvement of salinity happens in the deep ocean. Argo profile data have a significant impact on the regional ocean reanalysis through improvements of both hydrographic and dynamic fields.


Introduction
With the gradual increasing of global ocean Argo (Array for Real-Time Geostrophic Oceanography) profile [1], a lot of researches on Argo application have been carried out by scientists and many useful results have been achieved.Argo data especially have been widely used in operational oceanography, such as ocean data assimilation, forecast, and reanalysis [2][3][4][5].
Therefore, it is also necessary to evaluate the impact of Argo on ocean data assimilation.Many research works have addressed this issue.Studies related to this topic can be roughly divided into two classes.The first class is known as Observing System Experiments (OSEs).In these experiments, an ocean analysis, in which all available data are assimilated, serves as a reference.And permutations of combinations of the available observation systems are used in an analysis, in which one observational system is excluded from the analysis, so providing an estimate of the impact of the omitted observational system by comparing the analysis with the reference.In this field lots of works have been done to evaluate the Argo impact.For example, Vidard et al. [6] assessed the relative importance of the tropical in situ mooring arrays (TAO, TRITON, and PIRATA), XBTs, and the developing Argo float network on global ocean analysis and seasonal forecasts, showing a major role for the moorings and in some ways a smaller contribution from Argo than might have been expected.Balmaseda et al. [7] assessed the impact of Argo on ECMWF operational ocean analyses and evaluated the information content of Argo temperature and salinity data, gauged in terms of influence on the ocean state and the skill of seasonal forecasts.Oke and Schiller [8] assessed the relative importance of Argo temperature and salinity profiles, sea-surface temperature, and altimetric sea-level anomalies for constraining upper-ocean  and  properties and mesoscale variability of sea-level anomalies in an eddy resolving ocean reanalysis in the Australian region.Yan et al. [9] assessed the impacts of XBT, TAO, altimetry, and Argo observations on the tropical Pacific ocean data assimilation.Huang et al. [10] investigated the impact of Argo salinity profiles on the NCEP Global Ocean Data Assimilation System (GODAS) in the tropical Indian Ocean.
The second class is known as Observing System Simulation Experiments (OSSEs).The OSSE approach was first adopted by the meteorological community not only to assess the impact of observations but also to optimize the design of observing systems and observing networks in order to improve numerical weather predictions.Oceanographic applications to Argo impact assessment and Argo profile sampling strategy optimization have been reported by several literatures.For example, Schiller et al. [11] performed OSSE using OGCM output to assess sampling strategies for the Argo array in the Indian Ocean.Griffa et al. [12] and Raicich [13] quantitatively investigated the impact of assimilating temperature and salinity profiles from Argo floats in the Mediterranean Sea using the OSSE approach.Zhang et al. [14] show the potential of Argo profiles to initialize the Atlantic meridional overturning circulation (MOC) which is an important source of multidecadal climate variability and trends.Dunstone and Smith [15] investigated the impact of assimilating different amounts of atmosphere and ocean data, including Argo profile, on decadal climate prediction skill through a set of idealized model experiments and found that the upper 2000 m temperature and salinity observations currently provided by the Argo array of floats are potentially well suited to initializing decadal climate predictions.
However, most of the above literatures concentrate on the impact of Argo profiles on data assimilation for global ocean rather than for the marginal seas, such as the coastal waters of China and adjacent seas.
A regional ocean reanalysis experiment which is focused on the coastal waters of China and adjacent seas has already been started a few years ago by National Marine Data and Information Service (NMDIS).The present ocean reanalysis system (China ocean reanalysis, CORA) for this area is achieved, and a dataset package of sea surface height (SSH), three-dimensional (3D) temperature, salinity, and currents in this area (http://www.cmoc-china.cn/) is developed [5,16].
In this paper, based on the CORA system, data assimilation identical twin-experiment approach used in OSSE technique is employed to evaluate the impact of Argo data on temperature, salinity, and current reanalysis results of China coastal waters and adjacent seas.The data assimilation identical twin-experiment approach is one of the useful methods to evaluate the impact of the ocean observation system [14,17].But here we do not touch Argo observing systems optimal design which will be our future goal.Ocean dynamic model, ocean data assimilation scheme used in CORA, and twin-experiment configuration are described in the following section.Section 3 mainly discusses the impact of Argo on the ocean reanalysis of this area and conclusions are given in Section 4.

Twin Experiment of Reanalysis
The OSSE technique consists of identical twin experiments [13], in which data extracted from a reference model run is assimilated into other runs of the same model with different initial conditions.The convergence of the other runs towards the first one is measured to quantify the data assimilation effectiveness in driving the model with "wrong" initial conditions towards the reference, and the comparison between analyses with and without Argo profiles data assimilated can provide an estimate of impact of Argo profiles.All runs are driven by the same external forcing.

Ocean Dynamic Model.
The NMDIS-developed parallel version of Princeton Ocean Model with generalized coordinate system (POMgcs) [18,19] serves as the ocean dynamical model in CORA.The original version of POMgcs, known as POM (Princeton Ocean Model), has been widely used [20,21].Following [22,23], the wave breaking parameterization of Mellor and Blumberg [24] is introduced into POMgcs to deepen the mixed temperature surface layer.The tidal generating potential and tidal open boundary condition of eight major tidal constituents ( 2 ,  2 ,  2 ,  2 ,  1 ,  1 ,  1 , and  1 ) are added to this model to simulate tidal effect.
The study area extends from 10 ∘ S to 52 ∘ N in latitude and from 99 ∘ E to 150 ∘ E in longitude.The model grid spacing is varied from 1/2 ∘ to 1/8 ∘ and the area with the highest horizontal resolution 1/8 ∘ extends from 19 ∘ N to 33 ∘ N in latitude and from 117 ∘ E to 130 ∘ E in longitude.Due to the huge range and the complicated bathymetry of the study area, the hybrid coordinate of POMgcs is in use, in conformity with the study of Mellor et al. [18].The -level vertical grid is used in the area with local depth less than 200 m, that is, the continental shelf, to form a terrain-following vertical grid.While in the area with local depth greater than 200 m, that is, slope area and deep sea area, in order to simulate satisfactorily the upper mixed layer and thermocline, most of the vertical levels at the upper and middle ocean are set to -level, and at the level depth greater than 0.9 proportion of the local depth only a few vertical levels near seafloor are set to terrain-following -level to represent the seafloor preferably to avoid the stair-step effect.Coarse vertical level near seafloor can make the hydrostatic consistency criteria satisfied.There are 33 vertical levels with a maximum depth of 4500 m.The vertical -level grid is listed in Table 1.
The twin experiment of reanalysis spanned four years from January 2005 to December 2008.SODA (Simple Ocean Data Assimilation) monthly-mean product of each year serves as the open boundary condition for sea surface height (SSH), temperature, salinity, and current.Wind field uses Quick Scatterometer (QuikSCAT) wind field of daily Level 3 data with the horizontal resolution being 1/4 ∘ .And heat flux field comes from the National Centers of Environment Prediction (NCEP) reanalysis version 1 product.

Ocean Data Assimilation
Scheme.The ocean data assimilation scheme used in CORA is a sequential threedimensional variational (3D-Var) analysis scheme designed to assimilate temperature and salinity using a multigrid framework [5,16].This sequential 3D-Var analysis scheme can extract long and short wavelength information in turn quickly from observations system and provide objective and accurate analysis.The basic idea of this data assimilation scheme can be referred to in [25][26][27][28][29][30].In this study, 9 level grids are applied from 2 × 2 × 2 (only one big cell containing the whole study domain) to 257 × 257 × 33 (the horizontal interval is about 0.2 ∘ × 0.24 ∘ , and the vertical resolution is the same as that of the model).The smooth penalty parameter, weighing how strong the smoothing term will be, is set to 0.005.Square of the simulated observational error, which will be discussed in the next section, is used to specify the diagonal element of observational error covariance matrix, while the off-diagonal element is set to zero.Fifty iterations are employed to every level grid's optimization.The basic idea proposed by Troccoli et al. [31] is employed to make salinity adjustment during temperature assimilation.In this scheme, the - relation is basically conserved during the temperature data assimilation; and salinity measurements are assimilated to adjust the - relation only if such measurements are available.Following Troccoli et al. [31], a latitudinal filter has been applied to the salinity increments so that the whole salinity increment is applied only within 30 ∘ of the equator.Outside this region, the weight given to the salinity adjustment diminishes linearly to zero at latitudes poleward of north edge of model domain.This is done to avoid implementing the salinity correction scheme in areas where the stratification is weak and () persistence is more appropriate.
The assimilation is performed every day and a seven-day time window is used to incorporate as many observations as possible.

True Fields and "Observation" Construction.
With the open boundary conditions and meteorological driving force described above and with CORA reanalysis of January 1, 2005 serving as the initial field, 4-year integration is carried out from January 2005 to December 2008 by using the NMDIS-developed parallel version of POMgcs.Then the 4year simulation results serve as the true fields for comparing the following twin-experiment reanalysis results to evaluate the impact of Argo on this regional reanalysis.
Following the method in [14], "observations" needed in reanalysis twin-experiment are constructed by interpolating the above true fields to the temporal and spatial information of real temperature or salinity observation networks, including Nansen bottle, conductivity-temperature-depth (CTD), and various bathythermograph (BT) and Argo profiles.It should be noted that CTD and Argo profiles may have both temperature and salinity observations, while the others profiles may only have temperature observations.And compared to CTD profiles, Argo profiles may be distributed relatively homogeneously.A Gaussian white noise with the mean and standard deviation being 0.0 ∘ C (0.0 psu) and 1.0 ∘ C (0.2 psu), respectively, is added to temperature (salinity) "observation" as random error simulation.The imposed white noise attempts to account for random measurement errors of the observing system.For simplicity, "observations" which have the temporal and spatial information of Nansen bottle, CTD, and variety of BT are called "conventional observations, " and those with Argo temporal and spatial information are called "Argo observations." Distributions of "observations" used in reanalysis twin experiment from January 2005 to December 2008 are shown in Figure 1.It is found that dense "observations" in this study domain are mainly distributed around Japan, including Japan Sea and Kuroshio to the south of Japan.In other areas, except for some moored profiles which have many data, "observations" are scarcely distributed.In China coastal waters especially, including Bohai Sea (BS), Yellow Sea (YS), East China Sea (ECS), and South China Sea (SCS), most areas are measured only once per year."Argo observations" are homogeneously distributed in the Pacific Ocean to the east of Ryukyu Island.Along with current, several Argos drifted to the west of Ryukyu Island and in 2008 several Argos drifted in the northern SCS.

Twin-Experiment Configuration.
Three twin experiments are carried out to study the impact of Argo on temperature, salinity, and current reanalysis results of the China coastal waters and adjacent seas.They all use the setup ocean model described in Section 2.1 and data assimilation scheme described in Section 2.2 and are subject to the same meteorological driving force and open boundary condition as those used in construction of true field and all run for the period from January 2005 to December 2008.However, the initial condition is different from that of true field.Observational climatology of temperature and salinity field in August serves as the initial condition for the three twin experiments and the initial sea surface height and current field are obtained by one-year diagnostic spin-up using the same model.No "observation" is assimilated in Experiment 1; that is, Experiment 1 is a free run, while Experiment 2 only assimilates "conventional observations" and Experiment 3 assimilates all the "observations" including "conventional observations" and "Argo observations." The impact of satellite SST and SSHA on reanalysis is beyond the scope of this study, so no experiment assimilates satellite observations.By comparing the twin-experiment daily results with the true fields, root mean square error (RMSE) for 4-year      4(a)).By assimilating  "conventional observations, " salinity RMSE of Experiment 2 is obviously lower than that of Experiment 1 by about 25%.By further assimilating "Argo observations, " the salinity RMSE of Experiment 3 is lower than that of Experiment 2 by about 10%; that is, the salinity RMSE of Experiment 3 is lowest among all the three twin experiments.It is found that the improved area of Experiment 2 compared to 1 includes Japan Sea, Kuroshio to the south of Japan, and area to the east of Taiwan where many "observations" can support the analysis (Figure 4(d)).

Impact of Argo on Ocean Reanalysis
Besides the above improved area, the improved area of Experiment 3 compared to 1 is further enlarged (Figure 4(e)).It can be seen from Figure 4(f), as a necessary kind of supplement to "conventional observations, " salinity data assimilation of "Argo observation" can significantly improve the salinity analysis to the east of Luzon Strait and the western part of Japan Sea.From the vertical structure of salinity RMS error of these three twin experiments (Figure 5(a)) and for the similar reason as that for temperature, the maximum RMSE is at surface, but another obvious extreme exists between 500 m and 1000 m in Experiment 1. Different from that of temperature, although salinities in all levels are improved not only in Experiment 3 (Figure 5(c)) but also in Experiment 2 (Figure 5(b)), the maximum improvement concentrates in the middle level between 500 m and 1000 m (Figure 5(d)).
Large RMSE in upper 300 m ocean mainly comes from the continental shelf (Figure 4(a)), where almost no salinity observation exists.Therefore, only a little improvement is found in the upper 300 m ocean in both Experiments 2 and 3, and Argo does not provide additional useful information to improve the salinity reanalysis on continental shelf.However, in the interior ocean, especially in the area with depth deeper than 1000 m, Argo can provide additional useful salinity observations, which can improve salinity reanalysis significantly.

Impact of Argo on Current
Reanalysis.Since this study assimilates temperature and salinity "observations" rather than velocity "observations, " true velocity field can be added in the verification of reanalysis as an independent element (Figure 6).For both U component and V component, 2.0      improved the Kuroshio transport of East China Sea to the west of Ryukyu Island.Figure 9 gives the time series of the differences of the Kuroshio transport at PN section of East China Sea between these three twin experiments and the truth.The PN line is an important section which is about from (124.5 ∘ E, 30 ∘ N) to (128.23 ∘ E, 27.45 ∘ N), and oceanographic data obtained along this section are the most significant in the investigation of Kuroshio in East China Sea [32].Diagnosed from temperature and salinity observation at PN section, the average of observed PN relative geostrophic volume transport is about 25.8 Sv, with a mean seasonal maximum of 27.0 Sv in summer and minimum of 23.9 Sv in autumn [32].In our true field, the averaged PN transport is about 26.

Conclusion
In this paper, based on the CORA system, three twin experiments are used to evaluate the impact of Argo data on temperature, salinity, and current reanalysis results.The main conclusions can be drawn as follows.
(1) As a necessary kind of supplement to conventional temperature and salinity observations, Argo data  assimilation can further improve the reanalysis accuracy of temperature and salinity which have been improved by assimilating only conventional observations and further enlarge the improved area.With Argo profiles assimilated, the RMSEs of temperature and salinity reanalysis can be further reduced by about 10% in our study domain.Consistent with the unique feature of Argo observations, the temperature is improved in all levels and the largest improvement of salinity happens in the deep ocean.
(2) Velocity field can be determined by density field in large scale to the first order under geostrophic or quasigeostrophic approximation; therefore, as long as temperature and salinity can be improved by data assimilation, density structure and the corresponding velocity field can be improved.Argo profiles are widely distributed in World Ocean, so the ocean circulation field can be significantly improved by assimilating Argo.With Argo profiles assimilated, the RMS errors of current reanalysis can be further reduced by about 18% in our study domain.

Figure 7 :
Figure 7: Similar to Figure 4 but for U component.

Figure 8 :Figure 9 :
Figure 8: Similar to Figure 4 but for V component.

Table 1 :
The vertical -level grid list.
(1)ce no "observation" is assimilated in Experiment 1 and the initial field (climatological temperature in August) has big difference compared with the true field (CORA temperature of January 2005), the mean RMSE of temperature over whole domain is about 3.676 ∘ C using(1).The largest RMSEs appear in China coastal water, Thailand Bay, and Karimata Strait where few "observations" can support the analysis.Luzon Strait, area to the east of Taiwan, Japan Sea, and Equator area also have large RMSEs.
(2).Impact of Argo on Temperature Reanalysis.Figures2(a), 2(b), and 2(c) show the horizontal distribution of vertical averaged temperature RMSEs of the three twin experiments, respectively, using(2).RMSEs between Experiments 1 and 2, and Figure2(e) shows that between Experiments 1 and 3. Therefore Figures2(d) and 2(e) represent the improvement of Experiments 2 and 3 relative to 1, respectively.Figure 2(f) shows the difference of temperature RMSEs between Experiments 2 and 3, which represents the improvement of Experiment 3 relative to 2.Positive value means RMSE decreasing and reanalysis improving, and, on the contrary, negative value means reanalysis degenerating.It is found that the improved area of Experiment 2 compared to 1 includes Japan Sea, Kuroshio area, area to the east of Taiwan, north part of SCS, and Equator area where many "conventional observations" can support the analysis.In addition to the above improved area, the improved area of Experiment 3 compared to 1 salinity over whole domain is about 0.485 psu.Similar to that of temperature analysis, the largest RMSEs of salinity appear in China coastal water, Thailand Bay, and Karimata Strait where few "observations" can support the analysis.Luzon Strait, area to the east of Taiwan, Japan Sea, and Equator area also have large RMSEs (Figure 5Sv, and in 2007 the maximum is 29.7 Sv occurring in summer and the minimum is 23.3 Sv occurring in autumn, which is basically consistent with the observation.Due to the bad initial condition which may significantly deteriorate the Kuroshio analysis and simulation, the RMSE of Kuroshio transport in Experiment 1 is 15.1 Sv, while that in Experiment 2 with conventional observations data assimilated reduced to 4.4 Sv and that in Experiment 3 with all observations data assimilated further reduced to 2.9 Sv.Compared with Experiment 2, further assimilation of Argo profile may improve the reanalysis accuracy of Kuroshio transport by about 1.5 Sv and speed up the current simulation toward the true field.The reason may be that since Argo data assimilation can improve the current field of Pacific Ocean to the east of Ryukyu Island which is the open boundary condition of marginal sea current field, the improvement in Pacific Ocean circulation may successively improve the marginal sea current field including the western boundary current such as Kuroshio due to the continuity of fluid.Argo profiles are widely distributed in World Ocean, so the ocean circulation field can be significantly improved by assimilating Argo.