Interactive comment on “ Water stress induced breakdown of carbon-water relations : indicators from diurnal FLUXNET patterns

The analysis presented here is very broad (i.e. results are synthesized across many sites, and often many PFTs). Cross-site syntheses like this are invaluable for understanding broad patterns in vegetation functioning; however, the objectives of this study would benefit from a more in-depth analysis of the results at a least few sites, preferably sites where there exist independent estimates of plant water relations during pe-


Introduction
Processes such as photosynthesis and transpiration are so intimately linked that knowledge and assumptions about one process are needed to accurately understand the other.Unfortunately, the relationship between carbon and water cycles is not fully understood (Tang et al., 2014), passing the biases and uncertainties caused by an incomplete carbon-water framework back onto flux estimates specifically and global water and carbon cycle interactions and dynamics in general (Keenan et al., 2013;Schlesinger and Jasechko, 2014;Ito and Inatomi, 2012).One source of uncertainty that is increasingly being identified is the diverse responses of plants to water limitation (Zhou et al., 2013;Dietze et al., 2014;Rogers et al., 2017), which hampers the understanding and predictability of water and carbon cycles during drought.Here we outline potential causes of uncertainty in carbon-water dynamics in an effort to outline data-derived inductors based on current theory.
Classically, vegetation water and carbon fluxes are linked by stomates, where an open stomate allows CO 2 to enter the leaf and, consequentially, water is lost.Most theoretical frameworks make some form of assumption that carbon assimilation (A) and water losses (T ) are both contingent primarily on leaf stomatal conductance (g s ).This assumed relationship allows us to pass between the realms of carbon and water, based on the assumption that at any given time both A and T are proportional to the stomatal conductance multiplied by the difference in internal and external CO 2 and water vapor concentrations.More specifically, where c and v are the differences in inner and outer stomatal cavity concentrations of CO 2 and water vapor, respectively.These diffusion equations lead to the relatively consistent carbon : water ratio, generally expressed as a water use efficiency (WUE = A/T ).At the ecosystem level where direct measurements of A and T are not available, WUE is simply calculated as the ratio of gross primary productivity (GPP) to total evapotranspiration (ET) (Kuglitsch et al., 2008).These carbon-water links are fundamental to understanding how stomata are regulated and underlie key functioning in mechanistic plant and ecosystem models.One such set of models are those based on optimality theory which posit that plants tend to optimize carbon gains to water losses, such as the models described by Katul et al. (2010) and Katul et al. (2009).These concepts from Katul, which carry the assumptions of RuBisCO (light) limitation, were built upon by Zhou et al. (2014) and Zhou et al. (2015) to give the equation where the √ VPD accounts for the stomatal response to vapor pressure deficit (VPD), assuming the stomatal response optimizes carbon gain to water losses.Accounting for the VPD response allows for a more stable metric of WUE that is temporally more stable and physiologically more meaningful, such as when comparing the diurnal cycles of carbon and water.As ET is the sum of both T and non-biological evaporation (e.g.soil and intercepted evaporation), often periods during and shortly after rain events are excluded from WUE estimates to minimize the influence of non-plant evaporation.Ultimately, calculations of WUE provide a simple summary of the cost in water per carbon gain and become an indicator of how plants have and will adapt to the physical limitations of their changing environments (Keenan et al., 2013;Tang et al., 2014).
Though assuming a rigid carbon-water relationship works well in conditions when ecosystems are moderately wet, conditions associated with the majority of carbon and water fluxes, an inflexible carbon-water assumption is unsatisfactory in that these assumptions may break down as plants shift from light to water limitations.Indeed, in a review of leaflevel stomatal conductance models, Damour et al. (2010) concluded that the majority of stomatal models fail to adequately capture the effects of drought.This failure to capture the effects of drought is not only disconcerting as waterlimited conditions are when ecosystems are most at risk, but an incomplete framework tends to propagate errors and uncertainties from models into estimates of the water and carbon cycles.For instance, in outlining a road map for an improved modeling of photosynthesis, Rogers et al. (2017) noted as key recommendations both improving information about water-carbon relations (in the form of the stomatal slope parameter g 1 ) as well as improving the understanding of the response of carbon assimilation to drought.Similarly, in an analysis of parameter uncertainties for a terrestrial biosphere model, Dietze et al. (2014) found that two of the top five parameters contributing to the predictive uncertainty of net primary productivity were associated with plant water regulation.This uncertainty is reflected in the stomatal conductance parameterization exercise from Knauer et al. (2015), where the authors were able to improve model performance in predicting eddy covariance (EC)-measured GPP and ET by including atmospheric effects (in the form of VPD) on stomatal conductance but concluded that further improvement required a global understanding of water limitation response variation across plant functional traits and growing conditions, which is currently unavailable.
Two ideas to account for the errors in carbon-water assumptions under dry conditions have begun to emerge: that hydraulic limitations in transporting water from root to leaf change stomatal responses and thus limit transpiration under high demand and that changes in the intra-leaf processes of carbon transport and fixation under drought conditions result in non-stomatal limitations that impact carbon assimilation independently of water fluxes (Novick et al., 2016).
As soil water potentials in the root zone become increasingly negative, the long-term plant strategy may turn from optimizing carbon fixation to preventing damage to hydraulic architecture (Tyree and Sperry, 1988).As such, stomata and transpiration are likely to increasingly respond not just to atmospheric conditions but also to soil moisture.Under this hydraulic limitation framework, a plant will be reacting to the inability to transport water, even though the key control mechanism for a plant is via the stomata, possibly expressed as an increase in sensitivity.Such assumptions are consistent with the mechanisms encoded in some land surface and ecosystem models, which account for water limitations by scaling the water-to-carbon ratio in relation to available soil moisture.Though this method should link the leaf physiology to the soil and thus capture some hydraulic limitation, it has been criticized for not capturing the variety of drought responses found in different plant species and ecosystems (De Kauwe et al., 2015).This diversity in plant responses has been pointed to as a key point of uncertainty in earth system models (Dietze et al., 2014).
Though ecosystem water and carbon fluxes are predominantly controlled by stomates, non-stomatal or bio-/photo-chemical inhibitions to carbon assimilation are worth considering as they have the capacity to decouple the watercarbon exchange.This decoupling could include conditions where the stomates are transpiring water but intra-leaf factors are slowing carbon fixation, changing the intrinsic water use efficiency directly.Intra-leaf factors could include effects such as the production of reactive oxygen species (Lawlor and Tezara, 2008); environmental limitations on the photosynthetic pathways, such as leaf temperature (Medlyn et al., 2002); or declines in mesophyll conductance (Flexas et al., 2012).Non-stomatal limitations have been observed at the ecosystem scale (Reichstein et al., 2002;Migliavacca et al., 2009), though the exact mechanism is difficult to elucidate (Reichstein, 2003).These effects likely vary between species, as well as with the rate of the onset of drought, access to water, and other environmental conditions.

Objectives
There seems to be a collective conclusion that the breakdown of carbon-water assumptions needs to be better characterized in general and specifically for the implementation in modeling frameworks (Egea et al., 2011;Flexas et al., 2012;Zhou et al., 2013;Manzoni, 2014;De Kauwe et al., 2015).
Though the problem is becoming clear, the way forward is hampered by an uncertainty in how to capture the large variety in the response to drought across climates, strategies, and species.In this sense, the use of EC-measured diurnal patterns of carbon, water, and energy fluxes to derive clues on ecosystem drought responses at a daily resolution could prove valuable both as a means to identify potential periods of ecosystem stress, inform machine learning algorithms on ecophysiological conditions not found in environmental variables, as well as benchmarking a model's ability to capture sub-daily dynamics.To this end, we propose two data-driven indicators of water stress, the Diurnal Water-Carbon Index (DWCI) and the relative diurnal centroid in latent energy (LE) (C * ET ).Both metrics are derived directly from the EC data and based on expected physiological responses to hydraulic and non-stomatal limitations.Using these data-driven indicators we then characterize the distribution of these limitations across a global spread of climate and vegetation types.Finally, we explore the ability of these indicators to detect the disagreements between modeled and observed water use efficiency and explore how these biases may be attributed to hydraulic and non-stomatal limitations.
2 Methods and materials

Data
Carbon, water, and all fluxes measured with EC, as well as meteorological data, were obtained from the 2007 FLUXNET La Thuile Synthesis Dataset (NOA, 2007).Half-hourly latent heat and net ecosystem exchange (NEE) fluxes were collected and processed using standard quality-assurance-quality-control (QA/QC) procedures (Papale et al., 2006), gap-filling, and partitioning algorithms (Reichstein et al., 2005).From the database, half-hourly GPP and ET data (derived from latent heat flux measurements) were downloaded and used for the following analysis.An interactive map of sites used can be found in File S1 in the Supplement.
In order to provide a consistent measure of ecosystem dryness that can be utilized across sites, the ratio of water evaporated to potential water evaporated was calculated as evaporative fraction (EF), or the fraction of actual ET to Potential ET (PET).PET was calculated as the daily fraction between the measured ET and estimated ET via a Priestley-Taylor model (Priestley and Taylor, 1972) using site measured net radiation (R n ) and air temperature (T air ).The slope (alpha parameter) was fit for each site year using 95th quantile regression (Koenker and Bassett Jr., 1978) instead of using the original 1.26 value derived for a "well watered crop" (Priestley and Taylor, 1972).
In order to obtain high-quality data and minimize the influence of abiotic evaporation (hereafter just evaporation), all data was filtered with the aim to include only non-gapfilled data in the growing season with dry surface conditions.The growing season was defined as all days where GPP > 1 gC m −2 d −1 and daily mean air temperature > 5 • C.These threshold were shown to give a good response in the proposed metrics while minimizing variability due to low diurnal signals, a sensitivity analysis of which can be found in the Supplement Fig. S2.In an effort to minimize contributions of evaporation, the conservative soil wetness index (CSWI) was employed which was designed to estimate whether the ecosystem is likely to have "dry" surfaces and therefore ET is likely to be dominated by transpiration.This approach requires a certain amount of evaporation to occur after a rain event before the surface is considered to be dry and can be contrasted to the method of removing a set time period after rain employed in previous studies (Medlyn et al., 2017;Beer et al., 2009;Keenan et al., 2013).CSWI is calculating by first quantifying the storage at time t (S t ) as where ET t and P t are the ET and precipitation at time step t, respectively, and S t is effectively capped at a maximum storage value of S o , which was set to 5 mm.Furthermore, to make the metric conservative with regard to assumed water inputs, any precipitation event will refill the storage from 0 mm: which has the effect of requiring all precipitation up to 5 mm to be evaporated from the system before negative storage can occur.Any gaps in the precipitation data were assumed to be a precipitation event of 5 mm in order to prevent any unmeasured precipitation from biasing the results by inadvertently including rainy days.The code and a further outline of the algorithm can be found in File S3 as well as in Nelson (2017).
Evaporation was assumed to be negligible when CSWI < 0. This method was used over the more standard method of removing 1-5 days after a rain event, as it does not make the assumption that the surface will dry in a fixed amount of time, instead relying on a minimum amount of ET.As a comparison, the median time period for the CSWI to go from fully wet (CSWI = 5) to dry (CSWI ≤ 0) was 3.5 days across all sites in summer, where summer was defined as the period when daily potential radiation was above median daily potential radiation for each site.
The data filtering as outlined in this section was designed to isolate periods firmly in the growing season when plants are active and the signal of ET is most likely to be dominated by plant controls.

Relative diurnal centroid (C * ET )
As soils dry, it becomes more difficult to transport stem and root zone moisture to the leaf, potentially causing hydraulic limitations for the plant to transport water.This shift was seen in eddy covariance data in a study by Wilson et al. (2003), who examined the shift of latent compared to sensible heat, which suggested that a shift in water fluxes towards dawn can be indicative of afternoon stomatal closure.Shifts were further explored in a modeling study by Matheny et al. (2014), which found that the morning shift was not well captured by models and attributed the errors to inadequate hydraulic limitations in the models.The daily cycle of wetting and drying acts as a capacitor in the hydraulic circuit, allowing water stores to be more easily transported in the morning and de-pleted in the afternoon.As bulk soil moisture declines, this effect may be strong enough to shift the diurnal cycle of ET significantly toward the morning.Quantifying diurnal shifts in EC data using the diurnal centroid was first explored by Wilson et al. (2003) and defined as the flux weighted mean hour or where t is a regular, sub-daily time interval (here t measures as decimal hour at half-hourly time step).The resulting C flux is the weighted mean hour of the diurnal cycle of that particular flux for that particular day.For example, if a calculated C ET for a given day (using measurements of decimal hour) equals 12.25, this would entail that the weighted mean for that day is 15 min past noon.Figure 1 shows an example of the shifts in the monthly average cycle from a wet month to a dry month.In order to isolate a shift, we then had to control for variations in global radiation (R g ) and fluctuations due both to clouds and to differences in the timing of solar noon.Therefore, the difference between the diurnal centroids of ET (C ET ) and R g (C R g ) was calculated as giving C * ET as the diurnal centroid of ET relative to R g .The resulting values of C * ET are not tied to the carbon cycle, which can be affected by non-stomatal limitations and generally shows a more prominent midday depression.The annotated code for the CSWI calculation can be found in File S4 as well as in Nelson (2017).Though a diurnal centroid can be calculated for any diurnal cycle, basing a metric on the morning shift of ET relative to R g has the advantage of targeting the non-atmospheric drivers of the water flux, of which there are few ancillary variables.

Diurnal Water-Carbon Index (DWCI)
If transpiration and carbon assimilation are predominantly controlled by stomatal conductance, it follows that their diurnal cycles should be largely in sync.In other words, regardless of a plant's maximum T or A, if the stomates start to close, both rates should decrease by a similar percentage.On the other hand, non-stomatal limitations that inhibit carbon assimilation independently of water have the capability to alter the diurnal cycle on just one flux, causing them to decouple.In an effort to quantify the degree of carbon-water coupling for an individual day, we examined the relationship of GPP and ET, where This relationship incorporates the assumption that, at least over short timescales, the amount of carbon that enters the leaf is proportional to the amount of water that leaves and also incorporates the nonlinear response of stomates to VPD (Katul et al., 2010(Katul et al., , 2009;;Zhou et al., 2014).This model, though simple, has been shown to work well across a variety of EC sites (Zhou et al., 2015).Figure 2a and b show a comparison between the daily cycles in a wet and a dry month.By calculating a daily correlation between the normalized daily cycles of ET and GPP • √ VPD, we come to a correlation coefficient for each day (see Fig. 2c, d).For well-watered days in the growing season, the two signals tend to be well correlated (ρ > 0.9), but they tend to be less correlated in periods of stress, a comparison of which can be seen in Fig. 2c, d.
As it is, this daily correlation coefficient is dependent on the signal strength, or magnitude, of the flux.Low correlation values could just as easily be from carbon-water decoupling as from a low signal-to-noise ratio.Therefore, to produce a more robust metric and account for these statistical decreases in correlation, we turned the daily correlation coefficient into an index based on its rank in a distribution of correlation coefficients from artificial datasets.These artificial datasets are constructed using the diurnal signal from potential radiation, with Gaussian noise (N (0, σ )) added according to the standard deviation random uncertainty of the ET and NEE fluxes or and Uncertainties of the NEE and ET fluxes were estimated from the gap-filling procedure of Reichstein et al. (2005), with the uncertainty equal to the standard deviation of flux measurements within a time window and similar meteorological conditions.As GPP is calculated from gap-filled values of NEE, the uncertainty from NEE was used for GPP.Furthermore, the correlation structure between the noises in LE and NEE was preserved in the artificial dataset.
In essence, by using the underlying signal from potential radiation, both the artificial ET and GPP • √ VPD are perfectly correlated when no noise is added.Adding noise then isolates the decoupling effect of the signal-to-noise ratio.An artificial correlation coefficient can then be calculated from the two artificial datasets in the same manner as from the real dataset, and this experiment is repeated 100 times for each day, giving a daily distribution of artificial correlation coefficients.The rank of the real correlation coefficient in the distribution from the artificial set gives a probability that the carbon and water signals are actually coupled.The resulting index has a range of 0-100, with 100 indicating that the real correlation coefficient was greater than the entire artificial set, and therefore it is very likely that carbon and water are coupled.From this index we can now quantify if the water and carbon signals are coupled for any given day, and therefore shed light on whether the two fluxes are only controlled by the opening and closing of stomates.Annotated code for this calculation can be found in File S5 as well as in Nelson (2017).

Models and parameter estimation
In order to benchmark whether these metrics are capturing information that is possibly not being captured in modern model frameworks, three simple models were used to estimate WUE (GPP/ET) for each day at each site and compared to actual flux data.The purpose of the exercise was to evaluate if bias in the model predictions was associated with decoupled or morning-shifted days, thus indicating that the metrics correspond to information that the models are unable to capture.Here we utilize three models to provide a spectrum of theoretical to empirical basis.The Katul-Zhou model, as defined and used in the calculation of the DWCI, is based in stomatal optimization theory (Katul et al., 2010(Katul et al., , 2009;;Zhou et al., 2015), which makes the assumption that the WUE is constant if corrected by the effect of VPD, using an inverse square root as the assumed relationship.Though the constant nature of the underlying water use efficiency (uWUE, following convention in Zhou et al., 2015) may not be correct, with the optimal carbon cost of water changing over days or weeks (Manzoni et al., 2013;Palmroth et al., 2013), a yearly parameter of uWUE was estimated which is consistent with other modeling exercises (Zhou et al., 2016).A revision of this model which is one step away from a theoretical basis is the Boese model by Boese et al. (2017), where an additional radiation term was added such that www.biogeosciences.net/15/2433/2018/Biogeosciences, 15, 2433-2447, 2018 where i and r are parameters fit to each site year.This relationship with R g was shown to have a better predictive performance for EC data from 115 sites (Boese et al., 2017).The interpretation of this extra radiation term is not clear and is difficult to reconcile with the current understanding of physiology.It is possible the term could be related to biophysical effects, e.g.VPD at leaf surface vs. the measured ambient VPD.Nevertheless, the Boese model is an empirical and ecosystem-scale model that complements the theoretical and originally leaf-level model from Katul-Zhou.
Parameters of these models were estimated for each site year.The Boese model parameters were fit using trimmed least squares regression (TLS), which minimizes the 90th percentile of sum of squared errors (SSE) to prevent the influence of large outliers (Rousseeuw, 1983;Reth et al., 2005).As the error in both ET and GPP are assumed to be of a similar magnitude, the i parameter in the Katul-Zhou model was calculated using geometric mean regression, where the final slope was calculated as the geometric mean of the parameters from Both the Katul-Zhou and Boese models are theoretically based and, implemented here, have the underlying assumptions of RuBisCO-limited conditions and a constant carbon cost of water throughout the season which may not reflect reality.Therefore a fully empirical and highly nonlinear model can give insight into how much information is actually stored in the data while minimizing any assumptions.As a fully empirical model, a random forest regression (RandomForestRegressor from Pedregosa et al., 2011based on Breiman, 2001) was fit to half-hourly ET data for each site using R g , VPD, T air , GPP, and year as input parameters.Values were estimated using 50 trees, with predictions made using out-of-bag estimates to prevent over-fitted model predictions.

Results
As a case study, C * ET and DWCI time courses for six sites from Europe are shown in Fig. 3, with an emphasis on 2003 when the continent was struck by a heat wave that was shown to effect both the carbon and water cycles (Ciais et al., 2005;  Reichstein et al., 2007;Granier et al., 2007).For DWCI, forest sites showed high water-carbon coupling throughout the growing season, with the exception of Puéchabon (FR-Pue), which showed a regular seasonal cycle of decoupling.The grassland site (HU-Bg) showed a higher variability in DWCI compared to the forest sites (all others).All sites showed either a decrease in median DWCI or an increase in variability during 2003, generally in July or August, particularly at Hainich (DE-Hai), Bugacpuszta (HU-Bug), and El Saler (ES-ES1).This increase in decoupling during 2003 is consistent with the hypothesis of non-stomatal limitations being expressed in hot, dry conditions which can affect carbon fixing mechanisms.Median diurnal centroid values across all years varied in absolute magnitude but were generally near or above zero, i.e. the water cycle showed no shift or an afternoon shift.One exception would be the Mediterranean oak forest of Puéchabon, which shows a slight seasonal cycle of morning shifts going from a slight afternoon shift to a slight morning shift during June, July, and August.During drought years, sites that showed distinctive morning shifts were Puéchabon (FR-Pue), Sorø (DK-Sor), and Loobos (NL-Loo).The framework that morning shifts are associated with water stress from soil moisture depletion would be supported by the increase in morning shifts during 2003, though factors such as species composition and access to soil water would play a significant role and could account for the differences among sites.All sites had significantly different (p < 0.05, Wilcoxon rank-sum test) DWCI values between 2003 and all other years except Puéchabon, whereas with C * ET only Puéchabon, Sorø, and Loobos showed significant differences.

Distribution of data-driven indicators by vegetation type and climate
The frequency of low values of diurnal centroid and DWCI across climate groups and plant functional types is shown in Fig. 4. The thresholds designating decoupling and morning shifts were 25 and −0.25 for DWCI and C * ET , respectively.These thresholds were chosen to highlight frequency differences between sites and were shown to have large metric responses under dry conditions while having low frequencies under wetter conditions (see sensitivity analysis in Fig. S2).Furthermore, these thresholds result in a similar median frequency of uncoupled and morning-shifted days between all site years being 8.7 and 9.4 % of days, respectively.The similarity in median frequencies across site years allowed for easier intercomparison between the two metrics.The frequency of decoupling and morning shifts using these thresholds for each site can be found in the map found in File S1.Though there is a fairly large variance across climate groups and plant functional types, low values of both DWCI and C * ET occur at higher frequencies in savanna ecosystems and dry or Mediterranean climates.Conversely, lower frequencies of both metrics are seen in tropical, boreal, and temperatecontinental climates.Strikingly, the arid and semiarid climate group seems to be associated with the majority of low DWCI occurrences, with a median frequency of about 20 % of days being uncoupled between site years.Overall, frequencies were highly variable within plant functional types.high frequencies from savannas and grasslands.The differing responses between tree-and grass-dominated ecosystems can be further seen in Fig. 5, where savanna and grassland ecosystems show a distinct decrease in DWCI under conditions of low EF, in contrast to the forested sites which show a higher degree of carbon-water coupling, though still a slight decrease.Forested ecosystems show a higher degree of morning shift under low EF conditions when compared to grasslands, with savannas being somewhere between the two.
The response of both variables to drought stress is further observed in Fig. 6, where low mean values of both DWCI and C * ET are associated with conditions of high net radiation and low latent energy, indicative of drought.As this figure includes all days from all sites which meet the filtering outlined in the "Data" subsection of the "Methods and materials" section, i.e. dry periods in the growing season, these fig-ures exhibit the universality of the metrics across climates, ecosystems, and time periods.This pattern is much cleaner with the diurnal centroid than with DWCI, though mean values are generally above 50 for most bins, indicating that most days are well coupled.Low values of both indicators are also seen under conditions with low R n and high latent energy (as seen by the dark streak at the top edge in Fig. 6c, e), which is generally not associated with drought stress.Further analysis showed that these points are also associated with energy balance over-closure, where the sum of latent and sensible heat is greater than net radiation (ET + H > R n , see Fig. S2), and therefore they likely represent a data problem rather than a physiological response.Removing all days where the energy balance is over-closed did not alter the patterns associated with drought.Apart from the response to periods of high LE and low R n , the metrics showed a diverging response when looking at EF (ET / PET which is similar to LE / R n ) and ET showing a much stronger response to EF (Fig. 6a,  d).This difference in response would indicate that DWCI is more responsive to atmospheric demand (estimated via VPD) and C * ET is more responsive to water limitations.Both DWCI and C * ET also show a trend with low GPP, although in the case of the diurnal centroid the effect is limited to both low GPP and ET (Fig. 6c, g).

Difference between modeled and actual WUE
Figure 7 shows the difference between expected and observed WUE from the Katul-Zhou, Boese, and random forest (RF) models, with respect to conditions of drought as characterized by low evaporative fraction (EF < 0.2), carbonwater (C-W) decoupling (DWCI < 25), and morning shifts (C * ET < −0.25).This exercise was designed to test whether the metrics were associated with bias in the models, indicating that the metrics are able to capture information that the models are not (as further outlined in the "Methods and materials" subsection entitled "Models and parameter estimation").For all models, the dry days show the largest average shift between expected and observed WUE, followed by morning-shifted days.Uncoupled days show the smallest shifts for all models, with an overestimation of WUE for the Katul-Zhou and Boese models and no significant shift of WUE with the random forest model.As all models were calibrated within a site year, the over-or underestimation of WUE indicate an inability of the model to capture a change in the system.Cases of mean misestimation tended to be influenced by long tails in the distribution, with median differences being less exaggerated.However, these long tails are indicative of major model error in periods where the ecosystem is likely under stress conditions.

Looking beyond sums and means
The proposed metrics -DWCI and C * ET -depart from more traditional methods to summarize from sub-daily to daily timescales such as sums and means.This departure is advantageous in that it extracts added information that may otherwise have been ignored by turning the focus from signal amplitude to the signal shape.However, these new metrics also come with their own set of caveats, most notably issues with data quality confounding interpretability.Both metrics are susceptible to noise, as one or two errant points within a day can be reflected as a decrease in correlation or a shift in diurnal centroid.This is evident from the existence of very afternoon-shifted C * ET , sometimes by more than an hour, which the authors have no proposed explanation for other than noise in the data.However, attributing highly afternoon-shifted points to poor data requires further investigation.Note here that the "resting" C * ET seems to be slightly afternoon-shifted, which could be caused by real physiological factors such as differences in the incoming shortwave (SW) radiation (R g ) used in the calculation and net radiation (R n ), higher atmospherics demand (VPD) in the afternoon driving higher ET, or increased convection throughout the day resulting in higher transport of water away from the canopy and is likely a combination of all three.Differences in resting C * ET between sites could also be from instrumental ET is restricted to both low GPP and ET, indicating that water and carbon can decouple over a wider range of water stress.This also holds when points with energy balance over-closed are excluded (data not shown).causes such as radiometric sensors which are not adequately leveled or dirty, though the consistent, slight afternoon shifts would suggest this is a real response.Despite the possible shortcomings, both metrics show a definite response to drought conditions across the broad array of sites, climates, and ecosystems contained in FLUXNET (see Fig. 6) and give valuable insight into the underlying physiology.Given the broad nature of the analysis here, the metrics and hypothesis presented would benefit from site-specific validations such as looking to see if the morning shifts and decoupling are indeed associated with lower soil moisture levels, leaf water potentials, and/or decreases in sap flux.Sap flux in particular could give some interesting insights, as the diurnal patters in sap flux velocity will also have an offset to incoming radiation related to tree capacitance; therefore, relating sap flow diurnal centroids to the ET diurnal centroid could give some information on changes in plant water recharge.Furthermore, the diurnal centroid base metrics complement the hysteresis quantification methods such as those employed by Zhou et al. (2014) and Matheny et al. (2014), with the advantage of C * ET being compensated for by cloudy conditions and possibly comparatively less influence of noise, though an intercomparison would be useful to explore the strengths and weaknesses of the different approaches.By providing both the equations and related code of the metrics, we hope the metrics will be used by the community for both validation and to further ecophysiological understanding.

Trees, grass, and drought stress
By comparing climate groups and plant functional types (PFTs) with the frequent occurrence of low DWCI and C * ET from Fig. 4, we can note two striking differences: evergreen broad-and needleleaf forests show high variability in morning-shifted days but not uncoupled days, whereas grasslands show significantly high uncoupled but not morningshifted days.The pattern is further seen in Fig. 5, which shows the distinct divergent responses of decoupling and morning shifts between tree-and grass-dominated systems.This disparity may indicate an interaction of C * ET not only with drought but hydraulic sensitivity.The association of morning shifts with hydraulic sensitivity is further strengthened by Fig. 6a, d, where C * ET shows a much stronger response to EF rather than VPD, indicating that morning shifts of ET are not simply due to stomatal closure due to VPD but are in fact a response to drought conditions.The shorter hydraulic system of grasses may not necessitate stomatal closure under high demands (Holloway-Phillips and Brodribb, 2011), thus causing less frequent phase shifts even under drought conditions.In contrast, tree ecosystems may only exhibit higher hydraulic stresses, associated with both dry- ness and a more sensitive hydraulic strategy.Temperatecontinental and tropical climates all showed a low frequency of morning-shifted days, even though they are occupied by large trees with cavitation-susceptible vascular systems (Konings and Gentine, 2016), suggesting that these ecosystems show limited drought stress even with the hydraulic susceptibility.Similarly, the high degree of variability in morning-shifted frequency between site years in subtropical/Mediterranean and evergreen broad-and needleleaf forests could either indicate variation in the response in hydraulic stress between sites or that hydraulic stress is only expressed some years, leading to high and low frequencies within the same site.
In this way, it seems that though C * ET is less noisy as a drought indicator (see Fig. 6), it may only be of use in tree systems that are more prone to hydraulic stress.However, this does put the metric in a rather unique position in that it could be used as a global-scale hydraulic indicator, having potential application in exploring ecosystem-level isohydricity (Martínez-Vilalta and Garcia-Forner, 2016) or the degree to which continuing to extract water risks vascular system damage.Isohydricity is intrinsically a concept that relates to an individual plant, as dynamics of rooting depth, hydraulic conductances, and sensitivities to VPD can vary within individuals of the same species at the same location.However, these factors are all interrelated, as hydraulic and stomatal conductances drive transpiration dynamics which control the rate of depletion of root zone water which can then feed back to stomatal sensitivity, such as via the plant hormone abscisic acid (ABA) signaling (Wilkinson and Davies, 2002).As such, current estimates of isohydricity require plant-level measurements, which are currently restrained to the individual scale, i.e. from actual leaf measurements (Martínez-Vilalta et al., 2014), or to a global scale, but only 0.5 • resolution estimates from radar (Konings and Gentine, 2016).This limitation of large and small scales leaves a knowledge gap the size of an eddy covariance footprint, hindering the study of ecosystem response to drought.However, under the assumption that the morning shifts seen under low evaporative fraction are related to increased stomatal sensitivity in www.biogeosciences.net/15/2433/2018/Biogeosciences, 15, 2433-2447, 2018 response to root zone moisture depletion, it may be possible to compare the onset and speed with which the diurnal centroid shifts toward the mornings as ecosystems dry.In this way, one could infer the ecosystem response to soil moisture, without explicitly knowing the soil moisture.The resulting relationship could prove useful as a data-derived ecosystem functional property, giving direct information on variations in water limitation response.

C-W decoupling and energy balance closure
In addition to error from single data points, both metrics, but especially the DWCI, show some relationship with energy balance over-closure.Energy balance mismatch is a common phenomenon in EC measurements, with under-closure (ET + H < R n ) being a more common concern (Leuning et al., 2012;Wilson et al., 2002).Issues with energy balance closure can be, among other causes, attributed to advection, where energy, water, and carbon are transported in and out of the tower footprint, complicating an absolute accounting of these quantities (Wilson et al., 2003;Barr et al., 2006;Brötz et al., 2014).The apparent association of DWCI and overclosure could be due to the transfer of moist air from the surrounding landscape, causing the DWCI to be more contingent on the mixing of source air and less on plant controls.In this scheme, the over-closure seen in Fig. 6 could be caused by the mixing of outside moist air into the drier air from the EC site, causing an increase in latent energy.However, the infiltrating air sources could also have similar or drier moisture levels which would not necessarily be seen as over-closure.In this scenario, this infiltrating air could contain varying carbon and water concentrations, again causing a carbon-water decoupling but one that would not be associated with over-closure.If this effect has no diurnal pattern and thus does not generally influence the mean diurnal centroid in ET, it could explain why the patterns with dryness are much clearer with C * ET compared to DWCI.This would have the implication that DWCI is then a mixture of advection and non-stomatal signals, complicating the biological interpretability.However, the association with dryness in both metrics gives credence that they do indeed reflect some physiology, if we assume energy balance closure (EBC) should not be influenced by dryness level.Furthermore, if potential stress conditions are removed, the DWCI could be useful as a metric of advection in the system, even when the energy balance is relatively well closed.

WUE shifts associated with metrics and not captured by models
Figure 7 demonstrates the strong tendency of the models to underestimation WUE in dry conditions.This is true even for the fully nonlinear and empirical random forest model, indicating that the model underperformance is not necessarily due to an incomplete model framework but due to a lack of information to constrain the problem.Given the association of both metrics with drought (Fig. 6), one could expect that the models would underestimate WUE in uncoupled and morning-shifted days.Though this is the case with morning-shifted days, decoupling shows no underestimations of WUE and even shows a mean overestimation in the case of the Katul-Zhou and Boese models.Given the limitations outlined in the previous sections, one could blame noise for the lack of the WUE shift, but this cannot be reconciled with the higher frequency of decoupling during dry days, which should bias the WUE estimates.Furthermore, the more empirical random forest model reduces the prediction variability, leaving a slight WUE underestimation and indicating that some of the overestimation from the Katul-Zhou and Boese models may be tied to limitations of the underlying assumptions; yet the distribution from the RF model still lacks the long tails of underestimation characteristic of the dry points.Extending these findings to the underlying hypotheses of the metrics, namely hydraulic and non-stomatal limitations, we could conclude that the hydraulic controls do impose a greater water use advantage than non-stomatal limitations.In other words, the findings suggest that days with water-carbon decoupling, and possibly non-stomatal limitations, do not improve WUE, whereas hydraulic responses can improve WUE.As WUE is a ratio, this does not shed any light on the change in productivity, as low values of WUE may indicate that a plant is still productive but at a higher water cost.However, solid conclusions would require further analysis with some site-specific measurements of actual plant function.
Though the models used here are relatively simple and lack the complexities and feedbacks found in more vigorous ecosystem models, Matheny et al. (2014) also demonstrated the fundamental inability of nine different land surface models with four different stomatal conductance schemes to capture diurnal variability which the authors attributed to inadequate representation of how water gets from the soil to the leaf.Given the demonstrated phenomenon of morning shifts and decoupling across sites under dry conditions, the metrics here provide a benchmarking tool for mechanistic models to test their ability to replicate these patterns, suggesting that the models are capable of expressing hydraulic and non-stomatal limitations.Furthermore, in the case of machine learning approaches, the metrics may provide a useful input parameter which summarizes these diurnal effects, as is evidenced by the difference in response the bias in RF modeled WUE; i.e. while both metrics are associated with low EF, RF WUE was underestimated with morning-shifted days but not decoupled days, implying that two different strategies are being captured by the metrics.As such, by demonstrating the utility of the metrics and providing code and explanations for calculation, we hope they will become useful to the community at large.

Conclusions
Both the DWCI and the C * ET demonstrate an ability to show consistent patterns with drought across a broad array of sites, climates, and ecosystems, with the added advantage of being tied to theoretical underpinnings.In particular, the demonstrated patterns give novel information about carbon-water relations and hydrological dynamics that are not currently present at the ecosystem scale across a database as large as FLUXNET.These metrics and their underlying theory provide a data-derived example differentiating the hydrological response of tree and grass plant functional types as well as constituting evidence for the presence and absence of a WUE advantage from hydraulic and stomatal limitations, respectively.Going forward, these metrics can be used as a tool to further understand the diversity of ecosystem drought responses.
Code and data availability.Code used to calculate the metrics described here can be found both in the Supplement as well as in Nelson (2017).Data used in this analysis can be found at http: //fluxnet.fluxdata.org/(NOA, 2015).
Competing interests.The authors declare that they have no conflict of interest.

Figure 1 .
Figure 1.One-month average cycle (solid lines) and accompanying diurnal centroid (vertical dashed lines) of incoming shortwave radiation (R g ), evapotranspiration (ET), and gross primary productivity (GPP) at the Puéchabon, France (FR-Pue), site during 2003.May is relatively wet (32 mm rainfall, a) and July is relatively dry (0 mm rainfall, b).While ET and R g correspond well in the wet month, the dry month shows a distinct phase shift in both GPP and ET fluxes towards the morning, as well as a midday depression in GPP.

Figure 2 .
Figure 2. Theoretical overview of Diurnal Water-Carbon Index.Panels (a) and (b): 1-month average diurnal cycle of incoming shortwave radiation (R g ), evapotranspiration (ET), vapor pressure deficit (VPD), gross primary productivity (GPP), and GPP•VPD −0.5 at the Puéchabon Forest, France (FR-Pue), site during 2003.Discrepancies between GPP • VPD −0.5 and ET increase from the relatively wet May (32 mm rainfall, a, c) to the relatively dry July (0 mm rainfall, b, d).Panels (c) and (d): these discrepancies are reflected in the daily correlation values between GPP • VPD −0.5 and ET, giving an indication of the appropriateness of the uWUE model for each day, as well as the degree of coupling between water and carbon signals.

Figure 3 .
Figure 3. Monthly median Diurnal Water-Carbon Index (DWCI, lower panels) and diurnal centroids (C * ET , upper panels) for six sites in Europe.Data from all years available (black) is compared to 2003 (red) during which a drought event resulted in high temperatures and low precipitation throughout the summer.Note that DWCI of 0-100 indicates lowest-highest probability of diurnal carbon-water coupling and that C * ET of −1 to 1 indicates 1-hour morning-shifted to 1-hour afternoon-shifted ET.Vertical bars represent interquartile range.Sites from four plant functional types: evergreen broadleaf (EBF), deciduous broadleaf (DBF) and evergreen needleleaf (ENF) forests, as well as grasslands (GRA).Ecosystems show tendencies of morning shifts (e.g.DK-Sor and NL-Loo) and carbon-water decoupling (e.g.ES-ES1 and HU-Bug) during the drought year.

Figure 4 .
Figure 4.The frequency of morning-shifted diurnal centroids (C * ET < −0.25 h, a, b) and low Diurnal Water-Carbon Index (DWCI < 25, c, d) for 690 FLUXNET site years/192 sites, grouped by climate group (a, c) and plant functional type (b, d).Group labels on x axis indicate the number of site years/sites (n = site-years/sites) for each category.The dashed line is the median for all site years.Color shade indicates level of significance, with light colors and dark colors having p values < 0.10 and < 0.05, respectively (Wilcoxon-Mann-Whitney two-sample rank-sum test); red and blue colors indicate distributions higher and lower, respectively, compared to data from all sites excluding the group.Only site years with at least 20 data points and groups with more than 5 site years were included.

Figure 5 .
Figure 5. Median Diurnal Water-Carbon Index (DWCI, a) and diurnal centroid (C * ET , b) of plant functional types binned by evaporative fraction (EF, low values indicate dry conditions).Note that DWCI of 0-100 indicates lowest-highest probability of diurnal carbon-water coupling and that C * ET of −1 to 1 indicates 1-hour morning-shifted to 1-hour afternoon-shifted ET.Evergreen needleleaf (ENF), deciduous broadleaf (DBF), and evergreen broadleaf (EBF) forests show increased morning shifts (low C * ET ) with decreasing EF when compared to grassland (GRA) sites, which tended to have decreased carbon-water decoupling (low DWCI) with decreasing EF.Savanna ecosystems (SAV) show a high degree of decoupling and intermediate levels of morning shifts.Vertical bars represent the interquartile range.

Figure 6 .
Figure 6.Mean DWCI (a, b, c) and C * ET (d, e, f) with respect to evaporative fraction (EF) by vapor pressure deficit (VPD) (a, d), latent energy (LE) by R n (b, e) and LE by GPP (c, g).Note that DWCI of 0-100 indicates lowest-highest probability of diurnal carbon-water coupling and that C * ET of −1 to 1 indicates 1-hour morning-shifted to 1-hour afternoon-shifted ET.Points with high R n and low LE are associated with both low DWCI and C * ET , indicating that both metrics are related to water limitations.Though both metrics are associated with low EF, DWCI shows a much higher response to atmospheric demand as measured by VPD, with C * ET showing a very limited response.Both metrics, and DWCI in particular, show low values with high ET and low R n , though these points are also associated with over-closed energy balances (LE + H > R n ).Both metrics are associated with low GPP, but the C *ET is restricted to both low GPP and ET, indicating that water and carbon can decouple over a wider range of water stress.This also holds when points with energy balance over-closed are excluded (data not shown).

Figure 7 .
Figure 7. Difference in modeled and measured WUE for Katul-Zhou (a), Boese (b), and random forest (c) models.The random forest model was fit using R g , VPD, T air , GPP, and year.Thresholds designating dry, morning-shifted, and carbon-water uncoupled days were EF < 0.2, C * ET < −0.25, and DWCI < 25, respectively, for each day.The distributions span from the 5th to 95th percentiles, and the width of each gives an indication of the variance, which is larger in the subgroups compared to all points.Furthermore, the mean difference in WUE (black lines) tends to be shifted in dry and morning-shifted days, indicating a mean underestimation of WUE by the models mostly due to the long tails.Decoupled days show higher variance but no clear pattern in under-or overestimation.The percentage of days in each category is designated in parentheses next to the y axis label .