 Research Article
 Open Access
 Published:
Towards a seamlessly diagnosable expression for the energy flux associated with both equatorial and midlatitude waves
Progress in Earth and Planetary Science volume 4, Article number: 11 (2017)
Abstract
For midlatitude Rossby waves (RWs) in the atmosphere, the expression for the energy flux for use in a model diagnosis, and without relying on a Fourier analysis or a ray theory, has previously been derived using quasigeostrophic equations and is singular at the equator. By investigating the analytical solution of both equatorial and midlatitude waves, the authors derive an exact universal expression for the energy flux which is able to indicate the direction of the group velocity at all latitudes for linear shallow water waves. This is achieved by introducing a streamfunction as given by the inversion equation of Ertel’s potential vorticity, a novel aspect for considering the energy flux. For ease of diagnosis from a model, an approximate version of the universal expression is explored and illustrated for a forced/dissipative equatorial basin mode simulated by a singlelayer oceanic model that includes both midlatitude RWs and equatorial waves. Equatorial Kelvin Waves (KWs) propagate eastward along the equator, are partially redirected poleward as coastal KWs at the eastern boundary of the basin, and then shed midlatitude RWs that propagate westward into the basin interior. The connection of the equatorial and coastal waveguides has been successfully illustrated by the approximate expression of the groupvelocitybased energy flux of the present study. This will allow for tropicalextratropical interactions in oceanic and atmospheric model outputs to be diagnosed in terms of an energy cycle in a future study.
Introduction
A feature of many phenomena in the equatorial oceans is the role played by equatorial Kelvin waves (KWs), examples being El Niño Southern Oscillation (ENSO; Philander 1989) and the socalled Atlantic Niño (Merle 1980). KWs propagate along the equator and are partially redirected into coastal KWs at the eastern boundary, where they can influence offequatorial latitudes (e.g., Lübbecke et al. 2010) as well as excite extratropical Rossby waves (RWs) that subsequently propagate into the ocean interior (McPhaden and Ripa 1990; Isachsen et al. 2007). A striking example of this behavior is the equatorial basin mode (Cane and Moore 1981). For the gravest basin mode, the time scale is set by the time taken for an equatorial KW to propagate across the basin and for the reflected gravest long Rossby wave to return to the western boundary (that is 4L/c where L is the basin width and c is the phase propagation speed for KWs). In addition to waves that are trapped on the equator, equatorial basin modes also feature coastal KWs that propagate along the eastern boundary and extratropical RWs that are excited by these KWs and refocus on the equator, as described by Schopf et al. (1981). There is growing evidence that equatorial basin modes play an important role in equatorial ocean dynamics. For example, basin modes have been associated with the equatorial deep jets (Johnson and Zhang 2003; Brandt et al. 2011; Claus et al. 2016) and with the semiannual (Thierry et al. 2004) and annual cycles (Brandt et al. 2016) in the equatorial Atlantic. However, the energy cycle associated with equatorial basin modes has received little attention and is an important factor when considering the forced/dissipative basin modes that one can relate to observations. A particularly interesting example is the upward energy propagation associated with the Atlantic equatorial deep jets (Johnson and Zhang 2003; Brandt et al. 2011; Mathiessen et al. 2015). Yet, the detailed energy cycle associated with the jets remains largely unknown.
One way to approach the energy flux is to use ray theory. However, ray theory is linked to the dispersion relation of a single type of wave and is not suitable for investigating the sequential connection of different types of waves that are associated with a basin mode. Likewise, a Fourier analysis is not suitable for the investigation of waves near the coastal boundaries of the ocean. In fact, it is only for midlatitude inertiagravity waves (IGWs) that the flux of wave energy has been diagnosed from oceanic model output (Cummins and Oey 1997; Niwa and Hibiya 2004; Furuichi et al. 2008). On the other hand, in the atmospheric literature, the model diagnosis of pseudomomentum (or wave activity) flux has been more popular than the model diagnosis of the energy flux (Hoskins et al. 1983; Plumb 1986; Takaya and Nakamura 1997; Nakamura and Solomon 2011).
Here, we seek a general expression that can be used to diagnose the energy flux associated with linear shallow water waves at all latitudes from model output. This manuscript is organized as follows. First we provide the theoretical background. Then, we present an analytical investigation that leads to a general expression for the energy flux that can indicate the exact profile of the group velocity times wave energy for both equatorial and midlatitude waves. The utility of the universal expression of energy flux as a model diagnostic is illustrated for a forced/dissipative equatorial basin mode simulated by a singlelayer model. The model diagnosis is achieved by introducing an inversion for the linearized version of Ertel’s potential vorticity. This is a novel aspect for considering the energy flux in the presence of a coastal waveguide that connects the equatorial and midlatitude regions.
Theoretical background
We use the shallow water equations for a single vertical normal mode (Gill 1982) appropriate to linear waves in a rotating frame of reference and in the absence of a mean flow. Let an arbitrary variable with an associated physical dimension be expressed by A ^{∗}, and let Cartesianhorizontal coordinates be labelled by the set of independent variables x ^{∗},y ^{∗},t ^{∗}, where each of x ^{∗}, y ^{∗} increases eastward and northward, respectively, and u ^{∗},v ^{∗} are the corresponding horizontal components of velocity (a list of variables is given in Table 1)^{1}. The equations may then be written as
where \(f^{*} =f^{*}_{0} + \beta ^{*} y^{*}\) is the Coriolis parameter, p ^{∗}=p ^{∗}(x ^{∗},y ^{∗},t ^{∗}) corresponds to the pressure^{2} or geopotential, and c ^{∗} is a uniform constant representing the propagation speed of nonrotating gravity waves for a given mode. Manipulation of (1a)–(1c) yields a prognostic equation for the linearized version of Ertel’s potential vorticity (hereafter EPV and symbolized as q ^{∗}) to read
which is applicable to waves at all latitudes, such as midlatitude RWs, midlatitude IGWs, and equatorial waves [i.e., equatorial RWs and IGWs, equatorial Rossbygravity waves (RGWs, i.e., Yanai waves), and equatorial KWs; Matsuno 1966; Yanai and Maruyama 1966], understanding \(f^{*}_{0} = 0\) for an equatorial βplane and β ^{∗}=0 for a midlatitude fplane. Both midlatitude IGWs (i.e., β ^{∗}=0) and equatorial KWs (i.e., v ^{∗}=0) are characterized by q ^{∗}=0, as noted in Table 2.
On the other hand, a prognostic equation for wave energy may be derived from (1a)–(1c) as
where \(\nabla ^{*} \equiv \langle \!\langle \frac {\partial ~}{\partial {x^{*}}}, \frac {\partial ~}{\partial {y^{*}}}\rangle \!\rangle \) and the overbar symbol represents a phaseaverage operator (i.e., for a sinusoidal wave, \(\overline {A^{*}}=0\) for A ^{∗}=u ^{∗}, v ^{∗}, and p ^{∗}) or a lowpass time filter (for this reason, we retain the local time derivative in (3) to allow for slow time variations in the general case).
For midlatitude IGWs in the ocean and atmosphere, the group velocity vector points in the same direction as the energy flux vector in (3):
a property that has been exploited by Cummins and Oey (1997), Niwa and Hibiya (2004), and Furuichi et al. (2008) for a model diagnosis. However, for midlatitude RWs, the vector in (4) does not point in the direction of the group velocity of the waves (LonguetHiggins 1964; Masuda 1978; Cai and Huang 2013). In order to retrieve the correct direction for the energy flux associated with midlatitude RWs, Orlanski and Sheldon (1993, hereafter OS93) have suggested to modify (3), without affecting the horizontal divergence of the energy flux, as
where each of u ^{∗} and v ^{∗} should be the sum of the geostrophic and ageostrophic components and \(f^{*} = f^{*}_{0} + \beta ^{*} y^{*}\) is understood. The energy flux vector in (5) consists of two terms,
where \(\overline {\mathbf {V}^{*} p^{*}}\) is as in the gravity wave literature (i.e., V ^{∗} is the sum of the geostrophic and ageostrophic components of velocity). The second term in (6) is the additional rotational component required to reproduce the direction of the group velocity of midlatitude RWs (z is the upward vertical unit vector). In LonguetHiggins (1964), the second term of (6) has been expressed as \(\nabla ^{*} \times [f^{*} \overline {\psi ^{*2}}/2]\mathbf {z}\) where ψ ^{∗} is a streamfunction based on the assumption of horizontally nondivergent velocity. This assumption is hardly used in modern oceanography owing to the smallness of the deformation radius. In quasigeostrophic theory, ψ ^{∗}=p ^{∗}/f ^{∗} from which the connection with (6) is clear.
The question naturally arises as to whether or not it is possible to find a general expression for the additional rotational flux, R ^{∗}, that holds for waves at all latitudes and is such that the corresponding energy flux \(\overline {{\mathbf {V}}^{*} p^{*}} + \mathbf {R}^{*}\) always points in the direction of the group velocity and thus constitutes a general expression for the energy flux associated with waves at all latitudes. This is the main subject of the present study. In this study, we focus on wave types for which the group velocity has been well formulated in the literature/textbook, as listed in Table 2. Of particular interest is the energy flux associated with equatorial RWs given that the expression in (6) is singular at the equator. The assumption of horizontally nondivergent velocity in LonguetHiggins (1964) is also inappropriate for equatorial regions. In the next section, by investigating the analytical solution of equatorial waves, we derive an exact universal expression for the rotational flux which, after being added to \(\overline {{\mathbf {V}}^{*} p^{*}}\), is able to indicate the direction of the group velocity for linear waves at all latitudes.
Analytical investigation
We begin by revisiting analytical expressions for the profile of the energy flux associated with equatorial waves. This investigation allows us to derive an expression for the energy flux that points in the direction of the group velocity for waves at all latitudes.
Energy flux associated with equatorial waves
We assume linear waves in the absence of a mean flow on an equatorial βplane. As in Matsuno (1966) and Gill (1982), we use a time scale \(1/\sqrt {c^{*}\beta ^{*}}\) and a length scale \(\sqrt {c^{*}/\beta ^{*}}\) to nondimensionalize the equation system (1a)–(1c) to give
where symbols without an asterisk indicate nondimensionalized quantities and subscripts indicate partial differentiations. Manipulation of (7a)–(7c) yields prognostic equations for EPV and wave energy in a nondimensionalized form to read,
where \(\partial _{t} \equiv \frac {\partial ~}{\partial t}\), \(\nabla \equiv \langle \!\langle \frac {\partial ~}{\partial x}, \frac {\partial ~}{\partial y}\rangle \!\rangle \), and for A=u, v, or p, \(\overline {A}=0\) for sinusoidally varying waves.
In what follows, we assume v≢0 which is appropriate for equatorial RWs, RGWs, and IGWs (i.e., waves other than equatorial KWs). Then, we consider zonally propagating free waves with a relationship v∝ cosθ, u∝ sinθ, and p∝ sinθ where θ≡k x−ω t is wave phase with k and ω being wavenumber and wave frequency, respectively. Substitution of these relationships to (7a)–(7c), followed by some manipulation, yields a characteristic equation for the meridional structure of v to read,
Matsuno (1966) has derived a solution for (7a)–(7c) and (10) to yield,
where is wave amplitude and the symbol H ^{(n)} is the Hermite polynomial with n being the meridional mode number^{3}. The subscript θ represents partial differentiation in terms of the wave phase .
Substitution of (11a) to (10) yields,
which is a unified dispersion relation for equatorial RWs, RGWs, and IGWs. Partial differentiation of (12) with respect to wavenumber k yields a unified expression for the group velocity of equatorial waves,
where 2ω ^{3}/k in the denominator has often been ignored in previous studies when focusing on lowfrequency equatorial waves (e.g., equatorial RWs; Gill 1982).
We now investigate the energy flux associated with (7a)–(7c). It is known that, for zonally propagating equatorial waves, the meridional integral of \(\overline {up}\) is equal to the group velocity times the meridional integral of the wave energy (Philander 1989):
It should be noted that the identity (14a) does not hold if it is evaluated without the meridional integral:
For lowfrequency equatorial waves (with ω<1— see Fig. 1—, i.e., all equatorial RWs and westward propagating RGWs), the meridional profiles of \(\overline {up}\) and \((\partial \omega /\partial k)(\overline {u^{2}+v^{2}+p^{2}})/2\) are shown by the dashed green and solid black lines, respectively, in Fig. 2. It is clear that, when compared at a given latitude, \(\overline {up}\) is not equal to the group velocity times wave energy. In particular, the meridional profile of \(\overline {up}\) is signindefinite for lowfrequency equatorial waves (Fig. 2). On the other hand, as shown by the dashed green and solid black lines in Fig. 3 for highfrequency equatorial waves (with ω>1— see Fig. 1—, i.e., all equatorial IGWs and eastward propagating RGWs), the meridional profile of \(\overline {up}\) provides a much better approximation for the group velocity times wave energy. The solid blue line, dashed orange line, and purple dots in Figs. 2 and 3 are explained later in the manuscript.
Identification of the additional rotational flux associated with equatorial waves
It is useful to derive the analytical expression for the difference between the left and right hand sides of (14b). A first step for identifying the difference is to decompose the zonal component of \(\overline {up}\) into two parts, one that determines the meridional integral and one that does not affect it, as follows:
where the first equality has been derived using (11b)–(11c) and \(\overline {\sin \theta \sin \theta }=\overline {\cos \theta \cos \theta }\) and the second equality has been derived using (10). Note that it is the second of the two terms whose meridional integral is zero (noting that v and yv go to zero at large distances from the equator).
We now decompose the wave energy^{4} into two parts, one that determines the meridional integral and one does not. We then have
where the first equality has been derived using (11b)–(11c) and \(\overline {\sin \theta \sin \theta }=\overline {\cos \theta \cos \theta }\), and the third equality has been derived using (10). As before, it is the second of the two terms whose meridional integral is zero. Using (14c)–(14d), we now obtain an analytical expression for the difference between the right and left hand sides of (14b) to yield
where the first and second equalities have been derived using (13), the third equality has been derived using \(\overline {\cos \theta \cos \theta }=\overline {\sin \theta \sin \theta }\), and the last equality has been derived using (11b)–(11c). The last line of (14e) has been written as the meridional gradient of scalar quantities. Thus, the meridional integral of (14e) vanishes for equatorial waves (with a meridionally decaying structure) and is consistent with (14a).
Using (14e), we can now rewrite the zonal component of the group velocity times wave energy as
where the scalar quantity φ has been introduced. We have confirmed that, as long as φ is set by (15b), the meridional profile of the zonal energy flux, \(\overline {up}+ (\overline {p\varphi }/2 + \overline {u_{tt}\varphi })_{y}\), in (15a) is precisely identical to \((\partial \omega /\partial k) (\overline {u^{2}+v^{2}+p^{2}})/2\) for all types of equatorial waves in Figs. 2 and 3. Namely, all solid black lines in Figs. 2 and 3 may be drawn using either expression. As far as we know, (15a) and (15b) have not been mentioned in previous studies and therefore constitute a new result.
Inversion equations for Ertel’s potential vorticity
The definition of φ, as given by (15b), is based on a Fourier expansion. However, we have found that (15b) may be rewritten into an expression which contains none of θ, k, and ω to read
where ∇^{2}≡∂ _{ xx }+∂ _{ yy } is understood, the first line has been derived using (10), and the second line has been derived using (8) [i.e., q _{ t }=−ω q _{ θ }=−v and thus −ω q _{ θ θ }=ω q=−v _{ θ }]. The new Eq. (16) of EPV is the cornerstone of the present study, because it suggests a possibility for the scalar quantity φ to be estimated without using a Fourier analysis. This feature is important for identifying the direction of the energy flux of waves in the presence of coastal boundaries.
To summarize, in order to reproduce the profile of the group velocity times wave energy without relying on a Fourier analysis, we have obtained a new expression for the energy flux that has turned out to be associated with the streamfunction Eq. (16). Equation (16) may be rewritten into a dimensional form as
where \(\phantom {\dot {i}\!}\nabla ^{*}\equiv \langle \!\langle \partial _{x^{*}}, \partial _{y^{*}}\rangle \!\rangle \) and \(\phantom {\dot {i}\!}q^{*} = v^{*}_{x^{*}}u^{*}_{y^{*}}  (f^{*}/c^{*2}) p^{*}\). The exact profile of the group velocity times wave energy may be reproduced by the right hand side of (15a) and is here rewritten into a vector and dimensional form as
The additional rotational flux in (17b) corrects the profile of the energy flux, without affecting the divergence of the energy flux. The quantity φ ^{∗} in (17b) is the solution of the accurate streamfunction Eq. (17a) associated with EPV in a dimensional form. We note in passing that for zonally propagating equatorial waves, as given by (11a)–(11c), \(\overline {v^{*}p^{*}}\) vanishes owing to the phase relationship between v ^{∗} and p ^{∗} [see (11a) and (11c)] and the meridional component of the additional rotational flux, \((\overline {p^{*} \varphi ^{*}}/2 + \overline {u^{*}_{t^{*}t^{*}}\varphi ^{*}}/\beta ^{*})_{x^{*}}\), also vanishes.
Equatorial KWs
So far, we have not investigated the energy flux of equatorial KWs. Since KWs are gravity waves, \(\overline {\mathbf {V}^{*} p^{*}}\) becomes equal to the group velocity times wave energy. Namely, the additional rotational flux is absent. KWs are also characterized by q ^{∗}=0; hence, the EPV equation (17a) yields φ ^{∗}=0. The result is that, in the case of KWs, the expression for the energy flux, as given by (17b) reduces to \(\overline {\mathbf {V}^{*} p^{*}}\), which is consistent with the nature of gravity waves.
Boundary conditions and the connection to midlatitude regions
Consider a basin with closed zonal boundaries (i.e., the eastern and western coastlines of a basin of arbitrary shape). It is clear that the flux \(\overline {\mathbf {V}^{*} p^{*}}\) in (17b) has no component normal to the zonal boundaries. Hence, the additional rotational flux in (17b) should also have no component crossing the closed boundaries. This requirement is satisfied in the present study by solving (17a) with a boundary condition of
In a general situation in the ocean, waves propagating eastward along the equatorial waveguide are partially redirected poleward as KWs along the eastern boundary where they can shed RWs that then propagate westward into the ocean interior (Cane and Moore 1981; Philander 1989; Chelton and Schlax 1996; Isachsen et al. 2007).
We now investigate whether or not the set of (17a) and (17b) is applicable to offequatorial regions where smallamplitude perturbations are characterized by either midlatitude RWs or IGWs. For perturbations associated with midlatitude RWs, the solution φ ^{∗} of (17a) corresponds to the geostrophic streamfunction for which φ ^{∗}≃p ^{∗}/f ^{∗} is a reasonable approximation in an interior region (i.e., far from coastal boundaries), noting that ∇^{∗} ^{2} φ ^{∗} corresponds to \(v^{*}_{x^{*}}u^{*}_{y^{*}}\). The result is that the energy flux in (17b) automatically reduces to the expression of OS93 for midlatitude RWs^{5}. On the other hand, if perturbations associated with midlatitude IGWs are given, the inversion Eq. (17a) of EPV, which equals zero, yields, with φ ^{∗}=0 on the boundaries, φ ^{∗}=0 everywhere. Thus, the energy flux in (17b) automatically reduces to \(\overline {\mathbf {V}^{*} p^{*}}\) which represents the group velocity of midlatitude IGWs times wave energy. We conclude that the set of (17a) and (17b) can represent the exact profile of the group velocity times wave energy associated with both midlatitude IGWs and RWs, which may be reconfirmed using almost the same procedure as in the “Identification of the additional rotational flux associated with equatorial waves” section. See Appendix 1 for details.
Methods/Experimental
The rest of this manuscript presents an example illustrating the diagnosis of the energy flux from a model. To be useful for our discussion, the exact universal expression for both equatorial and midlatitude waves, as given by the set of (17a) and (17b), is hereafter referred to as the level0 energy flux. In practice, the level0 expression of the energy flux is not straightforward to compute from model output, since the secondorder time derivative term in (17a) makes it difficult to solve for φ ^{∗}.
For the present study, we investigate the consequence of artificially removing the secondorder time derivative term from (17a) to give
which may be justified at least for lowfrequency waves (e.g., both equatorial and midlatitude RWs) based on scale analysis. The superscript of φ ^{app∗} indicates that the solution of (18a) may be regarded as an approximation for the solution φ ^{∗} of the accurate streamfunction Eq. (17a) associated with EPV. Then, we replace φ ^{∗} in (17b) with φ ^{app∗} to read
which is diagnosable^{6} from model output and is referred to as the level1 expression of the energy flux in the present study. As shown by the dashed orange lines in Fig. 2, the level1 expression provides a nice approximation for the groupvelocitybased energy flux of lowfrequency equatorial waves, but not for highfrequency equatorial waves in Fig. 3. Next, with the form of the additional rotational flux \(\nabla ^{*} \times [\overline {{p^{*}}^{2}}/(2f^{*})]\mathbf {z}\) in (6) in mind, we investigate the consequence of simplifying (18b) as
which we refer to as the level2 expression for the energy flux. As shown by the solid blue lines in Figs. 2 and 3, the level2 expression provides an approximation for the groupvelocitybased energy flux of both low and highfrequency equatorial waves, although there can be some error. Further discussion of the level2 approximation is given in Appendices 2 and 3 where it is noted that the level2 approximation is comparable in accuracy to the pseudomomentum (or waveactivity) flux used in previous studies (Randel and Williamson 1990; Brunet and Haynes 1996; Fukutomi and Yasunari 2002; Wakata and Kitaya 2002; Kawatani et al. 2010).
We now contrast both the level1 and level2 energy fluxes with the expressions in previous studies, given by (6) and (4), using a solution from a linear shallow water model. This illustrates the potential of the expression given by (18b) and (18c) for use as a model diagnostic (see Table 3). Suitable for this purpose is an equatorial basin mode solution since it is associated with both equatorial and coastal waveguides as well as the radiation of midlatitude RWs into the basin interior. Furthermore, as noted in the Introduction section, the equatorial basin mode, first studied by Cane and Moore (1981), has recently attracted attention because of its importance in the dynamics of the equatorial Atlantic Ocean. Indeed, the annual cycle, the semiannual cycle, and the interannual variability associated with the Atlantic equatorial deep jets (Brandt et al. 2011) all appear to be resonant excitations of equatorial basin modes [see Brandt et al. (2016) and Claus et al. (2016) for more details].
Model setup
To illustrate the importance of dissipation for explaining the observed crossequatorial width of the equatorial deep jets, Greatbatch et al. (2012, hereafter G12) have simulated a forced/dissipative basin mode solution using a singlelayer reducedgravity linear model. The model is set up in spherical coordinates, with a rectangular domain in latitude/longitude space of roughly the same width as the Atlantic Ocean at the equator (that is 55° in longitude) and reaching to 10°N/S on either side of the equator^{7}. All lateral boundaries are closed. In both G12 and Claus et al. (2014, hereafter C14), the model has been forced by an idealized oscillatory forcing with a period of 4.5 years in the zonal momentum equation to mimic the forcing of the jets, together with a lateral mixing of momentum that provides dissipation. [See Ascani et al. (2015) for a discussion on the forcing of the equatorial deep jets, the details of which are not important here]. It should be noted that 4.5 years is roughly the time taken for an equatorial KW and the reflected long gravest equatorial RW, to travel across the basin for the vertical mode that is closest to resonance. As noted in G12 and C14, the (westward) propagation speed of equatorial long RWs is three times less than the (eastward) propagation speed of equatorial KWs [see the dispersion relation (12)].
Our model has been set up as in G12 and C14. The gravity wave speed is set equal to c ^{∗}=0.17 m/s [see the upper panel in Fig. 4 of C14]. The equatorial deformation radius becomes \(\sqrt {c^*/\beta ^*}=87~\text {km}\), with a consequence that disturbances further than a few degrees from the equator in our model experiment may be regarded as midlatitude RWs, even though they are part of the equatorial basin mode resonance. As in G12, our model has been formulated in a spherical coordinate system with a grid spacing of 0.1 ° in both longitude and latitude. The coefficient^{8} of eddy viscosity has been set to 10 m ^{2}/s. From an initial condition of no motion and no pressure anomaly, the model has been integrated for 20 cycles (i.e., 90 years) using the oscillatory forcing which is sufficient for a steady oscillatory state to be reached. Since the model code is fully nonlinear, we have set the amplitude of the forcing to a small value, 1.0 × 10^{−10} m/s ^{2} to ensure that linear dynamics prevails. Indeed, the magnitude of velocity associated with the gravest basin mode may be scaled as 10^{−10} m/s ^{2} × 4.5 years/(2π)=0.0023 m/s, which results in a Froude number of (0.0023 m/s)/c ^{∗}=0.014 (nondimensional). These parameters are summarized in Table 4. Below, we show results from an experiment which corresponds to the “full” case in G12. In particular, the oscillatory zonal forcing is spatially uniform and acts over the whole model domain. All the model results shown below are averages over the last model cycle.
Results and discussion
At each time step of the model output, we have calculated the EPVbased streamfunction φ ^{app∗} (contours in the left panels of Fig. 4) by solving the spherical coordinate version of (18a) with the boundary condition of φ ^{app∗}=0. The color shading in Fig. 4 shows the snapshots of thickness anomaly (left panels) and the zonal component of velocity u ^{∗} (right panels). The movie of the model experiment is found in Additional file 1. RWs are identified by the correlation (anticorrelation) between the EPVbased streamfunction and thickness anomaly in the northern (southern) hemisphere. This follows from the correspondence between the EPVbased streamfunction and the geostrophic streamfunction for the case of midlatitude RWs, as noted earlier. As noted in G12 and C14, the (westward) propagation speed of equatorial long RWs is three times smaller than the (eastward) propagation speed of equatorial KWs [see the dispersion relation (12)]. It takes a threequarter cycle (i.e., 3T ^{∗}/4) for equatorial long RWs to travel westward from the eastern boundary to the western boundary of the model domain (see red lines in Fig. 5 a). After reflection at the western boundary, it takes only a quarter of a cycle (i.e., T ^{∗}/4) for equatorial KWs to travel eastward to the eastern boundary of the model domain (see blue lines in Fig. 5 a), where some disturbances are deflected poleward along the eastern boundary to be the source of midlatitude RWs which then propagate westward (Fig. 5 b).
In Fig. 6, the divergence of the horizontal energy flux, given by \(\nabla ^* \cdot \overline {\mathbf {V}^* p^*}\), is shown for the whole model domain using color shading. Red indicates regions of a net energy input, and blue indicates regions of a net dissipation. It is clear that the main region of energy input is in the central part of the basin along the equator, where the strongest zonal velocities are found, and that the main regions of energy loss are associated with the RWs that radiate away from the eastern boundary. Arrows in Fig. 6 a show the energy flux used in the gravitywave literature, \(\overline {\mathbf {V}^*p^*}\), which is mostly westward along the equator and eastward in the immediate offequatorial region. This can be clearly seen in Fig. 7 a which shows a blowup of the eastern equatorial region. Figures 6 b and 7 b show the energy flux given by (6), which has been adapted from OS93, where only regions more than 1° latitude away from the equator are plotted to avoid the singularity in the Coriolis parameter f ^{∗} at the equator. From these figures (especially the blowup of the eastern equatorial region in Fig. 7 a, b), it is clear that the energy flux is strongly reversed when compared to \(\overline {\mathbf {V}^* p^*}\) in the immediate offequatorial region and is now strongly eastward in association with RWs that are radiated from the eastern boundary.
From Figs. 6 c and 7 c, it is clear that when the set of Eqs. (18a), (18c) and (17c) is used to estimate the energy flux, the westward flux associated with the offequatorial RWs is part of a recirculation of energy in the eastern part of the basin (Fig. 7 c) with eastward energy flux along the equator and westward energy flux off the equator. The eastward flux along the equator in Figs. 6 c and 7 c is in the opposite direction to the westward \(\overline {\mathbf {V}^* p^*}\) flux in Figs. 6 a and 7 a along the equator in the same region. This indicates the role of the rotational flux contribution in (18c) which counters the westward \(\overline {\mathbf {V}^*p^*}\) flux along the equator. This westward flux is associated with the equatorial RWs but represents an overestimation of the energy flux associated with these waves (see Fig. 2). When the rotational flux is added, what emerges is the eastward flux associated with the KW which, in turn, leads to a poleward flux arising from KWs propagating along the eastern boundary and, in turn, leads to the westward flux associated with the offequatorial RWs that are excited at the eastern boundary. Here, in terms of the transfer of wave energy, the equatorial waveguide has been connected to the eastern coastal waveguide and, in turn, to the basin interior at offequatorial latitudes, which is at the heart of the present study.
Finally, we note that the forcing period of T ^{∗}=4.5 years is much longer than the equatorial inertial period of \(2\pi /\sqrt {c^*\beta ^*}=37\) days. It can be said that the simulated equatorial basin mode consists of lowfrequency equatorial waves, as in Fig. 2, and midlatitude RWs. We recall the small difference between the solid blue and dashed orange lines in Fig. 2, the former and the latter of which may be written as \(\overline {u^* p^*} + (\overline {p^* \varphi ^{\mathrm {app*}}}/2)_{y^*}\) and \(\overline {u^* p^*} + (\overline {p^* \varphi ^{\mathrm {app*}}}/2+\overline {u^*_{t^* t^*}\varphi ^{\mathrm {app*}}}/\beta ^*)_{y^*}\), respectively, in a dimensional form (see level2 and level1, respectively, in Table 3). Since arrows in Figs. 6 c and 7 c have been plotted using the expression which corresponds to the solid blue line in Fig. 2, we have checked for any improvement by using the expression which corresponds to the dashed orange lines in the same figure. The checking has been done by comparing the distribution of \(\overline {p^* \varphi ^{\mathrm {app*}}}/2\) and \(\overline {u^*_{t^*t^*} \varphi ^{\mathrm {app*}}}/\beta ^*\), from which we have learned that the latter quantity (not shown) is three orders of magnitude smaller than the former. Thus, we conclude that, in the diagnosis of the simulated basin mode, the expression of the energy flux, as given by (18c), has provided a nice approximation for the group velocity times wave energy.
Conclusions
In previous studies of the ocean, the energy flux of waves in model output has been diagnosed using \(\overline {\mathbf {V}^* p^*}\), where V ^{∗} is the horizontal component of velocity perturbation and p ^{∗} corresponds to the pressure perturbation. This is appropriate for understanding the energy flux associated with midlatitude inertiagravity waves (IGWs). For midlatitude Rossby waves (RWs), however, the direction of \(\overline {\mathbf {V}^* p^*}\) differs from the group velocity and hence the energy flux, by a rotational vector flux with zero divergence. The rotational flux to be added to \(\overline {\mathbf {V}^* p^*}\) for estimating the group velocity of midlatitude RWs has previously been derived using quasigeostrophic equations and is singular at the equator.
By investigating the analytical solution of both equatorial waves (“Analytical investigation” section) and midlatitude waves (Appendix 1), we have derived an exact universal^{9} expression for the rotational flux which, after being added to \(\overline {\mathbf {V^*}p^*}\), is able to indicate the profile of the group velocity times wave energy for linear waves at all latitudes. This is what we call the level0 expression of the energy flux. The level0 energy flux is written using the solution φ ^{∗} of (17a), previously unmentioned in the literature, which we refer to as the accurate streamfunction associated with Ertel’s potential vorticity (EPV). Equation (17a) is the cornerstone of the present study, because it suggests a possibility for the energy flux to be estimated (i) without using a Fourier analysis nor ray theory and (ii) in the presence of coastal boundaries, which will allow for tropicalextratropical interactions in model output to be diagnosed in terms of an energy cycle in a future study. Presently, the level0 energy flux is not practical for use as a model diagnostic, since the secondorder time derivative term in (17a) makes it difficult to solve for φ ^{∗}. Thus, we hope that a future study is able to develop a numerical algorithm to solve (17a) for φ ^{∗}. We also note the need to extend the theory to a continuously stratified ocean and also to test out the theory in the presence of a sheared mean flow, both of which topics await a future study. This is a new step from the recent understanding of energetics in the atmosphere and ocean that had been focused on, for example, the global mapping of energy conversion rates associated with various physical processes (e.g., baroclinic and barotropic instabilities) and external forcing (Iwasaki 2001; Aiki and Richards 2008; Zhai et al. 2012).
The potential of our analysis as a model diagnostic is illustrated in the present study for a forced/dissipative equatorial basin mode simulated by a singlelayer model. The model result includes both midlatitude RWs (maintained by coastal KWs propagating poleward along the eastern boundary) and equatorial RWs (maintained by the reflection of equatorial KWs at the eastern boundary). We have used approximate expressions for the energy flux (what we call the level1 and level2 energy fluxes) that is based on the inversion equation (18a) of EPV and which is shown to be good approximations to the level0 expression in the case of the model run being considered. Since (18a) is seamlessly solvable at all latitudes with φ ^{app∗}=0 at coastlines, the source of the westward energy flux of midlatitude RWs in the model output has been successfully illustrated in the present study. To our knowledge, this is the first attempt to diagnose the energy cycle of a tropicalextratropical interaction associated with the connection of the equatorial and coastal waveguides.
Endnotes
^{1} While the energy flux of waves at all latitudes is considered in the present study, the pseudomomentum (or waveactivity) flux of waves at all latitudes is considered in Aiki et al. (2015, hereafter ATG15). Both the formulations of the present study and ATG15 may be reproduced even if a spherical coordinate system is used. The use of a Cartesian horizontal coordinate system in both the present study and ATG15 is for the purpose of simplicity, which will allow for the results of the two studies to be linked in a future study. A related discussion appears in Appendix 3.
^{2} What we call pressure, energy, and momentum in the present study are actually dynamic pressure, energy density, and momentum density, respectively, following ATG15.
^{3} d H ^{(n)}/d y=2n H ^{(n−1)}, H ^{(n+1)}=2y H ^{(n)}−2n H ^{(n−1)}, H ^{(0)}=1, H ^{(1)}=2y, H ^{(2)}=4y ^{2}−2, H ^{(3)}=8y ^{3}−12y, H ^{(4)}=16y ^{4}−48y ^{2}+12.
^{4} The factor ∂ ω/∂ k to calculate the energy flux is added in (14e).
^{5} The second term in the square brackets of (17b) vanishes as \(\overline {u^{*}_{t^{*}t^{*}}\varphi ^{*}}\simeq \overline {(p^{*}_{y^{*} t^{*} t^{*}}/f^{*})(p^{*}/f^{*})}=0\) where the phase relationship of plane waves is understood.
^{6} We use the term “diagnosable” to indicate that the quantity is readily estimated from quantities in model output without relying on a Fourier analysis.
^{7} In a related paper, Claus et al. (2014) also used this solution to investigate the influence of the barotropic mean flow on the Atlantic equatorial deep jets. The Atlantic equatorial deep jets are resonant with the gravest basin mode for a highorder baroclinic mode (typically the 15th vertical normal mode) and consist of vertically stacked zonal jets that oscillate at a given depth with a period of around 4.5 years.
^{8} This is lower than the value recommended by G12 for capturing the observed width of the deep jets but is chosen here since it is not so large as to prevent focusing of RWs on the equator. In the inviscid solution of Cane and Moore (1981), there is a singularity on the equator at the center of the basin due to RW focusing as described by Schopf et al. (1981).
^{9} In the present manuscript, we have used the term “exact” to refer to the level0 expression, in contrast to approximate expressions (i.e., level1 and 2). Likewise, we have used the term “universal” to indicate the ability to handle all wave types in Table 2, for which the group velocity has been well formulated in the literature/textbook.
^{10}Although it is not in the list of wave types in Table 2, IGWs on a midlatitude βplane may be characterized as α≪1,δ ^{2}≤1,γ ^{2}<1 where α≪1 corresponds to (19b). Thus, the net content in the square brackets on the last line of (24c) becomes O(1). Given α in front of \(c^* \overline {v^* v^*}\) on the last line of (24c), we may justify (23d) for IGWs on a midlatitude βplane. It can be said that the right hand side of (24c) becomes significantly nonzero when the assumption of plane waves in the meridional direction becomes inconsistent (Anderson and Gill 1979).
^{11}While the pseudomomentum flux itself \((\overline {E^*v^*v^*})\) is diagnosable from model output, the pseudomomentumfluxbased expression of the energy flux \((\overline {E^*v^*v^*})\omega ^*/k^*\) is not easily diagnosable from model output because of multiplication by the phase speed (see Appendix 3 for details).
Appendix 1
Is the streamfunction Eq. (17a) associated with EPV applicable to midlatitude waves?
Manipulation of the shallow water equation system (1a)–(1c) yields a characteristic equation associated with the meridional component of velocity to read
which is applicable to both midlatitude and equatorial regions. In what follows, we consider plane waves on either an fplane or a midlatitude βplane (i.e., \(f^* = f^{*}_{0}+ \beta ^* y^*\) and f0∗≫β ^{∗} y ^{∗}) and thus assume
Then, (19a) may be simplified as
The Coriolis parameter f0∗ in (19c) is constant that allows us to assume a horizontally monochromatic wave in a complex form
where i is the unit imaginary number, is wave amplitude, and θ=k ^{∗} x ^{∗}+l ^{∗} y ^{∗}−ω ^{∗} t ^{∗} is wave phase (k ^{∗} and l ^{∗} are the zonal and meridional components of a wavenumber vector, respectively, and ω ^{∗} is wave phase). For simplicity, all , k ^{∗}, l ^{∗}, and ω ^{∗} are assumed to be constant. Substitution of (20a) to both (1a) and (1c) yields a solution for u ^{∗} and p ^{∗} to read
where f ^{∗}=f0∗+β ^{∗} y ^{∗}. On the other hand, substitution of (20a) to (19c) yields
which is a universal expression for the dispersion relation of the various types of waves in midlatitude regions. For example, substitution of β ^{∗}=0 to (21) yields a classical dispersion relation for midlatitude IGWs (i.e., waves on an fplane), and substitution of ω ^{∗2}≪c ^{∗2} k ^{∗2} to (21) yields a classical dispersion relation for midlatitude RWs.
An expression for the zonal component of group velocity may be derived using (21) to read
We now identify the content of \((\overline {A^*B^*})_{y^*}\) in the following equation:
where each of A ^{∗} and B ^{∗} are quantities associated with the set of u ^{∗}, v ^{∗}, p ^{∗}, c ^{∗}, and f ^{∗}. A first step for investigating (22b) is to decompose \(\overline {u^* p^*}\) into two parts: one that is associated with the numerator of (22a) and one that is written as the meridional derivative of a scalar quantity, as follows:
where the first equality has been derived using both (20b)–(20c) and the set of \(\overline {v^* v^*}=\overline {v^*_{\theta } v^*_{\theta }}\) and \(\overline {v^*_{\theta } v^*}=0\) and the approximate equality in the middle has been derived using both the dispersion relation (21) and (19b). Then, we decompose the wave energy in (22b) into two parts, one that is associated with the denominator of (22a) and one that is written as the meridional derivative of a scalar quantity. We then have
where the first equality has been derived using both (20b)–(20c) and the set of \(\overline {v^* v^*}=\overline {v^*_{\theta } v^*_{\theta }}\) and \(\overline {v^*_{\theta } v^*}=0\) and the approximated equality in the middle has been derived using both the dispersion relation (21) and (19b). The set of (22c) and (22d) allows us to identify the content of \((\overline {A^* B^*})_{y^*}\) in (22b) to read
where the last equality has been derived using (20a)–(20c). Equation (22e) may be rewritten as
where
has been introduced. The definition of φ ^{∗}, as given by (23b), is based on a Fourier expansion and may be rewritten into an expression which contains none of θ, k ^{∗}, l ^{∗}, and ω ^{∗} to read
where the first equality has been derived using (20b) and the second equality has been derived using (2) [i.e., q t ^{∗}∗=−ω ^{∗} q θ∗=−β ^{∗} v ^{∗} and thus −ω ^{∗} q θ θ∗=ω ^{∗} q ^{∗}=−β ^{∗} v θ∗]. As far as we know, the set of (23a) and (23c) has not been mentioned in previous studies for midlatitude waves and has turned out to be almost the same as the set of (17b) and (17a) that has been derived for equatorial waves.
We now consider the meridional flux of wave energy. We would like to show that
It turns out that the second term on the left hand side, associated with the additional rotational flux, vanishes when evaluated using the analytical solution of waves [i.e., \((\overline {v^* v^*})_{x^*} = k\overline {v^*_{\theta } v^*}=0\)], which is as in LonguetHiggins (1964). This is attributed to the assumption of all , k ^{∗}, l ^{∗}, and ω ^{∗} being constant in particular in the zonal direction. An expression for the meridional component of group velocity may be derived from (21) to read
Then, we calculate the left hand side of (23d) using (20a)–(20b) as
where \(\overline {v^*_{\theta } v^*}=0\) has been used. We now calculate the difference of the meridional component of the group velocity times wave energy and \(\overline {v^* p^*}\) using the set of (22d), (24a), and (24b) to yield
where the last line has been written using the set of nondimensional parameters. These are defined as
It can be said that the last line of (24c) represents the contribution of higher order terms in an asymptotic expansion based on α, δ, and γ. This contribution should not be confused with the universal expression of the additional rotational flux which has already been clarified at (23a) and (23d). It should be also noted that the net content within the square brackets on the last line of (24c) is nondimensional, for which we shall make scale analysis in the next paragraph.
The quantity \(\alpha c^* \overline {v^* v^*}\) on the last line of (24c) may be interpreted as a reference for the magnitude of the energy flux of midlatitude RWs. Midlatitude RWs may be characterized as
Thus, the net content within the square brackets on the last line of (24c) approximates to zero, which justifies (23d) for midlatitude RWs. On the other hand, for midlatitude IGWs, \(c^* \overline {v^* v^*}\) on the last line of (24c) represents a reference for the magnitude of the energy flux. IGWs on an fplane may be characterized as
Thus, the last line of (24c) vanishes, which justifies (23d) for IGWs on an fplane^{10}.
To summarize, the streamfunction Eq. (17a) associated with EPV and the universal expression of the additional rotational flux in (17b) applies to both midlatitude and equatorial waves, in particular for wave types considered in the present study, as listed in Table 2.
Appendix 2
Approximate expressions for the energy flux
The exact profile of the group velocity times wave energy is given by the set of (15a) and (16), which is what we call the level0 energy flux. Owing to the last term on the left hand side of (16) that contains the secondorder partial differentiation with respect to time, the procedure of inverting EPV, without using a Fourier analysis, is still complicated.
Hence, we investigate the consequence of artificially removing the secondorder time derivative term from (16) as
where the superscript of φ ^{app} indicates that the solution of (26a) may be regarded as an approximation for the solution φ of the accurate streamfunction Eq. (16) associated with EPV. We have calculated the meridional profiles of
as shown by the dashed orange lines in Fig. 2 for lowfrequency equatorial waves (e.g., equatorial RWs) and in Fig. 3 for highfrequency equatorial waves (e.g., equatorial IGWs). Since this is an analytical investigation, we have used φ ^{app}=−v _{ θ }/(k−ω ^{3}) which has been derived from the EPV inversion Eq. (26a) with the use of the characteristic Eq. (10). All panels in Fig. 2 show a nice agreement between the dashed orange line given by (26b) and the solid black line, \((\partial \omega /\partial k)(\overline {u^2+v^2+p^{2}})\). By contrast, all panels in Fig. 3 show a finite disagreement between the dashed orange line given by (26b), \(\overline {up}+(\overline {p\varphi ^{\text {app}}}/2+\overline {u_{tt} \varphi ^{\text {app}}})_{y}\), and the solidblack line, \((\partial \omega /\partial k)(\overline {u^2+v^2+p^{2}})\).
It would be nice if there is a unified approximation for the energy flux that is able to represent the profile of the group velocity times the energy of both low and highfrequency equatorial waves. We have found that this requirement is roughly satisfied if (26b) is simplified as
where φ ^{app}=−v _{ θ }/(k−ω ^{3}) is the solution of (26a). The profile of (26c) is shown by the solid blue lines in Figs. 2 and 3 for low and highfrequency equatorial waves, respectively. This expression provides what we think is a potentially useful approximation for the group velocity times wave energy (the solid black lines) for all types of equatorial waves, as we show in the “Methods/Experimental” section.
In the present study, (26b) and its vector and dimensional form (18b) are referred to as the level1 energy flux. Likewise, (26c) and its vector and dimensional form (18c) are referred to as the level2 energy flux.
Why do we appreciate the level2 energy flux regardless of the error? An expression for pseudomomentum (or waveactivity) flux has long been used for the model diagnosis of the direction of the group velocity of waves in the atmosphere (and also the ocean), including in lowlatitude regions (Ripa 1982; Hoskins et al. 1983; Plumb 1986; Haynes 1988; Randel and Williamson 1990; Brunet and Haynes 1996; Fukutomi and Yasunari 2002; Wakata and Kitaya 2002; Kawatani et al. 2010). Using the analytical solution of equatorial waves, we have calculated the profile of the traditional pseudomomentum flux^{11} times the phase velocity of waves (see Appendix 3), as shown by the purple dots in Figs. 2 and 3. Interestingly, for lowfrequency waves, the profile of the pseudomomentumfluxbased expression (the purple dots) is almost the same as that of the level2 energy flux (the blue solid line). On the other hand, for highfrequency waves, the profile of the pseudomomentumfluxbased expression (the purple dots) is similar to that of the level1 energy flux (the orange dashed line) and quite different from the exact, level0 energy flux to which the level2 energy flux is a better approximation. Thus, the level2 energy flux is, in general, an improvement on the traditional model diagnosis of group velocity based on the pseudomomentum flux.
Concerning extension to midlatitude waves, both the level1 and level2 energy fluxes satisfy all conditions noted in the last paragraph of the “Boundary conditions and the connection to midlatitude regions” section. Note that the inversion Eq. (18a) of EPV is seamlessly solvable at all latitudes with the boundary condition of φ ^{app∗}=0. To summarize, the set of (18a) and (18c) [together with the boundary condition (17c)]—what we call the level2 expression—originates from a tradeoff between mathematical exactness and practical accessibility. The mathematical exactness for retrieving the group velocity of equatorial waves times wave energy has been achieved by the set of (17a) and (17b)—what we call the level0 expression. However, its accessibility is harmed by the secondorder time derivative term in the streamfunction equation (16) associated with EPV. On the other hand, concerning the practical accessibility, the set of (18a) and (18c)—the level2 expression—has the advantages that (i) it is seamlessly solvable at all latitudes and (ii) it provides a unified expression for all types of waves with which to estimate the direction of the group velocity. We have noted, for equatorial waves, that the profile of the level2 energy flux is somewhat better than that of the traditional pseudomomentum flux. It should be also noted that the energy flux given by (18c) satisfies the boundary condition of no flux through coastlines [using (17c)], an issue not considered in previous studies for the pseudomomentum flux. With these requirements in mind, we hope that future studies can lead to either an improved approximation or a numerical algorithm for the level0 energy flux.
Appendix 3
Similarity between the level2 energy flux of this study and the pseudomomentum flux in previous studies
Ripa (1982) has derived a conservation equation for pseudomomentum (or wave activity) associated with ageostrophic waves. His equation may be reproduced using (1a)–(1c) as
where the prognostic quantity may be referred to as the impulsebolus (IB) pseudomomentum (Aiki et al. 2015, hereafter ATG15) and E ^{∗} is the wave energy. Note that the IB pseudomomentum given here is the shallow water version of that given by Eq. (27a) in ATG15. It has been known that the expression of the flux in (27a) can indicate the direction of the group velocity of different types of waves, in particular, midlatitude RWs and IGWs (Hoskins et al. 1983; Plumb 1986; Haynes 1988). Another nice feature of the IB pseudomomentum Eq. (27a) is that it does not contain a singularity at the equator. In order to investigate the origin of these features, ATG15 have shown in their Eq. (18a) an identity between the IB pseudomomentum and the classical energybased (CE) pseudomomentum to read (again, written here for the shallow water equations)
which may be derived from (1a)–(1c) of the present study. Application of a lowpass temporal filter to (27b), and then, understanding the phase relationship between v ^{∗}=−q t ^{∗}∗/β ^{∗} and q ^{∗} yields
Substitution of (28b) to a lowpass timefiltered version of (28a) yields
which is a prognostic equation for the wave energy wherein the zonal component of the flux is proportional to that in the IB pseudomomentum equation (27a).
It is easy to expect that the expression of the flux in (29) can indicate the direction of the group velocity of midlatitude RWs and IGWs (Hoskins et al. 1983; Plumb 1986; Haynes 1988). For equatorial waves, here, we investigate the meridional profile of \((\overline {E^*v^*v^*})\omega ^*/k^*\) as shown by the purple dots in Figs. 2 and 3 for low and highfrequency waves, respectively. For lowfrequency waves (Fig. 2), the meridional profile of \((\overline {E^*v^*v^*})\omega ^*/k^*\) (the purple dots) is almost the same as that of the level2 energy flux (the blue solid line), showing that the level2 energy flux and the IB flux are closely related. For highfrequency waves (Fig. 3), the meridional profile of \((\overline {E^*v^*v^*})\omega ^*/k^*\) (the purple dots) is nearly the same as that of the level1 energy flux (the orange dashed line), indicating that the level2 energy flux is somewhat better than the IB flux.
In fact, without relying on the level0 expression, we have arrived at the level2 expression of the energy flux by extending the investigation of ATG15 concerning the algebraic structure of the IB flux (to be explained in a future study). ATG15 have addressed the importance of a waveinduced scalar quantity and symbolized it as Λ: it vanishes for midlatitude IGWs (i.e., waves with no perturbation of EPV) and becomes nonzero for midlatitude RWs (i.e., wave with a perturbation of EPV). Here, we suggest that \(\overline {\Lambda }=(\overline {p^* \eta ^* })_{y^*}/2\) is closely linked to \((\overline {p^* \varphi ^{\mathrm {app*}}})_{y^*}/2\) in the present study (η ^{∗} is meridional displacement). This is why the level2 expression for the energy flux in the present study can indicate the direction of the group velocity of different types of waves, an issue we shall discuss in a future study.
Note that the IB flux in (27a) has already been used for the model diagnosis of waves in lowlatitude regions (Randel and Williamson 1990; Brunet and Haynes 1996; Fukutomi and Yasunari 2002; Wakata and Kitaya 2002; Kawatani et al. 2010). We suggest that, despite the certain inaccuracy associated with equatorial waves as compared with the level0 expression, the level2 expression of the energy flux in the present study will be at least as useful as the IB flux which has long been used in the atmospheric (and oceanic) literature. For oceanic applications, the level2 energy flux brings two new advantages over the IB flux: (i) the level2 energy flux satisfies a nonormalflux boundary condition at coastlines, and (ii) the wave energy is a signdefinite quantity while the IB pseudomomentum is not.
Overall, we address the balance of (i) model accessibility, (ii) unified treatment for different types of waves, (iii) mathematical accuracy, and (iv) boundary conditions at coastlines. With these requirements in mind, we hope future studies can lead to either an improved approximation or a numerical algorithm for the level0 energy flux, wherein the profile of the IB flux will provide a reference for accuracy because the IB flux has long been used in previous studies.
Abbreviations
 EPV:

Ertel’s potential vorticity
 IGW:

Inertia gravity wave
 KW:

Kelvin wave
 RGW:

Mixed Rossbygravity wave
 RW:

Rossby wave
References
Aiki, H, Richards KJ (2008) Energetics of the global ocean: the role of layerthickness form drag. J Phys Oceanogr 38: 1845–1869.
Aiki, H, Takaya K, Greatbatch RJ (2015) A divergenceform waveinduced pressure inherent in the extension of the EliassenPalm theory to a threedimensional framework for waves at all latitudes. J Atmos Sci 72: 2822–2849.
Anderson, DLT, Gill AE (1979) Beta dispersion of inertial waves. J Geophys Res 84: 1836–1842.
Ascani, F, Firing E, McCreary JP, Brandt P, Greatbatch RJ (2015) The deep equatorial ocean circulation in windforced numerical solutions. J Phys Oceanogr 45: 1709–1734.
Brandt, P, Funk A, Hormann V, Dengler M, Greatbatch RJ (2011) Interannual atmospheric variability forced by the deep equatorial Atlantic Ocean. Nature 473: 497–500.
Brandt, P, Claus M, Greatbatch RJ, Kopte R, Toole JM, Johns WE (2016) Annual and semiannual cycle of equatorial Atlantic circulation associated with basin mode resonance. J Phys Oceanogr 46: 3011–3029.
Brunet, G, Haynes PH (1996) Lowlatitude reflection of Rossby wave trains. J Atmos Sci 53: 482–496.
Cai, M, Huang B (2013) A new look at the physics of Rossby waves: a mechanicalCoriolis oscillation. J Atmos Sci 70: 303–316.
Cane, MA, Moore DW (1981) A note on lowfrequency equatorial basin modes. J Phys Oceanogr 11: 1794–1806.
Chelton, DB, Schlax MG (1996) Global observations of oceanic Rossby waves. Science 272: 234–238.
Claus, M, Greatbatch RJ, Brandt P (2014) Influence of the barotropic mean flow on the width and the structure of the Atlantic equatorial deep jets. J Phys Oceanogr 44: 2485–2497.
Claus, M, Greatbatch RJ, Brandt P, Toole J (2016) Forcing of the Atlantic equatorial deep jets derived from observations. J Phys Oceanogr 46: 3549–3562.
Cummins, PF, Oey LY (1997) Simulation of barotropic and baroclinic tides off northern British Columbia. J Phys Oceanogr 27: 762–781.
Fukutomi, Y, Yasunari T (2002) Tropicalextratropical interaction associated with the 10–25day oscillation over the western Pacific during the northern summer. J Meteo Soc Japan 80: 311–331.
Furuichi, N, Hibiya T, Niwa Y (2008) Modelpredicted distribution of windinduced internal wave energy in the world’s oceans. J Geophys Res 113: C09034.
Gill, AE (1982) Atmosphere–ocean dynamics. Academic Press, London.
Greatbatch, RJ, Brandt P, Claus M, Didwischus SH, Fu Y (2012) On the width of the equatorial deep jets. J Phys Oceanogr 42: 1729–1740.
Haynes, PH (1988) Forced, dissipative generalizations of finiteamplitude waveactivity conservation relations for zontal and nonzonal basic flows. J Atmos Sci 45: 2352–2362.
Hoskins, BJ, James IN, White GH (1983) The shape, propagation and meanflow interaction of largescale weather systems. J Atmos Sci 40: 1595–1612.
Isachsen, PE, LaCasce JJ, Pedlosky J (2007) Rossby wave instability and apparent phase speeds in large ocean basins. J Phys Oceanogr 37: 1177–1191.
Iwasaki, T (2001) Atmospheric energy cycle viewed from wavemeanflow interaction and Lagrangian mean circulation. J Atmos Sci 58: 3036–3052.
Johnson, GC, Zhang D (2003) Structure of the Atlantic Ocean equatorial deep jets. J Phys Oceanogr 33: 600–609.
Kawatani, Y, Sato K, Dunkerton TJ, Watanabe S, Miyahara S, Takahashi M (2010) The roles of equatorial trapped waves and internal inertiagravity waves in driving the quasibiennial oscillation. Part II: threedimensional distribution of wave forcing. J Atmos Sci 67: 981–997.
Lübbecke, JF, Böning CW, Keenlyside N, Xie SP (2010) On the connection between Benguela and equatorial Atlantic Ninos and the role of the South Atlantic Anticyclone. J Geophys Res 115: C09015.
LonguetHiggins, MS (1964) On group velocity and energy flux in planetary wave motion. DeepSea Res 11: 35–42.
Masuda, A (1978) Group velocity and energy transport by Rossby waves. J Oceanogr Soc Jpn 34: 1–7.
Matsuno, T (1966) Quasigeostrophic motions in the equatorial area. J Meteo Soc Japan 44: 25–43.
Matthiessen, JD, Greatbatch RJ, Brandt P, Claus M, Didwischus SH (2015) Influence of the equatorial deep jets on the north equatorial countercurrent. Ocean Dyn 65: 1095–1102.
McPhaden, MJ, Ripa P (1990) Wavemean flow interactions in the equatorial ocean. Annu Rev Fluid Mech 20: 167–205.
Merle, J (1980) Annual and interannual variability of temperature in the eastern equatorial Atlantic—the hypothesis of an Atlantic El Nino. Oceanol Acta 3: 209–220.
Nakamura, N, Solomon A (2011) Finiteamplitude wave activity and mean flow adjustments in the atmospheric general circulation. Part II: analysis in the isentropic coordinates. J Atmos Sci 68: 2783–2799.
Niwa, Y, Hibiya T (2004) Threedimensional numerical simulation of M2 internal tides in the East China Sea. J Geophys Res 109: C04027.
Orlanski, I, Sheldon J (1993) A case of downstream baroclinic development over western north America. Mon Wea Rev 121: 2929–2950.
Philander, SGH (1989) El Nino, La Nina, and the Southern Oscillation. Academic Press, London.
Plumb, RA (1986) Threedimensional propagation of transient quasigeostrophic eddies and its relationship with the eddy forcing of the time mean flow. J Atmos Sci 43: 1657–1678.
Randel, WJ, Williamson DL (1990) A comparison of the climate simulated by the NCAR community climate model (CCM1:R15) with ECMWF analysis. J Climate 3: 608–633.
Ripa, P (1982) Nonlinear wavewave interactions in a onelayer reducedgravity model on the equatorial β plane. J Phys Oceanogr 12: 97–111.
Schopf, PS, Anderson DLT, Smith R (1981) Betadispersion of lowfrequency Rossby waves. Dyn Atmos Oceans 5: 187–214.
Takaya, K, Nakamura H (1997) A formulation of a wave activity flux for stationary Rossby waves on a zonally varying basic flow. Geophys Res Lett 24: 2985–2988.
Thierry, V, Treguier AM, Mercier H (2004) Numerical study of the annual and semiannual fluctuations in the deep equatorial Atlantic Ocean. Ocean Model 6: 1–30.
Wakata, Y, Kitaya S (2002) Annual variability of sea surface height and upper layer thickness in the Pacific Ocean. J Oceanogr 58: 439–450.
Yanai, M, Maruyama T (1966) Stratospheric wave disturbances propagating over the equatorial pacific. J Meteo Soc Japan 44: 291–294.
Zhai, X, Johnson HL, Marshall DP, Wunsch C (2012) On the wind power input to the ocean general circulation. J Phys Oceanogr 42: 1357–1365.
Acknowledgements
This manuscript has been improved by comments from two anonymous reviewers. HA thanks Paal Erik Isachsen for the helpful discussions and RJG is grateful to the GEOMAR for ongoing support.
Funding
This study was supported by JSPS KAKENHI Grant Numbers 26400474 and 15H02129 and also by the Deutsche Forschungsgemeinschaft as part of the Sonderforschungsbereich 754 “Climate  Biogeochemistry Interactions in the Tropical Ocean,” by the German Federal Ministry of Education and Research as part of the cooperative project SACUS (03G0837A), and by the European Union 7th Framework Programme (FP7 20072013) under grant agreement 603521 PREFACE project.
Authors’ contributions
HA proposed the topic and performed the analytical investigation. RJG helped write the manuscript. MC helped with the numerical investigation. All authors read and approved the final manuscript.
Competing interests
The authors declare that they have no competing interest.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Author information
Additional file
Additional file 1: Movie of the model experiment. See the caption of Fig. 4 for details. (MP4 2365 kb)
Additional file 1: Movie of the model experiment. See the caption of Fig. 4 for details. (MP4 2365 kb)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Aiki, H., Greatbatch, R. & Claus, M. Towards a seamlessly diagnosable expression for the energy flux associated with both equatorial and midlatitude waves. Prog. in Earth and Planet. Sci. 4, 11 (2017) doi:10.1186/s4064501701211
Received
Accepted
Published
DOI
Keywords
 Group velocity
 Model diagnosis
 Equatorial Rossby waves
 Equatorial mixed Rossbygravity waves
 Equatorial inertiagravity waves
 Equatorial Kelvin waves
 Coastal Kelvin waves
 Midlatitude Rossby waves
 Midlatitude inertiagravity waves
 Tropicalextratropical interactions