Towards a seamlessly diagnosable expression for the energy flux associated with both equatorial and mid-latitude waves

Aiki, Hidenori; Greatbatch, Richard J.; Claus, Martin

doi:10.1186/s40645-017-0121-1

Research Article
Open access
Published: 31 March 2017

Towards a seamlessly diagnosable expression for the energy flux associated with both equatorial and mid-latitude waves

Hidenori Aiki^1,2,
Richard J. Greatbatch^3,4 &
Martin Claus³

Progress in Earth and Planetary Science volume 4, Article number: 11 (2017) Cite this article

3636 Accesses
19 Citations
2 Altmetric
Metrics details

Abstract

For mid-latitude Rossby waves (RWs) in the atmosphere, the expression for the energy flux for use in a model diagnosis, and without relying on a Fourier analysis or a ray theory, has previously been derived using quasi-geostrophic equations and is singular at the equator. By investigating the analytical solution of both equatorial and mid-latitude waves, the authors derive an exact universal expression for the energy flux which is able to indicate the direction of the group velocity at all latitudes for linear shallow water waves. This is achieved by introducing a streamfunction as given by the inversion equation of Ertel’s potential vorticity, a novel aspect for considering the energy flux. For ease of diagnosis from a model, an approximate version of the universal expression is explored and illustrated for a forced/dissipative equatorial basin mode simulated by a single-layer oceanic model that includes both mid-latitude RWs and equatorial waves. Equatorial Kelvin Waves (KWs) propagate eastward along the equator, are partially redirected poleward as coastal KWs at the eastern boundary of the basin, and then shed mid-latitude RWs that propagate westward into the basin interior. The connection of the equatorial and coastal waveguides has been successfully illustrated by the approximate expression of the group-velocity-based energy flux of the present study. This will allow for tropical-extratropical interactions in oceanic and atmospheric model outputs to be diagnosed in terms of an energy cycle in a future study.

Introduction

A feature of many phenomena in the equatorial oceans is the role played by equatorial Kelvin waves (KWs), examples being El Niño Southern Oscillation (ENSO; Philander 1989) and the so-called Atlantic Niño (Merle 1980). KWs propagate along the equator and are partially redirected into coastal KWs at the eastern boundary, where they can influence off-equatorial latitudes (e.g., Lübbecke et al. 2010) as well as excite extratropical Rossby waves (RWs) that subsequently propagate into the ocean interior (McPhaden and Ripa 1990; Isachsen et al. 2007). A striking example of this behavior is the equatorial basin mode (Cane and Moore 1981). For the gravest basin mode, the time scale is set by the time taken for an equatorial KW to propagate across the basin and for the reflected gravest long Rossby wave to return to the western boundary (that is 4L/c where L is the basin width and c is the phase propagation speed for KWs). In addition to waves that are trapped on the equator, equatorial basin modes also feature coastal KWs that propagate along the eastern boundary and extratropical RWs that are excited by these KWs and refocus on the equator, as described by Schopf et al. (1981). There is growing evidence that equatorial basin modes play an important role in equatorial ocean dynamics. For example, basin modes have been associated with the equatorial deep jets (Johnson and Zhang 2003; Brandt et al. 2011; Claus et al. 2016) and with the semi-annual (Thierry et al. 2004) and annual cycles (Brandt et al. 2016) in the equatorial Atlantic. However, the energy cycle associated with equatorial basin modes has received little attention and is an important factor when considering the forced/dissipative basin modes that one can relate to observations. A particularly interesting example is the upward energy propagation associated with the Atlantic equatorial deep jets (Johnson and Zhang 2003; Brandt et al. 2011; Mathiessen et al. 2015). Yet, the detailed energy cycle associated with the jets remains largely unknown.

One way to approach the energy flux is to use ray theory. However, ray theory is linked to the dispersion relation of a single type of wave and is not suitable for investigating the sequential connection of different types of waves that are associated with a basin mode. Likewise, a Fourier analysis is not suitable for the investigation of waves near the coastal boundaries of the ocean. In fact, it is only for mid-latitude inertia-gravity waves (IGWs) that the flux of wave energy has been diagnosed from oceanic model output (Cummins and Oey 1997; Niwa and Hibiya 2004; Furuichi et al. 2008). On the other hand, in the atmospheric literature, the model diagnosis of pseudomomentum (or wave activity) flux has been more popular than the model diagnosis of the energy flux (Hoskins et al. 1983; Plumb 1986; Takaya and Nakamura 1997; Nakamura and Solomon 2011).

Here, we seek a general expression that can be used to diagnose the energy flux associated with linear shallow water waves at all latitudes from model output. This manuscript is organized as follows. First we provide the theoretical background. Then, we present an analytical investigation that leads to a general expression for the energy flux that can indicate the exact profile of the group velocity times wave energy for both equatorial and mid-latitude waves. The utility of the universal expression of energy flux as a model diagnostic is illustrated for a forced/dissipative equatorial basin mode simulated by a single-layer model. The model diagnosis is achieved by introducing an inversion for the linearized version of Ertel’s potential vorticity. This is a novel aspect for considering the energy flux in the presence of a coastal waveguide that connects the equatorial and mid-latitude regions.

Theoretical background

We use the shallow water equations for a single vertical normal mode (Gill 1982) appropriate to linear waves in a rotating frame of reference and in the absence of a mean flow. Let an arbitrary variable with an associated physical dimension be expressed by A ^∗, and let Cartesian-horizontal coordinates be labelled by the set of independent variables x ^∗,y ^∗,t ^∗, where each of x ^∗, y ^∗ increases eastward and northward, respectively, and u ^∗,v ^∗ are the corresponding horizontal components of velocity (a list of variables is given in Table 1)¹. The equations may then be written as

$$\begin{array}{@{}rcl@{}} && \frac{\partial u^{*}}{\partial t^{*}} - f^{*} v^{*} + \frac{\partial p^{*}}{\partial x^{*}} = 0, \end{array} $$

(1a)

Table 1 List of symbols, where A ^∗ and A are arbitrary quantities written dimensionally or non-dimensionally, respectively

Full size table

$$\begin{array}{@{}rcl@{}} && \frac{\partial v^{*}}{\partial t^{*}} + f^{*} u^{*} + \frac{\partial p^{*}}{\partial y^{*}} = 0, \end{array} $$

(1b)

$$\begin{array}{@{}rcl@{}} && \frac{\partial p^{*}}{\partial t^{*}} + {c^{*}}^{2} \left(\frac{\partial u^{*}}{\partial x^{*}}+\frac{\partial v^{*}}{\partial y^{*}}\right) = 0, \end{array} $$

(1c)

where $f^{*} =f^{*}_{0} + \beta ^{*} y^{*}$ is the Coriolis parameter, p ^∗=p ^∗(x ^∗,y ^∗,t ^∗) corresponds to the pressure² or geopotential, and c ^∗ is a uniform constant representing the propagation speed of nonrotating gravity waves for a given mode. Manipulation of (1a)–(1c) yields a prognostic equation for the linearized version of Ertel’s potential vorticity (hereafter EPV and symbolized as q ^∗) to read

$$\begin{array}{@{}rcl@{}} \frac{\partial}{\partial t^{*}} \Big(\underbrace{ \frac{\partial v^{*}}{\partial x^{*}}- \frac{\partial u^{*}}{\partial y^{*}} - \frac{f^{*}}{{c^{*}}^{2}} p^{*} }_{\equiv q^{*}} \Big) + v^{*} \beta^{*} = 0, \end{array} $$

(2)

which is applicable to waves at all latitudes, such as mid-latitude RWs, mid-latitude IGWs, and equatorial waves [i.e., equatorial RWs and IGWs, equatorial Rossby-gravity waves (RGWs, i.e., Yanai waves), and equatorial KWs; Matsuno 1966; Yanai and Maruyama 1966], understanding $f^{*}_{0} = 0$ for an equatorial β-plane and β ^∗=0 for a mid-latitude f-plane. Both mid-latitude IGWs (i.e., β ^∗=0) and equatorial KWs (i.e., v ^∗=0) are characterized by q ^∗=0, as noted in Table 2.

Table 2 Characteristics of different waves at various latitudes

Full size table

On the other hand, a prognostic equation for wave energy may be derived from (1a)–(1c) as

$$\begin{array}{@{}rcl@{}} \frac{\partial}{\partial t^{*}} \frac{1}{2}\left(\overline{{u^{*}}^{2} + {v^{*}}^{2} + \frac{{p^{*}}^{2}}{{c^{*}}^{2}}}\right) +\nabla^{*} \cdot \langle\!\langle \overline{u^{*} p^{*}}, \overline{v^{*} p^{*}}\rangle\!\rangle = 0, \end{array} $$

(3)

where $\nabla ^{*} \equiv \langle \!\langle \frac {\partial ~}{\partial {x^{*}}}, \frac {\partial ~}{\partial {y^{*}}}\rangle \!\rangle $ and the overbar symbol represents a phase-average operator (i.e., for a sinusoidal wave, $\overline {A^{*}}=0$ for A ^∗=u ^∗, v ^∗, and p ^∗) or a low-pass time filter (for this reason, we retain the local time derivative in (3) to allow for slow time variations in the general case).

For mid-latitude IGWs in the ocean and atmosphere, the group velocity vector points in the same direction as the energy flux vector in (3):

$$\begin{array}{@{}rcl@{}} \overline{{\mathbf{V}}^{*} p^{*}} = \langle\!\langle \overline{u^{*} p^{*}},\overline{v^{*} p^{*}} \rangle\!\rangle, \end{array} $$

(4)

a property that has been exploited by Cummins and Oey (1997), Niwa and Hibiya (2004), and Furuichi et al. (2008) for a model diagnosis. However, for mid-latitude RWs, the vector in (4) does not point in the direction of the group velocity of the waves (Longuet-Higgins 1964; Masuda 1978; Cai and Huang 2013). In order to retrieve the correct direction for the energy flux associated with mid-latitude RWs, Orlanski and Sheldon (1993, hereafter OS93) have suggested to modify (3), without affecting the horizontal divergence of the energy flux, as

$$\begin{array}{@{}rcl@{}} &&\frac{\partial}{\partial t^{*}} \frac{1}{2}\left(\overline{{u^{*}}^{2} + {v^{*}}^{2} + \frac{{p^{*}}^{2}}{{c^{*}}^{2}}} \right) +\\ &&\nabla^{*} \cdot \Big\langle\!\Big\langle \overline{u^{*} p^{*}} + \frac{\partial}{\partial y^{*}} \left(\frac{\overline{{p^{*}}^{2}}}{2 f^{*}}\right), \overline{v^{*} p^{*}} - \frac{\partial}{\partial x^{*}}\left(\frac{\overline{{p^{*}}^{2}}}{2f^{*}}\right) \Big\rangle\!\Big\rangle = 0,\\ \end{array} $$

(5)

where each of u ^∗ and v ^∗ should be the sum of the geostrophic and ageostrophic components and $f^{*} = f^{*}_{0} + \beta ^{*} y^{*}$ is understood. The energy flux vector in (5) consists of two terms,

$$\begin{array}{@{}rcl@{}} \overline{\mathbf{V}^{*} p^{*}} -\nabla^{*} \times [\overline{{p^{*}}^{2}}/(2f^{*})]\mathbf{z}, \end{array} $$

(6)

where $\overline {\mathbf {V}^{*} p^{*}}$ is as in the gravity wave literature (i.e., V ^∗ is the sum of the geostrophic and ageostrophic components of velocity). The second term in (6) is the additional rotational component required to reproduce the direction of the group velocity of mid-latitude RWs (z is the upward vertical unit vector). In Longuet-Higgins (1964), the second term of (6) has been expressed as $-\nabla ^{*} \times [f^{*} \overline {\psi ^{*2}}/2]\mathbf {z}$ where ψ ^∗ is a streamfunction based on the assumption of horizontally nondivergent velocity. This assumption is hardly used in modern oceanography owing to the smallness of the deformation radius. In quasi-geostrophic theory, ψ ^∗=p ^∗/f ^∗ from which the connection with (6) is clear.

The question naturally arises as to whether or not it is possible to find a general expression for the additional rotational flux, R ^∗, that holds for waves at all latitudes and is such that the corresponding energy flux $\overline {{\mathbf {V}}^{*} p^{*}} + \mathbf {R}^{*}$ always points in the direction of the group velocity and thus constitutes a general expression for the energy flux associated with waves at all latitudes. This is the main subject of the present study. In this study, we focus on wave types for which the group velocity has been well formulated in the literature/textbook, as listed in Table 2. Of particular interest is the energy flux associated with equatorial RWs given that the expression in (6) is singular at the equator. The assumption of horizontally nondivergent velocity in Longuet-Higgins (1964) is also inappropriate for equatorial regions. In the next section, by investigating the analytical solution of equatorial waves, we derive an exact universal expression for the rotational flux which, after being added to $\overline {{\mathbf {V}}^{*} p^{*}}$, is able to indicate the direction of the group velocity for linear waves at all latitudes.

Analytical investigation

We begin by revisiting analytical expressions for the profile of the energy flux associated with equatorial waves. This investigation allows us to derive an expression for the energy flux that points in the direction of the group velocity for waves at all latitudes.

Energy flux associated with equatorial waves

We assume linear waves in the absence of a mean flow on an equatorial β-plane. As in Matsuno (1966) and Gill (1982), we use a time scale $1/\sqrt {c^{*}\beta ^{*}}$ and a length scale $\sqrt {c^{*}/\beta ^{*}}$ to nondimensionalize the equation system (1a)–(1c) to give

$$\begin{array}{@{}rcl@{}} &&u_{t} - y v + p_{x} = 0, \end{array} $$

(7a)

$$\begin{array}{@{}rcl@{}} &&v_{t} + y u + p_{y} = 0, \end{array} $$

(7b)

$$\begin{array}{@{}rcl@{}} && p_{t} + u_{x} + v_{y} = 0, \end{array} $$

(7c)

where symbols without an asterisk indicate nondimensionalized quantities and subscripts indicate partial differentiations. Manipulation of (7a)–(7c) yields prognostic equations for EPV and wave energy in a nondimensionalized form to read,

$$\begin{array}{@{}rcl@{}} \partial_{t} (\underbrace{v_{x}-u_{y} - y p}_{\equiv q}) + v = 0, \end{array} $$

(8)

$$\begin{array}{@{}rcl@{}} \partial_{t} (\overline{u^{2}+v^{2}+p^{2}}) /2 + \nabla \cdot \langle\!\langle \overline{up},\overline{vp}\rangle\!\rangle = 0, \end{array} $$

(9)

where $\partial _{t} \equiv \frac {\partial ~}{\partial t}$, $\nabla \equiv \langle \!\langle \frac {\partial ~}{\partial x}, \frac {\partial ~}{\partial y}\rangle \!\rangle $, and for A=u, v, or p, $\overline {A}=0$ for sinusoidally varying waves.

In what follows, we assume v≢0 which is appropriate for equatorial RWs, RGWs, and IGWs (i.e., waves other than equatorial KWs). Then, we consider zonally propagating free waves with a relationship v∝ cosθ, u∝ sinθ, and p∝ sinθ where θ≡k x−ω t is wave phase with k and ω being wavenumber and wave frequency, respectively. Substitution of these relationships to (7a)–(7c), followed by some manipulation, yields a characteristic equation for the meridional structure of v to read,

$$\begin{array}{@{}rcl@{}} &&v_{yy} + (\omega^{2} -k^{2} - k/\omega - y^{2}) v = 0. \end{array} $$

(10)

Matsuno (1966) has derived a solution for (7a)–(7c) and (10) to yield,

(11a)

$$\begin{array}{@{}rcl@{}} && u = (\omega y v_{\theta} - k v_{y\theta})/(\omega^{2}-k^{2}), \end{array} $$

(11b)

$$\begin{array}{@{}rcl@{}} && p = (k y v_{\theta} - \omega v_{y\theta})/(\omega^{2}-k^{2}), \end{array} $$

(11c)

where is wave amplitude and the symbol H ⁽ⁿ⁾ is the Hermite polynomial with n being the meridional mode number³. The subscript θ represents partial differentiation in terms of the wave phase .

Substitution of (11a) to (10) yields,

$$\begin{array}{@{}rcl@{}} &&\omega^{3} - (k^{2} + 2n + 1)\omega - k = 0, \end{array} $$

(12)

which is a unified dispersion relation for equatorial RWs, RGWs, and IGWs. Partial differentiation of (12) with respect to wavenumber k yields a unified expression for the group velocity of equatorial waves,

$$\begin{array}{@{}rcl@{}} \frac{\partial \omega}{\partial k} = \frac{2 k \omega + 1 }{3 \omega^{2} -(k^{2} +2n +1)}= \frac{2 \omega^{2} + \omega/k}{2\omega^{3}/k+1}, \end{array} $$

(13)

where 2ω ³/k in the denominator has often been ignored in previous studies when focusing on low-frequency equatorial waves (e.g., equatorial RWs; Gill 1982).

We now investigate the energy flux associated with (7a)–(7c). It is known that, for zonally propagating equatorial waves, the meridional integral of $\overline {up}$ is equal to the group velocity times the meridional integral of the wave energy (Philander 1989):

$$\begin{array}{@{}rcl@{}} \int^{+\infty}_{-\infty} \overline{up}~dy= (\partial \omega/\partial k) \int^{+\infty}_{-\infty} (\overline{u^{2}+v^{2}+p^{2}})/2~dy. \\ \end{array} $$

(14a)

It should be noted that the identity (14a) does not hold if it is evaluated without the meridional integral:

$$\begin{array}{@{}rcl@{}} \overline{up} \neq (\partial \omega/\partial k) (\overline{u^{2}+v^{2}+p^{2}})/2. \end{array} $$

(14b)

For low-frequency equatorial waves (with ω<1— see Fig. 1—, i.e., all equatorial RWs and westward propagating RGWs), the meridional profiles of $\overline {up}$ and $(\partial \omega /\partial k)(\overline {u^{2}+v^{2}+p^{2}})/2$ are shown by the dashed green and solid black lines, respectively, in Fig. 2. It is clear that, when compared at a given latitude, $\overline {up}$ is not equal to the group velocity times wave energy. In particular, the meridional profile of $\overline {up}$ is sign-indefinite for low-frequency equatorial waves (Fig. 2). On the other hand, as shown by the dashed green and solid black lines in Fig. 3 for high-frequency equatorial waves (with ω>1— see Fig. 1—, i.e., all equatorial IGWs and eastward propagating RGWs), the meridional profile of $\overline {up}$ provides a much better approximation for the group velocity times wave energy. The solid blue line, dashed orange line, and purple dots in Figs. 2 and 3 are explained later in the manuscript.

Identification of the additional rotational flux associated with equatorial waves

It is useful to derive the analytical expression for the difference between the left and right hand sides of (14b). A first step for identifying the difference is to decompose the zonal component of $\overline {up}$ into two parts, one that determines the meridional integral and one that does not affect it, as follows:

$$ {\begin{aligned} &\overline{up}\\ &= \left[y^{2} \overline{vv}(\omega k) - y \overline{v_{y} v}\left(\omega^{2} + k^{2}\right) + \overline{v_{y} v_{y}}(\omega k) \right]/\left(\omega^{2} - k^{2}\right)^{2}\\ &= \left\{\overline{v_{yy} v}(\omega k) + \overline{vv}\left(\omega^{3} k - \omega k^{3} - k^{2}\right)\right.\\ & \quad\left.-\left[\left(y \overline{vv}/2\right)_{y} - \overline{v v}/2 \right]\left(\omega^{2} + k^{2}\right) + \overline{v_{y} v_{y}}(\omega k)\right\}/\left(\omega^{2} - k^{2}\right)^{2}\\ &= \left[\overline{vv}\left(2\omega^{3} k - 2\omega k^{3} - k^{2} + \omega^{2}\right)+ \left(\overline{v_{y} v}\right)_{y} (2\omega k) \right.\\ &\quad\left.-\left(y \overline{v v }\right)_{y} \left(\omega^{2} + k^{2}\right)\right] /\left[2\left(\omega^{2} - k^{2}\right)^{2}\right]\\ &=\overline{vv}\left(2\omega k + 1\right) /\left[2\left(\omega^{2} - k^{2}\right)\right] \\ &\quad+ \left[\overline{v_{y} v}(2\omega k) - y \overline{vv} \left(\omega^{2} + k^{2}\right)\right]_{y} /\left[2\left(\omega^{2} - k^{2}\right)^{2}\right], \end{aligned}} $$

(14c)

where the first equality has been derived using (11b)–(11c) and $\overline {\sin \theta \sin \theta }=\overline {\cos \theta \cos \theta }$ and the second equality has been derived using (10). Note that it is the second of the two terms whose meridional integral is zero (noting that v and yv go to zero at large distances from the equator).

We now decompose the wave energy⁴ into two parts, one that determines the meridional integral and one does not. We then have

$$ { \begin{aligned} &\left(\overline{u^{2}+v^{2}+p^{2}}\right)/2\\ &= \overline{vv}/2 + \left[\left(y^{2} \overline{vv} + \overline{v_{y} v_{y}}\right)\left(\omega^{2} + k^{2}\right)\right.\\ &\quad\left.-\left(y\overline{v_{y} v}\right)\left(4 k \omega\right)\right]/\left[2 \left(\omega^{2} - k^{2}\right)^{2}\right]\\ &= \overline{vv}/2 + \left\{\left[y^{2} \overline{vv} - \overline{v_{yy} v} + \left(\overline{v_{y} v}\right)_{y}\right]\left(\omega^{2} + k^{2}\right)\right.\\ &\quad\left.-\left(y \overline{vv}\right)_{y}\left(2 k \omega\right) + \left(\overline{vv}\right)\left(2 k \omega\right) \right\}/\left[2 \left(\omega^{2} - k^{2}\right)^{2}\right]\\ &= \left[\overline{vv}\left(k^{4} - 2 k^{2} \omega^{2} +\omega^{4} + \omega^{4} - k^{4} + k\omega - k^{3}/\omega\right)\right.\\ &\left.\quad+\left(\overline{v_{y} v}\right)_{y} \left(\omega^{2} + k^{2}\right)- \left(y\overline{vv}\right)_{y}\left(2 k \omega\right)\right]/\left[2 \left(\omega^{2} - k^{2}\right)^{2}\right]\\ &=\overline{vv} \left(2 \omega^{2} + k/\omega \right)/\left[2 \left(\omega^{2} - k^{2}\right)\right]\\ &\quad+\left[\overline{v_{y} v}\left(\omega^{2} + k^{2}\right)- y \overline{vv} (2 k \omega)\right]_{y}/\left[2 \left(\omega^{2} - k^{2}\right)^{2}\right], \end{aligned}} $$

(14d)

where the first equality has been derived using (11b)–(11c) and $\overline {\sin \theta \sin \theta }=\overline {\cos \theta \cos \theta }$, and the third equality has been derived using (10). As before, it is the second of the two terms whose meridional integral is zero. Using (14c)–(14d), we now obtain an analytical expression for the difference between the right and left hand sides of (14b) to yield

$$\begin{array}{@{}rcl@{}} &&(\partial \omega/\partial k) (\overline{u^{2}+v^{2}+p^{2}})/2 - \overline{up} \\ &&= \frac{(\overline{v_{y} v})_{y}}{2 (\omega^{2} - k^{2})^{2}}\left[\frac{(2\omega^{2}+\omega/k)(\omega^{2} + k^{2})}{2\omega^{3}/k + 1}-2\omega k\right] \\ &&\quad-\frac{(y\overline{vv})_{y}}{2 (\omega^{2} - k^{2})^{2}}\left[\frac{(2\omega^{2}+\omega/k) 2 k \omega}{2\omega^{3}/k + 1}- (\omega^{2} + k^{2}) \right] \\ &&= \frac{(\overline{v_{y} v})_{y}(2\omega^{4}+ 2\omega^{2} k^{2} + \omega^{3}/k + \omega k - 4\omega^{4} -2\omega k)}{2 (\omega^{2} - k^{2})^{2}(2\omega^{3}/k + 1)} \\ &&\quad-\frac{(y \overline{vv})_{y}(4\omega^{3} k+ 2\omega^{2} - 2 \omega^{5} /k - 2\omega^{3} k - \omega^{2} - k^{2})}{2 (\omega^{2} - k^{2})^{2}(2\omega^{3}/k + 1)} \\ &&= \frac{(\overline{v_{y\theta} v_{\theta}})_{y}(\omega/k - 2 \omega^{2})-(y \overline{v_{\theta} v_{\theta}})_{y}(1-2\omega^{3}/k)}{2 (\omega^{2} - k^{2})(2\omega^{3}/k + 1)} \\ && = \frac{[\overline{(\omega v_{y\theta} - k y v_{\theta})v_{\theta}}]_{y} + [\overline{(-kv_{y\theta}+\omega y v_{\theta}) v_{\theta}}]_{y}(2\omega^{2})}{2 k (\omega^{2} - k^{2})(2 \omega^{3}/k+1)} \\ && = \frac{-(\overline{pv_{\theta}})_{y} - (\overline{2 u_{tt} v_{\theta}})_{y}}{2k (1+2 \omega^{3}/k)}, \end{array} $$

(14e)

where the first and second equalities have been derived using (13), the third equality has been derived using $\overline {\cos \theta \cos \theta }=\overline {\sin \theta \sin \theta }$, and the last equality has been derived using (11b)–(11c). The last line of (14e) has been written as the meridional gradient of scalar quantities. Thus, the meridional integral of (14e) vanishes for equatorial waves (with a meridionally decaying structure) and is consistent with (14a).

Using (14e), we can now rewrite the zonal component of the group velocity times wave energy as

$$\begin{array}{@{}rcl@{}} && (\partial \omega/\partial k) (\overline{u^{2}+v^{2}+p^{2}})/2 = \overline{up} + (\overline{p\varphi}/2 + \overline{u_{tt}\varphi})_{y},{\phantom{11111}} \end{array} $$

(15a)

$$\begin{array}{@{}rcl@{}} &&\varphi \equiv -v_{\theta}/(k+2\omega^{3}), \end{array} $$

(15b)

where the scalar quantity φ has been introduced. We have confirmed that, as long as φ is set by (15b), the meridional profile of the zonal energy flux, $\overline {up}+ (\overline {p\varphi }/2 + \overline {u_{tt}\varphi })_{y}$, in (15a) is precisely identical to $(\partial \omega /\partial k) (\overline {u^{2}+v^{2}+p^{2}})/2$ for all types of equatorial waves in Figs. 2 and 3. Namely, all solid black lines in Figs. 2 and 3 may be drawn using either expression. As far as we know, (15a) and (15b) have not been mentioned in previous studies and therefore constitute a new result.

Inversion equations for Ertel’s potential vorticity

The definition of φ, as given by (15b), is based on a Fourier expansion. However, we have found that (15b) may be rewritten into an expression which contains none of θ, k, and ω to read

$$\begin{array}{*{20}l} \nabla^{2} \varphi -y^{2} \varphi -3\varphi_{tt} &= - v_{\theta}/\omega \\ &= q, \end{array} $$

(16)

where ∇²≡∂ _xx+∂ _yy is understood, the first line has been derived using (10), and the second line has been derived using (8) [i.e., q _t=−ω q _θ=−v and thus −ω q _{θ
θ}=ω q=−v _θ]. The new Eq. (16) of EPV is the cornerstone of the present study, because it suggests a possibility for the scalar quantity φ to be estimated without using a Fourier analysis. This feature is important for identifying the direction of the energy flux of waves in the presence of coastal boundaries.

To summarize, in order to reproduce the profile of the group velocity times wave energy without relying on a Fourier analysis, we have obtained a new expression for the energy flux that has turned out to be associated with the streamfunction Eq. (16). Equation (16) may be rewritten into a dimensional form as

$$\begin{array}{@{}rcl@{}} &&\nabla^{*2} \varphi^{*} - (f^{*}/c^{*})^{2} \varphi^{*} - (3/{c^{*}}^{2}) \varphi^{*}_{t^{*} t^{*}} = q^{*}, \end{array} $$

(17a)

where $\phantom {\dot {i}\!}\nabla ^{*}\equiv \langle \!\langle \partial _{x^{*}}, \partial _{y^{*}}\rangle \!\rangle $ and $\phantom {\dot {i}\!}q^{*} = v^{*}_{x^{*}}-u^{*}_{y^{*}} - (f^{*}/c^{*2}) p^{*}$. The exact profile of the group velocity times wave energy may be reproduced by the right hand side of (15a) and is here rewritten into a vector and dimensional form as

$$\begin{array}{@{}rcl@{}} \overline{\mathbf{V}^{*} p^{*}} - \nabla^{*} \times [(\overline{p^{*} \varphi^{*}})/2+ (\overline{u^{*}_{t^{*}t^{*}}\varphi^{*}})/\beta^{*}]\mathbf{z}. \end{array} $$

(17b)

The additional rotational flux in (17b) corrects the profile of the energy flux, without affecting the divergence of the energy flux. The quantity φ ^∗ in (17b) is the solution of the accurate streamfunction Eq. (17a) associated with EPV in a dimensional form. We note in passing that for zonally propagating equatorial waves, as given by (11a)–(11c), $\overline {v^{*}p^{*}}$ vanishes owing to the phase relationship between v ^∗ and p ^∗ [see (11a) and (11c)] and the meridional component of the additional rotational flux, $-(\overline {p^{*} \varphi ^{*}}/2 + \overline {u^{*}_{t^{*}t^{*}}\varphi ^{*}}/\beta ^{*})_{x^{*}}$, also vanishes.

Equatorial KWs

So far, we have not investigated the energy flux of equatorial KWs. Since KWs are gravity waves, $\overline {\mathbf {V}^{*} p^{*}}$ becomes equal to the group velocity times wave energy. Namely, the additional rotational flux is absent. KWs are also characterized by q ^∗=0; hence, the EPV equation (17a) yields φ ^∗=0. The result is that, in the case of KWs, the expression for the energy flux, as given by (17b) reduces to $\overline {\mathbf {V}^{*} p^{*}}$, which is consistent with the nature of gravity waves.

Boundary conditions and the connection to mid-latitude regions

Consider a basin with closed zonal boundaries (i.e., the eastern and western coastlines of a basin of arbitrary shape). It is clear that the flux $\overline {\mathbf {V}^{*} p^{*}}$ in (17b) has no component normal to the zonal boundaries. Hence, the additional rotational flux in (17b) should also have no component crossing the closed boundaries. This requirement is satisfied in the present study by solving (17a) with a boundary condition of

$$\begin{array}{@{}rcl@{}} \varphi^{*} = 0. \end{array} $$

(17c)

In a general situation in the ocean, waves propagating eastward along the equatorial waveguide are partially redirected poleward as KWs along the eastern boundary where they can shed RWs that then propagate westward into the ocean interior (Cane and Moore 1981; Philander 1989; Chelton and Schlax 1996; Isachsen et al. 2007).

We now investigate whether or not the set of (17a) and (17b) is applicable to off-equatorial regions where small-amplitude perturbations are characterized by either mid-latitude RWs or IGWs. For perturbations associated with mid-latitude RWs, the solution φ ^∗ of (17a) corresponds to the geostrophic streamfunction for which φ ^∗≃p ^∗/f ^∗ is a reasonable approximation in an interior region (i.e., far from coastal boundaries), noting that ∇^∗ ² φ ^∗ corresponds to $v^{*}_{x^{*}}-u^{*}_{y^{*}}$. The result is that the energy flux in (17b) automatically reduces to the expression of OS93 for mid-latitude RWs⁵. On the other hand, if perturbations associated with mid-latitude IGWs are given, the inversion Eq. (17a) of EPV, which equals zero, yields, with φ ^∗=0 on the boundaries, φ ^∗=0 everywhere. Thus, the energy flux in (17b) automatically reduces to $\overline {\mathbf {V}^{*} p^{*}}$ which represents the group velocity of mid-latitude IGWs times wave energy. We conclude that the set of (17a) and (17b) can represent the exact profile of the group velocity times wave energy associated with both mid-latitude IGWs and RWs, which may be reconfirmed using almost the same procedure as in the “Identification of the additional rotational flux associated with equatorial waves” section. See Appendix 1 for details.

Methods/Experimental

The rest of this manuscript presents an example illustrating the diagnosis of the energy flux from a model. To be useful for our discussion, the exact universal expression for both equatorial and mid-latitude waves, as given by the set of (17a) and (17b), is hereafter referred to as the level-0 energy flux. In practice, the level-0 expression of the energy flux is not straightforward to compute from model output, since the second-order time derivative term in (17a) makes it difficult to solve for φ ^∗.

For the present study, we investigate the consequence of artificially removing the second-order time derivative term from (17a) to give

$$\begin{array}{*{20}l} \nabla^{*2} \varphi^{\mathrm{app*}} - (f^{*}/c^{*})^{2} \varphi^{\mathrm{app*}} = q^{*}, \end{array} $$

(18a)

which may be justified at least for low-frequency waves (e.g., both equatorial and mid-latitude RWs) based on scale analysis. The superscript of φ ^app∗ indicates that the solution of (18a) may be regarded as an approximation for the solution φ ^∗ of the accurate streamfunction Eq. (17a) associated with EPV. Then, we replace φ ^∗ in (17b) with φ ^app∗ to read

$$\begin{array}{@{}rcl@{}} \overline{\mathbf{V}^{*} p^{*}} - \nabla^{*} \times [(\overline{p^{*} \varphi^{\mathrm{app*}}})/2+ (\overline{u^{*}_{t^{*}t^{*}}\varphi^{\mathrm{app*}}})/\beta^{*}]\mathbf{z}, \\ \end{array} $$

(18b)

which is diagnosable⁶ from model output and is referred to as the level-1 expression of the energy flux in the present study. As shown by the dashed orange lines in Fig. 2, the level-1 expression provides a nice approximation for the group-velocity-based energy flux of low-frequency equatorial waves, but not for high-frequency equatorial waves in Fig. 3. Next, with the form of the additional rotational flux $-\nabla ^{*} \times [\overline {{p^{*}}^{2}}/(2f^{*})]\mathbf {z}$ in (6) in mind, we investigate the consequence of simplifying (18b) as

$$\begin{array}{@{}rcl@{}} \overline{\mathbf{V}^{*} p^{*}}-\nabla^{*} \times (\overline{p^{*} {\varphi^{\text{app}}}^{*}}/2)\mathbf{z}, \end{array} $$

(18c)

which we refer to as the level-2 expression for the energy flux. As shown by the solid blue lines in Figs. 2 and 3, the level-2 expression provides an approximation for the group-velocity-based energy flux of both low- and high-frequency equatorial waves, although there can be some error. Further discussion of the level-2 approximation is given in Appendices 2 and 3 where it is noted that the level-2 approximation is comparable in accuracy to the pseudomomentum (or wave-activity) flux used in previous studies (Randel and Williamson 1990; Brunet and Haynes 1996; Fukutomi and Yasunari 2002; Wakata and Kitaya 2002; Kawatani et al. 2010).

We now contrast both the level-1 and level-2 energy fluxes with the expressions in previous studies, given by (6) and (4), using a solution from a linear shallow water model. This illustrates the potential of the expression given by (18b) and (18c) for use as a model diagnostic (see Table 3). Suitable for this purpose is an equatorial basin mode solution since it is associated with both equatorial and coastal waveguides as well as the radiation of mid-latitude RWs into the basin interior. Furthermore, as noted in the Introduction section, the equatorial basin mode, first studied by Cane and Moore (1981), has recently attracted attention because of its importance in the dynamics of the equatorial Atlantic Ocean. Indeed, the annual cycle, the semi-annual cycle, and the interannual variability associated with the Atlantic equatorial deep jets (Brandt et al. 2011) all appear to be resonant excitations of equatorial basin modes [see Brandt et al. (2016) and Claus et al. (2016) for more details].

Table 3 List of energy flux vectors and EPV-based streamfunctions in dimensional form and their location in the text and figures

Full size table

Model set-up

To illustrate the importance of dissipation for explaining the observed cross-equatorial width of the equatorial deep jets, Greatbatch et al. (2012, hereafter G12) have simulated a forced/dissipative basin mode solution using a single-layer reduced-gravity linear model. The model is set up in spherical coordinates, with a rectangular domain in latitude/longitude space of roughly the same width as the Atlantic Ocean at the equator (that is 55° in longitude) and reaching to 10°N/S on either side of the equator⁷. All lateral boundaries are closed. In both G12 and Claus et al. (2014, hereafter C14), the model has been forced by an idealized oscillatory forcing with a period of 4.5 years in the zonal momentum equation to mimic the forcing of the jets, together with a lateral mixing of momentum that provides dissipation. [See Ascani et al. (2015) for a discussion on the forcing of the equatorial deep jets, the details of which are not important here]. It should be noted that 4.5 years is roughly the time taken for an equatorial KW and the reflected long gravest equatorial RW, to travel across the basin for the vertical mode that is closest to resonance. As noted in G12 and C14, the (westward) propagation speed of equatorial long RWs is three times less than the (eastward) propagation speed of equatorial KWs [see the dispersion relation (12)].

Our model has been set up as in G12 and C14. The gravity wave speed is set equal to c ^∗=0.17 m/s [see the upper panel in Fig. 4 of C14]. The equatorial deformation radius becomes $\sqrt {c^*/\beta ^*}=87~\text {km}$, with a consequence that disturbances further than a few degrees from the equator in our model experiment may be regarded as mid-latitude RWs, even though they are part of the equatorial basin mode resonance. As in G12, our model has been formulated in a spherical coordinate system with a grid spacing of 0.1 ° in both longitude and latitude. The coefficient⁸ of eddy viscosity has been set to 10 m ²/s. From an initial condition of no motion and no pressure anomaly, the model has been integrated for 20 cycles (i.e., 90 years) using the oscillatory forcing which is sufficient for a steady oscillatory state to be reached. Since the model code is fully non-linear, we have set the amplitude of the forcing to a small value, 1.0 × 10⁻¹⁰ m/s ² to ensure that linear dynamics prevails. Indeed, the magnitude of velocity associated with the gravest basin mode may be scaled as 10⁻¹⁰ m/s ² × 4.5 years/(2π)=0.0023 m/s, which results in a Froude number of (0.0023 m/s)/c ^∗=0.014 (nondimensional). These parameters are summarized in Table 4. Below, we show results from an experiment which corresponds to the “full” case in G12. In particular, the oscillatory zonal forcing is spatially uniform and acts over the whole model domain. All the model results shown below are averages over the last model cycle.

Table 4 Parameters in the model experiment of the present study

Full size table

Results and discussion

At each time step of the model output, we have calculated the EPV-based streamfunction φ ^app∗ (contours in the left panels of Fig. 4) by solving the spherical coordinate version of (18a) with the boundary condition of φ ^app∗=0. The color shading in Fig. 4 shows the snapshots of thickness anomaly (left panels) and the zonal component of velocity u ^∗ (right panels). The movie of the model experiment is found in Additional file 1. RWs are identified by the correlation (anticorrelation) between the EPV-based streamfunction and thickness anomaly in the northern (southern) hemisphere. This follows from the correspondence between the EPV-based streamfunction and the geostrophic streamfunction for the case of mid-latitude RWs, as noted earlier. As noted in G12 and C14, the (westward) propagation speed of equatorial long RWs is three times smaller than the (eastward) propagation speed of equatorial KWs [see the dispersion relation (12)]. It takes a three-quarter cycle (i.e., 3T ^∗/4) for equatorial long RWs to travel westward from the eastern boundary to the western boundary of the model domain (see red lines in Fig. 5 a). After reflection at the western boundary, it takes only a quarter of a cycle (i.e., T ^∗/4) for equatorial KWs to travel eastward to the eastern boundary of the model domain (see blue lines in Fig. 5 a), where some disturbances are deflected poleward along the eastern boundary to be the source of mid-latitude RWs which then propagate westward (Fig. 5 b).

In Fig. 6, the divergence of the horizontal energy flux, given by $\nabla ^* \cdot \overline {\mathbf {V}^* p^*}$, is shown for the whole model domain using color shading. Red indicates regions of a net energy input, and blue indicates regions of a net dissipation. It is clear that the main region of energy input is in the central part of the basin along the equator, where the strongest zonal velocities are found, and that the main regions of energy loss are associated with the RWs that radiate away from the eastern boundary. Arrows in Fig. 6 a show the energy flux used in the gravity-wave literature, $\overline {\mathbf {V}^*p^*}$, which is mostly westward along the equator and eastward in the immediate off-equatorial region. This can be clearly seen in Fig. 7 a which shows a blow-up of the eastern equatorial region. Figures 6 b and 7 b show the energy flux given by (6), which has been adapted from OS93, where only regions more than 1° latitude away from the equator are plotted to avoid the singularity in the Coriolis parameter f ^∗ at the equator. From these figures (especially the blow-up of the eastern equatorial region in Fig. 7 a, b), it is clear that the energy flux is strongly reversed when compared to $\overline {\mathbf {V}^* p^*}$ in the immediate off-equatorial region and is now strongly eastward in association with RWs that are radiated from the eastern boundary.

From Figs. 6 c and 7 c, it is clear that when the set of Eqs. (18a), (18c) and (17c) is used to estimate the energy flux, the westward flux associated with the off-equatorial RWs is part of a recirculation of energy in the eastern part of the basin (Fig. 7 c) with eastward energy flux along the equator and westward energy flux off the equator. The eastward flux along the equator in Figs. 6 c and 7 c is in the opposite direction to the westward $\overline {\mathbf {V}^* p^*}$ flux in Figs. 6 a and 7 a along the equator in the same region. This indicates the role of the rotational flux contribution in (18c) which counters the westward $\overline {\mathbf {V}^*p^*}$ flux along the equator. This westward flux is associated with the equatorial RWs but represents an overestimation of the energy flux associated with these waves (see Fig. 2). When the rotational flux is added, what emerges is the eastward flux associated with the KW which, in turn, leads to a poleward flux arising from KWs propagating along the eastern boundary and, in turn, leads to the westward flux associated with the off-equatorial RWs that are excited at the eastern boundary. Here, in terms of the transfer of wave energy, the equatorial waveguide has been connected to the eastern coastal waveguide and, in turn, to the basin interior at off-equatorial latitudes, which is at the heart of the present study.

Finally, we note that the forcing period of T ^∗=4.5 years is much longer than the equatorial inertial period of $2\pi /\sqrt {c^*\beta ^*}=37$ days. It can be said that the simulated equatorial basin mode consists of low-frequency equatorial waves, as in Fig. 2, and mid-latitude RWs. We recall the small difference between the solid blue and dashed orange lines in Fig. 2, the former and the latter of which may be written as $\overline {u^* p^*} + (\overline {p^* \varphi ^{\mathrm {app*}}}/2)_{y^*}$ and $\overline {u^* p^*} + (\overline {p^* \varphi ^{\mathrm {app*}}}/2+\overline {u^*_{t^* t^*}\varphi ^{\mathrm {app*}}}/\beta ^*)_{y^*}$, respectively, in a dimensional form (see level-2 and level-1, respectively, in Table 3). Since arrows in Figs. 6 c and 7 c have been plotted using the expression which corresponds to the solid blue line in Fig. 2, we have checked for any improvement by using the expression which corresponds to the dashed orange lines in the same figure. The checking has been done by comparing the distribution of $\overline {p^* \varphi ^{\mathrm {app*}}}/2$ and $\overline {u^*_{t^*t^*} \varphi ^{\mathrm {app*}}}/\beta ^*$, from which we have learned that the latter quantity (not shown) is three orders of magnitude smaller than the former. Thus, we conclude that, in the diagnosis of the simulated basin mode, the expression of the energy flux, as given by (18c), has provided a nice approximation for the group velocity times wave energy.

Conclusions

In previous studies of the ocean, the energy flux of waves in model output has been diagnosed using $\overline {\mathbf {V}^* p^*}$, where V ^∗ is the horizontal component of velocity perturbation and p ^∗ corresponds to the pressure perturbation. This is appropriate for understanding the energy flux associated with mid-latitude inertia-gravity waves (IGWs). For mid-latitude Rossby waves (RWs), however, the direction of $\overline {\mathbf {V}^* p^*}$ differs from the group velocity and hence the energy flux, by a rotational vector flux with zero divergence. The rotational flux to be added to $\overline {\mathbf {V}^* p^*}$ for estimating the group velocity of mid-latitude RWs has previously been derived using quasi-geostrophic equations and is singular at the equator.

By investigating the analytical solution of both equatorial waves (“Analytical investigation” section) and mid-latitude waves (Appendix 1), we have derived an exact universal⁹ expression for the rotational flux which, after being added to $\overline {\mathbf {V^*}p^*}$, is able to indicate the profile of the group velocity times wave energy for linear waves at all latitudes. This is what we call the level-0 expression of the energy flux. The level-0 energy flux is written using the solution φ ^∗ of (17a), previously unmentioned in the literature, which we refer to as the accurate streamfunction associated with Ertel’s potential vorticity (EPV). Equation (17a) is the cornerstone of the present study, because it suggests a possibility for the energy flux to be estimated (i) without using a Fourier analysis nor ray theory and (ii) in the presence of coastal boundaries, which will allow for tropical-extratropical interactions in model output to be diagnosed in terms of an energy cycle in a future study. Presently, the level-0 energy flux is not practical for use as a model diagnostic, since the second-order time derivative term in (17a) makes it difficult to solve for φ ^∗. Thus, we hope that a future study is able to develop a numerical algorithm to solve (17a) for φ ^∗. We also note the need to extend the theory to a continuously stratified ocean and also to test out the theory in the presence of a sheared mean flow, both of which topics await a future study. This is a new step from the recent understanding of energetics in the atmosphere and ocean that had been focused on, for example, the global mapping of energy conversion rates associated with various physical processes (e.g., baroclinic and barotropic instabilities) and external forcing (Iwasaki 2001; Aiki and Richards 2008; Zhai et al. 2012).

The potential of our analysis as a model diagnostic is illustrated in the present study for a forced/dissipative equatorial basin mode simulated by a single-layer model. The model result includes both mid-latitude RWs (maintained by coastal KWs propagating poleward along the eastern boundary) and equatorial RWs (maintained by the reflection of equatorial KWs at the eastern boundary). We have used approximate expressions for the energy flux (what we call the level-1 and level-2 energy fluxes) that is based on the inversion equation (18a) of EPV and which is shown to be good approximations to the level-0 expression in the case of the model run being considered. Since (18a) is seamlessly solvable at all latitudes with φ ^app∗=0 at coastlines, the source of the westward energy flux of mid-latitude RWs in the model output has been successfully illustrated in the present study. To our knowledge, this is the first attempt to diagnose the energy cycle of a tropical-extratropical interaction associated with the connection of the equatorial and coastal waveguides.

Endnotes

¹ While the energy flux of waves at all latitudes is considered in the present study, the pseudomomentum (or wave-activity) flux of waves at all latitudes is considered in Aiki et al. (2015, hereafter ATG15). Both the formulations of the present study and ATG15 may be reproduced even if a spherical coordinate system is used. The use of a Cartesian horizontal coordinate system in both the present study and ATG15 is for the purpose of simplicity, which will allow for the results of the two studies to be linked in a future study. A related discussion appears in Appendix 3.

² What we call pressure, energy, and momentum in the present study are actually dynamic pressure, energy density, and momentum density, respectively, following ATG15.

³ d H ⁽ⁿ⁾/d y=2n H ⁽ⁿ⁻¹⁾, H ⁽ⁿ⁺¹⁾=2y H ⁽ⁿ⁾−2n H ⁽ⁿ⁻¹⁾, H ⁽⁰⁾=1, H ⁽¹⁾=2y, H ⁽²⁾=4y ²−2, H ⁽³⁾=8y ³−12y, H ⁽⁴⁾=16y ⁴−48y ²+12.

⁴ The factor ∂ ω/∂ k to calculate the energy flux is added in (14e).

⁵ The second term in the square brackets of (17b) vanishes as $\overline {u^{*}_{t^{*}t^{*}}\varphi ^{*}}\simeq \overline {(-p^{*}_{y^{*} t^{*} t^{*}}/f^{*})(p^{*}/f^{*})}=0$ where the phase relationship of plane waves is understood.

⁶ We use the term “diagnosable” to indicate that the quantity is readily estimated from quantities in model output without relying on a Fourier analysis.

⁷ In a related paper, Claus et al. (2014) also used this solution to investigate the influence of the barotropic mean flow on the Atlantic equatorial deep jets. The Atlantic equatorial deep jets are resonant with the gravest basin mode for a high-order baroclinic mode (typically the 15th vertical normal mode) and consist of vertically stacked zonal jets that oscillate at a given depth with a period of around 4.5 years.

⁸ This is lower than the value recommended by G12 for capturing the observed width of the deep jets but is chosen here since it is not so large as to prevent focusing of RWs on the equator. In the inviscid solution of Cane and Moore (1981), there is a singularity on the equator at the center of the basin due to RW focusing as described by Schopf et al. (1981).

⁹ In the present manuscript, we have used the term “exact” to refer to the level-0 expression, in contrast to approximate expressions (i.e., level-1 and -2). Likewise, we have used the term “universal” to indicate the ability to handle all wave types in Table 2, for which the group velocity has been well formulated in the literature/textbook.

¹⁰Although it is not in the list of wave types in Table 2, IGWs on a mid-latitude β-plane may be characterized as α≪1,δ ²≤1,γ ²<1 where α≪1 corresponds to (19b). Thus, the net content in the square brackets on the last line of (24c) becomes O(1). Given α in front of $c^* \overline {v^* v^*}$ on the last line of (24c), we may justify (23d) for IGWs on a mid-latitude β-plane. It can be said that the right hand side of (24c) becomes significantly nonzero when the assumption of plane waves in the meridional direction becomes inconsistent (Anderson and Gill 1979).

¹¹While the pseudomomentum flux itself $(\overline {E^*-v^*v^*})$ is diagnosable from model output, the pseudomomentum-flux-based expression of the energy flux $(\overline {E^*-v^*v^*})\omega ^*/k^*$ is not easily diagnosable from model output because of multiplication by the phase speed (see Appendix 3 for details).

Appendix 1

Is the streamfunction Eq. (17a) associated with EPV applicable to mid-latitude waves?

Manipulation of the shallow water equation system (1a)–(1c) yields a characteristic equation associated with the meridional component of velocity to read

$$\begin{array}{*{20}l} v^{*}_{t^{*} t^{*} t^{*}} - c^{*2}\left(v^{*}_{x^{*} x^{*}}+v^{*}_{y^{*} y^{*}}\right)_{t^{*}} + f^{*2} v^{*}_{t^{*}} - \beta^{*} c^{*2} v^{*}_{x^{*}}=0, \end{array} $$

(19a)

which is applicable to both mid-latitude and equatorial regions. In what follows, we consider plane waves on either an f-plane or a mid-latitude β-plane (i.e., $f^* = f^{*}_{0}+ \beta ^* y^*$ and |f0∗|≫|β ^∗ y ^∗|) and thus assume

$$\begin{array}{*{20}l} f^{*2} \simeq f^{*2}_{0}. \end{array} $$

(19b)

Then, (19a) may be simplified as

$$\begin{array}{*{20}l} & v^{*}_{t^{*} t^{*} t^{*}} - c^{*2}\left(v^{*}_{x^{*} x^{*}}+v^{*}_{y^{*} y^{*}}\right)_{t^{*}} + f^{*2}_{0} v^{*}_{t^{*}} - \beta^{*} c^{*2} v^{*}_{x^{*}}=0. \end{array} $$

(19c)

The Coriolis parameter f0∗ in (19c) is constant that allows us to assume a horizontally monochromatic wave in a complex form

(20a)

where i is the unit imaginary number, is wave amplitude, and θ=k ^∗ x ^∗+l ^∗ y ^∗−ω ^∗ t ^∗ is wave phase (k ^∗ and l ^∗ are the zonal and meridional components of a wavenumber vector, respectively, and ω ^∗ is wave phase). For simplicity, all , k ^∗, l ^∗, and ω ^∗ are assumed to be constant. Substitution of (20a) to both (1a) and (1c) yields a solution for u ^∗ and p ^∗ to read

$$\begin{array}{*{20}l} &u^{*} = \left(f^{*} \omega^{*} v^{*}_{\theta} + c^{*2} k^{*} l^{*} v^{*}\right)/\left(\omega^{*2} - c^{*2} k^{*2}\right), \end{array} $$

(20b)

$$\begin{array}{*{20}l} &p^{*} = \left(f^{*} k^{*} v^{*}_{\theta} + \omega^{*} l^{*} v^{*}\right)c^{*2}/\left(\omega^{*2} - c^{*2} k^{*2}\right), \end{array} $$

(20c)

where f ^∗=f0∗+β ^∗ y ^∗. On the other hand, substitution of (20a) to (19c) yields

$$\begin{array}{@{}rcl@{}} \omega^{*3} -c^{*2} \left(k^{*2} + l^{*2}\right) \omega^{*} - f^{*2}_{0} \omega^{*} - \beta^{*} c^{*2} k^{*} = 0, \end{array} $$

(21)

which is a universal expression for the dispersion relation of the various types of waves in mid-latitude regions. For example, substitution of β ^∗=0 to (21) yields a classical dispersion relation for mid-latitude IGWs (i.e., waves on an f-plane), and substitution of ω ^∗2≪c ^∗2 k ^∗2 to (21) yields a classical dispersion relation for mid-latitude RWs.

An expression for the zonal component of group velocity may be derived using (21) to read

$$\begin{array}{*{20}l} \frac{\partial \omega^{*}}{\partial k^{*}} &= \frac{2 c^{*2} k^{*} \omega^{*} + \beta^{*} c^{*2}}{3\omega^{*2} - c^{*2} \left(k^{*2} + l^{*2}\right) - f^{*2}_{0}} \\ &= \frac{2 c^{*2} \omega^{*2} k^{*} + \beta^{*} c^{*2} \omega^{*}}{2 \omega^{*3} + \beta^{*} c^{*2} k^{*}}. \end{array} $$

(22a)

We now identify the content of $(\overline {A^*B^*})_{y^*}$ in the following equation:

$$\begin{array}{*{20}l} \overline{u^{*} p^{*}} + \left(\overline{A^{*} B^{*}}\right)_{y^{*}} = \frac{\partial \omega^{*}}{\partial k^{*}}\frac{1}{2}\left(\overline{u^{*2}+v^{*2}+\frac{p^{*2}}{c^{*2}}}\right), \end{array} $$

(22b)

where each of A ^∗ and B ^∗ are quantities associated with the set of u ^∗, v ^∗, p ^∗, c ^∗, and f ^∗. A first step for investigating (22b) is to decompose $\overline {u^* p^*}$ into two parts: one that is associated with the numerator of (22a) and one that is written as the meridional derivative of a scalar quantity, as follows:

$$\begin{array}{*{20}l} &\overline{u^{*} p^{*}} = \frac{\overline{v^{*} v^{*}} \left(f^{*2} + c^{*2} l^{*2}\right) c^{*2} \omega^{*} k^{*}}{\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ &\simeq \frac{\overline{v^{*} v^{*}} \left(\omega^{*3} -c^{*2} k^{*2} \omega^{*} -\beta^{*} c^{*2} k^{*}\right) c^{*2} k^{*}}{\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}} \\ & = \frac{\overline{v^{*} v^{*}}c^{*2} \omega^{*} k^{*}}{\left(\omega^{*2} -c^{*2} k^{*2}\right)}- \frac{\left(f^{*}\overline{v^{*} v^{*}}\right)_{y^{*}} c^{*4} k^{*2}}{\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}} \\ & = \frac{\overline{v^{*} v^{*}} \left(2c^{*2} \omega^{*} k^{*}+ \beta^{*} c^{*2}\right)}{2\left(\omega^{*2} -c^{*2} k^{*2}\right)} \\ &\quad\ -\frac{\left(f^{*} \overline{v^{*} v^{*}}\right)_{y^{*}} c^{*2} \left(\omega^{*2} + c^{*2} k^{*2}\right)}{2\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}, \end{array} $$

(22c)

where the first equality has been derived using both (20b)–(20c) and the set of $\overline {v^* v^*}=\overline {v^*_{\theta } v^*_{\theta }}$ and $\overline {v^*_{\theta } v^*}=0$ and the approximate equality in the middle has been derived using both the dispersion relation (21) and (19b). Then, we decompose the wave energy in (22b) into two parts, one that is associated with the denominator of (22a) and one that is written as the meridional derivative of a scalar quantity. We then have

$$ {\begin{aligned} & \frac{1}{2}\left(\overline{u^{*2} + v^{*2} + \frac{p^{*2}}{c^{*2}}}\right)\\ & =\frac{\overline{v^{*} v^{*}} \left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}{2\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ &\quad+\frac{\overline{v^{*} v^{*}} \left(\omega^{*2} f^{*2} + c^{*4} k^{*2} l^{*2} + k^{*2} f^{*2} c^{*2} + \omega^{*2} l^{*2} c^{*2}\right)}{2\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ & =\frac{\overline{v^{*} v^{*}} \left[\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2} + \left(f^{*2} + c^{*2} l^{*2}\right) \left(\omega^{*2} + c^{*2} k^{*2}\right)\right]}{2\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ & \simeq\frac{\overline{v^{*} v^{*}}\left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}{2 \left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ &\quad+\frac{\overline{v^{*} v^{*}}\left(\omega^{*2} - c^{*2} k^{*2} - \beta^{*} c^{*2} k^{*}/\omega^{*}\right) \left(\omega^{*2} + c^{*2} k^{*2}\right)}{2 \left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ & =\frac{\overline{v^{*} v^{*}} \omega^{*2}}{\left(\omega^{*2} - c^{*2} k^{*2}\right)} - \frac{\left(f^{*} \overline{v^{*} v^{*}}\right)_{y^{*}} c^{*2} k^{*} \left(\omega^{*2} + c^{*2} k^{*2}\right)}{2\omega^{*} \left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}}\\ & =\frac{\overline{v^{*} v^{*}} \left(2\omega^{*3}+\beta^{*} c^{*2} k^{*}\right)}{2\omega^{*} \left(\omega^{*2} - c^{*2} k^{*2}\right)} -\frac{\left(f^{*} \overline{v^{*} v^{*}}\right)_{y^{*}} 2 c^{*2} \omega^{*} k^{*}}{2 \left(\omega^{*2} - c^{*2} k^{*2}\right)^{2}},\\ \end{aligned}} $$

(22d)

where the first equality has been derived using both (20b)–(20c) and the set of $\overline {v^* v^*}=\overline {v^*_{\theta } v^*_{\theta }}$ and $\overline {v^*_{\theta } v^*}=0$ and the approximated equality in the middle has been derived using both the dispersion relation (21) and (19b). The set of (22c) and (22d) allows us to identify the content of $(\overline {A^* B^*})_{y^*}$ in (22b) to read

$$ { \begin{aligned} & \frac{\partial \omega^{*}}{\partial k^{*}}\frac{1}{2}\left(\overline{u^{*2}+v^{*2}+\frac{p^{*2}}{c^{*2}}}\right) - \overline{u^{*} p^{*}}\\ & \simeq \frac{-(f^{*}\overline{v^{*} v^{*}})_{y^{*}} c^{*2}}{2 (\omega^{*2} - c^{*2} k^{*2})^{2}}\Big\{\frac{(2c^{*2}\omega^{*2} k^{*}+\beta^{*} c^{*2} \omega) 2 k^{*} \omega^{*}}{2\omega^{*3} + \beta^{*} c^{*2} k^{*}}\\ &\quad-(\omega^{*2} + c^{*2} k^{*2}) \Big\}\\ & =\frac{-(f^{*} \overline{v^{*} v^{*}})_{y^{*}} c^{*2}} {2(\omega^{*2} - c^{*2}k^{*2})^{2}}\Big\{\frac{(4c^{*2}\omega^{*3} k^{*2}+ 2\beta^{*} c^{*2} \omega^{*2}k^{*})}{2\omega^{*3} +\beta^{*} c^{*2} k^{*}}\\ &\quad+\frac{(-2 \omega^{*5} - 2 c^{*2} \omega^{*3} k^{*2} - \beta^{*} c^{*2} \omega^{*2} k^{*} - \beta^{*} c^{*4} k^{*3})}{ 2\omega^{*3} +\beta^{*} c^{*2} k^{*}}\Big\}\\ & =\frac{-(f^{*} \overline{v^{*}_{\theta} v^{*}_{\theta}})_{y^{*}} c^{*2} (\beta^{*} c^{*2} k-2\omega^{*3})}{2 (\omega^{*2} - c^{*2} k^{*2})(2\omega^{*3} + \beta^{*} c^{*2} k^{*})}\\ & =\frac{-(f^{*} \overline{v^{*}_{\theta} v^{*}_{\theta}})_{y^{*}} c^{*2} [1-2\omega^{*3}/(\beta^{*} c^{*2} k^{*})]}{2 (\omega^{*2} - c^{*2} k^{*2})[2\omega^{*3}/(\beta^{*} c^{*2} k^{*}) + 1]}\\ & = \frac{-[\overline{(f^{*} k^{*} v^{*}_{\theta} + \omega^{*} l^{*} v^{*})c^{*2} v^{*}_{\theta}}]_{y^{*}}} {2 k^{*} (\omega^{*2} - c^{*2} k^{*2})[2 \omega^{*3}/(\beta^{*} c^{*2} k^{*})+1]}\\ &\quad+\frac{[\overline{(f^{*} \omega^{*} v^{*}_{\theta} + c^{*2} k^{*} l^{*} v^{*}) v^{*}_{\theta}}]_{y^{*}}2\omega^{*2}/\beta^{*}}{2 k^{*} (\omega^{*2} - c^{*2} k^{*2})[2 \omega^{*3}/(\beta^{*} c^{*2} k^{*})+1]}\\ & = \frac{-(\overline{p^{*} v^{*}_{\theta}})_{y^{*}} - (\overline{2 u^{*}_{t^{*} t^{*}} v^{*}_{\theta}})_{y^{*}}/\beta^{*}}{2k^{*} [1+2 \omega^{*3}/(\beta^{*} c^{*2} k^{*})]}, \end{aligned}} $$

(22e)

where the last equality has been derived using (20a)–(20c). Equation (22e) may be rewritten as

$$\begin{array}{*{20}l} \overline{u^{*} p^{*}} + &(\overline{p^{*} \varphi^{*}}/2 + \overline{u^{*}_{t^{*} t^{*}} \varphi^{*}}/\beta^{*})_{y^{*}} \\ &= \frac{\partial \omega^{*}}{\partial k^{*}}\frac{1}{2}\left(\overline{u^{*2} + v^{*2} + \frac{p^{*2}}{c^{*2}}}\right), \end{array} $$

(23a)

where

$$\begin{array}{*{20}l} &\varphi^{*} \equiv \frac{- v^{*}_{\theta}}{k^{*} + 2 \omega^{*3}/(\beta^{*} c^{*2})}, \end{array} $$

(23b)

has been introduced. The definition of φ ^∗, as given by (23b), is based on a Fourier expansion and may be rewritten into an expression which contains none of θ, k ^∗, l ^∗, and ω ^∗ to read

$$\begin{array}{*{20}l} \nabla^{*2} \varphi^{*} - (f^{*}_{0}/c^{*})^{2} \varphi^{*} - (3/c^{*2}) \varphi^{*}_{t^{*} t^{*}} &= - \beta^{*} v^{*}_{\theta}/\omega^{*} \\ &= q^{*}, \end{array} $$

(23c)

where the first equality has been derived using (20b) and the second equality has been derived using (2) [i.e., q t ^∗∗=−ω ^∗ q θ∗=−β ^∗ v ^∗ and thus −ω ^∗ q θ θ∗=ω ^∗ q ^∗=−β ^∗ v θ∗]. As far as we know, the set of (23a) and (23c) has not been mentioned in previous studies for mid-latitude waves and has turned out to be almost the same as the set of (17b) and (17a) that has been derived for equatorial waves.

We now consider the meridional flux of wave energy. We would like to show that

$$\begin{array}{*{20}l} \overline{v^{*} p^{*}} - &\underbrace{(\overline{p^{*} \varphi^{*}}/2 + \overline{u^{*}_{t^{*} t^{*}} \varphi^{*}}/\beta^{*})_{x^{*}}}_{0} \\ &= \frac{\partial \omega^{*}}{\partial l^{*}} \frac{1}{2} \left(\overline{u^{*2}+v^{*2}+\frac{p^{*2}}{c^{*2}}}\right). \end{array} $$

(23d)

It turns out that the second term on the left hand side, associated with the additional rotational flux, vanishes when evaluated using the analytical solution of waves [i.e., $(\overline {v^* v^*})_{x^*} = k\overline {v^*_{\theta } v^*}=0$], which is as in Longuet-Higgins (1964). This is attributed to the assumption of all , k ^∗, l ^∗, and ω ^∗ being constant in particular in the zonal direction. An expression for the meridional component of group velocity may be derived from (21) to read

$$\begin{array}{*{20}l} \frac{\partial \omega^{*}}{\partial l^{*}} &= \frac{2 c^{*2} l^{*} \omega^{*}}{3\omega^{*2} - c^{*2} (k^{*2} + l^{*2}) - f^{*2}_{0}} \\ &= \frac{2 c^{*2} l^{*} \omega^{*2}}{2 \omega^{*3}+ \beta^{*} c^{*2} k^{*}}. \end{array} $$

(24a)

Then, we calculate the left hand side of (23d) using (20a)–(20b) as

$$\begin{array}{*{20}l} \overline{v^{*} p^{*}} = \frac{\overline{v^{*} v^{*}} c^{*2} \omega^{*}l^{*}}{\omega^{*2} -c^{*2} k^{*2}}, \end{array} $$

(24b)

where $\overline {v^*_{\theta } v^*}=0$ has been used. We now calculate the difference of the meridional component of the group velocity times wave energy and $\overline {v^* p^*}$ using the set of (22d), (24a), and (24b) to yield

$$\begin{array}{*{20}l} &\frac{\partial \omega^{*}}{\partial l^{*}} \frac{1}{2} \left(\overline{u^{*2}+v^{*2}+\frac{p^{*2}}{c^{*2}}}\right) - \overline{v^{*} p^{*}} \\ &= - \frac{2 c^{*4} k^{*} l^{*} \omega^{*3}}{(2\omega^{*3} + \beta^{*} c^{*2} k^{*})}\frac{(f \overline{v^{*} v^{*}})_{y^{*}}}{(\omega^{*2}-c^{*2}k^{*2})^{2}} \\ &= - \frac{2 \beta^{*} c^{*3} k^{*} l^{*}}{(2 + \beta^{*} c^{*2} k^{*}/\omega^{*3})}\frac{c^{*} \overline{v^{*} v^{*}}}{\omega^{*4}(1-c^{*2}k^{*2}/\omega^{*2})^{2} } \\ &= - \left[\frac{2 \delta^{2}}{(2 + \alpha \delta^{2} \gamma)(1-\gamma^{2})^{2}}\frac{c^{*2} k^{*} l^{*}}{\omega^{*2}}\right] \alpha c^{*} \overline{v^{*}v^{*}}, \end{array} $$

(24c)

where the last line has been written using the set of nondimensional parameters. These are defined as

$$\begin{array}{*{20}l} \alpha \equiv \beta^{*} c^{*}/f^{*2}_{0},\;\;\delta \equiv f^{*}_{0} / \omega^{*},\;\; \gamma \equiv c^{*}k^{*}/\omega^{*}. \end{array} $$

(24d)

It can be said that the last line of (24c) represents the contribution of higher order terms in an asymptotic expansion based on α, δ, and γ. This contribution should not be confused with the universal expression of the additional rotational flux which has already been clarified at (23a) and (23d). It should be also noted that the net content within the square brackets on the last line of (24c) is nondimensional, for which we shall make scale analysis in the next paragraph.

The quantity $\alpha c^* \overline {v^* v^*}$ on the last line of (24c) may be interpreted as a reference for the magnitude of the energy flux of mid-latitude RWs. Mid-latitude RWs may be characterized as

$$\begin{array}{*{20}l} |\alpha \gamma| = \frac{\beta^{*} c^{*2}/f_{0}^{*2}}{|\omega^{*}/k^{*}|}\geq 1,\;\; \delta^{2} \gg 1,\;\;\gamma^{2} \gg 1. \end{array} $$

(25a)

Thus, the net content within the square brackets on the last line of (24c) approximates to zero, which justifies (23d) for mid-latitude RWs. On the other hand, for mid-latitude IGWs, $c^* \overline {v^* v^*}$ on the last line of (24c) represents a reference for the magnitude of the energy flux. IGWs on an f-plane may be characterized as

$$\begin{array}{*{20}l} \alpha = 0,\;\; \delta^{2} \leq 1,\;\; \gamma^{2} < 1. \end{array} $$

(25b)

Thus, the last line of (24c) vanishes, which justifies (23d) for IGWs on an f-plane¹⁰.

To summarize, the streamfunction Eq. (17a) associated with EPV and the universal expression of the additional rotational flux in (17b) applies to both mid-latitude and equatorial waves, in particular for wave types considered in the present study, as listed in Table 2.

Appendix 2

Approximate expressions for the energy flux

The exact profile of the group velocity times wave energy is given by the set of (15a) and (16), which is what we call the level-0 energy flux. Owing to the last term on the left hand side of (16) that contains the second-order partial differentiation with respect to time, the procedure of inverting EPV, without using a Fourier analysis, is still complicated.

Hence, we investigate the consequence of artificially removing the second-order time derivative term from (16) as

$$\begin{array}{@{}rcl@{}} \nabla^{2} \varphi^{\text{app}} -y^{2} \varphi^{\text{app}} &=& q, \end{array} $$

(26a)

where the superscript of φ ^app indicates that the solution of (26a) may be regarded as an approximation for the solution φ of the accurate streamfunction Eq. (16) associated with EPV. We have calculated the meridional profiles of

$$\begin{array}{@{}rcl@{}} \overline{up}+(\overline{p\varphi^{\text{app}}}/2+\overline{u_{tt} \varphi^{\text{app}}})_{y}, \end{array} $$

(26b)

as shown by the dashed orange lines in Fig. 2 for low-frequency equatorial waves (e.g., equatorial RWs) and in Fig. 3 for high-frequency equatorial waves (e.g., equatorial IGWs). Since this is an analytical investigation, we have used φ ^app=−v _θ/(k−ω ³) which has been derived from the EPV inversion Eq. (26a) with the use of the characteristic Eq. (10). All panels in Fig. 2 show a nice agreement between the dashed orange line given by (26b) and the solid black line, $(\partial \omega /\partial k)(\overline {u^2+v^2+p^{2}})$. By contrast, all panels in Fig. 3 show a finite disagreement between the dashed orange line given by (26b), $\overline {up}+(\overline {p\varphi ^{\text {app}}}/2+\overline {u_{tt} \varphi ^{\text {app}}})_{y}$, and the solid-black line, $(\partial \omega /\partial k)(\overline {u^2+v^2+p^{2}})$.

It would be nice if there is a unified approximation for the energy flux that is able to represent the profile of the group velocity times the energy of both low- and high-frequency equatorial waves. We have found that this requirement is roughly satisfied if (26b) is simplified as

$$\begin{array}{@{}rcl@{}} \overline{up}+ (\overline{p\varphi^{\text{app}}}/2)_{y}, \end{array} $$

(26c)

where φ ^app=−v _θ/(k−ω ³) is the solution of (26a). The profile of (26c) is shown by the solid blue lines in Figs. 2 and 3 for low- and high-frequency equatorial waves, respectively. This expression provides what we think is a potentially useful approximation for the group velocity times wave energy (the solid black lines) for all types of equatorial waves, as we show in the “Methods/Experimental” section.

In the present study, (26b) and its vector and dimensional form (18b) are referred to as the level-1 energy flux. Likewise, (26c) and its vector and dimensional form (18c) are referred to as the level-2 energy flux.

Why do we appreciate the level-2 energy flux regardless of the error? An expression for pseudomomentum (or wave-activity) flux has long been used for the model diagnosis of the direction of the group velocity of waves in the atmosphere (and also the ocean), including in low-latitude regions (Ripa 1982; Hoskins et al. 1983; Plumb 1986; Haynes 1988; Randel and Williamson 1990; Brunet and Haynes 1996; Fukutomi and Yasunari 2002; Wakata and Kitaya 2002; Kawatani et al. 2010). Using the analytical solution of equatorial waves, we have calculated the profile of the traditional pseudomomentum flux¹¹ times the phase velocity of waves (see Appendix 3), as shown by the purple dots in Figs. 2 and 3. Interestingly, for low-frequency waves, the profile of the pseudomomentum-flux-based expression (the purple dots) is almost the same as that of the level-2 energy flux (the blue solid line). On the other hand, for high-frequency waves, the profile of the pseudomomentum-flux-based expression (the purple dots) is similar to that of the level-1 energy flux (the orange dashed line) and quite different from the exact, level-0 energy flux to which the level-2 energy flux is a better approximation. Thus, the level-2 energy flux is, in general, an improvement on the traditional model diagnosis of group velocity based on the pseudomomentum flux.

Concerning extension to mid-latitude waves, both the level-1 and level-2 energy fluxes satisfy all conditions noted in the last paragraph of the “Boundary conditions and the connection to mid-latitude regions” section. Note that the inversion Eq. (18a) of EPV is seamlessly solvable at all latitudes with the boundary condition of φ ^app∗=0. To summarize, the set of (18a) and (18c) [together with the boundary condition (17c)]—what we call the level-2 expression—originates from a trade-off between mathematical exactness and practical accessibility. The mathematical exactness for retrieving the group velocity of equatorial waves times wave energy has been achieved by the set of (17a) and (17b)—what we call the level-0 expression. However, its accessibility is harmed by the second-order time derivative term in the streamfunction equation (16) associated with EPV. On the other hand, concerning the practical accessibility, the set of (18a) and (18c)—the level-2 expression—has the advantages that (i) it is seamlessly solvable at all latitudes and (ii) it provides a unified expression for all types of waves with which to estimate the direction of the group velocity. We have noted, for equatorial waves, that the profile of the level-2 energy flux is somewhat better than that of the traditional pseudomomentum flux. It should be also noted that the energy flux given by (18c) satisfies the boundary condition of no flux through coastlines [using (17c)], an issue not considered in previous studies for the pseudomomentum flux. With these requirements in mind, we hope that future studies can lead to either an improved approximation or a numerical algorithm for the level-0 energy flux.

Appendix 3

Similarity between the level-2 energy flux of this study and the pseudomomentum flux in previous studies

Ripa (1982) has derived a conservation equation for pseudomomentum (or wave activity) associated with ageostrophic waves. His equation may be reproduced using (1a)–(1c) as

$$\begin{array}{*{20}l} &\frac{\partial}{\partial t^{*}} \underbrace{\left(\frac{p^{*} u^{*}}{{c^{*}}^{2}}-\frac{{q^{*}}^{2}}{2\beta^{*}} \right)}_{\sf IB\ pseudomomentum} + \nabla^{*}\cdot \underbrace{ \langle\!\langle E^{*}-v^{*}v^{*},\ v^{*} u^{*} \rangle\!\rangle }_{\sf IB\ flux} = 0, \end{array} $$

(27a)

$$\begin{array}{*{20}l} & E^{*} \equiv \frac{1}{2}\left({u^{*}}^{2}+{v^{*}}^{2}+\frac{{p^{*}}^{2}}{{c^{*}}^{2}}\right), \end{array} $$

(27b)

where the prognostic quantity may be referred to as the impulse-bolus (IB) pseudomomentum (Aiki et al. 2015, hereafter ATG15) and E ^∗ is the wave energy. Note that the IB pseudomomentum given here is the shallow water version of that given by Eq. (27a) in ATG15. It has been known that the expression of the flux in (27a) can indicate the direction of the group velocity of different types of waves, in particular, mid-latitude RWs and IGWs (Hoskins et al. 1983; Plumb 1986; Haynes 1988). Another nice feature of the IB pseudomomentum Eq. (27a) is that it does not contain a singularity at the equator. In order to investigate the origin of these features, ATG15 have shown in their Eq. (18a) an identity between the IB pseudomomentum and the classical energy-based (CE) pseudomomentum to read (again, written here for the shallow water equations)

$$\begin{array}{*{20}l} \underbrace{\frac{E^{*}}{(\omega^{*}/k^{*})}}_{\sf CE\ pseudomomentum} =&\underbrace{\frac{p^{*} u^{*}}{{c^{*}}^{2}}-\frac{{q^{*}}^{2}}{2\beta^{*}}}_{\sf IB\ pseudomomentum} \\ &- \frac{\partial}{\partial y^{*}} \left(\frac{u^{*} q^{*}}{2 \beta^{*}}\right) + \frac{\partial}{\partial x^{*}} \left(\frac{v^{*} q^{*}}{2 \beta^{*}}\right), \end{array} $$

(28a)

which may be derived from (1a)–(1c) of the present study. Application of a low-pass temporal filter to (27b), and then, understanding the phase relationship between v ^∗=−q t ^∗∗/β ^∗ and q ^∗ yields

$$\begin{array}{*{20}l} \frac{\overline{E^{*}}}{(\omega^{*}/k^{*})} = \overline{\frac{p^{*} u^{*}}{{c^{*}}^{2}}-\frac{{q^{*}}^{2}}{2\beta^{*}}} - \frac{\partial}{\partial y^{*}} \left(\frac{\overline{u^{*} q^{*}}}{2 \beta^{*}}\right). \end{array} $$

(28b)

Substitution of (28b) to a low-pass time-filtered version of (28a) yields

$$\begin{array}{*{20}l} &\frac{\partial}{\partial t^{*}} \overline{E^{*}} + \\ &\frac{\omega^{*}}{k^{*}} \nabla^{*}\cdot \Big\langle\!\Big\langle \overline{E^{*}-v^{*}v^{*}},\ \overline{v^{*} u^{*}}+\frac{\partial}{\partial t^{*}} \left(\frac{\overline{u^{*} q^{*}}}{2 \beta^{*}}\right) \Big\rangle\!\Big\rangle = 0, \end{array} $$

(29)

which is a prognostic equation for the wave energy wherein the zonal component of the flux is proportional to that in the IB pseudomomentum equation (27a).

It is easy to expect that the expression of the flux in (29) can indicate the direction of the group velocity of mid-latitude RWs and IGWs (Hoskins et al. 1983; Plumb 1986; Haynes 1988). For equatorial waves, here, we investigate the meridional profile of $(\overline {E^*-v^*v^*})\omega ^*/k^*$ as shown by the purple dots in Figs. 2 and 3 for low- and high-frequency waves, respectively. For low-frequency waves (Fig. 2), the meridional profile of $(\overline {E^*-v^*v^*})\omega ^*/k^*$ (the purple dots) is almost the same as that of the level-2 energy flux (the blue solid line), showing that the level-2 energy flux and the IB flux are closely related. For high-frequency waves (Fig. 3), the meridional profile of $(\overline {E^*-v^*v^*})\omega ^*/k^*$ (the purple dots) is nearly the same as that of the level-1 energy flux (the orange dashed line), indicating that the level-2 energy flux is somewhat better than the IB flux.

In fact, without relying on the level-0 expression, we have arrived at the level-2 expression of the energy flux by extending the investigation of ATG15 concerning the algebraic structure of the IB flux (to be explained in a future study). ATG15 have addressed the importance of a wave-induced scalar quantity and symbolized it as Λ: it vanishes for mid-latitude IGWs (i.e., waves with no perturbation of EPV) and becomes nonzero for mid-latitude RWs (i.e., wave with a perturbation of EPV). Here, we suggest that $\overline {\Lambda }=(\overline {p^* \eta ^* })_{y^*}/2$ is closely linked to $(\overline {p^* \varphi ^{\mathrm {app*}}})_{y^*}/2$ in the present study (η ^∗ is meridional displacement). This is why the level-2 expression for the energy flux in the present study can indicate the direction of the group velocity of different types of waves, an issue we shall discuss in a future study.

Note that the IB flux in (27a) has already been used for the model diagnosis of waves in low-latitude regions (Randel and Williamson 1990; Brunet and Haynes 1996; Fukutomi and Yasunari 2002; Wakata and Kitaya 2002; Kawatani et al. 2010). We suggest that, despite the certain inaccuracy associated with equatorial waves as compared with the level-0 expression, the level-2 expression of the energy flux in the present study will be at least as useful as the IB flux which has long been used in the atmospheric (and oceanic) literature. For oceanic applications, the level-2 energy flux brings two new advantages over the IB flux: (i) the level-2 energy flux satisfies a no-normal-flux boundary condition at coastlines, and (ii) the wave energy is a sign-definite quantity while the IB pseudomomentum is not.

Overall, we address the balance of (i) model accessibility, (ii) unified treatment for different types of waves, (iii) mathematical accuracy, and (iv) boundary conditions at coastlines. With these requirements in mind, we hope future studies can lead to either an improved approximation or a numerical algorithm for the level-0 energy flux, wherein the profile of the IB flux will provide a reference for accuracy because the IB flux has long been used in previous studies.

Abbreviations

EPV:: Ertel’s potential vorticity
IGW:: Inertia gravity wave
KW:: Kelvin wave
RGW:: Mixed Rossby-gravity wave
RW:: Rossby wave

References

Aiki, H, Richards KJ (2008) Energetics of the global ocean: the role of layer-thickness form drag. J Phys Oceanogr 38: 1845–1869.
Article Google Scholar
Aiki, H, Takaya K, Greatbatch RJ (2015) A divergence-form wave-induced pressure inherent in the extension of the Eliassen-Palm theory to a three-dimensional framework for waves at all latitudes. J Atmos Sci 72: 2822–2849.
Article Google Scholar
Anderson, DLT, Gill AE (1979) Beta dispersion of inertial waves. J Geophys Res 84: 1836–1842.
Article Google Scholar
Ascani, F, Firing E, McCreary JP, Brandt P, Greatbatch RJ (2015) The deep equatorial ocean circulation in wind-forced numerical solutions. J Phys Oceanogr 45: 1709–1734.
Article Google Scholar
Brandt, P, Funk A, Hormann V, Dengler M, Greatbatch RJ (2011) Interannual atmospheric variability forced by the deep equatorial Atlantic Ocean. Nature 473: 497–500.
Article Google Scholar
Brandt, P, Claus M, Greatbatch RJ, Kopte R, Toole JM, Johns WE (2016) Annual and semi-annual cycle of equatorial Atlantic circulation associated with basin mode resonance. J Phys Oceanogr 46: 3011–3029.
Article Google Scholar
Brunet, G, Haynes PH (1996) Low-latitude reflection of Rossby wave trains. J Atmos Sci 53: 482–496.
Article Google Scholar
Cai, M, Huang B (2013) A new look at the physics of Rossby waves: a mechanical-Coriolis oscillation. J Atmos Sci 70: 303–316.
Article Google Scholar
Cane, MA, Moore DW (1981) A note on low-frequency equatorial basin modes. J Phys Oceanogr 11: 1794–1806.
Article Google Scholar
Chelton, DB, Schlax MG (1996) Global observations of oceanic Rossby waves. Science 272: 234–238.
Article Google Scholar
Claus, M, Greatbatch RJ, Brandt P (2014) Influence of the barotropic mean flow on the width and the structure of the Atlantic equatorial deep jets. J Phys Oceanogr 44: 2485–2497.
Article Google Scholar
Claus, M, Greatbatch RJ, Brandt P, Toole J (2016) Forcing of the Atlantic equatorial deep jets derived from observations. J Phys Oceanogr 46: 3549–3562.
Article Google Scholar
Cummins, PF, Oey LY (1997) Simulation of barotropic and baroclinic tides off northern British Columbia. J Phys Oceanogr 27: 762–781.
Article Google Scholar
Fukutomi, Y, Yasunari T (2002) Tropical-extratropical interaction associated with the 10–25-day oscillation over the western Pacific during the northern summer. J Meteo Soc Japan 80: 311–331.
Article Google Scholar
Furuichi, N, Hibiya T, Niwa Y (2008) Model-predicted distribution of wind-induced internal wave energy in the world’s oceans. J Geophys Res 113: C09034.
Article Google Scholar
Gill, AE (1982) Atmosphere–ocean dynamics. Academic Press, London.
Google Scholar
Greatbatch, RJ, Brandt P, Claus M, Didwischus S-H, Fu Y (2012) On the width of the equatorial deep jets. J Phys Oceanogr 42: 1729–1740.
Article Google Scholar
Haynes, PH (1988) Forced, dissipative generalizations of finite-amplitude wave-activity conservation relations for zontal and nonzonal basic flows. J Atmos Sci 45: 2352–2362.
Article Google Scholar
Hoskins, BJ, James IN, White GH (1983) The shape, propagation and mean-flow interaction of large-scale weather systems. J Atmos Sci 40: 1595–1612.
Article Google Scholar
Isachsen, PE, LaCasce JJ, Pedlosky J (2007) Rossby wave instability and apparent phase speeds in large ocean basins. J Phys Oceanogr 37: 1177–1191.
Article Google Scholar
Iwasaki, T (2001) Atmospheric energy cycle viewed from wave-mean-flow interaction and Lagrangian mean circulation. J Atmos Sci 58: 3036–3052.
Article Google Scholar
Johnson, GC, Zhang D (2003) Structure of the Atlantic Ocean equatorial deep jets. J Phys Oceanogr 33: 600–609.
Article Google Scholar
Kawatani, Y, Sato K, Dunkerton TJ, Watanabe S, Miyahara S, Takahashi M (2010) The roles of equatorial trapped waves and internal inertia-gravity waves in driving the quasi-biennial oscillation. Part II: three-dimensional distribution of wave forcing. J Atmos Sci 67: 981–997.
Article Google Scholar
Lübbecke, JF, Böning CW, Keenlyside N, Xie S-P (2010) On the connection between Benguela and equatorial Atlantic Ninos and the role of the South Atlantic Anticyclone. J Geophys Res 115: C09015.
Article Google Scholar
Longuet-Higgins, MS (1964) On group velocity and energy flux in planetary wave motion. Deep-Sea Res 11: 35–42.
Google Scholar
Masuda, A (1978) Group velocity and energy transport by Rossby waves. J Oceanogr Soc Jpn 34: 1–7.
Article Google Scholar
Matsuno, T (1966) Quasi-geostrophic motions in the equatorial area. J Meteo Soc Japan 44: 25–43.
Google Scholar
Matthiessen, J-D, Greatbatch RJ, Brandt P, Claus M, Didwischus S-H (2015) Influence of the equatorial deep jets on the north equatorial countercurrent. Ocean Dyn 65: 1095–1102.
Article Google Scholar
McPhaden, MJ, Ripa P (1990) Wave-mean flow interactions in the equatorial ocean. Annu Rev Fluid Mech 20: 167–205.
Article Google Scholar
Merle, J (1980) Annual and interannual variability of temperature in the eastern equatorial Atlantic—the hypothesis of an Atlantic El Nino. Oceanol Acta 3: 209–220.
Google Scholar
Nakamura, N, Solomon A (2011) Finite-amplitude wave activity and mean flow adjustments in the atmospheric general circulation. Part II: analysis in the isentropic coordinates. J Atmos Sci 68: 2783–2799.
Article Google Scholar
Niwa, Y, Hibiya T (2004) Three-dimensional numerical simulation of M2 internal tides in the East China Sea. J Geophys Res 109: C04027.
Article Google Scholar
Orlanski, I, Sheldon J (1993) A case of downstream baroclinic development over western north America. Mon Wea Rev 121: 2929–2950.
Article Google Scholar
Philander, SGH (1989) El Nino, La Nina, and the Southern Oscillation. Academic Press, London.
Google Scholar
Plumb, RA (1986) Three-dimensional propagation of transient quasi-geostrophic eddies and its relationship with the eddy forcing of the time mean flow. J Atmos Sci 43: 1657–1678.
Article Google Scholar
Randel, WJ, Williamson DL (1990) A comparison of the climate simulated by the NCAR community climate model (CCM1:R15) with ECMWF analysis. J Climate 3: 608–633.
Article Google Scholar
Ripa, P (1982) Nonlinear wave-wave interactions in a one-layer reduced-gravity model on the equatorial β plane. J Phys Oceanogr 12: 97–111.
Article Google Scholar
Schopf, PS, Anderson DLT, Smith R (1981) Beta-dispersion of low-frequency Rossby waves. Dyn Atmos Oceans 5: 187–214.
Article Google Scholar
Takaya, K, Nakamura H (1997) A formulation of a wave activity flux for stationary Rossby waves on a zonally varying basic flow. Geophys Res Lett 24: 2985–2988.
Article Google Scholar
Thierry, V, Treguier AM, Mercier H (2004) Numerical study of the annual and semi-annual fluctuations in the deep equatorial Atlantic Ocean. Ocean Model 6: 1–30.
Article Google Scholar
Wakata, Y, Kitaya S (2002) Annual variability of sea surface height and upper layer thickness in the Pacific Ocean. J Oceanogr 58: 439–450.
Article Google Scholar
Yanai, M, Maruyama T (1966) Stratospheric wave disturbances propagating over the equatorial pacific. J Meteo Soc Japan 44: 291–294.
Google Scholar
Zhai, X, Johnson HL, Marshall DP, Wunsch C (2012) On the wind power input to the ocean general circulation. J Phys Oceanogr 42: 1357–1365.
Article Google Scholar

Download references

Acknowledgements

This manuscript has been improved by comments from two anonymous reviewers. HA thanks Paal Erik Isachsen for the helpful discussions and RJG is grateful to the GEOMAR for ongoing support.

Funding

This study was supported by JSPS KAKENHI Grant Numbers 26400474 and 15H02129 and also by the Deutsche Forschungsgemeinschaft as part of the Sonderforschungsbereich 754 “Climate - Biogeochemistry Interactions in the Tropical Ocean,” by the German Federal Ministry of Education and Research as part of the cooperative project SACUS (03G0837A), and by the European Union 7th Framework Programme (FP7 2007-2013) under grant agreement 603521 PREFACE project.

Authors’ contributions

HA proposed the topic and performed the analytical investigation. RJG helped write the manuscript. MC helped with the numerical investigation. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interest.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Institute for Space-Earth Environmental Research, Nagoya University, Aichi, Nagoya City, 464-8601, Japan
Hidenori Aiki
Application Laboratory, Japan Agency for Marine-Earth Science and Technology, Yokohama, Japan
Hidenori Aiki
GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel, Kiel, Germany
Richard J. Greatbatch & Martin Claus
Faculty of Mathematics and Natural Sciences, University of Kiel, Kiel, Germany
Richard J. Greatbatch

Authors

Hidenori Aiki
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. Greatbatch
View author publications
You can also search for this author in PubMed Google Scholar
Martin Claus
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hidenori Aiki.

Additional file

Additional file 1: Movie of the model experiment. See the caption of Fig. 4 for details. (MP4 2365 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Aiki, H., Greatbatch, R. & Claus, M. Towards a seamlessly diagnosable expression for the energy flux associated with both equatorial and mid-latitude waves. Prog. in Earth and Planet. Sci. 4, 11 (2017). https://doi.org/10.1186/s40645-017-0121-1

Download citation

Received: 28 June 2016
Accepted: 06 March 2017
Published: 31 March 2017
DOI: https://doi.org/10.1186/s40645-017-0121-1

Towards a seamlessly diagnosable expression for the energy flux associated with both equatorial and mid-latitude waves

Abstract

Introduction

Theoretical background

Analytical investigation

Energy flux associated with equatorial waves

Identification of the additional rotational flux associated with equatorial waves

Inversion equations for Ertel’s potential vorticity

Equatorial KWs

Boundary conditions and the connection to mid-latitude regions

Methods/Experimental

Model set-up

Results and discussion

Conclusions

Endnotes

Appendix 1

Is the streamfunction Eq. (17a) associated with EPV applicable to mid-latitude waves?

Appendix 2

Approximate expressions for the energy flux

Appendix 3

Similarity between the level-2 energy flux of this study and the pseudomomentum flux in previous studies

Abbreviations

References

Acknowledgements

Funding

Authors’ contributions

Competing interests

Publisher’s Note

Author information

Authors and Affiliations

Corresponding author

Additional file

Rights and permissions

About this article

Cite this article

Share this article

Keywords