1. Introduction
After more than a century of vigorous investigation, Rayleigh–Bénard convection remains a quintessential distilled model for convective instability, pattern formation and turbulence in buoyancy-driven flows (Chassignet, Cenedese & Verron Reference Chassignet, Cenedese and Verron2012). Likewise, rotationally influenced Rayleigh–Bénard convection serves a similar classic role in modelling many natural environments such as oceanic, stellar and planetary systems (Ecke & Shishkina Reference Ecke and Shishkina2023).
Four non-dimensional control parameters characterise Rayleigh–Bénard convection in a closed rotating vessel. Historically, the Rayleigh number, Taylor number, Prandtl number and aspect ratio provide the most common choice,

These measure the strength of the characteristic buoyancy force, the strength of rotation, the fluid’s dissipative properties and the geometry. Here,
$g$
gives the gravitational acceleration,
$\alpha = - \rho ^{-1}(\partial \rho /\partial T)|_{T_{0}, P_{0}}$
is the thermal expansion coefficient,
$\Delta T$
is the vertical temperature contrast,
$\nu$
represents the kinematic viscosity,
$\kappa$
represents the thermal diffusivity,
$\Omega$
gives the background rotation rate,
$H$
represents the container depth and
$L$
gives a characteristic lateral extent (or diameter) of the container. The Ekman and convective Rossby numbers also provide useful equivalents to
${\textit{Ta}}$
and
${\textit{Ra}}$
. That is,

where
${\textit{E}}$
measures the size of viscous boundary layers, and
${\textit{Ro}}$
measures the relative influence of buoyancy and rotation. In rapidly rotating systems, both
${\textit{E}}\ll 1$
and
${\textit{Ro}} \ll 1$
.
A significant partition in Rayleigh–Bénard convection modelling exists between large-aspect-ratio and finite-aspect-ratio systems. In large-aspect-ratio systems, sidewall boundaries play no significant role in the dynamics. Theoretical efforts in this area focus on linear stability analyses and numerical simulations with periodic boundary conditions, a useful simplification in many instances. Chandrasekhar (Reference Chandrasekhar1953) first solved the linear stability theory for large-aspect-ratio rotating Rayleigh–Bénard convection; Chandrasekhar (Reference Chandrasekhar1961) discusses the subject in thorough detail.
For moderate Prandtl number, Niiler & Bisshopp (Reference Niiler and Bisshopp1965) further clarified the influence of flow boundary conditions on the top and bottom plates in the rapidly rotating limit. For the present discussion, the most relevant result of these efforts is that in the rapidly rotating limit (large
${\textit{Ta}}$
) with no-slip boundary conditions, the critical onset Rayleigh number scales as

assuming
$\sigma = {\mathcal{O}}(1)$
(see Clune & Knobloch Reference Clune and Knobloch1993; Dawes Reference Dawes2001). The
${\mathcal{O}}( {\textit{Ta}}^{7/12} )$
term in (1.3) arises because of Ekman pumping effects correlating with leading-order convective modes (Zhang & Roberts Reference Zhang and Roberts1998). Buell & Catton (Reference Buell and Catton1983) first attempted to solve the linear stability problem in a closed cylinder, finding notable decreases in the critical Rayleigh number compared with the extended-layer modes considered by Chandrasekhar (Reference Chandrasekhar1953). They also pointed out the presence of a wall-localised mode, implying a tendency for instability at a reduced Rayleigh number for any aspect ratio. However, their analysis only considered steady onset, precluding the preferred types of propagating modes discovered later. The laboratory work of Nakagawa & Frenzen (Reference Nakagawa and Frenzen1955) gave the first qualitative stability agreement and flow visualisation for rotating convection.
Pioneering experimental studies, with an approximately 2 : 1 aspect ratio, found interesting discrepancies from the large-aspect ratio theory (Rossby Reference Rossby1969). Without a means to visualise the interior flow, heat transport data was the primary quantitative outcome of these experiments. In discussing the convective onset in water, Rossby noted
We find excellent agreement between theory and experiment for the critical Rayleigh number at all Taylor numbers less than
$5\times 10^{4}$
; beyond this, the fluid becomes unstable at lower Rayleigh numbers. At a Taylor number
$=10^{8}$
, for example, the measured critical Rayleigh number is about one-third the expected value. We do not understand why this should be. It is quite reproducible; i.e. if one changes the depth of the fluid, the instability will occur at the same Rayleigh number for a given Taylor number (Rossby Reference Rossby1969, p. 322–323).
Motivated by the theory of Veronis (Reference Veronis1968) for subcritical phenomena, Rossby conjectured at the possibility of finite-amplitude effects and speculated about the possible modes for the ‘subcritical’ instability. Pfotenhauer, Niemela & Donnelly (Reference Pfotenhauer, Niemela and Donnelly1987) found similar onset phenomena in cryogenic helium and speculated that the lower onset state was related to linear stability calculations of the static wall mode from Buell & Catton (Reference Buell and Catton1983). Inspired by precision laboratory work in the 1990s (Zhong, Ecke & Steinberg Reference Zhong, Ecke and Steinberg1991; Ecke, Zhong & Knobloch Reference Ecke, Zhong and Knobloch1992; Ning & Ecke Reference Ning and Ecke1993; Zhong, Ecke & Steinberg Reference Zhong, Ecke and Steinberg1993), several almost simultaneous efforts helped clarify our theoretical understanding of Rossby’s paradox: sidewalls lead to distinct modes of linear instability at lower Rayleigh numbers than in a large-aspect-ratio system.
Goldstein et al. (Reference Goldstein, Knobloch, Mercader and Net1993, Reference Goldstein, Knobloch, Mercader and Net1994) conducted the first comprehensive linear stability analysis for a cylindrical vessel with a range of
${\textit{Ra}},\,{\textit{Ta}},\,\sigma ,\,\Gamma$
. This work demonstrated that in a rapidly rotating finite-aspect ratio system, the onset of convection occurs via two distinct mode sets. The first type comprises non-propagating modes supported within the bulk and corresponds to the periodic modes in Chandrasekhar’s large-aspect-ratio analyses and Buell’s and Catton’s finite-domain modes. These solutions remain stationary in the rotating coordinate frame.
Goldstein et al. (Reference Goldstein, Knobloch, Mercader and Net1993) also found a second type of instability that proceeds via a set of faster precessing modes (retrograde with respect to system rotation). In contrast to the bulk dynamics, these waves exist in a thin boundary layer attached to the container’s sidewall. For large enough background rotation rates, the wall-localised modes emerge as the preferred pattern of instability in a regime subcritical to the bulk-mode instability. In the rapidly rotating limit,

Herrmann & Busse (Reference Herrmann and Busse1993) derived the leading-order term in this expression. Assuming stress-free top and bottom boundary conditions, Liao, Zhang & Chang (Reference Liao, Zhang and Chang2006) computed an
${\mathcal{O}}( {\textit{Ta}}^{\,1/3} )$
correction distinguishing between different sidewall velocity boundary conditions. The
${\mathcal{O}}( {\textit{Ta}}^{\,5/12} )$
correction term also arises from no-slip top and bottom boundary conditions. As far as we know, this term is absent from the literature. In § 5.2, we account for Ekman pumping (similar to Zhang & Roberts Reference Zhang and Roberts1998) to calculate this term, along with the other associated corrections.
Along with the linear stability theory, the advent of practical optical shadowgraph techniques allowed much more detailed studies of the flow structures contained in experiments (Boubnov & Golitsyn Reference Boubnov and Golitsyn1986; Kuo & Cross Reference Kuo and Cross1993; Zhong et al. Reference Zhong, Ecke and Steinberg1993). A large number of studies – for example, Rossby (Reference Rossby1969), Boubnov & Golitsyn (Reference Boubnov and Golitsyn1990), Ning & Ecke (Reference Ning and Ecke1993), Julien et al. (Reference Julien, Legg, Mcwilliams and Werne1996), Sakai (Reference Sakai1997), Liu & Ecke (Reference Liu and Ecke1997), Hart, Kittelman & Ohlsen (Reference Hart, Kittelman and Ohlsen2002), Ahlers, Grossmann & Lohse (Reference Ahlers, Grossmann and Lohse2009), King et al. (Reference King, Stellmach, Noir, Hansen and Aurnou2009), Kunnen, Geurts & Clercx (Reference Kunnen, Geurts and Clercx2009), Zhan et al. (Reference Zhan, Liao, Zhu and Zhang2009), Niemela, Babuin & Sreenivasan (Reference Niemela, Babuin and Sreenivasan2010), Zhong & Ahlers (Reference Zhong and Ahlers2010), Weiss & Ahlers (Reference Weiss and Ahlers2011), Liu & Ecke (Reference Liu and Ecke2011), Stevens et al. (Reference Stevens, Overkamp, Lohse and Clercx2011) – examine the heat-transport and flow properties in rotating Rayleigh–Bénard systems significantly above the onset of bulk convection (see figure 1 for a sample of experiment parameter ranges). Alternatively, we focus on the parameter range below onset for bulk convection, where wall-mode convection exists in isolation. Most recently, Aurnou et al. (Reference Aurnou, Bertin, Grannan, Horn and Vogt2018) investigated the regime before bulk onset for
${\textit{Ta}} \approx 10^{10}$
. We point out that in the limit
${\textit{Ta}}\rightarrow \infty$
, wall-mode convection can reach values of
${\textit{Ra}}/{\textit{Ra}}_{\textrm{{c},{wall}}}={\mathcal{O}}({\textit{Ta}}^{1/6})$
before exciting the bulk mode. This indicates the potential development of strongly nonlinear dynamics in the cross-hashed region of figure 1.

Figure 1. Approximate stability regions and experimental parameters. The respective sources are all laboratory experiments: R69 (Rossby Reference Rossby1969) ZES93 (Zhong et al. Reference Zhong, Ecke and Steinberg1993); LE11 (Liu & Ecke Reference Liu and Ecke2011); KSNHA09 (King et al. Reference King, Stellmach, Noir, Hansen and Aurnou2009); KSOSHC11 (Kunnen et al. Reference Kunnen, Stevens, Overkamp, Sun, van Heijst and Clercx2011); ABGHV18 (Aurnou et al. Reference Aurnou, Bertin, Grannan, Horn and Vogt2018). In each case, the current author gathered the information from the respective sources by hand. All values are approximate and intended to get an idea of the relative ranges of data. The cross-hatched region shows the wall-mode regime between the asymptotic stability scalings
${\textit{Ra}}_{\,{bulk}} \approx 8.7 \, {\textit{Ta}}^{2/3}$
and
${\textit{Ra}}_{\,{wall}} \approx 31.8 \, {\textit{Ta}}^{1/2}$
. The 31.8 coefficient is roughly an upper bound; the actual value somewhat lower in a finite-aspect-ratio cylinder (see figure 5). Except for
${\textit{ABGHV18}}$
, all experiments use moderate Prandtl numbers, approximately between 4 and 7. The steep dashed line shows the convective Rossby number threshold
${\textit{Ro}} \approx 1$
for Prandtl
$\approx 7$
. The
${\textit{ABGHV18}}$
experiment used gallium with Prandtl
$\approx 0.025$
with markedly different onset behaviour, but still well below
${\textit{Ro}} = 1$
.
More recently, several studies have examined the cross-hatched region and its vicinity with direct numerical simulations (Kunnen et al. Reference Kunnen, Stevens, Overkamp, Sun, van Heijst and Clercx2011; Horn & Schmid Reference Horn and Schmid2017; Zhang et al. Reference Zhang, van Gils, Horn, Wedi, Zwirner, Ahlers, Ecke, Weiss, Bodenschatz and Shishkina2020; Favier & Knobloch Reference Favier and Knobloch2020; Ecke, Zhang & Shishkina Reference Ecke, Zhang and Shishkina2022; de Wit et al. Reference de Wit, Boot, Madonia, Aguirre Guzmán and Kunnen2023; Ravichandran & Wettlaufer Reference Ravichandran and Wettlaufer2023; Zhang et al. Reference Zhang, Reiter, Shishkina and Ecke2024). One significant result from these efforts is that wall mode dynamics can undergo various degrees of wall separation and even generate interior turbulence below the threshold for bulk convection onset (Albaiz & Hart Reference Albaiz and Hart1990; Zhong et al. Reference Zhong, Ecke and Steinberg1991; Marques & Lopez Reference Marques and Lopez2008; Li et al. Reference Li, Liao, Chan and Zhang2008; Rubio, Lopez & Marques Reference Rubio, Lopez and Marques2009; Favier & Knobloch Reference Favier and Knobloch2020; Zhang et al. Reference Zhang, Reiter, Shishkina and Ecke2024). The generation of boundary zonal flows has also been an area of interest (Zhang et al. Reference Zhang, van Gils, Horn, Wedi, Zwirner, Ahlers, Ecke, Weiss, Bodenschatz and Shishkina2020; de Wit et al. Reference de Wit, Aguirre Guzmán, Madonia, Cheng, Clercx and Kunnen2020; Zhang, Ecke & Shishkina Reference Zhang, Ecke and Shishkina2021; Wedi et al. Reference Wedi, Moturi, Funfschilling and Weiss2022; Ecke et al. Reference Ecke, Zhang and Shishkina2022). With the rich interplay of wall-localised and interior dynamics, many questions remain. What is genuinely wall-generated behaviour versus what is wall-catalysed bulk dynamics? Without getting into a regime of
${\textit{Ta}}$
far exceeding current simulations and experiments, there is not enough dynamic range to disentangle various phenomena.
Within the wall-dominated cross-hatch region, the convective Rossby number satisfies
${\textit{Ro}} = {\mathcal{O}}( {\textit{E}}^{\,1/2} )$
indicating strong rotational influence. Sprague et al. (Reference Sprague, Julien, Knobloch and Werne2006) and Julien & Knobloch (Reference Julien and Knobloch2007) show that regimes of low
${\textit{E}}, {\textit{Ro}}$
exhibit balanced dynamics for bulk convection and are prime candidates for dynamical reductions capturing nonlinear evolution through the application of multiple-scale perturbation theory. In this paper, we derive a systematic dynamical reduction for wall-mode convection.
1.1. Outline
We organise the paper as follows:
-
§ 2 outlines the parameter choices and model equation set-up;
-
§ 3 derives the fundamental equations, including:
-
§ 4 gives a concise summary of all derived results in (4.5)–(4.10);
-
§ 5 describes linear theory, including:
-
§ 5.1 for Cartesian coordinates near a semi-infinite domain;
§ 5.2 the leading-order correction to the critical Rayleigh number;
§ 5.3 direct numerical validation of critical parameters;
§ 5.4 within a closed finite cylinder, including the tall-aspect-ratio limit;
§ 5.5 a local baroclinic instability;
-
-
§ 6 reports on nonlinear results in cylindrical geometry, including:
-
§ 7 gives conclusions and discusses generalisations, including:
2. Model set-up
We start our multiscale asymptotic analysis from the Boussinesq equations (Spiegel & Veronis Reference Spiegel and Veronis1960). We pose and analyse the system in a local Cartesian coordinate system
$(x,y,z)$
along a flat vertical wall, with
$x$
increasing into the wall at
$x=\Gamma$
, the coordinate
$y$
tangential along the wall and
$z$
vertical. We choose this coordinate system to easily translate to more general situations, e.g. an upright cylinder,
$(x,y,z) \to (r,\theta ,z)$
.
We pick our non-dimensionalisation to agree mostly with the standard thermal diffusion scaling. For box depth,
$H$
, and thermal diffusivity,
$\kappa$
, we scale
$\text{length} \sim H$
,
$\text{velocity} \sim \kappa / H$
and
$\text{time} \sim H^{2} / \kappa$
.
For the temperature and pressure, the known properties of wall-mode dynamical balances (Herrmann & Busse Reference Herrmann and Busse1993) imply a slightly non-standard normalisation. We define a linear reference temperature profile,

where the domain comprises
$0\leq z/H \leq 1$
, and
$T_{{top}}$
and
$T_{{bot}}=T_{{top}}+\Delta T$
are the fixed temperatures at the top and bottom boundary. We use the temperature perturbation,
$\Theta = T - T_{{ref}}(z)$
, as a dynamical variable. Similarly, we evolve the kinematic pressure perturbation,
$p = P/\rho$
, where
$P$
is the dynamic pressure and
$\rho$
is the mass density. We also subtract the hydrostatic reference profile balancing buoyancy from the background
$T_{{ref}}(z)$
. From now on, we refer to
$p$
simply as ‘pressure’.
It is common to scale
$p \sim \nu \kappa / H^{2}$
and
$\Theta \sim \Delta T$
. However, we pick

which allow the pressure to balance buoyancy and rotation; the temperature maintains thermal wind balance.
These scalings naturally introduce a ‘reduced’ Rayleigh number as the buoyancy control parameter,

Our non-dimensional system takes the form,





where

The system models the dynamics of a weakly compressible fluid in an upright container with uniform height
$0 \le z \le 1$
and arbitrary horizontal geometry. The system rotates at a fixed rate about an axis anti-aligned with gravity; see, e.g. figure 2. The triplet
$(u,v,w)$
represents the fluid velocities in the
$(x,y,z)$
directions, respectively. The quantity
$\Theta$
represents the temperature departure from a constant reference value and
$p$
gives the dynamic pressure.

Figure 2. Schematic diagram of the basic domain. Each region, I–III, comprises different length, time and amplitude scalings in terms of powers of
$\varepsilon = {\textit{E}}^{\,1/3}$
. The asymptotic analysis considers each region separately and connects each through a series of matching conditions, eventually being left entirely with
${\mathcal{O}}( 1 )$
dynamics in the bulk (Region I). We refer to region II as the Stewartson sidewall layer and region III as the Ekman layer (at both the top and bottom of the domain). We conduct the asymptotic analysis in Cartesian coordinates
$(x,\,y,\,z)$
without loss of generality. After obtaining the final results, it is straightforward to use general coordinates and re-pose the system in cylindrical polar coordinates with
$x \to r$
,
$y \to \phi$
and
$z \to z$
.
In this simplified geometry, we address both no-slip (NS) and stress-free (SF) boundary conditions on all walls:




Differences between no-slip and stress-free conditions on the sidewalls produce no leading-order effect on the final result, while the top/bottom boundaries show important differences.
Given that
$\Theta$
is the perturbation from the reference profile,
$T_{{ref}}(z)$
, the thermal boundary conditions on both plates are

The thermal sidewall condition influences the dynamics more than any other. We use perfectly insulating sidewalls,

Relaxing the insulating condition to include finite conduction stabilises the wall mode because of heat losses. In the infinite conductivity limit, the container walls short-circuit the instability. In that case, the critical
${\textit{Ra}} \sim {\mathcal{O}}({\textit{E}}^{\,-4/3})$
along with the bulk convection. This regime nonetheless contains rich wall-mode dynamics with investigation ongoing (e.g. Ravichandran & Wettlaufer Reference Ravichandran and Wettlaufer2023; Ecke, Zhang & Shishkina Reference Ecke, Zhang and Shishkina2024). It is also possible to match the final asymptotic equations to external thermal field with arbitrary heat capacity and conductivity. A large conductivity will increase the critical Rayleigh number.
3. Multiscale asymptotic analysis
We aim to find reduced equations capturing the leading-order nonlinear behaviour of the wall-mode instability in the limit of an infinitesimal Ekman number,
${\textit{E}} \ll 1$
. We have already discussed the relevant pressure amplitude and Rayleigh number scaling. We now describe the a priori known properties of the boundary layer thicknesses and flow amplitudes.
-
(i) A thermally forced side-wall Stewartson boundary layer (Stewartson Reference Stewartson1957; Barcilon & Pedlosky Reference Barcilon and Pedlosky1967a , Reference Barcilon and Pedloskyb ; Hashimoto Reference Hashimoto1976; Fein Reference Fein1978; Albaiz & Hart Reference Albaiz and Hart1990; Pedlosky Reference Pedlosky2009; Kunnen, Clercx & van Heijst Reference Kunnen, Clercx and van Heijst2013) drives the dynamics from within a characteristic layer thickness
(3.1)\begin{eqnarray} \,\text{d}{x} \sim {\textit{E}}^{\,1/3}. \end{eqnarray}
-
(ii) For no-slip boundary conditions, top and bottom Ekman layers (Greenspan Reference Greenspan1969; Pedlosky Reference Pedlosky1987) influence the dynamics from within a characteristic layer thickness
(3.2)\begin{eqnarray} \,\text{d}{z} \sim {\textit{E}}^{\,1/2}. \end{eqnarray}
-
(iii) Relative to the thermal diffusion velocity normalisation
$\kappa /H$ , the fluid velocities near the wall carry a large amplitude
(3.3)while the interior velocities remain\begin{eqnarray} \left (u,v,w\right ) \sim {\textit{E}}^{\,-1/3}, \end{eqnarray}
${\mathcal{O}}(1)$ in these units.
Throughout this paper, the Prandtl number remains a fixed order-unity number and does not scale with
${\textit{E}}$
in any way. Allowing
$\sigma$
to scale with
${\textit{E}}$
can lead to several interesting possibilities (Dawes Reference Dawes2001; Zhang, Liao & Busse Reference Zhang, Liao and Busse2007; Zhang & Liao Reference Zhang and Liao2009), but this remains beyond our current scope.
Assumptions (i) and (ii) introduce multiple scales, where all variables depend on both small and large length scales. For example, with the pressure

For the duration of the asymptotic derivation, we use capital-letter variables to denote coordinates varying on the bulk-interior scale, i.e.
$(X, Y, Z) = (x^{*}, y^{*}, z^{*})/H$
, where e.g.
$x^{*}$
is a physical coordinate. Also, for the duration of the asymptotic analysis, the lower-case variables represent boundary layers. In the horizontal directions,
$(x,y) = {\textit{E}}^{\,-1/3}\,(x^{*},y^{*})/H$
and
$z = {\textit{E}}^{\,-1/2} z^{*}/H$
. After establishing the final asymptotic equations, all remaining coordinates will revert to bulk-only. We will, therefore, restore all coordinates to simply
$(x,y,z)$
only, representing bulk physical scales.
Multiple-scale theory dictates (Kevorkian & Cole Reference Kevorkian and Cole1981) that each argument varies independently of the others. This assumption allows redefining the derivative operators

The capital-letter coordinates
$(X, Y, Z)$
now label position within the ‘large-scale’ interior (outside boundary layers). The small-letter coordinates
$(x, z)$
label position within the Stewartson and Ekman boundary layers, respectively. We exclude asymptotically small-scale dynamics in the
$Y$
-direction along the sidewall. In the bulk-convection regime (i.e.
$R \sim {\textit{E}}^{\,-1/3}$
), both horizontal directions adopt
${\mathcal{O}}( {\textit{E}}^{\,1/3 } )$
length scales (Julien & Knobloch Reference Julien and Knobloch2007).
The
${\mathcal{O}}( {\textit{E}}^{\,1/3} )$
length scale controls the overall system, suggesting the expansion parameter,

The Ekman layers on the top and bottom thus require half-integer powers
${\textit{E}}^{\,1/2}=\varepsilon ^{3/2}$
.
Equation (3.6) and the above mentioned scalings of pressure and velocity with
${\textit{E}}$
imply the following asymptotic series for the dynamical variables:





Defining pure interior variables aids in the following analysis. Capital-letter variables (e.g.
$U, V, W, P$
) denote quantities depending only on large-scale interior coordinates
$(X, Y, Z)$
. These connect to the boundary layer variables via a matching principle. For example, with the leading-order pressure,

and so on. The analysis in §§ 3.1–3.3 shows that the temperature is an interior variable, removing the need to start with a lowercase variable.
The three following subsections determine the leading-order dynamics. We examine each region shown in figure 2 and connect each through appropriate matching conditions.
We comment on a key aspect to keep in mind throughout the derivation. In our final equation, the only explicit time dependence comes from interior temperature and barotropic vorticity evolution; the full system contains many more time derivatives in the momentum equation.
In many systems, it is common for time-dependence to be subdominant in some part of the domain. In particular, it is common for boundary layers to have direct diagnostic constraints (compared with prognostic evolution). In boundary-layer theory, the idea is often that the time-evolving interior provides the boundary layer with a state that does not satisfy boundary conditions at the physical wall. The thinness of the boundary layer implies that diffusion time scales are extremely fast and can relax an arbitrary far-field to the required balance virtually instantaneously, which obviates the need for explicit
$\partial _{t}$
.
3.1. Region-I (Interior)
Given the scaling of (2.4)–(2.7), we can read off the leading-order interior dynamical equations by inspection. That is, for the bulk interior, all variables retain their original amplitude scalings, and the Coriolis and hydrostatic terms are the only remaining in (2.4)–(2.7) as
${\textit{E}} \to 0$
. The bulk dynamics then only contains
${\mathcal{O}}( \varepsilon ^{0} )$
amplitudes and the only time derivative occurs in the temperature equation (2.8). It is possible to derive this explicitly by considering the leading-order balances of the full asymptotic expansion. From the momentum equations and continuity,

The pressure gives the horizontal velocities and temperature in a thermal-wind balance (i.e. a joint hydrostatic and geostrophic balance).
The temperature equation retains its original nonlinear form (except for vertical advection),

The combined (3.13) and (3.14) almost form a closed system, with two important caveats.
The first major caveat is that the temperature evolution equation contains no obvious sources. Defining the horizontal velocity vector,
$U_{\bot } = (U_{0}, V_{0})$
, the available thermal energy evolution,

where
$\,\text{d}{\ell }$
is the integration measure along the boundary,
$\partial {\mathcal{A}}$
, and
$\hat {n}$
is the outward unit normal vector. Equation (3.15) shows that the boundary terms must force the system strongly enough to overcome persistent dissipation. Without the boundary terms (like in conventional bulk convection), the available thermal energy would monotonically decrease. Driving (3.14) requires forcing from the sidewall boundary layers, which we explain in § 3.2.
The second caveat is that (3.14) determines the evolution only for the temperature. Obtaining the velocities requires the pressure and obtaining the pressure requires solving hydrostatic balance (3.13). The difficulty comes from not knowing the depth-independent, or barotropic, component of the pressure,

We derive an equation for
$\!\left \langle P_{0} \right \rangle$
by considering the top and bottom boundaries together with their velocity boundary conditions in § 3.3.
3.2. Region-II (thermal Stewartson layer)
In the Stewartson sidewall layers, focusing on
$X=\Gamma$
and
$- \infty \lt x \le 0$
, the physical container boundary corresponds to
$x=0$
and
$X=\Gamma$
, while the boundary of the interior corresponds to
$X=\Gamma$
and
$x \to - \infty$
. The leading-order momentum equation (2.4) in the
$X$
-direction gives

The leading-order diffusion of temperature from (2.8) gives

which implies a constant leading-order temperature across the boundary layer. Strictly speaking, (3.18) contains a linearly growing solution in
$x$
. The possibility of unbounded amplitudes as
$|x| \to \infty$
preclude this solution. Therefore, the leading-order temperature exists as a pure interior dynamical variable,

While (3.19) implies that
$\Theta _{0}$
does not vary across the boundary layer, there exist temperature fluctuations in the directions along the wall both in
$Y$
and
$Z$
. The interior thermally forces the boundary layer via tangential gradients.
Given a temperature field in the interior, the leading-order momentum and continuity equations follow:




The system is similar to the Stewartson layer balances derived in other work (Stewartson Reference Stewartson1957; van Heijst Reference van Heijst1983; Kunnen et al. Reference Kunnen, Clercx and van Heijst2013). The main difference is that (3.20)–(3.23) have enough derivatives to satisfy all velocity boundary conditions at the physical wall and thermal anomalies force the whole boundary layer from the interior. There is no need for the classical
${\textit{E}}^{\, 1/4}$
exterior layer typically required to satisfy all velocity boundary conditions.
The interior forces the vertical velocity via (3.22). Leading-order balances also imply
$\partial _{x}^{2} \Theta _{1/2} = 0$
. To see how the boundary layer feeds back to the interior, the next-order temperature equation for
$\Theta _{1}$
,

The crucial connection between the interior and boundary layer comes from the full form of the insulating thermal boundary condition at the physical sidewall,

This couples
$\Theta _{0}$
, from the interior, with
$\Theta _{1}$
, from (3.24). Integrating (3.24) over boundary layer coordinate,

gives

where the
$\partial _{x}\Theta _{1}\big |_{x\to -\infty }$
term vanishes because
$v_{-1}$
and
$w_{-1}$
decay exponentially away from
$x=0$
(see later), and there is no linearly growing homogeneous solution in
$x$
with bounded amplitude as
$x\to -\infty$
. Equation (3.27) expresses energy conservation between the boundary layer and the interior.
The total momentum fluxes within the sidewall layers are

These purely surface quantities depend on the tangent directions,
$(Y, Z)$
. They couple to the
${\mathcal{O}}( \varepsilon ^{0} )$
dynamics because while
$v_{-1}, \, w_{-1} \sim {\mathcal{O}}( \varepsilon ^{-1} )$
, the boundary layer width,
$\,\text{d}{x} \sim {\mathcal{O}}( \varepsilon )$
. The tangential velocities act exactly like Dirac-
$\delta$
distributions at the boundary of the bulk,
$X=\Gamma$
.
To further constrain
$q_{Y}$
and
$q_{Z}$
, we integrate the continuity equation

The integration limit
$u_{0}\big |_{x=0} = 0$
because of impenetrability at the physical container wall. Equation (3.29) expresses mass conservation between the boundary layer and the interior.
To close the system, we need another relation between
$q_{Y}$
and
$q_{Z}$
, which comes from directly solving (3.20)–(3.23). We represent the horizontal velocities strictly in terms of the pressure


The following coupled system relates the pressure and vertical velocity:


which collapses into a single system for the vertical velocity,

3.2.1. Bulk onset comparison
Equation (3.34) should look familiar; it is half of the traditional asymptotic stability condition in the rapidly rotating regime when
${\textit{Ra}} \sim {\textit{E}}^{\,4/3}$
. In terms of the
$R$
and
$\varepsilon$
parameters, the full bulk leading-order balance,

If
$\varepsilon \, R \sim {\mathcal{O}}( 1 )$
, the left-hand side balances with
$\partial _{x}^{2} \to -k^{2}$
,
$\partial _{Z}^{2} \to -\pi ^{2}$
and we have the traditional stability condition for rotating convection (Chandrasekhar Reference Chandrasekhar1961),

Here,
$R_{c}$
is the critical reduced Rayleigh number and
$k_{c}$
is the critical wavenumber for instability.
Equation (3.36) represents a triple balance between vortex stretching (i.e.
$\partial _{Z}^{2}$
) and diffusion (i.e.
$\partial _{x}^{6}$
) on the left, with buoyancy on the right. However, if
$R \sim {\mathcal{O}}( 1 )$
, the right-hand side of (3.35) drops out and the bulk no longer supports harmonic
$x$
dependence. In that case, the only possible solutions happen with exponential
$x$
dependence, which can only remain bounded near a wall.
3.2.2. Boundary layer solutions
We solve the system for no-slip physical sidewall boundary conditions,

We could alternatively use stress-free conditions; however, doing so gives exactly the same final relationship between
$q_{Y}$
and
$q_{Z}$
. We also impose
$w_{-1} = 0$
at
$Z=0,1$
.
Solving (3.32) and (3.33) requires Fourier decomposing the velocities, pressure and temperature in the
$Z$
direction,





Solving for the Fourier coefficients, which decay as
$x\to -\infty$
and satisfy either no-slip or stress-free boundary conditions at
$x=0$
, gives




where
$\alpha _{n} = (n\pi )^{1/3}$
. The boundary layer profile function,

where the phase
$\delta = \pm \pi /6$
depends on the tangential boundary conditions. In both cases,

which satisfies the normal velocity,
$\hat {u}_{n} = 0$
at
$x=0$
. For the tangential velocities,


Figure 3 shows the velocity amplitudes for the no-slip case.
The capital-letter variables,
$\widehat {U}_{n}$
,
$\widehat {P}_{n}$
and
$\widehat {\Theta }_{n}$
are the Fourier transforms of the interior bulk quantities that satisfy thermal wind balance at
$X=\Gamma$
,

The overall amplitude satisfies

which is the Fourier-space version of mass conservation at the bulk boundary (3.29).

Figure 3. Thermal Stewartson boundary layer solutions for no-slip tangential velocities: the blue line shows
$1-\mathcal{X}(x)$
. The orange line shows
$\mathcal{X}'(x)$
. The orange line compares well with figure 2(b) from Buell & Catton (Reference Buell and Catton1983) for
${\textit{Ta}} \approx 1400^{2}$
.
There are a couple of notable comments regarding (3.43)–(3.46). The first is that both
$\hat {v}_{n}$
and
$\hat {w}_{n}$
exist strictly within the boundary layer and do not continue into the bulk interior. The second comment is that
$\hat {p}_{n}$
and
$\hat {u}_{n}$
both contain components that are independent of
$x$
. These terms take precisely the same form as the interior geostrophic and hydrostatic balance found in § 3.1.
The most important comment regarding (3.45) and (3.46) is that the coefficients of
$\hat {v}_{k,n}$
and
$\hat {w}_{k,n}$
take identical form,

and

That is, the
$\widehat {Q}_{n}$
amplitude gives the Fourier coefficients for both
$q_{Y}$
and
$q_{Z}$
, closing the system of equations. However, we still need to collect all the relevant information and pose the system in a local abstract form, not in Fourier space.
Finally, regarding the bulk, all relevant quantities do not depend on the value of
$\delta = \pm \pi /6$
. This observation accords with the findings from Liao et al. (Reference Liao, Zhang and Chang2006) that the leading-order stability is independent of the sidewall velocity boundary conditions.
3.2.3. Hilbert transform
While the Fourier amplitudes of
$v_{-1}$
,
$q_{Y}$
,
$w_{-1}$
and
$q_{Z}$
are all identical, the local functions are not equal because
$v_{-1}$
and
$q_{Y}$
are Fourier cosine series in
$Z$
, but
$w_{-1}$
and
$q_{Z}$
are Fourier sine series. We need an operator to transform between the different parities, which naturally leads to the Hilbert transform,
$\mathcal{H}$
, in the
$Z$
direction. For the present analysis, the utility of the Hilbert transform lies in its action on a Fourier sine and cosine basis. For
$n \ne 0$
,

which alters the parity of the basis elements in the same manner as a derivative, but without the multiplication by the wavenumber,
$n \pi$
. Additionally, the Hilbert transform possesses the following useful properties

The Benjamin–Ono equation (Benjamin Reference Benjamin1967; Ono Reference Ono1975) for small-amplitude surface gravity waves over an infinitely deep interior employs the Hilbert transform for reasons similar to the current context. In particular, (3.56) implies

for trigonometric functions. Therefore, we may factor the Laplacian-like operator in (3.34)

which separates modes that decay as
$x \to \infty$
versus
$x \to -\infty$
.
Our final closure for the boundary momentum fluxes is

which means they share the same Fourier coefficients but with different vertical parity.
3.2.4. Barotropic mode
Finally, we derive properties of the barotropic mode in the Stewartson layer. We first take the vertical average of the horizontal velocities



Therefore,
$\!\left \langle p_{0}\right \rangle = \!\left \langle P_{0}\right \rangle$
within the sidewall layer, with
$\!\left \langle u_{0}\right \rangle = -\partial _{Y} \!\left \langle P_{0}\right \rangle$
. Impenetrability at the physical container boundary, along with fixing the overall pressure gauge condition, implies

which is the only sidewall boundary condition needed for no-slip top and bottom boundary conditions.
Free-slip top and bottom boundaries also require the next-order barotropic momentum balances and continuity in the sidewall layer,



where

The tangential solution in the sidewall layer,

A no-slip boundary condition,
$\!\left \langle v_{0}\right \rangle = 0$
, at
$x=0$
,
$X=\Gamma$
implies


Computing the coefficients,

Both coefficients are independent of
$\delta = \pm \pi /6$
. Putting everything together,

3.3. Region-III (Ekman layers)
We still need to fix the barotropic pressure,
$\!\left \langle P_{0}\right \rangle$
, in the interior, which requires considering the vertical vorticity along with the top and bottom boundary conditions. Geostrophic balance relates the vertical vorticity and the pressure,

Taking the horizontal curl of the momentum equations and using the continuity equation,

Notably, the leading-order rotational terms enforce

The first non-constant vertical velocity is
$W_{3}$
.
3.3.1. Stress-free boundaries
For stress-free boundaries (2.11), applying the averaging operator to (3.74) gives

The
$\partial _{Z}$
terms on the right-hand side vanish because of impenetrability and vanishing stress,


Equation (3.76) determines the missing depth-independent pressure component needed to obtain the velocities in (3.14). The depth-dependent pressure (hence temperature) forces (3.76) through the nonlinear baroclinic interactions, e.g.
$\!\left \langle U_{0} \zeta _{0}\right \rangle - \!\left \langle U_{0}\right \rangle \!\left \langle \zeta _{0}\right \rangle$
, and the two systems are coupled. We solve (3.76) together with boundary conditions
$\!\left \langle P_{0}\right \rangle =0$
and (3.72).
3.3.2. No-slip boundaries
For no-slip top and bottom boundaries, one expects an Ekman-pumping effect of the order (Zhang & Roberts Reference Zhang and Roberts1998)

Together, (3.75) and (3.79) imply a depth-independent Ekman velocity,

The Ekman layers determine the barotropic pressure.
Given an interior flow, the Ekman boundary layer response follows as a well-known exercise (Greenspan Reference Greenspan1969; Pedlosky Reference Pedlosky1987). Region-III lies beyond the sidewall Stewartson layer, and hence
$\partial _{x} \to 0$
. Within the top and bottom boundary layers,




The solutions to (3.81)–(3.84) are well known for
$Z=0$
and the boundary layer coordinate
$0 \le z \lt \infty$
, and for
$Z=1$
and the boundary layer coordinate
$-\infty \lt z \le 0$
. In both cases,

where
$P_{0}$
is evaluated at
$Z=0$
or
$Z=1$
depending on the boundary layer and the functions,

Both
$\text{se}(0)=\text{ce}(0)=0$
.
For the vertical velocity at either boundary,

The Ekman pumping velocity must connect to the interior as
$|z| \to \infty$
,

However, in the interior,
$\partial _{Z}W_{3/2} = 0$
. Therefore,

which indirectly fixes the barotropic pressure. In terms of the local pressure,

where
$\Phi (X, Y)$
represents a two-dimensional harmonic function (the factor of two provides a later simplification). The Stewartson layer supports no barotropic pressure, and
$\Phi$
ensures
$\left \langle P_{0}\right \rangle$
vanishes at the sidewall.
Given hydrostatic balance,
$\partial _{Z}P_{0}(Z) = \Theta _{0}(Z)$
, the pressure decomposes into barotropic and baroclinic components,

Applying (3.90), the barotropic pressure is

The Stewartson layer equations imply
$\!\left \langle P_{0}\right \rangle = 0$
within the side wall layer, thus fixing the harmonic function,
$\Phi (X, Y)$
, when applying (3.92) at
$X=\Gamma$
.
While the Ekman layers in the bulk dictate leading-order dynamics of the barotropic mode, the Ekman layers within the sidewall Stewartson layers generate a lower-order passive response. That is, it is possible to compute the effects of Ekman pumping after the fact (e.g § 5.2), but we can press ahead with calculating the leading-order dynamics if we are not interested in the correction terms.
There is, however, one notable aspect. The balances in the sidewall naturally produce velocities
$u_{0}, v_{-1}, w_{-1}$
. That is, the tangential velocities are
${\mathcal{O}}( {\textit{E}}^{\,-1/3} )$
compared with
${\mathcal{O}}( 1 )$
for the normal component. However, the normal component achieves its largest amplitudes in the form of an Ekman response.
Like before,
$\partial _{z}p_{0} = 0$
. Now, the horizontal velocities are isotropic,


The Ekman flux is now larger than within the bulk, but is still subdominant compared with the
$w_{-1}$
convection; i.e.
$\partial _{x} u_{-1} + \partial _{z} w_{-1/2} =0$
,

The
$w_{-1/2}$
does not automatically satisfy the no-slip condition at
$x=0$
, which requires an
${\mathcal{O}}( {\textit{E}}^{\,1/2} )$
corner layer (Kerswell & Barenghi Reference Kerswell and Barenghi1995) that remains passive with respect to the leading-order dynamics. Solving the corner layer requires a somewhat technical direct numerical calculation (Burns et al. Reference Burns, Fortunato, Julien and Vasil2022). While the sidewall Ekman response remains passive in the fully asymptotic regime, we should expect noticeable corrections to heat transport as Rayleigh number transitions out of a strongly rotationally dominated (asymptotic) regime (Stellmach et al. Reference Stellmach, Lischper, Julien, Vasil, Cheng, Ribeiro, King and Aurnou2014; Julien et al. Reference Julien, Aurnou, Calkins, Knobloch, Marti, Stellmach and Vasil2016).
4. System summary
To summarise our multiscale asymptotic analysis, we here report the final equations in dimensional form, with all explicit parameter values restored. All equations and variables are either purely bulk quantities or supported on the bulk boundary. All boundary layers have been eliminated and act only backstage. We, therefore, drop the notation of lowercase and capital letters distinguishing between different scales. All coordinates are lowercase and all variables are in their dimensional physical form. The relevant physical variables are velocity,
$u$
, temperature,
$T$
, vertical vorticity,
$\zeta = \hat {z} \boldsymbol{\cdot} \boldsymbol{\nabla} \times u$
, and wall fluxes,
$q_{\ell },\, q_{z}$
.
4.1. Geometry and operators
The container is now a general upright direct product of a
$z$
direction with a smooth but otherwise arbitrary horizontal area,

with e.g. Cartesian horizontal coordinates
$(x,y) \in \mathcal{A}$
, and vertical coordinate
$z \in [0,H]$
, with local unit vector,
$\hat {z}$
. The boundary of the horizontal area is
$\partial \mathcal{A}$
with the local outward normal vector,
$\hat {n}$
. In the case of a general smooth
$\partial \mathcal{A}$
, the analysis in § 3.2 would follow the same reasoning with
$x$
representing a local signed-distance coordinate (Hester & Vasil Reference Hester and Vasil2023). We represent the coordinate along the tangential direction to
$\partial \mathcal{A}$
as
$\ell$
, with local unit vector,
$\hat {\ell } = \hat {z} \times \hat {n}$
, assuming a right-handed coordinate system. For short-hand, we denote derivatives,
$\partial _{z} = \hat {z} \boldsymbol{\cdot} \boldsymbol{\nabla}$
,
$\partial _{n} = \hat {n} \boldsymbol{\cdot} \boldsymbol{\nabla}$
and
$\partial _{\ell } = \hat {\ell } \boldsymbol{\cdot} \boldsymbol{\nabla}$
. In the two most relevant cases,


We define the respective barotropic average and Hilbert transform,

where p.v. denotes the principle value in the Hilbert transform integral.
4.2. Dynamics
For
$(x,y) \in \mathcal{A}$
,


For
$(x,y) \in \partial \mathcal{A}$
,


For vertical no-slip,

For vertical stress-free,

4.3. Heat transport
Heat transport is an essential diagnostic in convection-dominated systems, with much work for laboratory and natural systems still ongoing (Doering, Toppaladoddi & Wettlaufer Reference Doering, Toppaladoddi and Wettlaufer2019; Schumacher & Sreenivasan Reference Schumacher and Sreenivasan2020; Vasil, Julien & Featherstone Reference Vasil, Julien and Featherstone2021; Lohse & Shishkina Reference Lohse and Shishkina2024). Rapid rotation adds many additional nuances (Julien et al. Reference Julien, Knobloch, Rubio and Vasil2012; Stellmach et al. Reference Stellmach, Lischper, Julien, Vasil, Cheng, Ribeiro, King and Aurnou2014; Julien et al. Reference Julien, Aurnou, Calkins, Knobloch, Marti, Stellmach and Vasil2016). In studies of bulk convection, heat transport from wall modes tends to confound measurements, and efforts have attempted to quantify and/or limit their influence (Ecke et al. Reference Ecke, Zhang and Shishkina2022; Terrien, Favier & Knobloch Reference Terrien, Favier and Knobloch2023; Zhang et al. Reference Zhang, Reiter, Shishkina and Ecke2024).
At first glance, the reduced wall-mode equations do not appear to contain any means for vertical heat transport; there is no bulk vertical velocity. The resolution occurs when taking into account the sidewall heat fluxes. All the vertical heat transport happens in the sidewall layer, which manifests within the bulk via
$q_{z}$
.
Defining the horizontal average,
$\overline {T}$
, the mean thermal equation,

The integrated boundary term follows straightforwardly from integrating the bulk equation and applying (4.7). When the system is in a statistically steady state, the Nusselt number is a global constant with

where in this case,
$\Theta = T - \overline {T}$
.
When considering the boundary terms in (3.15), we also now find the non-trivial wall-modes ‘power integral’ (Howard Reference Howard1963) that regulates overall input and dissipation,

In both cases, it is clear the vertical velocity behaves precisely as a Dirac-
$\delta$
distribution supported on the boundary; e.g. in local Cartesian coordinates with the wall at
$x=0$
,

where the
$\delta$
-function replaces the detailed behaviour within the sidewall layer.
5. Linear instability
From now on, we revert to the same non-dimensionalisation used throughout the asymptotic analysis. However, we do not use mixtures of lowercase and capital letters to distinguish between boundary layer and bulk quantities. All variables and coordinates are bulk-only; hence, we use their standard nomenclature.
For the temperature perturbations linearised around a vertical gradient,
$T = R\, (1-z) + \theta (x,y,z)$
,


The linearised equations collapse into a single equation and set of boundary conditions for the temperature perturbations,


where
$|\partial _{z}| = \mathcal{H}\, \partial _{z}$
. For the top and bottom boundary conditions,

5.1. Semi-infinite channel
This section reproduces the final results of Herrmann & Busse (Reference Herrmann and Busse1993) with the advantage that now, all aspects are physically apparent. We assume a semi-infinite domain with the wall-normal coordinate,
$- \infty \lt x \le 0$
, and harmonic dependence along the wall,
$\sim e^{i k y}$
, with wavenumber
$k$
.
The following exponential profile satisfies the boundary condition at
$x=0$
:

Assuming
$R\gt 0$
, the temperature perturbations decays as
$x\to -\infty$
. The bulk evolution equation gives the dispersion relation for the complex-valued frequency

The real-valued growth rate,

The growth rate is positive for

Zhang et al. (Reference Zhang, Reiter, Shishkina and Ecke2024) fit a quadratic curve through their stability data near the minimum of the form

They find
$\xi \approx 0.18$
for
${\textit{E}} = 10^{-6}$
. Equation (5.9) implies
$\xi = \sqrt {2 - \sqrt {3}}/\pi \approx 0.164769$
.
The respective critical wavenumber and frequency are

The sign convention for frequency means that
$\omega \gt 0$
moves retrograde to the background rotation,
$\Omega$
. The respective phase and group speeds,

within
$\approx \,$
20 % of the group velocity of
$-2.1$
reported in the laboratory experiments of Ning & Ecke (Reference Ning and Ecke1993) for
${\textit{E}}=10^{-3}$
. Notably, the group speed is always prograde above onset.
Even though we remove the explicit Stewartson boundary layer, at onset, the temperature perturbations remain quite localised near the wall (on the bulk scale,
$x$
), where

In physical units, the critical wavelength along the wall is approximately
$\lambda _{y} \approx 1.04\, H$
and the
$e$
-folding length away from the wall is
$\lambda _{x} \approx 0.13\, H$
.
For large
$R$
, the growth rate,

The fastest-growing wavenumber,
$k$
maximises the second term with
$k \sim 3^{1/4} \sqrt {R}$
leaving

The severe dependence of
$\gamma$
on
$R$
makes nonlinear simulations very stiff at high values of criticality. For example, when
$R = 5 \, R_{{c}}(k_{{c}})$
,
$\gamma \approx 2500$
. Fast time scales exist for the propagation as well. As
$R\to \infty$
, the phase and group speeds are exactly opposite,

Reintroducing physical parameters implies the dimensional growth rate in the large-
$R$
limit,

which is notably independent of the container size.
5.2. Ekman-pumping correction
In § 1, we report the critical Rayleigh numbers, including the leading-order corrections due to Ekman pumping effects in (1.3) and (1.4). The
${\mathcal{O}}({\textit{Ta}}^{7/12})$
correction to the bulk
${\textit{Ra}}$
has an interesting history. The
${\mathcal{O}}({\textit{Ta}}^{5/12})$
correction to the wall
${\textit{Ra}}$
has not, to our knowledge, been calculated before.
In his famous book, Chandrasekhar (Reference Chandrasekhar1961) calculated the leading-order contribution for bulk modes (unaware of wall modes at the time) using sound asymptotic methods,

However, when comparing to numerical calculations up to
${\textit{Ta}} \approx 10^{12}$
, he noticed a discrepancy for no-slip boundary conditions that appeared like an offset in the numerical proportionality factor, commenting ‘…the discussion is not “fine” enough to account for the differences in the constants of proportionality…’ (Chandrasekhar Reference Chandrasekhar1961 § 27d ‘The origin of the
$T^{{2}/{3}}$
-law’). The resolution of the issue is that the relative leading-order correction is
$\propto {\textit{Ta}}^{-1/12}$
, which produces a noticeable change for realistic numerical and laboratory parameter values. Ekman pumping manifests especially clearly in heat-transport enhancement (King et al. Reference King, Stellmach, Noir, Hansen and Aurnou2009; Stellmach et al. Reference Stellmach, Lischper, Julien, Vasil, Cheng, Ribeiro, King and Aurnou2014; Julien et al. Reference Julien, Aurnou, Calkins, Knobloch, Marti, Stellmach and Vasil2016). The upshot is that Chandrasekhar’s discussion was ‘fine’ enough; it simply did not go far enough.
The issue became fully resolved when Zhang & Roberts (Reference Zhang and Roberts1998) showed the leading-order correction results from considering the Ekman pumping boundary conditions of the form we use in § 3.3 (3.88), together with a leading-order interior equation of the form as (3.35). The critical bulk Rayleigh number and wavenumber,


as
${\textit{Ta}} \to \infty$
. We use analogous techniques to find the equivalent correction for wall modes.
The Ekman flux in the bulk is much too small to produce a significant correction. However, in the sidewall,




The boundary condition is the key difference from the leading order,

Projecting the system on to the relevant
$2\cos (n\pi Z)$
and
$2\sin (n\pi Z)$
functions,




where recalling
$\alpha _{n} = (n\pi )^{1/3}$
. The right-hand side results from integrating by parts in
$Z$
and using the Ekman boundary conditions,

where by definition,

The solution to the system is straightforward but messy. However, we only need the relationship between
$q_{Y}$
and
$q_{Z}$
to close the system.
The final result in general form (dropping the
$X, Y, Z$
bulk notation),

The right-hand side contains the highly non-local operator

We can now continue to find the
${\mathcal{O}}( \varepsilon ^{1/2} )$
perturbation to
$R, \omega ,k$
from the semi-infinite stability problem:





The system has an extra
$R_{1/2}$
term along with an
${\mathcal{O}}( \varepsilon ^{1/2} )$
time scale,
$\partial _{\tau }$
. For a given wavenumber along the wall,
$k$
,

Optimising the combined reduced Rayleigh number,
$R \sim R_{0}(k) + \sqrt {\varepsilon }\, R_{1/2}(k)$
, over
$k$
yields



as
$\varepsilon \to 0$
. The next section computes the critical onset parameters directly.
5.3. Direct numerical calculations
We validate the asymptotic correction from § 5.2 with high-precision direct numerical calculations of the critical Rayleigh number, wave number and precession frequency over a range of very small but finite Ekman numbers. We solve a linearised version of (2.4)–(2.8) in a finite Cartesian channel assuming
$y$
-direction Fourier dependence with a complex-valued growth rate, i.e.





Nominally, the system has no-slip boundary conditions on all walls. For the top and sides of the domain,

However, vertical reflection symmetry,
$z \to 1-z$
, implies we can save computational cost by restricting to only the upper-half domain with mid-plane conditions,

The perfectly insulating thermal boundary condition at
$x=\Gamma$
implies
$\partial _{x} \theta = 0$
. We impose perfectly conducting conditions at all other walls,
$\theta = 0$
at
$x=0$
and
$z=1$
. The conducting condition at
$x=0$
isolates the dynamics only to the
$x=\Gamma$
boundary with quasi-exponential decay into the interior.
We use Dedalus to implement a continuous Galerkin spectral-element solver for (5.43)–(5.49) with a
$3 \times 3$
array of elements scaled to capture the extreme boundary-layer behaviour for
${\textit{E}} \ll 1$
. Each subdomain uses a tensor-product discretisation with between 20 and 120 polynomial modes in each direction, chosen so that the solutions in each element are resolved to a truncation error of approximately
$ 10^{-6}$
.
We find the minimum onset value for
$R$
as a function of
$k$
at fixed
$E$
by solving the simultaneous equations for the growth rate,

We solve these equations numerically by computing
$\gamma (k,R)$
on a
$3\times 3$
grid of
$(k,R)$
input values in the vicinity of the leading-order solution and interpolating with a local polynomial approximation of the form

We update the approximate solution to (5.50) from the polynomial fit via

The true solution need not lie exactly within the original
$3\times 3$
grid; the polynomial approximation can extrapolate slightly outside the fitting range. We perform the procedure with both Chebyshev and Legendre polynomial discretisations to estimate the numerical uncertainty in the critical parameters, which are in the fourth digit or smaller. The asymptotic results are independent of
$\sigma$
. We set the Prandtl number
$\sigma = 1$
throughout. We use both
$\Gamma = 2, 4$
over the whole range of
${\textit{E}}$
. The results agree to within the other uncertainties of the calculation.
Table 1. Direct spectral element calculations of the stability threshold at various Ekman numbers. Values of
$R$
,
$k$
and
$\omega$
have estimated respective uncertainties of
$4\times10^{-4}$
,
$3\times10^{-4}$
and
$3\times10^{-3}$
, with asymptotic analytical results reported to the same number of digits.

We scan over a range of powers of Ekman number with
$- \log _{10} {\textit{E}} \ge 6$
. These go to 11. Table 1 reports the onset critical parameters,
$(R, k, \omega )$
, from the scans over
${\textit{E}}$
. We perform five-term fits for the critical parameters in powers of
$\varepsilon = {\textit{E}}^{\,1/3}$
as




Figure 4. Plots of critical parameters versus Ekman number for
$6 \le -\log _{10} {\textit{E}} \le 11$
from the spectral-element calculations of (5.43)–(5.49). (a) Scaled Rayleigh number,
$R$
, (b) tangential wave number,
$k$
, and (c) precession frequency,
$\omega$
. Table 2 gives the coefficients of the best-fit expansions in powers of
$E^{1/6}$
. The orange dashed lines show the asymptotic predictions in (5.40)–(5.42).
Figure 4 shows plots over the values covered along with the respective best-fit curves. Table 2 shows the best-fit parameters. The
$R_{0}, \,R_{1/2}$
,
$k_{0}, \,k_{1/2}$
and
$\omega _{0}, \,\omega _{1/2}$
all agree with the predictions in § 5.2 to the number of reported digits.
5.4. Finite cylinder
In the finite cylinder case, we use polar coordinates,
$(r,\phi ,z)$
, where the radius
$0 \le r \le \Gamma / 2$
and azimuth angle
$0 \le \phi \lt 2 \pi$
, where
$\Gamma$
denotes the non-dimensional diameter or aspect ratio.
In this case, we solve the bulk equation first and substitute the result into the boundary condition. The radial dependence uses modified Bessel functions of the first kind,
$I_{m}$
, with integer wavenumber,
$m \in \mathbb{Z}$
, for
$\beta = \sqrt {\pi ^{2} + i\, \omega }$
,

The dispersion relation follows from the boundary condition applied at the outer radius,

Given
$m$
and
$\Gamma$
, we solve
$\textrm{Im}(R) = 0$
using Newton’s method for real-valued
$\omega$
. We define
$R_{{c}}$
and
$\omega _{{c}}$
as their respective values when minimising
$R$
over integer
$m$
values.

Figure 5. Critical parameters for a finite cylinder with aspect ratio,
$\Gamma$
. In each case, the dashed lines show the value for the semi-infinite channel. The critical wavenumber is typically within one of
$m \approx \lfloor \Gamma \,k_{{c}}/2 \rfloor$
, where
$k_{{c}}$
is the critical wavenumber from the semi-infinite case. Starting from
$m=1$
, for low aspect ratio, each cusp feature in
$R_{{c}}$
indicates a unit increase in
$m$
. The overall minimum happens for
$\Gamma \approx 0.366196$
and
$m=1$
with
$R \approx 26.3789$
and
$\omega \approx 82.9854$
.

Figure 6. Mid-plane linear temperature perturbations from two critical modes at aspect ratios
$1$
and
$3$
, respectively. The
$\Gamma =1$
case prefers
$m=3$
angular wavenumber, while
$\Gamma = 3$
prefers
$m=9$
.
Figure 5 shows the dependence of
$R_{{c}}$
and
$\omega _{{c}}$
on aspect ratio,
$\Gamma$
. The cusp-like features in both plots correspond to discrete changes in the optimal
$m$
value. At the lowest aspect ratios, the optimal
$m=1$
and scales approximately as
$m \sim \lfloor \Gamma \,k_{{c}}/2 \rfloor$
. Compared with the various Ekman-number corrections, the finite cylinder has a much stronger,
${\mathcal{O}}( 1 )$
, influence on the onset Rayleigh number.
Figure 6 shows two critical mid-plane temperature perturbation profiles for respective aspect ratios,
$\Gamma = 1, 3$
. We report the solutions from applying Newton’s method to (5.56):
for
$\Gamma = 1$
,
$R_{{c}} \approx 29.26899820255448$
,
$\omega _{{c}} \approx 68.93822823201539$
with
$m_{{c}} = 3$
;
for
$\Gamma = 3$
,
$R_{{c}} \approx 30.84620681276749$
,
$\omega _{{c}} \approx 66.29068835812032$
with
$m_{{c}} = 9$
.
We also compare to finite
${\textit{E}}=10^{-6}$
calculations of Zhang et al. (Reference Zhang, Reiter, Shishkina and Ecke2024) with
$\Gamma =1/2$
, which find
$R_{c}\approx 28$
and
$\omega _{c} \approx 73.5$
. For asymptotically low
${\textit{E}}$
, with
$\Gamma = 1/2$
, we find
$R_{{c}} \approx 27.100728659192075$
,
$\omega _{{c}} \approx 73.83743415332742$
with
$m_{{c}} = 1$
.
5.4.1. Tall aspect ratio
In the past few years, multiple research groups (e.g. Julien et al. Reference Julien, Aurnou, Calkins, Knobloch, Marti, Stellmach and Vasil2016; de Wit et al. Reference de Wit, Aguirre Guzmán, Madonia, Cheng, Clercx and Kunnen2020; Pandey & Sreenivasan Reference Pandey and Sreenivasan2024) have started using tall-aspect-ratio cylinders for rapidly rotating convection experiments; the reason being rotation’s tendency to produce elongated vertical structures. The linear stability result in (5.57) simplifies considerably as
$\Gamma \to 0$
. Assuming
$\omega \,\Gamma \sim {\mathcal{O}}(1)$
,

Imposing
$\textrm{Im}(R) = 0$
and minimising over
$m$
implies

with
$m_{c} = 1$
. Notably, the aspect ratio and scaled Rayleigh number exactly cancel the cylinder depth dependence in the onset condition,

where
$D$
is the cylindrical diameter. Furthermore, as
$ \Gamma \sim {\textit{E}}^{\,1/3}$
, the critical Rayleigh number scales the same as for bulk onset, also with identical horizontal scales,
${\mathcal{O}}({\textit{E}}^{\,1/3})$
.
5.5. Local baroclinic instability
When assembled in one place, it becomes apparent that the wall-mode system is closely akin to the well-known planetary geostrophy (PG) equations, which typically model global-scale stratified ocean dynamics away from coastlines (Vallis Reference Vallis2017, Ch. 5). The PG system has rich dynamical structure. However, a common puzzle is the lack of self-consistent forcing and eventual rundown (Schonbek & Vallis Reference Schonbek and Vallis1999). Several works have considered coupling to smaller-scale quasi-geostrophic (QG) dynamics as an intriguing alternative (see Grooms, Julien & Fox-Kemper Reference Grooms, Julien and Fox-Kemper2011).
This context clarifies that the wall-mode convection equations are a sideways forced PG-QG system that relies on non-hydrostatic QG (similar to Sprague et al. Reference Sprague, Julien, Knobloch and Werne2006; Julien & Knobloch Reference Julien and Knobloch2007) near lateral boundaries and interacts with a barotropic QG system in the stress-free case. However, we can also study the wall-mode equations without the lateral boundary interactions.
In a highly reduced setting, the combined baroclinic temperature and barotropic vorticity equations have a rudimentary form of baroclinic instability. To demonstrate, we neglect all diffusion and boundary conditions. Hence,

The baroclinic equation has a family of exact nonlinear solutions,

where
$V_{0}(z)$
is an arbitrary
$y$
-direction velocity profile (without loss of generality,
$\!\left \langle V_{0}\right \rangle = 0$
),
$P_{0}(z)$
is and arbitrary hydrostatic pressure,
$A(y+c\,t)$
is an arbitrary complex-valued amplitude function and
$c$
is a complex-valued phase speed.
The barotropic equation requires the solvability condition,

For example,
$V_{0} = z-1/2$
,
$c = \pm i / \sqrt {12}$
, or
$V_{0} = \cos (\pi z)$
,
$c = \pm i/\sqrt {2}$
. When
$A(y) = A_{0}\,e^{iky}$
for wavenumber
$k$
, then the growth rate is
$k \| V_{0} \|$
.
6. Nonlinear simulations
Using Dedalus (Burns et al. Reference Burns, Vasil, Oishi, Lecoanet and Brown2020), we simulate the full nonlinear system of reduced wall-mode equations summarised in § 4. We use a full cylinder with radius
$0 \le r \le \Gamma /2$
, azimuth
$0 \le \phi \lt 2 \pi$
and height
$0 \le z \le 1$
. We use a spectral basis of generalised Zernike polynomials that naturally represent scalars, vectors and tensors in the full unit disk, including
$r=0$
with no singularities; see Vasil et al. (Reference Vasil, Burns, Lecoanet, Olver, Brown and Oishi2016) for details.
6.1. Set-up for numerical implementation
Our Dedalus implementation corresponds to the following description. We use the same non-dimensional form as the asymptotic analysis with control parameters
$R$
,
$\sigma$
and
$\Gamma$
. We use
$p(r,\phi ,z)$
as the primary dynamical variable for the bulk. The respective thermal wind relations follow:

We represent
$p(r,\phi ,z)$
in a vertical cosine series,
$\cos (n \pi z)$
. The temperature perturbations,
$\Theta (r,\phi ,z)$
, are naturally a sine series
$\sin (n \pi z)$
. For
$n \ge 1$
, the baroclinic evolution equation

We use a second-order accurate multi-stage, mixed implicit-explicit time-stepping scheme (SBDF scheme from Wang & Ruuth Reference Wang and Ruuth2008), with linear terms on the left-hand side treated implicitly and the nonlinear right-hand side treated explicitly. The time-step size follows a Courant–Friedrichs–Lewy criterion with a 0.2 safety factor. Because of the fine radial grid spacing near the walls, normal flow through the outer boundary principally determines the time-step size.
For the outer boundary conditions at
$r=\Gamma /2$
, we use
$q_{z}(\phi ,z)$
as an additional variable and define
$q_{\phi } = -\mathcal{H} [ q_{z} ]$
. The boundary conditions are


where
$\boldsymbol{\nabla} _{\!\phi \,} = \hat {\phi } \boldsymbol{\cdot} \boldsymbol{\nabla} _{\!\bot \,} = r^{-1} \partial _{\phi }$
. As before, all linear terms are time-implicit and nonlinear terms are explicit.
The
$n=0$
barotropic pressure mode has different dynamics depending on the velocity boundary conditions on the top and bottom. In both cases, we define the vorticity,
$\zeta = \nabla _{\!\bot \,}^{2} p$
.
For stress-free, we evolve the
$n=0$
vorticity equation,

with, again, the usual linear–nonlinear implicit–explicit time splitting. For boundary conditions at
$r =\Gamma /2$
,

For no-slip, the
$n=0$
mode involves a bit of subtlety. The Ekman boundary condition directly mixes barotropic and baroclinic variables, and parity mixes sine and cosine series. Vasil, Brummell & Julien (Reference Vasil, Brummell and Julien2008a
, Reference Vasil, Brummell and Julien
b
) discusses issues like this detail. The problem is that we need to represent expressions like (3.92) spectrally using sine/cosine series. We fix the barotropic pressure via

with the boundary condition
$\!\langle p\rangle =0$
at
$r = \Gamma /2$
. We use the finite spectral representation of
$1/2 -z$
,

A sufficiently large resolution parameter,
$N$
, captures the integral exactly. That is,

for all
$n \le N$
. The advantage is that
$K_{N}$
is a natural sine series having a natural inner product with
$\Theta$
.
6.2. Weakly nonlinear theory
Weakly nonlinear theory (see, e.g. Hoyle Reference Hoyle2006) is a useful diagnostic for validating calculations slightly above onset. We define the perturbation parameter,

As
$\delta \to 0$
, we expect pressure perturbations of the form,

with a complex-valued amplitude varying on a slow time scale

The pressure ansatz contains the leading-order convective modes derived from linear theory and the next-order convective feedback. At a given aspect ratio, we pick the most unstable wavenumber,
$m$
, defining the non-dimensional radius,
$a=\Gamma /2$
. The radial functions are normalised modified Bessel functions,

with the coefficients,



with
$P_{n,-j}(r)= P^{*}_{n,j}(r)$
. The normalisations ensure the solution satisfies all horizontal boundary conditions, along with the no-slip barotropic condition (6.7); the stress-free barotropic condition would be considerably more complicated.
The weakly nonlinear amplitude satisfies a complex Ginzburg–Landau equation,

where


for
$\Phi _{2,2} = P_{2,2}(a) (r/a)^{2m}$
. Linear theory is sufficient to determine
$c_{0}$
. The nonlinear coefficient,
$c_{1}$
, requires a messy perturbation series that closes at
${\mathcal{O}}(\delta ^{3/2})$
. There is no spatial modulation because of the discrete critical wavenumber,
$m$
.
The amplitude in (6.17) converges to a travelling wave,

with time-independent parameters,

The weakly nonlinear solution also gives the Nusselt number near onset. In particular,

where we define the convective conductivity,

As the aspect ratio
$\Gamma \to 0$
, the calculation simplifies considerably and
$K \to 12/7$
. As the aspect ratio,
$\Gamma \to \infty$
, we can use a derivation along a flat Cartesian wall such that

Like the conductivity, we find similar behaviour for the nonlinear frequency,
$\omega = \omega _{c} + \Omega$
. We define the convective Doppler shift,

As
$\Gamma \to 0$
,
$\Upsilon \to 173/252$
. Likewise,

Figure 7 shows plots of
$K$
and
$\Upsilon$
as a function of aspect ratio.

Figure 7. Nusselt number and relative frequency slopes as a function of aspect ratio in a finite cylinder. For the Nusselt number, the horizontal dashed grey line indicates the analytical value of
$12/7$
as
$\Gamma \to 0$
and the dashed grey curve shows the asymptotic scaling
$\sim\! 0.701822/\Gamma$
as
$\Gamma \to \infty$
derived from a flat Cartesian wall bounding an infinite bulk interior. For the relative frequency slope, the dashed grey curves indicate the analytical value
$173/252$
as
$\Gamma \to 0$
and the Cartesian result
$\sim \!1.533634$
as
$\Gamma \to \infty$
. The jagged behaviour in both plots results from jumps in the critical mode number,
$m_{c}$
, as a function of aspect ratio.
6.3. Fully nonlinear results
We first use the methods described previously to simulate the reduced wall-mode equations with no-slip boundary conditions,
$R=2R_{{c}}$
, and
$\Gamma =4/5$
. For this choice of
$\Gamma$
, the most unstable mode at
$R=R_{{c}}$
has an azimuthal wavenumber
$m=2$
(figure 5). We initialise the simulation with random low-amplitude noise, which triggers the wall-mode instability. At
$R=2R_{{c}}$
, the modes
$m=2,3,4,5,6$
all have positive growth rates, and
$m=3,4$
have the largest growth rates with
$\gamma /\omega _{{c}} \approx 2.3$
and
$\omega /\omega _{{c}} \approx 3.1, 2.7$
, respectively.
The instability saturates nonlinearly, producing characteristic patterns which propagate retrograde. In figure 8, we plot the temperature near the wall and at mid-height as a function of
$\phi$
and
$t$
. The downward propagation of the constant temperature stripes shows the retrograde motion. Initially, the dominant mode has
$m=3$
, but a coarsening event at
$t\approx 0.075$
results in a steady
$m=2$
pattern that persists for the rest of the simulation. Others have also found coarsening dynamics in similar systems (Ecke et al. Reference Ecke, Zhong and Knobloch1992; Plaut Reference Plaut2003; Scheel et al. Reference Scheel, Paul, Cross and Fischer2003; Choi et al. Reference Choi, Prasad, Camassa and Ecke2004; Lopez et al. Reference Lopez, Marques, Mercader and Batiste2007; Favier & Knobloch Reference Favier and Knobloch2020). In particular, Ning & Ecke (Reference Ning and Ecke1993), Liu & Ecke (Reference Liu and Ecke1997, Reference Liu and Ecke1999) study the coarsening dynamics for wall modes in great detail. They find that increasing the Rayleigh number slowly (as we do later) maintains
$m$
, but a rapid change in Rayleigh (e.g. starting a simulation with
$R/R_{{c}}=2$
from noise) usually results in a decrease in
$m$
. We chose
$\Gamma =4/5$
because the simulations yield an
$m=2$
pattern, implying more azimuthal resolution elements per wavelength than simulations with the same resolution but higher
$m$
.

Figure 8. Temperature space–time propagation diagram as a function of azimuth,
$\phi$
, and time,
$t$
. The pattern shows the
$R/R_{{c}} = 2$
, no-slip case. The simulation starts with low-amplitude noise. The modes
$m=2,3,4,5,6$
are all unstable, with
$m=3, 4$
having the largest growth rates. The pattern starts with approximately three full wavelengths, undergoes a coarsening defect and settles into a consistent two-wave pattern.
Having found a steady propagating solution for no-slip boundary conditions and
$R=2R_{{c}}$
, we bootstrap simulations to progressively higher and lower
$R$
, and simulations with stress-free boundary conditions. We run each simulation until it reaches a statistically steady state. Starting from the saturated state with a similar
$R$
means that our subsequent simulations run much shorter than figure 8. In each case, we find the
$m=2$
structure persists. Table 3 lists all simulations described in this paper and their spatial resolution.
Table 3. Summary of all simulation parameters. In every case, for
$\Gamma = 4/5$
, the critical scaled Rayleigh number,
$R_{{c}} = 28.237851887421087$
, and the critical frequency,
$\omega _{{c}} = 69.02735982770973$
. For the weakly nonlinear parameters,
$K = 0.8028404111646181$
,
$\Upsilon = 1.0933189545586768$
. By convention,
$\omega \gt 0$
corresponds to retrograde propagation, consistent with the linear instability calculation. For top/bottom boundary conditions, NS is no-slip, and SF is stress-free. While
$R,\Gamma ,\sigma$
are input control parameters, figure 12 shows the output parameters,
$\omega$
and
${\textit{Nu}}$
. Note that the Prandtl number,
$\sigma$
, drops out of the system in the no-slip case.

Figure 9 illustrates the sidewall temperature pattern for different boundary conditions and
$R$
. In all cases, alternating hot/cold spots exist near the top/bottom of the domain. These patterns extend towards the mid-plane of the cylinder as
$\phi$
increases past the location of the hot/cold spot. The height of these patterns is smaller for no-slip boundary conditions than for stress-free boundary conditions. As
$R$
increases, the edges of the hot/cold spots sharpen, producing front-like features that require increasingly higher azimuthal resolution for increased
$R$
. The stress-free simulations have sharper, more vertically aligned fronts. For this reason, we could only run simulations up to
$R=4R_{{c}}$
for stress-free boundary conditions, while we were able to reach higher values
$R=6R_{{c}}$
in the no-slip simulations.

Figure 9. Temperature patterns at the sidewall: (a) the no-slip case and (b) the stress-free case. The rows show
$R/R_{{c}} = 2,3,4$
, respectively.
To show the horizontal structure of the modes, we plot the pressure at the top of the cylinder in figure 10. The pressure perturbations extend significantly into the cylinder, although the features become increasingly localised to the walls at
$R$
increases. Recall that these visualisations do not directly show the Stewartson layers, which are infinitely thin in this analysis. For stress-free boundary conditions, the pressure fields do not change substantially as
$R$
increases, other than becoming more localised near the boundary. However, there are more interesting differences between different
$R$
for the no-slip boundary conditions. For low
$R$
, the contours of constant pressure slope away from the walls in the counter-clockwise direction, as for the stress-free simulations. However, for higher
$R$
, the no-slip simulations have contours of constant pressure that slope away from the walls in the clockwise direction. This change in morphology may be related to the differences in propagation direction between the no-slip and stress-free simulations at higher
$R$
.

Figure 10. Pressure patterns at the top: (a) the no-slip case and (b) the stress-free case. The rows show
$R/R_{{c}} = 2,3,4$
, respectively.
Figure 11 visualises the momentum fluxes within the Stewartson layers by plotting
$\boldsymbol{\nabla} \boldsymbol{\cdot} q$
and
$\boldsymbol{\nabla} \times q$
. For the divergence,
$\boldsymbol{\nabla} \boldsymbol{\cdot} q = u_{n}$
, the normal velocity at the sidewall. For the curl,
$\boldsymbol{\nabla} \times q = \partial _{\ell } q_{z} - \partial _{z}q_{\ell }$
. The temperature at the sidewalls shows sharper azimuthal features in the stress-free simulations than in the no-slip simulations. In particular,
$\boldsymbol{\nabla} \times q$
is very sharp and vertically aligned in the stress-free simulation. In the no-slip simulation,
$\boldsymbol{\nabla} \boldsymbol{\cdot} q$
and
$\boldsymbol{\nabla} \times q$
are much wider and show the same types of sloped features near the top and bottom of the cylinder as the temperature field.

Figure 11. Boundary-layer divergence and curl patterns at the sidewall: (a) the no-slip case and (b) the stress-free case. Both cases correspond to
$R/R_{{c}} =4$
.
Next, we focus on heat transport. In the reduced wall-mode equations, the bulk has no vertical velocity. Within the asymptotic approximation, the diffusive heat transport happens entirely in the bulk, while the convective heat transport happens entirely within the Stewartson layer. In polar coordinates,

where
$|\mathcal{A}| = \pi \Gamma ^{2}/4 = 4\pi /25 \approx 0.502$
and
$\Theta = T - \overline {T}$
. These fluxes sum to the total
$F_{t} = F_{c}+F_{d}$
. The dynamical equations (6.2)–(6.4) have the mean linear conduction temperature profile subtracted out, i.e.
$- \partial _{z} T_{0} = R$
; therefore, the Nusselt number
${\textit{Nu}} = F_{t}/R$
.
Figure 12(a) shows
${\textit{Nu}}$
as a function of
$R/R_{{c}}$
for both no-slip and stress-free simulations. As
$R$
increases from
$R_{{c}}$
, we find that
${\textit{Nu}}$
also increases linearly, as
${\textit{Nu}} - 1 \approx 0.807\,(R/R_{c}-1)$
, shown as a dashed line in the plot. Weakly nonlinear theory predicts a coefficient
$\approx 0.8028404111646181$
; see § 6.2 and figure 7. As
$R$
increases further,
${\textit{Nu}}$
appears to level off, although we have not been able to run at sufficiently high
$R/R_{{c}}$
to probe the asymptotic behaviour as
$R$
becomes large.
Figure 12(b) shows the different depth-dependent heat fluxes for the no-slip and stress-free simulations with
$R=4R_{{c}}$
. The convective heat flux is zero at the top and bottom boundaries (
$q_z=0$
), but becomes significant and roughly constant away from the horizontal boundaries. The diffusive heat flux is large and positive in the top and bottom thermal boundary layers, but intriguingly becomes negative near the mid-plane. The (vertical) middle of the cylinder becomes stably stratified due to the wall modes. All of our simulations are thermally equilibrated, where the total heat flux is constant with height.

Figure 12. Diagrams of heat transport. (a) Nusselt number,
${\textit{Nu}}$
, versus the supercriticality,
$R/R_{{c}}$
; by definition
${\textit{Nu}}_{\,{c}} = 1$
at onset. The dashed line shows the approximate linear dependence
${\textit{Nu}}-1 \approx 0.807\, (R/R_{c}-1)$
near onset; weakly nonlinear predicts a slope of
$\approx \,$
0.803 (an
$\approx \,$
0.5 % difference). (b) Depth-dependent diffusive (
$F_{d}$
, blue) and convective fluxes (
$F_{c}$
, red), along with their total (
$F_{t} = F_{c} + F_{d}$
, gold) for simulations with
$R=4R_{{c}}$
. The solid lines correspond to the no-slip case, and the dashed lines correspond to stress-free. The independence of
$F_{t}$
on depth,
$z$
, implies the systems are in statistically steady state with a well-defined
${\textit{Nu}}$
.
Figure 8 shows the instability of the wall modes saturates into nonlinear travelling waves. At onset, we calculate the propagation rate via linear theory; for
$\Gamma =4/5$
,
$\omega _{{c}} \approx 69$
in the retrograde direction. Previous work has shown the propagation rate varies with
$R$
as expected for a supercritical Hopf bifurcation (e.g. Zhong et al. Reference Zhong, Ecke and Steinberg1991; Liu & Ecke Reference Liu and Ecke1999; Favier & Knobloch Reference Favier and Knobloch2020). Figure 13 shows the propagation rate as a function of
$R/R_{{c}}$
for no-slip and stress-free simulations. As
$R$
increases from
$R_{{c}}$
, the propagation rate
$\omega$
also increases approximately linearly. At
$R=2R_{{c}}$
, the propagation rates are similar for no-slip and stress-free simulations. However, for larger
$R$
, the propagation rate increases monotonically for stress-free simulations, whereas the propagation rate decreases for simulations with no-slip boundary conditions. The experiments of Zhong et al. (Reference Zhong, Ecke and Steinberg1993) at
${\textit{E}} \approx 10^{-4}$
similarly found a maximum propagation rate for
$R/R_{c} \approx \text{2-3}$
. At
$R\approx 5R_{{c}}$
, there is a steady nonlinear solution with no propagation and for higher
$R$
, the solution propagates in the prograde direction. This switch in propagation direction may relate to the change in the azimuthal structure of the pressure perturbations shown in figure 10. Horn & Schmid (Reference Horn and Schmid2017), Ravichandran & Wettlaufer (Reference Ravichandran and Wettlaufer2023) and Zhang et al. (Reference Zhang, Reiter, Shishkina and Ecke2024) all find mixtures of both prograde and retrograde propagation, depending on parameter values. Note that the structure of the temperature perturbations at the boundary (e.g. shown up to
$R=4R_{{c}}$
in figure 9) does not change significantly as
$R$
increases, other than becoming increasingly sharp.

Figure 13. Propagation rate,
$\omega / \omega _{{c}}$
versus the supercriticality,
$R/R_{{c}}$
. The dashed line shows the approximate linear dependence
$\omega /\omega _c-1 \approx 1.05\, (R/R_{c}-1)$
near onset; compared with a predicted slope of approximately 1.09. The rates continue to increase up to
$R/R_{{c}} \approx 2$
. At that point, the stress-free case continues the trend, while the no-slip rate declines and eventually becomes prograde beyond
$R/R_{{c}} \approx 5$
.
Finally, figure 14 shows the mean zonal flow for no-slip and stress-free simulations with
$R=4R_{{c}}$
. At this
$R$
, the modes propagate retrograde in both simulations. However, the mean zonal flow at the boundary is positive in the no-slip simulation, while it is (as expected) negative in the stress-free simulation. The change in propagation direction for simulations with no-slip boundary conditions may be related to the development of a prograde mean zonal flow.

Figure 14. Mean zonal flow in meridional planes: (a) no-slip case and (b) stress-free case. Both cases correspond to
$R/R_{{c}} =4$
.
7. Discussion and conclusions
This work derives a balanced set of reduced equations governing the nonlinear development of convective wall-mode instabilities in rapidly rotating systems. Focusing on the essential dynamics within the bulk interior and the Stewartson boundary layers at the sidewalls captures the multiscale nature of wall-mode convection. The dynamics of the bulk interior diagnostically determine the small-scale behaviour within the boundary layers. In contrast, the sidewall layers feedback onto the interior through a nonlinear lateral heat-flux boundary condition, providing a closed and self-consistent system. The final reduced dynamics occupies the bulk interior and comprises geometrically prescribed scales. All boundary layers exist behind the scenes, their presence felt via a series of non-trivial boundary conditions that include nonlinear transport and non-local integral transforms.
The bulk-boundary connection offers a clearer understanding of the interplay between multiscale dynamics, often obscured in full models due to their complexity. Systematically eliminating secondary effects isolates the dominant physical processes, enhancing explanatory power and providing avenues for investigating wall-mode convection in the strongly nonlinear regime. A key difference between our studies and previous high-resolution wall-mode simulations is that we find no evidence for secondary instability of the sharp propagating thermal structures. We do not know if this is because we have not reached high enough supercriticality (other work found secondary instabilities for
$R/R_{{c}}\gtrsim 10$
; Favier & Knobloch Reference Favier and Knobloch2020; de Wit et al. Reference de Wit, Boot, Madonia, Aguirre Guzmán and Kunnen2023; Zhang et al. Reference Zhang, Reiter, Shishkina and Ecke2024) or because the equations filter baroclinic inertial dynamics. We imagine future work methodically reintroducing dynamic ingredients that model various aspects of vortex separation, which can populate the interior with pseudo-convective structures. The first additional effect worth including is the
${\mathcal{O}}( {\textit{E}}^{\,1/6} )$
Ekman flux correction in (5.32).
We also reiterate that the final system of equations resembles boundary-forced planetary geostrophic baroclinic dynamics coupled with barotropic quasi-geostrophic vorticity. These connections point to ways systems can include more general effects, such as container topography. We also expect to include centrifugal buoyancy effects in future studies. We have also not considered the effect of a low Prandtl number, which would much more strongly emphasise inertia. Furthermore, these equations can incorporate additional physical effects, as described later.
7.1. Double diffusion
Given the simple physical principles involved, we can, in one way, generalise the thermal wall-mode instability almost by inspection. In the case of thermal-solutal buoyancy, we simply add an equation for the ‘salt’ concentration,
$s(t,x,y,z)$
. The updated hydrostatic balance,

The relevant linear bulk equation and boundary condition,

The total set of control parameters includes the solutal scaled Rayleigh number and the Lewis number,

where
$\lambda$
is the solute diffusivity,
$\alpha _{S}$
is solutal the expansion coefficient and
$\Delta S$
is the total imposed concentration jump. As defined,
$R_{T} \gt 0$
and
$R_{S} \gt 0$
both typically imply destabilising configurations. For a neutral-density background state,
$R_{T} + R_{S} = 0$
. In many common situations,
${\textit{Le}} \gg 1$
; e.g. for salt and water,
${\textit{Le}} \approx 10^{3}$
.
Carrying out a nearly identical analysis to the purely thermal case, the condition for marginal stability is

where
$k$
is the
$y$
-direction Fourier wavenumber and
$\omega$
is the complex-valued frequency. In this case, the dispersion relation shows similarities and differences with both the fingering and oscillatory regimes of double-diffusive convection (Garaud Reference Garaud2018). We leave the exploration of the critical
$R_{T}, R_{S}$
curve for later work.
7.2. Magnetism
Finally, we point out that magnetic wall modes have attracted significant recent attention (Busse Reference Busse2008; Liu, Krasnov & Schumacher Reference Liu, Krasnov and Schumacher2018; Grannan et al. Reference Grannan, Cheng, Aggarwal, Hawkins, Xu, Horn, Sánchez-Álvarez and Aurnou2022; Xu, Horn & Aurnou Reference Xu, Horn and Aurnou2023; McCormack et al. Reference McCormack, Teimurazov, Shishkina and Linkmann2023; Teimurazov et al. Reference Teimurazov, McCormack, Linkmann and Shishkina2024). There are many ways these dynamics will interact with rapid rotational effects.
In magnetohydrodynamics (MHD), the Chandrasekhar number (Chandrasekhar Reference Chandrasekhar1961) measures the influence of an imposed background magnetic field,

where
$B_{0}$
is the vertical field strength,
$\mu _{0}$
is the permeability,
$\rho _{0}$
is the average density and
$\eta$
is the magnetic diffusivity. In the large-
$Q$
limit, several interesting overlaps occur with rapidly rotating parameters that point to extremely rich unexplored dynamics.
In particular, the equivalent MHD sidewall interactions will couple with the rotating version, provided the boundary layers have the same scaling in the same Rayleigh number regime. Indeed, Busse (Reference Busse2008) shows wall-mode convection exists when


In both cases, the velocity amplitudes in the boundary layers scale
$v,w \sim \,\text{d}{x}^{-1}$
. Therefore, when
$Q \sim {\textit{Ta}}^{2/3}$
, we expect interesting non-trivial interactions with
${\mathcal{O}}( 1 )$
bulk dynamics coupled to the sidewall via nonlinear and non-local boundary conditions. We should also expect magnetic double-diffusive effects (Silvers et al. Reference Silvers, Vasil, Brummell and Proctor2009).
However, even more fascinating complications can occur in the ‘magnetostrophic’ regime (Horn & Aurnou Reference Horn and Aurnou2022). In this case, convection can arise purely in the bulk on
${\mathcal{O}}(1)$
length scales without reference to sidewalls. This regime requires a joint balance

Convection sets in when
$Q^{2} \sim 3 {\textit{Ta}}$
and
${\textit{Ra}} {\textit{Ta}}^{-1/2} \sim 3 \sqrt {3}\, \pi ^{2} \approx 51.284$
, or
$R/R_{{c}} \approx 1.6$
in terms of the wall modes. The implication is that magnetostrophic convection will interact with the sidewall layers with a lower magnetic field strength than purely sidewall-catalysed magnetoconvection. If the wall modes can drive a kinematic dynamo, this is likely the regime in which the system would end up.
Acknowledgements
The authors thank J. Aurnou, B. Ecke, B. Favier, I. Grooms, S. Horn, E. King, E. Knobloch, M. Linkmann and M. McCormack for many helpful and insightful conversations on wall modes and related topics.
We dedicate this work to our dear friend, K. Julien – you’ve done so much for so many in far too short a time.
Data availability
We use the Dedalus code and additional analysis tools written in Python and Mathematica. Beyond the main Dedalus installation, all scripts are available at GitHub (https://github.com/geoffvasil/rapidly_rotating_wall_modes)
Funding
K.J.B., D.L., J.S.O. and B.P.B. acknowledge support from the NASA HTMS grant 80NSSC20K1280 and the NASA OSTFL grant 80NSSC22K1738. D.L. is partially supported by Sloan Foundation grant FG-2024-21548 and Simons Foundation grant SFI-MPS-T-MPS-00007353.
Declaration of interests
The authors report no conflict of interest.