1. Introduction
The Burger’s equation

where
$u:\mathbb R_+\times \mathbb R \to \mathbb R$, is a particularly simple example of a conservation law in one spatial dimension. As a simple model, which resembles (both in its form as well as its behaviour) many distinguished equations in mathematical physics, it is an ubiquitous object in the modern dynamical systems theory. The Cauchy problem, along with many other properties, has been explored in the literature. In particular, and specifically for this model, the Cole–Hopf transformation has played an important role as it transforms (1.1) into a linear problem. More specifically, the assignment
$u=-2\frac{w_x}{w}$ reduces (1.1) to the linear heat equation for w, namely

In this way, one can write explicit solutions and derive particular properties of the conservation laws, which are not transparent in the usual general implicit formulations.
Along with the Burgers equation, related models have been analysed. The KdV–Burgers model

which describes, among other things, bores in viscous fluids [Reference Su and Gardner12], and weak plasma shocks propagating transversely to a magnetic field. Many similar models were analysed recently, among them the following general model [Reference Barker, Bronski, Hur and Yang2],

Here,
${\mathcal L}$ is a multiplier operator, given by
$\widehat{{\mathcal L} f}(k) = l(k) \hat{f}(k)$. Now, we must impose assumptions on
${\mathcal L}$. Namely, it is required that
${\mathcal L}$ maps real-valued functions into real-valued functions, and also its quadratic form is non-positive. That is, it satisfies


The condition (1.5) can be alternatively stated as
$l[0]=0$ and l is continuous at zero, but the form (1.5) is more suitable in this context. Note that (1.3) incorporates (1.2), but includes much more. In fact, we list several physically relevant cases: the KdV–Burgers,
${\mathcal L}=\nu \partial_{xxx}$, the Benjamin–Ono–Burgers,
${\mathcal L}=\partial_x |\partial_x|$, the Hilbert–Burgers case
${\mathcal L}= \mathcal H=\partial_x|\partial_x|^{-1}$, the fractional Burgers Footnote equation 1 with
${\mathcal L}=-\sum_{j=1}^N a_j (-\partial_{xx})^{\alpha_j}, \ \ \ 0 \lt \alpha_1 \lt \alpha_2 \lt \ldots \lt \alpha_N \lt 1, a_j \gt 0$, etc. All these could be of interest.
Understanding of the long-time dynamics of such models is clearly important, and as it turns out a challenging problem. As is usual in conservation laws, the standard background is starting with heteroclinic data with enough decay towards the limiting values at
$\pm \infty$. To this end, assume that the initial data u 0 satisfies

where
$u_- \gt u_+$ are constants.
1.1. Galilean transformation: reductions and existence of fronts
Let us record two elementary properties of Equation (1.3). For the Burgers equation, i.e. the model (1.3) with
${\mathcal L}=0$, we have true invariances with respect to the Galilean transformation

as well as the scaling transformation

These two transformations, in general, do not commute with the flow of (1.3), but they transform it into a model of the same type, i.e. it still satisfies (1.4), (1.5). Since our goal is to understand the long-term dynamics of heteroclinic initial data, we can use these transformations to our advantage as follows: given
$u_{\pm}, u_- \gt u_+$, introduce
$c=\frac{u_- + u_+}{u_--u_+}, \lambda=\frac{u_- - u_+}{2} \gt 0$ and a function v, so that

Then, the function U will satisfy the Footnote equation 2

where
$\widehat{{\mathcal L}_1 g}(k)=\lambda^{-2} l(\lambda k) \hat{g}(k)$,
$\lim_{x\to \mp \infty} U(x) =\pm 1$. As we shall see,
${\mathcal L}_1$ also satisfies the relevant assumptions, (1.4), (1.5), so it will suffice to work with the model, where
$U_{\mp}=\pm 1$. Naturally, and by the choice of the transformations, this corresponds to a case, where c = 0 as well. Thus, we restrict our attention to the steady-state solutions of (1.3) with end values
$\mp 1$, which in this case are functions
$\phi:\mathbb R\to \mathbb R$,

The existence theory for such a problem is, of course, highly non-trivial. In fact, as in previous papers, we shall make assumptions to this effect. We mention in this regard that for the KdV–Burger’s model, the existence of such monotone ϕ was proved in [Reference Bona, Rajopadhye and Schonbek4, Reference Bona and Schonbek5]. It is worth adding that these fronts are monotone if and only if
$|\nu|\leq \frac{1}{4}$, [Reference Bona, Rajopadhye and Schonbek4]. In addition, such solutions are unique (up to a translation), and if one specifies the values
$u_{\pm}$. In fact, in the case
$\nu=-\frac{1}{10}$, there is the explicit formulaFootnote 3

1.2. Asymptotic attraction towards heteroclinic fronts
We now review the literature regarding earlier asymptotic stability results. Pego [Reference Pego11] has shown that the monotonic fronts are orbitally asymptotically stable. More specifically, he showed that any (small) perturbation of the front converges to some translate of the front. In later works, Khodja [Reference Khodja8] and Naumkin–Shishmarev [Reference Naumkin and Shishmarev9, Reference Naumkin and Shishmarev10] showed that the KdV–Burgers fronts are asymptotically stable, even for some “slightly” non-monotonic fronts, i.e.
$\nu: 0 \lt \nu-\frac{1}{4} \lt \lt 1$.
Next, we introduce the results of Barker–Bronski–Hur–Yang [Reference Barker, Bronski, Hur and Yang2], which represents a real breakthrough in the study of this problem. Specifically, the authors improved the aforementioned results in at least three important directions. Firstly, they consider all (non-necessarily monotonic) fronts, subject to a natural and specific spectral condition,Footnote 4 see Theorem 1.1. Secondly, they extend the range of applicability to a wide range of examples, driven by possibly a mixture of dissipative and/or dispersive linear operators, subject to (1.4). Thirdly, and most impressively, they allow large data perturbations for asymptotic attraction type of result vis-a-vis asymptotic stability. That is, as long as the initial data approaches the fixed heteroclinics with good decay properties, and the fronts exist (with the prescribed spectral condition), the solution will stay close to translates (defined dynamically) of the front, all in L 2 norm.
We state the precise result below.
Theorem 1.1 (Barker–Bronski–Hur–Yang [Reference Barker, Bronski, Hur and Yang2])
Suppose that
${\mathcal L}$ satisfies the dispersivity property (1.4), while the front ϕ satisfies the profile equation (1.8) and in addition.

Finally, assume that for some
$\epsilon\in (0,1)$, the Schrödinger operator

Assume that u 0 is a heteroclinic data, so that
$ \lim_{x\to \mp \infty} u_0(x)=\pm 1$ and more precisely,
$u_0(x)-\phi\in L^2(\mathbb R)$. Then, there exists a continuous function
$x_0(t): \mathbb R_+\to \mathbb R$, so that the solution to (1.3) can be written in the form

so that
$t\to \|v(t, \cdot)\|_{L^2(\mathbb R)}$ is a monotonically decreasing function of time and

Let us record an energy estimate, which came up in the proof of Theorem 1.1. More precisely, and under the assumptions of Theorem 1.1, and specifically (1.10), we have

We shall make extensive use of (1.13) in the sequel, but note right away that the monotonicity of the mapping
$t\to \|v(t, \cdot)\|_{L^2(\mathbb R)}$ is a direct consequence of it.
Next, a few remarks are in order.
(1) Note that the results in Theorem 1.1 do not require smallness of the perturbation
$u-\phi$. In other words, this is a strong asymptotic attraction result rather than a mere asymptotic stability result.
(2) Related to the previous point, this implies a rather strong uniqueness (up to the usual translation) theorem for such fronts ϕ. Namely, fronts ϕ with the properties (1.9), (1.10), if any, are unique. The uniqueness result was certainly explored in the literature in certain contexts (see for example [Reference Jeffrey and Mohamad7] for the KdV–Burgers case), but not in the generality of the model (1.3).
(3) In the statement of Theorem 1.1, the particular values of the limits
$\phi_{\pm}=\lim_{x\to \pm\infty} \phi(x)$, are set to be
$\phi_{\pm}=\mp 1$ only out of convenience. Assuming that solutions exist for some other pair , subject to
$\phi_{\pm}: \phi_- \gt \phi_+$, one can retrace the proof in [Reference Barker, Bronski, Hur and Yang2] to the same results, as stated in Theorem 1.1.
(4) Note that since
$\int_{-\infty}^{+\infty}\phi_x(x) dx=-2$, the Schrödinger operators
$\mathcal H_\epsilon$ do have negative spectrum, which consists of eigenvalues. The condition (1.10) requires that the spectrum is minimal, in the sense that it consists of only one eigenvalue.
(5) Related to the previous item, we note that the spectral condition (1.10) has been rigorously verified in [Reference Barker, Bronski, Hur and Yang2], for the (unique) monotone KdV–Burger’s fronts, i.e. for the case
$|\nu| \lt \frac{1}{4}$. In the companion paper [Reference Barker, Bronski, Hur and Yang3], the authors further explored the validity of (1.10) beyond the monotone cases. In fact, informal numerics show that (1.10) holds all the way up to
$|\nu| \lt 4.1$, while rigorous numerics confirm this up to
$|\nu| \lt 3.9$.
1.3. Main result: pointwise decay for the KdV–Burgers equation
Our first result concerns the standard KdV–Burgers model. We bootstrap upon the results of [Reference Barker, Bronski, Hur and Yang2] to obtain pointwise in time bounds for various Lebesgue norms. The precise statement is below.
Theorem 1.2 (Pointwise decay to the fronts for the KdV–Burgers model)
For the KdV–Burgers model, i.e.
${\mathcal L}=\nu \partial_{xxx}$, under the assumptions of Theorem 1.1 and

the decomposition (1.11) still holds, with

In addition, for every
$q\in (1,2)$, there is a decay
$\lim_{t\to \infty} \|v(t, \cdot)\|_{L^q}=0$, and in fact for every δ > 0, there exists
$C=C_\delta$, so that

For p > 2, the bound is

Some important remarks to be made herein.
(1) While we do require a more localized data, see (1.14), we still allow such perturbations to be large.
(2) The decay rate (1.15) is faster, compared to the linear heat equation, where we only have
$\|v(t)\|_{L^2(\mathbb R)}\sim t^{-\frac{1}{4}}$. This enhanced decay rate can be thought of as an anomalous dissipation of sorts, caused by the non-trivial background around the front solution and the lack of edge resonance.
(3) The estimate (1.17) is obtained as an interpolation between the L 2 bounds (1.15) and the uniform
$L^\infty$ bound (1.12), obtained earlier in [Reference Barker, Bronski, Hur and Yang2].
1.4. Main result: pointwise decay for the fractional Burgers equation
For clarity, we consider the fractional Burgers equation to be the evolution problem

subject to initial data
$u(0,x)=u_0(x)$. Clearly, the evolution of (1.18) preserves odd initial data. We have the following, more precise than (1.12) result.
Theorem 1.3. Suppose that ϕ is an odd stationary front of the fractional Burger equation

where
$0 \lt \alpha_1 \lt \alpha_2 \lt \ldots \lt \alpha_N \lt 1, a_j\geq 0$. In addition, we assume that ϕ satisfies
$\phi_x, \phi_{xx} \in L^2(\mathbb R)$ and the spectral assumption (1.10).
Suppose now that
$u_0=\phi+v_0$, where
$v_0\in L^1\cap L^2(\mathbb R)$ is an odd function. Then, the solution u of (1.3), with initial data u 0 can be written as

where for t > 2, one has the bounds


• Note that as before, the perturbation v 0 may be large, as long as it is well-localized, i.e.
$v_0\in L^1\cap L^2(\mathbb R)$.
• Related to the previous item, it is worth noting that our proof fails under an assumption such as
$v_0\in L^q\cap L^2, 1 \lt q$. That is, we cannot establish the algebraic decay rates (for any
$p\in (1, \infty]$!) in (1.20), (1.21), without assuming first that
$v_0\in L^1(\mathbb R)$. At this point, it is unclear whether this is a technical obstacle or one indeed needs to require L 1 decay of the perturbation.
• As before, Theorem 1.3 provides a strong uniqueness property of the odd fronts ϕ satisfying (1.19).
• It is an open question whether or not the algebraic decay rates stated in (1.20) are sharp. While the logarithmic corrections are most certainly not, one might wonder about the other bounds in (1.20), (1.21).
• The assumption for existence of solution of (1.19), which in addition satisfies
$\phi_x, \phi_{xx} \in L^2(\mathbb R)$ and the spectral assumption (1.10) is a non-trivial one. In some earlier works, see for example [Reference Achleitner, Cuesta and Hittmeir1], stationary fronts were constructed for similar fractional KdV–Burger’s problems, but with a non-trivial KdV term. The scheme in [Reference Achleitner, Cuesta and Hittmeir1] can be applied to (1.19). More specifically, assume that we have only one fractional term in (1.19),
$(-\partial_{xx})^\alpha=D^{2\alpha}, 0 \lt \alpha \lt \frac{1}{2}$. Then, we look for solution
$\phi:\lim_{x\to \pm \infty}\phi(x)=\mp 1$,
(1.22)\begin{equation} \phi_x =-\partial_{x}[-\partial_{xx}+D^{2\alpha}]^{-1} [\phi_x \phi]=K\ast [\phi_x\phi], \ \ \hat{K}(\xi)= \frac{2\pi i \xi}{4\pi^2 \xi^2 +(2\pi)^{2\alpha} |\xi|^{2\alpha}}. \end{equation}
It can be shown that
$|K(x)|\leq C |x|^{-(2-2\alpha)}, |x| \gt \gt 1$. We conjecture that this equation is solvable and the precise decay rate is inherited by ϕx, due to the integral formulation (1.22). That is
$|\phi_x(x)|\leq C |x|^{-(2-2\alpha)}, |x| \gt \gt 1$, and so
$\phi_x\in L^1(\mathbb R)$, whereas
$|\phi_{xx}(x)|\leq C |x|^{-(3-2\alpha)}, |x| \gt \gt 1$, etc. In such a way, ϕx will never satisfy the assumptionFootnote 5
$(1+|x|)\phi_x(x)\in L^1(\mathbb R)$. On the other hand, the symmetric operator
$\mathcal H_\epsilon=-(1-\epsilon)\partial_{xx}+\frac{1}{2}\phi_x$ may still satisfy the assumption (1.10), which is the only one used in the proof of Theorem 1.1.
We plan our paper as follows. In Section 2, we introduce some basic notations, as well as the Sobolev embedding and various related interpolation inequalities. We introduce the notion of a diffusive semigroup, as this allows us for an effective energy estimates for the fractional Burgers evolution in
$L^p, p \gt 1$ norms. In Section 3, we prove Theorem 1.2. Specifically, for the case of the KdV–Burgers model, we show enhanced energy estimates for the weighted norms
$\left(\int_{\mathbb R} |v(t,x)|^2 |x| dx\right)^{\frac{1}{2}}$, which allows us to bootstrap the results of [Reference Barker, Bronski, Hur and Yang2] to pointwise in time decay estimates for
$\|v(t, \cdot)\|_{L^2}$. In Section 4, we prove Theorem 1.3. More specifically, we use the integration by parts techniques of [Reference Chamorro and Lemarié-Rieusset6] to derive energy estimates for
$\|v(t, \cdot)\|_{L^p}, p \gt 1$ and ultimately for p = 1. We again use in a critical way the energy estimate (1.13) from [Reference Barker, Bronski, Hur and Yang2], and in conjunction with the a priori estimates for
$\|v(t, \cdot)\|_{L^1}$, we conclude pointwise in time decay for the solution norms.
2. Preliminaries
We shall use standard notations for the Lebesgue spaces
$L^p, 1\leq p\leq \infty$. More generally, the Sobolev spaces
$W^{k,p}(\mathbb R)$ for integer
$k, 1 \lt p \lt \infty$, are defined via the norms

2.1. Fourier transform, fractional Burger’s and the Bernstein inequality
We take the Fourier transform and its inverse in the form

In this notation, we have
$\widehat{f''}(\xi)=-4\pi^2 \xi^2 \hat{f}(\xi)$. More generally, we introduce the fractional operators
$(-\partial_{xx})^\alpha$ as acting on Schwartz functions via the symbol
$(2\pi|\xi|)^{2\alpha}$. More precisely,

In fact, this allows one to introduce a companion spaces to
$W^{s,p}$, where
$s \gt 0, 1 \lt p \lt \infty$ is not necessarily an integer. More precisely, these are given by the norm

These spaces are equivalent to the previously defined
$W^{k,p}$ for integer k. The Sobolev inequality in one spatial dimension takes the simple form

Next, we discuss Bernstein’s inequality, which is a relatively crude, but helpful form of Sobolev embedding. Consider a function
$f: {\mathbf R}^n\to {\mathbf R}^n$ and a set of finite Lebesgue measure A. Let
$f_A: \widehat{f_A}(\xi)=\hat{f}(\xi) \chi_A(\xi)$, where
$\chi_A(x)=\left\{\begin{array}{cc}
1 & x\in A\\
0 & x\notin A
\end{array}
\right..$ Then, for all
$1\leq p\leq \infty$,

An easy estimate, which follows immediately from Plancherel’s identity, is

We will use the rough Fourier cutoff projections

We also need the Gagliardo–Nirenberg estimate (or the log-convexity of the map
$p\to \|f\|_p$). More precisely, for every
$1\leq p \lt q \lt r\leq \infty: \frac{1}{q}=\frac{\theta}{p}+\frac{1-\theta}{r}$,

Finally, there is the estimate,
$
\|f\|_{L^\infty}^2 \leq C \|f'\|_{L^2} \|f\|_{L^2},
$ which will also be useful.
2.2. Diffusion semigroups
The notion of diffusive semigroups will play a role in our considerations. Shortly, we say that
$\{T(t)\}_{t\geq 0}$ is a C 0 semigroup on a Banach space, if
$T(t)\in B(X)$,
$T(t+s)=T(t)T(s), T(0)=Id$ and for every
$f\in X: \lim_{t\to 0+} \|T(t) f - f\|_X=0$. It is standard that one can always associate with such objects a generator L, a generally unbounded operator, which can be introduced on a domain
$D(L)=\{f\in X: \lim_{t\to 0+} \frac{T(t) f - f}{t} \ \ \textrm{exists}\}$ as

Naturally,
$T(t) f\in D(L)$ solves the X valued differential equation
$x'=L x, x(0)=f$ for any
$f\in X$, so we informally denote it by
$T(t)=e^{t L}$.
Definition 2.1. We say that a C 0 semigroup on
$L^2({\mathbf R}^n)$, T(t), is a diffusive semigroup, if
$T(t), t\geq 0$ is self-adjoint, and given by a probability density kernel, i.e.

We observe that the generator L in such cases is necessarily a non-positive operator, since for all
$t\geq 0$ and a test function f, we have

whence
$\langle L f,f \rangle \leq \limsup_{t\to 0+} \frac{\langle T(t) f - f,f \rangle}{t}\leq 0$.
It is standard that the operator
$L=\partial_{xx}$ is a generator of such semigroup. More generally, the fractional Burgers operators
$L=-\sum_{j=1}^N a_j (-\partial_{xx})^{\alpha_j}, a_j\geq 0, 0 \lt \alpha_1 \lt \ldots \alpha_N\leq 1$ also generate diffusion semigroups,Footnote 6 see for example [Reference Chamorro and Lemarié-Rieusset6]. An important property of such semigroups is the following.
Proposition 2.2 (Reference Chamorro and Lemarié-Rieusset6, Theorem 3.2)
Assume that L generates a diffusive semigroup. Then, for all
$1\leq p \lt \infty$, we have

If, in addition,
$ 2\leq p \lt \infty$, then

In particular, we have that for each
$0 \lt \alpha\leq 1, 2\leq p \lt \infty$

3. Attraction to fronts: the Burger-KdV model
We start by revisiting some of the results obtained in the proof of Theorem 1.1. Indeed, in the course of the proof, the authors set
$\dot{x}_0(t)=\gamma \int_{-\infty}^\infty \phi_x(x) v(x) dx$, withFootnote 7 γ > 1. This is done to ensure that the rank-one perturbation
$\mathcal H_\gamma=-\partial_{xx}+\frac{1}{2}\phi_x+\gamma \langle \phi_x,\cdot \rangle \phi_x$ of the important linearized operatorFootnote 8
$\mathcal H=-\partial_{xx}+\frac{1}{2}\phi_x$ has the property
$\mathcal H_\gamma\geq 0$. Two important features of the construction are


We make substantial use of these two facts in our subsequent analysis. Next, we write the equation for the residual
$v: u(x,t)=\phi(x-x_0(t))+v(t, x-x_0(t))$. This takes the form

We setup an energy estimate for v in the space
$L^2_1(\mathbb R)$. More precisely, we shall show that

Of course, the global solution v produced in [Reference Barker, Bronski, Hur and Yang2] does not a priori live in the space
$L^2_1(\mathbb R)$, so one begins by only formally setting the energy estimate for it. We will ignore the details of this justification, as this is standard by considering instead a quantity of the form
$\int_{-\infty}^\infty |v(t,x)|^2 |x| \zeta(x/N)dx$ for
$N \gt \gt 1$ and some smooth cutoff function ζ. With these caveats, we proceed to bound the
$L^2_1(\mathbb R)$ norm. Taking a dot product of (3.3) with
$v |x|$, we have

We now proceed to estimate various terms, by keeping in mind the relation (3.2), which guides us about the acceptable error terms. After integration by parts and algebraic manipulations, we have

Also,

Here, in addition to Sobolev embedding and Gagliardo–Nirenberg estimates, we used the bound
$\|v(t, \cdot)\|_{L^2}\leq C$. Next,

Now, note that
$-\phi(x) sgn(x) \gt 0$ for all x and also there exists A, so that
$-\phi(x) sgn(x) \gt \frac{1}{2}$ for
$|x| \gt A$. This last statement follows from
$\lim_{x\to \pm \infty} \phi(x) = \mp 1$. So, we further estimate

Finally, for the term
$\int v_x'' v |x| dx$, we have

Putting it all together and estimating
$-\|v(t, \cdot)\|_{L^\infty}^2 \geq - C\|v_x(t, \cdot)\|_{L^2} \|v(t, \cdot)\|_{L^2}$, we obtain

We are almost ready to derive an estimate on the desired norms, except that it is hard to control
$|v_x(t,0)| $. Indeed, the easiest way out would be to use the norm
$\|v_x(t, \cdot)\|_{L^\infty}$, for which we unfortunately lack a priori estimates. Instead, we employ a simple averaging argument to reduce to the quantity
$\|v_x(t, \cdot)\|_{L^2}$, for which we do have a priori bound, see (3.2).
Indeed, instead of finding the bound for
$ \int v^2(t,x) |x| dx$, we might employ an estimate for
$ \int v^2(t,x) |x-a| dx$, where
$a\in (-1,1)$. Running through identical estimates, we arrive at

Integrating on the time interval
$(0,t)$ brings about

Taking an average of the previous estimate, i.e. integrating the previous estimate
$\int_{-1}^1 \ldots da$, and in doing so, recall the formula

This yields the estimate

Now, based on (3.2) and Hölder’s inequality

All in all, we conclude that

We can hide
$\|v\|_{L^2_{t,x}}^{\frac{3}{2}}$ on the right-hand side behind
$\frac{1}{2} \|v\|_{L^2_{t,x}}^2$. More precisely, by Young’s inequality, for any choice of ϵ > 0,

With an appropriately small choice of ϵ, we enter this on the right-hand side of (3.4), which leads us to the bound

for each T > 0. This bound has two important consequences. Firstly, using the Hölder inequality, we have that for each
$1 \lt p \lt 2$,

for which we have good bounds from (3.5). It follows that for any p > 1,
$\sup_{0 \lt t \lt T} \int |v(x)|^p\leq C_p$. Note, however, that the constant Cp blows up as
$p\to 1+$, due to
$\lim_{p\to 1+} \left(\int_{|x|\geq 1} |x|^{-\frac{p}{2-p}}dx \right)=+\infty$.
Secondly, by taking into account (3.1) and (3.5), we have that for each t > 0,

whence it follows that v obeys the bound
$\|v(t, \cdot)\|_{L^2}\leq C t^{-\frac{1}{2}}$. Interpolating this with the uniform bounds for
$\|v(t, \cdot)\|_{L^p}, 1 \lt p \lt 2$, obtained just before, we have that for every δ > 0, there is
$C=C(\delta)$, so that

4. Asymptotic attraction to odd fronts for the generalized Burgers model
In this section, we present an extension of the results in [Reference Barker, Bronski, Hur and Yang2], when the fronts and the respective perturbations are odd.
Now that
${\mathcal L}=-\sum_{j=1}^N a_j (-\partial_{xx})^{\alpha_j} $ preserves the parity of the inputs, it is clear that if u 0 is odd, so is
$v_0=u_0-\phi$. Clearly, the evolution preserves the odd solutions, whence
$x\to v(t,x)$ is odd for all
$t\geq 0$. Recall that by the construction in [Reference Barker, Bronski, Hur and Yang2], the translation function x 0 was selected (see formula (20) there) via the formula
$
\dot{x}_0(t)=\gamma \langle \phi_x,v \rangle=0,
$ since ϕx is even, while v is odd. It follows by Theorem 1.1 that for this particular front and odd perturbations u 0,
$u(t,x)=\phi(x)+v(t,x)$, where

Recall that v also satisfies the energy estimate (1.13).
We now aim to establish uniform estimates for the norm
$\|v(t, \cdot)\|_{L^1(\mathbb R)}$. To this end, take arbitrary large T and assume
$0\leq t \lt T$. Note that v satisfies

For each p > 1, multiply (4.2) by
$v |v|^{p-2}$ and integrate in x. We obtain,

Integration by parts yields for each p > 1,

whence

By (2.4), we have the coercivity estimates

Next,

All in all, we get

Denoting
$M_T:=\sup_{0 \lt t \lt T} \|v(t, \cdot)\|_{L^\infty}$, and integrating the above expression in
$(0,t)$, we obtain

Note that all the constants in (4.3) are written out explicitly. This allows us to take a limit
$p\to 1+$. We obtain

and this is valid for all T > 0, so (4.4) is a global estimate.
This provides another important ingredient to the estimates (4.1), which we have borrowed from Theorem 1.1. Fix ϵ > 0. By Plancherel’s identity

We now use (1.13) to conclude

It follows that

Multiplying by
$e^{C \epsilon^2 t}$ and integrating yields

However, integration by parts yields

Plugging this back in (4.5) and multiplying by
$e^{-C \epsilon^2 t}$, yields

Now, we apply the Bernstein’s inequality for
$A=(-\epsilon, \epsilon)$, and (4.4) to obtain

Adding
$I_{ \lt \epsilon}(t)$ on both sides of (4.6) and using (4.7), we obtain

for some absolute constant C 1, i.e. independent on
$\epsilon, t$.
This inequality is true for every fixed ϵ > 0, and every t, but since clearly the best strategy is to optimize (4.8) in ϵ given t, we should proceed with some care. To this end, fix large
$t \gt \gt 1$, and then select the unique
$\epsilon \lt \lt 1: e^{- C_1 \epsilon^2 t}=\epsilon$. Clearly, such choice of ϵ is possible, let us compute its asymptotic in terms of t. Since we need to solve,

its unique solution obeys the asymptotic

It follows that

We conclude that
$\|v(t, \cdot)\|_{L^2}\leq C \|v_0\|_{L^2\cap L^1} \left(\frac{\ln(t)}{t}\right)^{\frac{1}{4}}$, as stated. Interpolating between the bounds (4.10) and (4.4), the Gagliardo-Nirenberg’s inequality now yields

for intermediate values of q. While one can similarly interpolate with the bound (4.1), this will not be a particularly good decay, especially near
$q=\infty$.
Instead, we use an alternative estimate based again on (1.13). Indeed from (3.2) and (1.13) , it is clear that for every T > 0

where the last estimate follows from the L 2 bounds (4.10), Also, for every
$T \gt \gt 1$, we have that there exists
$T_0\in (T, 2T)$, so that

In other words,

For large T 0, denote
$\delta:=\left(\frac{\ln(T_0)}{T_0}\right)^{\frac{1}{2}} \gt \gt C \frac{\sqrt{\ln(T_0)}}{T_0^{\frac{3}{2}}}$. A this point, we refer to the argument in [Reference Barker, Bronski, Hur and Yang2], which establishes that starting with (4.11), we conclude

Since this can be done for at least one point
$T_0\in (2^k, 2^{k+1}), k \gt \gt 1$, we obtain the bound

By Sobolev embedding, followed by the Gagliardo-Nirenberg’s inequality we conclude
