Global well-posedness and soliton resolution for the half-wave maps equation with rational data

Patrick Gérard; Enno Lenzmann

doi:10.1017/fms.2025.10136

Global well-posedness and soliton resolution for the half-wave maps equation with rational data

Part of: Infinite-dimensional Hamiltonian systems Miscellaneous topics - Partial differential equations

Published online by Cambridge University Press: 20 November 2025

Patrick Gérard

and

Enno Lenzmann

Show author details

Patrick Gérard: Affiliation:
Université Paris-Saclay, France; E-mail: patrick.gerard@universite-paris-saclay.fr
Enno Lenzmann*: Affiliation:
University of Basel, Switzerland
*: E-mail: enno.lenzmann@unibas.ch (Corresponding author)

Article contents

Abstract
Introduction and main results
Preliminaries and notation
Lax pair structure
Spectral analysis of $T_{\mathbf {U}}$
Local well-posedness and explicit flow formula
Global well-posedness for rational data
Soliton Resolution and Non-Turbulence
Refined analysis for target $\mathbb {S}^2$
Competing interests
Funding statement
Footnotes
References

Abstract

We study the energy-critical half-wave maps equation:

$$\begin{align*}\partial_t \mathbf{u} = \mathbf{u} \times |D| \mathbf{u} \end{align*}$$

for $\mathbf {u} : [0, T) \times \mathbb {R} \to \mathbb {S}^2$. Our main result establishes the global existence and uniqueness of solutions for all rational initial data $\mathbf {u}_0 : \mathbb {R} \to \mathbb {S}^2$. This demonstrates global well-posedness for a dense subset within the scaling-critical energy space $ \dot {H}^{1/2}(\mathbb {R}; \mathbb {S}^2) $. Furthermore, we prove soliton resolution for a dense subset of initial data in the energy space with uniform bounds for all higher Sobolev norms $\dot {H}^s$ for $s> 0$.

Our analysis utilizes the Lax pair structure of the half-wave maps equation on Hardy spaces in combination with an explicit flow formula. Extending these results, we establish global well-posedness for rational initial data (along with a soliton resolution result) for a generalized class of matrix-valued half-wave maps equations with target spaces in the complex Grassmannians $ \mathsf {Gr}_k(\mathbb {C}^d) $. Notably, this includes the complex projective spaces $ \mathbb {CP}^{d-1} \cong \mathsf {Gr}_1(\mathbb {C}^d) $ thereby extending the classical case of the target $\mathbb {S}^2 \cong \mathbb {CP}^1 $.

MSC classification

Primary: 37K10: Completely integrable systems, integrability tests, bi-Hamiltonian structures, hierarchies (KdV, KP, Toda, etc.)

Secondary: 35R11: Fractional partial differential equations

Information

Type: Analysis
Information: Forum of Mathematics, Sigma , Volume 13 , 2025 , e190

DOI: https://doi.org/10.1017/fms.2025.10136 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

1. Introduction and main results

This paper is devoted to the half-wave maps equation posed on the real line, which reads

(HWM)

$$ \begin{align} \partial_t \mathbf{u} = \mathbf{u} \times |D| \mathbf{u} \end{align} $$

with $\mathbf {u} : [0,T) \times \mathbb {R} \to \mathbb {S}^2$ . Here $\mathbb {S}^2$ denotes the standard unit two-sphere embedded in $\mathbb {R}^3$ and $\times $ stands for the vector/cross product in $\mathbb {R}^3$ . Formally, the operator $|D|$ is given by its Fourier multiplier $|\xi |$ corresponding to the half-Laplacian $|D|=\sqrt {-\Delta }$ on $\mathbb {R}$ . Equivalently in our setting, we can write $|D| = \mathsf {H} \partial _x$ where

(1.1)

$$ \begin{align} (\mathsf{H} f)(x) = \frac{1}{\pi} \mathrm{p.v.} \int_{\mathbb{R}} \frac{f(y)}{x-y} \, dy \end{align} $$

denotes the Hilbert transform on the real line. The main physical motivation for studying (HWM) stems from the fact that it can be seen as a continuum version of discrete completely integrable so-called spin Calogero–Moser models; see [Reference Zhou and Stone37, Reference Lenzmann and Sok27]. See also [Reference Lenzmann and Schikorra26] for a complete classification of traveling solitary waves of (HWM) in relation to (nonfree) minimal surfaces of disk type, as well as the studies [Reference Berntson, Klabbers and Langmann3, Reference Matsuno29] of the dynamics of rational solutions of (HWM) in the applied math literature.

As shown in [Reference Gérard and Lenzmann14], the half-wave maps equation is a completely integrable Hamiltonian PDE in the sense of having a Lax pair structure that yields an infinite set of conserved quantities and also shows that rationality is preserved by the flow of (HWM). We remark that its Hamiltonian energy functional is easily found to be

(1.2)

$$ \begin{align} E(\mathbf{u}) = \frac{1}{2} \int_{\mathbb{R}} \mathbf{u} \cdot |D| \mathbf{u} \, dx = \frac{1}{4 \pi} \int_{\mathbb{R}} \! \int_{\mathbb{R}} \frac{|\mathbf{u}(x)-\mathbf{u}(y)|^2}{|x-y|^2} \, dx \, dy \,. \end{align} $$

Note that the scaling $\mathbf {u}(t,x) \mapsto \mathbf {u}(\lambda t, \lambda x)$ with some constant $\lambda> 0$ preserves solutions of (HWM) as well as the energy $E(\mathbf {u})$ . Thus we see that (HWM) is energy-critical.

However, the question of existence (or nonexistence) of global-in-time solutions for (HWM) – even for smooth and sufficiently small data – has been left completely open so far. Here one of the major obstacles lies in the nondispersive nature of the half-wave operator $|D|$ in one space dimension occurring in the quasi-linear evolution problem (HWM). In fact, this situation prevents us from adapting known tools developed to prove global well-posedness results for other dispersive geometric PDEs such as the Schrödinger maps and wave maps equations; see, for example, [Reference Tao35, Reference Koch, Tataru and Visan22] and references therein. We refer also to [Reference Krieger and Sire23, Reference Kiesenhofer and Krieger20, Reference Liu28] for small data global existence for (HWM) in the nonintegrable case with space dimensions at least $N \geq 3$ , where dispersive estimates can be used which are not available for our setting here.

In the present paper, we shall develop an entirely different approach that will lead to global well-posedness for all rational initial data, which are shown to form a dense subset in the scaling-critical energy space $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ for (HWM). Our proof will be based on the Lax pair structure on suitable Hardy spaces together with an explicit flow formula for (HWM) akin to the explicit formulae recently found by the first author for the Benjamin–Ono equation. Furthermore, as a byproduct of our analysis, we will also study the long-time behavior of rational solutions, leading to a general result on soliton resolution in this setting. In particular, this result yields a rigorous proof of the numerical findings for (HWM) that have been recently presented in [Reference Berntson, Klabbers and Langmann3].

Global well-posedness for rational data

We consider (HWM) with initial data that are given by rational functions. To this end, we define the set

$$ \begin{align*}\mathcal{R}at(\mathbb{R}; \mathbb{S}^2) := \left \{ \mathbf{u} : \mathbb{R} \to \mathbb{S}^2 \mid \mathbf{u}(x) \text{ is rational in } x \in \mathbb{R} \right \} \,. \end{align*} $$

Explicit examples of rational maps $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ are easily constructed by means of the (inverse) stereographic projection from the extended complex plane $\mathbb {C} \cup \{ \infty \}$ to $\mathbb {S}^2$ ; see Section 8 below for details.

In fact, rational maps play a distinguished role in the analysis of (HWM), as they occur in the complete classification of traveling solitary waves. Furthermore, due to its Lax pair structure (detailed below) and a Kronecker-type theorem for Hankel operators (see Section 4 below), another essential feature of (HWM) is that rationality is preserved by the flow, see [Reference Gérard and Lenzmann14]. For any $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ , we readily verify the following properties:

• Limit: $\displaystyle \lim _{x \to \pm \infty } \mathbf {u}(x) = \mathbf {p}$ for some unit vector $\mathbf {p} \in \mathbb {S}^2$ .
• Smoothness: $\mathbf {u} \in \dot {H}^\infty = \bigcap _{s>0} \dot {H}^s$ .

In addition, we can derive with the following nontrivial fact.

Theorem 1.1. $\mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ is a dense subset in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ .

Remark. Due to the nonlinear constraint of taking values in the unit sphere $\mathbb {S}^2$ , this density result is far from obvious. For the proof of Theorem 1.1, we refer to Appendix A below.

Our first main result shows that rational data always lead to unique global-in-time solutions of (HWM).

Theorem 1.2 (GWP for Rational Data).

For every $\mathbf {u}_0 \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ , there exists a unique global-in-time solution $\mathbf {u} \in C(\mathbb {R}; H^\infty _\bullet (\mathbb {R}; \mathbb {S}^2))$ of (HWM) with initial datum $\mathbf {u}(0) = \mathbf {u}_0$ .

Remarks. 1) The global solutions $\mathbf {u} : \mathbb {R} \times \mathbb {R} \to \mathbb {S}^2$ constructed above are of the form

$$ \begin{align*}\mathbf{u}(t) = \mathbf{u}_\infty + \mathbf{v}(t) \in \mathbb{S}^2 + C(\mathbb{R}; H^\infty(\mathbb{R}; \mathbb{R}^3)) \end{align*} $$

with the point $\mathbf {u}_\infty = \lim _{|x| \to \infty } \mathbf {u}_0(x) \in \mathbb {S}^2$ given by the initial datum $\mathbf {u}_0 \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ . See also below, for the definition of the space $H^\infty _\bullet (\mathbb {R}; \mathbb {S}^2)$ .

2) Our result establishes global well-posedness of (HWM) for initial data belonging to a dense subset in the scaling-critical energy space $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ . Hence any finite-time blowup solution for (HWM) in the energy space – provided such solutions exist at all – must be highly unstable.

3) The solutions of Theorem 1.2 exhibit an infinite set of conserved quantities

$$ \begin{align*}I_{p}(\mathbf{u}(t)) = I_{p}(\mathbf{u}_0) \quad \text{for } p \geq 1 \end{align*} $$

due to the Lax structure for (HWM). In particular, we obtain conservation of energy $E(\mathbf {u}(t)) = E(\mathbf {u}_0) \sim I_2(u_0)$ . As a consequence of Peller’s theorem, we obtain the infinite family of a priori bounds

$$ \begin{align*}\| \mathbf{u}(t) \|_{\dot{B}^{1/p}_p} \lesssim_p \| \mathbf{u}_0 \|_{\dot{B}^{1/p}_p} \quad \text{for } p \geq 1 \end{align*} $$

with the homogeneous Besov semi-norms $\| \cdot \|_{\dot {B}^{1/p}_p}$ ; see Section 3 for details. However, these bounds do not seem to provide strong enough control to deduce global existence. In this paper, we thus use an entirely different approach based on an explicit flow formula for (HWM).

4) In [Reference Berntson, Klabbers and Langmann3], the authors study the dynamics for rational initial data $\mathbf {u}_0 : \mathbb {R} \to \mathbb {S}^2$ with simple poles and derive a self-consistent system of ordinary differential equations of spin Calogero–Moser type. However, by following this approach, it still remains unclear whether such rational solutions can be extended globally in time, since a possible loss of simplicity of poles could arise at finite time, rendering the simple pole ansatz invalid in finite time.

Soliton resolution and nonturbulence

As our next main result, we discuss the long-time behavior of rational solutions provided by Theorem 1.2 above. Here a suitable spectral condition will enter the scene as follows. For $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ , we define the Toeplitz operator by

$$ \begin{align*}T_{\mathbf{U}} f = \Pi_+ ( \mathbf{U} f) \quad \text{for } f \in L^2_+(\mathbb{R}; \mathbb{C}^2) \,. \end{align*} $$

Here $\Pi _+ : L^2(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ is the Cauchy–Szegő projection onto the vector-valued Hardy space defined as

$$ \begin{align*}L^2_+(\mathbb{R}; \mathbb{C}^2) := \left \{ f \in L^2(\mathbb{R}; \mathbb{C}^2) \mid \mathrm{supp} \, \widehat{f}_k \subset [0, \infty) \text{ for } k=1,2 \right \}\,. \end{align*} $$

The symbol in $T_{\mathbf {U}}$ is given by the matrix-valued function $\mathbf {U} : \mathbb {R} \to \mathbb {C}^{2 \times 2}$ with

(1.3)

$$ \begin{align} \mathbf{U} = \mathbf{u} \cdot \boldsymbol{\sigma} = \sum_{k=1}^3 u_k \sigma_k = \left ( \begin{array}{cc} u_3 & u_1 - \mathrm{i} u_2 \\ u_1 + \mathrm{i} u_2 & -u_3 \end{array} \right ) , \end{align} $$

where $\boldsymbol {\sigma }=(\sigma _1, \sigma _2, \sigma _3)$ contains the standard Pauli matrices. For later use, we also remark that, by introducing the matrix-valued function $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma }$ , we can equivalently rewrite (HWM) as

(1.4)

$$ \begin{align} \partial_t \mathbf{U} = -\frac{\mathrm{i}}{2} [\mathbf{U}, |D| \mathbf{U}] \, , \end{align} $$

where $[X,Y] \equiv XY-YX$ is the commutator of matrices; see also [Reference Gérard and Lenzmann14] for more details on this.

In fact, by recasting (HWM) in terms of the matrix-valued function $\mathbf {U}$ , we will be able to fully exploit the Lax structure initially found in [Reference Gérard and Lenzmann14]. Also, we note that $\mathbf {U}(x) =\mathbf {U}(x)^* \in \mathbb {C}^{2 \times 2}$ takes values in the Hermitian matrices subject to the algebraic constraint that . As a consequence, the Toeplitz operator $T_{\mathbf {U}}=T_{\mathbf {U}}^*$ is self-adjoint with operator norm $\| T_{\mathbf {U}} \| \leq 1$ . Moreover, it turns out that $T_{\mathbf {U}}$ will be a Lax operator along the flow. Hence its spectrum $\sigma (T_{\mathbf {U}(t)})$ will be preserved in time for solutions $\mathbf {u}(t)$ of (HWM). As another key fact, we mention that the discrete spectrum

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathbf{U}}) = \{ \lambda \in \sigma(T_{\mathbf{U}}) \mid \lambda \text{ is isolated and has finite multiplicity} \} \end{align*} $$

is finite if and only if the function $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ is rational; see Section 4 for a detailed discussion of the spectral properties of $T_{\mathbf {U}}$ for general $\mathbf {u} \in \dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ .

Our next main result will prove that simplicity of the discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ implies scattering of the corresponding global rational solution $\mathbf {u} \in C(\mathbb {R}; H^\infty _\bullet (\mathbb {R}; \mathbb {S}^2))$ into a sum of traveling ground state solitary waves receding from each other, that is, we obtain soliton resolution in this case. From [Reference Lenzmann and Schikorra26] we recall that traveling solitary waves for (HWM) are, by definition, finite-energy solutions of the form

(1.5)

$$ \begin{align} \mathbf{u}_{v}(t,x) = \mathbf{q}_v(x- vt) \end{align} $$

with some profile $\mathbf {q} \in \dot {H}^{\frac {1}{2}}(\mathbb {R}; \mathbb {S}^2)$ and where $v \in \mathbb {R}$ corresponds to the traveling velocity. From the complete classification result in [Reference Lenzmann and Schikorra26] we recall that the any such profile $\mathbf {q}_v$ can be expressed in terms of a finite Blaschke product, whence it follows that $\mathbf {q}_v \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ holds. Moreover, the energy is quantized according to

(1.6)

$$ \begin{align} E(\mathbf{q}_v) = (1-v^2) \cdot m \pi \quad \text{with some } m=0,1,2, \ldots \end{align} $$

where the integer m corresponds to the degree of the Blaschke product. The case $m=0$ corresponds to the trivial case of constant $\mathbf {q}_v$ , whereas for nonconstant profiles $\mathbf {q}_v$ we must have that

(1.7)

$$ \begin{align} |v| < 1 \,. \end{align} $$

Note also that the special case $v=0$ yields static solutions of (HWM) and the profiles $\mathbf {q}_{v=0}$ are then so-called half-harmonic maps, see also [Reference Da Lio and Rivière8, Reference Millot and Sire31].

In view of (1.6), we refer to the case $m=1$ as ground state solitary waves, since these are nontrivial with the least possible energy for a given velocity. From the explicit classification in [Reference Lenzmann and Schikorra26] we can deduce that profiles $\mathbf {q}_v \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ for ground state solitary waves are exactly rational functions of the form

(1.8)

$$ \begin{align} \mathbf{q}_v(x) = \mathbf{q}_\infty + \frac{\mathbf{s}}{x-z} + \frac{\overline{\mathbf{s}}}{x-\overline{z}} \end{align} $$

with some $\mathbf {q}_\infty \in \mathbb {S}^2$ , $z \in \mathbb {C}_-$ , and $\mathbf {s} \in \mathbb {C}^3 \setminus \{ 0 \}$ satisfying the nonlinear constraints

(1.9)

$$ \begin{align} \mathbf{s} \cdot \mathbf{s} = 0 \quad \text{and} \quad \mathbf{s} \cdot \left ( \mathbf{q}_\infty + \frac{\overline{\mathbf{s}}}{z-\overline{z}} \right ) = 0 \,. \end{align} $$

Here $\mathbf {a} \cdot \mathbf {b} = \sum _{k=1}^3 a_k b_k$ denotes the non-Hermitian dot product of $\mathbf {a}, \mathbf {b} \in \mathbb {C}^3$ . We remark that (1.9) is easily seen to be equivalent (by partial fraction expansion) to the geometric constraint that $\mathbf {q}_v(x) \in \mathbb {S}^2$ for all $x \in \mathbb {R}$ . Moreover, the real part $\mathrm {Re} \, z$ corresponds to the spatial center of the solitary wave profile $\mathbf {q}_v$ , whereas $E(\mathbf {q}_v)= (\mathbf {s} \cdot \overline {\mathbf {s}}) \cdot \pi = (1-v^2) \cdot \pi $ yields its energy. For more details on $\mathbf {q}_v$ , we refer to the discussion in Appendix C below.

We are now ready to state our second main result.

Theorem 1.3 (Soliton Resolution and Non-Turbulence).

Let $\mathbf {u}_0 \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ and suppose the corresponding Toeplitz operator $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathbb {C}^2) \to L_+^2(\mathbb {R}; \mathbb {C}^2)$ has simple discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}_0})=\{ v_1, \ldots , v_N \}$ .

Then the corresponding solution $\mathbf {u} \in C(\mathbb {R}; H^\infty _\bullet (\mathbb {R}; \mathbb {S}^2))$ of (HWM) with initial datum $\mathbf {u}(0) = \mathbf {u}_0$ satisfies

$$ \begin{align*}\lim_{t \to \pm \infty} \| \mathbf{u}(t) - \mathbf{u}^\pm(t) \|_{\dot{H}^s} = 0 \quad \text{for all } s> 0 \, , \end{align*} $$

where

$$ \begin{align*}\mathbf{u}^{\pm}(t,x) = \sum_{j=1}^N \mathbf{q}_{v_j}(x-v_jt) - (N-1) \mathbf{u}_\infty \,. \end{align*} $$

Here each $\mathbf {q}_{v_j} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ is a profile of a ground state solitary wave for (HWM) with traveling velocity $v_j$ and it is given by

$$ \begin{align*}\mathbf{q}_{v_j}(x) = \mathbf{u}_\infty + \frac{\mathbf{s}_j}{x-y_j+\mathrm{i} \delta_j} + \frac{\overline{\mathbf{s}}_j}{x-y_j-\mathrm{i} \delta_j} \end{align*} $$

with some complex vectors $\mathbf {s}_{1}, \ldots , \mathbf {s}_N \in \mathbb {C}^3 \setminus \{ 0 \}$ , some real numbers $y_1, \ldots , y_N \in \mathbb {R}$ , some positive real numbers $\delta _1, \ldots , \delta _N> 0$ , and the point $\mathbf {u}_\infty = \lim _{|x| \to \infty } \mathbf {u}_0(x) \in \mathbb {S}^2$ .

Moreover, we have the a priori bounds

$$ \begin{align*}\sup_{t \in \mathbb{R}} \| \mathbf{u}(t) \|_{\dot{H}^s} \leq C(\mathbf{u}_0,s) \quad \text{for all } s>0 \,. \end{align*} $$

Remarks. 1) Obtaining a priori bounds on all higher Sobolev norms $\| \mathbf {u}(t) \|_{\dot {H}^s}$ is a remarkable fact, since the infinite hierarchy of conservation laws given by the Lax structure for (HWM) only provides a priori control over the weaker homogeneous Besov norms $\| \mathbf {u}(t) \|_{\dot {B}^{1/p}_{p}}$ for $1 \leq p < \infty $ . The latter fact follows from Peller’s theorem applied to the Hankel operator $H_{\mathbf {U}}$ and the conserved quantities given by the operator traces $\mathrm {Tr}(|H_{\mathbf {U}}|^p)$ ; see [Reference Gérard and Lenzmann14] for more details.

2) Note that the scattering profile $\mathbf {u}^{\pm }(t)$ is the same for both $t \to -\infty $ and $t \to +\infty $ , which can be seen as triviality of the scattering map in this setting.

3) It would be interesting to prove or disprove the existence of rational initial data $\mathbf {u}_0$ leading to turbulent behavior in the sense of growth of higher Sobolev norms such that $\| \mathbf {u}(t) \|_{\dot {H}^s} \to +\infty $ as $t \to \infty $ for some $s> \frac 1 2$ . Of course, the discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}_0})$ for such data must have degenerate eigenvalues.

4) It is interesting to compare our result to other completely integrable equations with a Lax pair structure on Hardy spaces: In [Reference Gérard and Lenzmann15], turbulent rational global-in-time solutions have been constructed for the Calogero–Moser derivative NLS on the real line. For the cubic Szegő equation on the real line, turbulent rational solutions have been proven to exist in [Reference Gérard and Pushnitski18] along with their genericity.

We conclude this subsection by establishing that the spectral assumption in Theorem 1.7 for the Toeplitz operator $T_{\mathbf {U}_0}$ holds on a dense subset in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ , thereby showing that the soliton resolution above holds on a dense subset in the energy space.

Theorem 1.4. The subset

$$ \begin{align*}\mathcal{R}at_{\mathrm{s}}(\mathbb{R};\mathbb{S}^2) := \left \{ \mathbf{u} \in \mathcal{R}at(\mathbb{R}; \mathbb{S}^2) \mid \sigma_{\mathrm{d}}(T_{\mathbf{U}}) \text{ is simple} \right \} \end{align*} $$

is dense in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ .

We remark that the nonlinear constraint of taking values in $\mathbb {S}^2$ poses serious challenges when proving this density result. Also, the reader should avoid the fallacy of claiming that rational functions $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ with simple poles will always lead to simple discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ . We refer to Section 4 for more details.

Generalized half-wave maps equation

We now discuss a natural geometric generalization of (HWM) beyond the target $\mathbb {S}^2$ . The reader who is mainly interested in the $\mathbb {S}^2$ -valued case may skip this subsection at first reading.

For a given integer $d \geq 2$ , we let $M_d(\mathbb {C}) \equiv \mathbb {C}^{d \times d}$ denote the vector space of complex $d \times d$ -matrices. For matrix-valued maps $\mathbf {U} : [0,T) \times \mathbb {R} \to M_d(\mathbb {C})$ , we introduce the generalized half-wave maps equation given by

(HWM_d)

$$\begin{align} \partial_t \mathbf{U} = -\frac{\mathrm{i}}{2} [\mathbf{U}, |D| \mathbf{U}] \, , \end{align} $$

subject to the initial condition $\mathbf {U}(0) = \mathbf {U}_0 : \mathbb {R} \to M_d(\mathbb {C})$ satisfying the algebraic constraints such that

(1.10)

We readily check that these properties of $\mathbf {U}_0$ are formally preserved along the flow of (HWM_d). At this point, we also mention that (HWM_d) can be formally seen as the zero-dispersion limit of the so-called spin Benjamin–Ono equation recently introduced in [Reference Berntson, Langmann and Lenells4]; see also below for further remarks on this.

The matrix-valued generalization of (HWM) above also has a straightforward geometric meaning as follows. Let $\mathsf {Gr}_k(\mathbb {C}^d)$ denote the complex Grassmannian consisting of the k-dimensional subspaces of the complex vector space $\mathbb {C}^d$ . We recall that $\mathsf {Gr}_k(\mathbb {C}^d)$ can be canonically identified with the space of self-adjoint projections $P =P^* \in M_d(\mathbb {C})$ with $\mathrm {rank}(P)=k$ . Since $\mathrm {Tr}(P)=\mathrm {rank}(P)$ for such projections P, we find

(1.11)

$$ \begin{align} \mathsf{Gr}_k(\mathbb{C}^d) = \left \{ P \in M_d(\mathbb{C}) \mid P^*=P = P^2 \text{ and } \mathrm{Tr}(P)=k \right \}. \end{align} $$

We remark that $\mathsf {Gr}_k(\mathbb {C}^d)$ is a compact submanifold of real dimension $2k(n-k)$ embedded in $M_d(\mathbb {C})$ . In fact, we have that $\mathsf {Gr}_k(\mathbb {C}^d)$ is a compact complex Kähler manifold, see also below.

Thanks to the elementary affine relation

(1.12)

we obtain the natural identification of the complex Grassmannians such that

(1.13)

for all $k=0, \ldots , d$ . With the simple relation (1.12) in mind, we will use the slight abuse of notation and identify elements in the right-hand side in (1.13) as elements in $\mathsf {Gr}_k(\mathbb {C}^d)$ in what follows. Moreover, throughout our discussion we will also include the trivial cases when $k=0$ or $k= d$ corresponding to or , respectively.

In addition to the constraints (1.10), it is easy to see that the matrix trace $\mathrm {Tr}(\mathbf {U}(t,x))$ is formally preserved in time along the flow of (HWM_d). Hence we can view solutions of (HWM_d) as maps

$$ \begin{align*}\mathbf{U} : [0,T) \times \mathbb{R} \to \mathsf{Gr}_k(\mathbb{C}^d) \, , \end{align*} $$

provided that the initial condition $\mathbf {U}_0 : \mathbb {R} \to M_d(\mathbb {C})$ satisfies the pointwise condition

(1.14)

$$ \begin{align} \mathrm{Tr}(\mathbf{U}_0(x))= d-2k \quad \text{for a. e. } x \in \mathbb{R} \end{align} $$

in addition to the constraints (1.10) above.

Remarks. 1) For $d=2$ and $k=1$ , we see that (HWM_d) reduces to (HWM) in accordance with the classical fact that $\mathsf {Gr}_1(\mathbb {C}^2) \cong \mathbb {CP}^1 \cong \mathbb {S}^2$ .

2) For general $d \geq 2$ and $k=1$ , we recall that $\mathsf {Gr}_1(\mathbb {C}^d) \cong \mathbb {CP}^{d-1}$ . In particular, our global well-posedness result below will apply to the generalized half-wave maps equation with target being the complex projective spaces $\mathbb {CP}^{d-1}$ for any $d \geq 2$ .

We will prove that (HWM_d) also possess a Lax structure on suitable $L^2$ -based Hardy spaces, which will be discussed in Section 3 below. For $d\geq 2$ and $0 \leq k \leq d$ given, we observe that the natural energy space for (HWM_d) reads

$$ \begin{align*}\dot{H}^{\frac 1 2}(\mathbb{R}; \mathbf{Gr}_k(\mathbb{C}^d)) := \left\{ \mathbf{U} \in \dot{H}^{\frac 1 2}(\mathbb{R}; M_d(\mathbb{C})) \mid \mathbf{U}(x) \in \mathbf{Gr}_k(\mathbb{C}^d) \text{ for a. e. } x \in \mathbb{R} \right\} \, \end{align*} $$

equipped with the natural Gagliardo semi-norm $\| \cdot \|_{\dot {H}^{\frac 1 2}}$ whose square is (up to a multiplicative constant) the energy functional for (HWM_d) given by

(1.15)

$$ \begin{align} E(\mathbf{U}) = \frac{1}{2} \| \mathbf{U} \|_{\dot{H}^{\frac 1 2}}^2 = \frac{1}{4 \pi} \int_{\mathbb{R}} \int_{\mathbb{R}} \frac{ | \mathbf{U}(x)- \mathbf{U}(y)|_F^2}{|x-y|^2} \, dx \, dy \,. \end{align} $$

Here $| A |_F = \sqrt {\mathrm {Tr}(A^* A)}$ denotes the natural Frobenius norm for matrices $A \in M_d(\mathbb {C})$ .

In analogy to our analysis of (HWM), we define the set

$$ \begin{align*}\mathcal{R}at(\mathbb{R}; \mathbf{Gr}_k(\mathbb{C}^d)) := \left \{ \mathbf{U} : \mathbb{R} \to \mathbf{Gr}_k(\mathbb{C}^d) \mid \mathbf{U}(x) \text{ is rational} \right \}. \end{align*} $$

We have the following global well-posedness result about (HWM_d) for rational initial data, which includes Theorem 1.2 as a special case.

Theorem 1.5 (GWP of (HWM_d) for Rational Data).

Let $d \geq 2$ and $0 \leq k\leq d$ be integers. Then, for every initial datum $\mathbf {U}_0 \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ , there exists a unique global-in-time solution $\mathbf {U} \in C(\mathbb {R}; H^\infty _\bullet (\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d)))$ of (HWM_d) with $\mathbf {U}(0) = \mathbf {U}_0$ .

Generalizing the density result in Theorem 1.6, we have the following result proven in Appendix A.

Theorem 1.6. For every $d\geq 2$ and $0 \leq k \leq d$ , the subset $\mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ is dense in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ .

Remark. The reader may wonder about finding explicit elements in $\mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ . Indeed, in the case $\mathsf {Gr}_1(\mathbb {C}^d) \cong \mathbb {CP}^{d-1}$ , we can easily construct rational maps as follows. Let $P_1, \ldots , P_d \in \mathbb {C}[X]$ be polynomials such that $f(x) := (P_1(x), \ldots , P_d(x)) \in \mathbb {C}^d \setminus \{ 0 \}$ for all $x \in \mathbb {R}$ . Evidently, the map $P : \mathbb {R} \to M_d(\mathbb {C})$ with

$$ \begin{align*}P(x) := \frac{f(x) \overline{f}(x)^t}{\langle f(x), f(x) \rangle_{\mathbb{C}^d}} \end{align*} $$

satisfies $P(x)=P(x)^*=P(x)^2$ with $\mathrm {Tr}(P(x)) \equiv 1$ . Thus belongs to $\mathcal {R}at(\mathbb {R}; \mathsf {Gr}_1(\mathbb {C}^d))$ .

Next, we will extend Theorem 1.3 to the setting of half-wave maps with target $\mathsf {Gr}_k(\mathbb {C}^d)$ . Here, for a given initial datum $\mathbf {U}_0 \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ , the corresponding Toeplitz operator $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R};\mathbb {C}^d)$ is analogously defined via $T_{\mathbf {U}_0} f = \Pi _+(\mathbf {U}_0 f)$ . Furthermore, the notion of traveling solitary waves for (HWM_d) is defined in the obvious manner: We say that a finite-energy solution to (HWM_d) of the form

$$ \begin{align*}\mathbf{U}_v(t,x) = \mathbf{Q}_v(x-vt) \end{align*} $$

is a traveling solitary wave with profile $\mathbf {Q}_v \in \dot {H}^{\frac 1 2}(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ and velocity $v \in \mathbb {R}$ . We have the following result.

Theorem 1.7 (Soliton Resolution and Non-Turbulence for (HWM_d)).

Let $d \geq 2$ and $0 \leq k \leq d$ be given. Suppose that $\mathbf {U}_0 \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ and that its Toeplitz operator $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R}; \mathbb {C}^d)$ has simple discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}_0})= \{v_1, \ldots , v_N \}$ .

Then the corresponding solution $\mathbf {U} \in C(\mathbb {R}; H^\infty _\bullet (\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d)))$ of (HWM_d) with initial datum $\mathbf {U}(0) = \mathbf {U}_0$ satisfies

$$ \begin{align*}\lim_{t \to \pm \infty} \| \mathbf{U}(t) - \mathbf{U}^\pm(t) \|_{\dot{H}^s} = 0 \quad \text{for all } s> 0 \, , \end{align*} $$

where

$$ \begin{align*}\mathbf{U}^{\pm}(t,x) = \sum_{j=1}^N \mathbf{Q}_{v_j}(x-v_j t) - (N-1) \mathbf{U}_\infty \,. \end{align*} $$

Here each $\mathbf {Q}_{v_j} \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ is a profile of a traveling solitary wave for (HWM_d) with velocity $v_j$ and it is given by

$$ \begin{align*}\mathbf{Q}_{v_j}(x) = \mathbf{U}_\infty + \frac{A_j}{x- y_j + \mathrm{i} \delta_j} + \frac{A_j^*}{x-y_j- \mathrm{i} \delta_j} \, , \end{align*} $$

with some matrices $A_j \in M_d(\mathbb {C})$ with $\mathrm {rank}(A_j)=1$ and $A_j^2 = 0$ for $j =1, \ldots , N$ , some real numbers $y_1, \ldots , y_N \in \mathbb {R}$ , some positive real numbers $\delta _1, \ldots , \delta _N> 0$ , and the constant matrix $\mathbf {U}_\infty = \lim _{|x| \to \infty } \mathbf {U}_0(x) \in \mathsf {Gr}_k(\mathbb {C}^d)$ .

Moreover, we have the a priori bounds

$$ \begin{align*}\sup_{t \in \mathbb{R}} \| \mathbf{U}(t) \|_{\dot{H}^s} \leq C(\mathbf{u}_0,s) \quad \text{for all } s>0 \,. \end{align*} $$

Remarks. 1) In the particular case $\mathsf {Gr}_1(\mathbb {C}^2) \cong \mathbb {CP}^1 \cong \mathbb {S}^2$ , we obtain Theorem 1.3 above, except that we also find in Theorem 1.3 that the traveling solitary profiles are known to be of ground state type in this case. For general targets $\mathsf {Gr}_k(\mathbb {C}^d)$ with $(k,d) \neq (1,2)$ , the complete classification of traveling solitary waves is open and hence we can only conclude that the profiles $\mathbf {Q}_{v_j}$ above give rise to traveling solitary wave for (HWM_d) with velocity $v_j$ .

2) It would be desirable to extend the density result for the simplicity condition on the discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}_0})$ stated in Theorem 1.4 to general targets $\mathsf {Gr}_k(\mathbb {C}^d)$ .

Strategy of proofs

Let us briefly outline the main ideas used in this paper.

The starting point of our analysis is a detailed study of the Lax pair structure of (HWM_d). In particular, this will largely extend the previous results found in [Reference Gérard and Lenzmann14] for (HWM) with target $\mathbb {S}^2$ . More precisely, we will show that, given a sufficiently smooth solution $\mathbf {U} : [0,T] \times \mathbb {R} \to \mathsf {Gr}_k(\mathbb {C}^d)$ of the matrix-valued (HWM_d), we obtain the following Lax equation

(1.16)

$$ \begin{align} \frac{d}{dt} T_{\mathbf{U}(t)} = \big [B^+_{\mathbf{U}(t)}, T_{\mathbf{U}(t)} \big ] \,. \end{align} $$

Here $T_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R};\mathcal {V})$ denotes the Toeplitz operator given by

$$ \begin{align*}T_{\mathbf{U}} f = \Pi_+( \mathbf{U} f) \quad \text{for } f \in L^2_+(\mathbb{R}; \mathcal{V})\, , \end{align*} $$

where $\mathcal {V}$ either stands for

$$ \begin{align*}\mathcal{V}=\mathbb{C}^d \quad \text{or} \quad \mathcal{V}=M_d(\mathbb{C}) \, , \end{align*} $$

equipped with their canonical scalar products, see below. In fact, we shall use both choices of $\mathcal {V}$ in the course of our analysis below. Moreover, we remark that $T_{\mathbf {U}} = T_{\mathbf {U}}^*$ is self-adjoint and bounded with operator norm $\| T_{\mathbf {U}} \| = \| \mathbf {U} \|_{L^\infty }=1$ thanks to the algebraic constraints imposed on the matrix-valued function $\mathbf {U}$ . The second operator appearing in (1.16) reads

(1.17)

$$ \begin{align} B^+_{\mathbf{U}} = \frac{\mathrm{i}}{2} \left ( D \circ T_{\mathbf{U}} + T_{\mathbf{U}} \circ D \right ) - \frac{\mathrm{i}}{2} T_{|D| \mathbf{U}} \, , \end{align} $$

which is an unbounded skew-adjoint operator with $\mathrm {dom}(B_{\mathbf {U}}) = H^1_+(\mathbb {R}; \mathcal {V})$ as its operator domain.

Now, another decisive feature of the Lax structure for (HWM_d) enters, which again is due to the algebraic constraints satisfied by the matrix-valued function $\mathbf {U}$ . Notably, we can derive the following key identity

(1.18)

$$ \begin{align} \boxed{ T_{\mathbf{U}}^2 = \mathrm{Id} - H_{\mathbf{U}}^* H_{\mathbf{U}}} \end{align} $$

where $H_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_-(\mathbb {R}; \mathcal {V})$ denotes the Hankel operator given by

$$ \begin{align*}H_{\mathbf{U}} f = \Pi_- (\mathbf{U} f) \quad \text{for } f \in L^2_+(\mathbb{R}; \mathcal{V}) \end{align*} $$

where $\Pi _- := \mathrm {Id}-\Pi _+$ denotes the projection in $L^2(\mathbb {R};\mathcal {V})$ onto orthogonal complement of the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ . By the Lax evolution (1.16) combined with (1.18), we obtain the infinite set of conserved quantities for (HWM_d) of the form

(1.19)

$$ \begin{align} I_p(\mathbf{U}) = \mathrm{Tr}(|K_{\mathbf{U}}|^{p/2}) \quad \text{for any } p>0 \, , \end{align} $$

with the nonnegative operator $K_{\mathbf {U}} = H_{\mathbf {U}}^* H_{\mathbf {U}}$ . In particular, for $p=2$ , we obtain the trace-class norm of $K_{\mathbf {U}}$ which is easily seen to be equivalent to the scaling-critical energy (semi-)norm $\| \mathbf {U} \|_{\dot {H}^{\frac {1}{2}}}$ ; see Section 3 for more details. Furthermore, we see from (1.18) that $T_{\mathbf {U}}$ is Fredholm with index 0. We will make use of this fact further below in our analysis.

However, as we have already mentioned above, the infinite family of conserved quantities $\{ I_p(\mathbf {U}) \}_{p \geq 1}$ does not seem to yield sufficient control to obtain global solutions for (HWM_d), even for smooth and sufficiently small initial data (i.e., small perturbations of a constant). To overcome this obstruction, we shall derive an explicit flow formula for sufficiently smooth solutions of (HWM_d), which is akin to the result discovered in [Reference Gérard10] for the Benjamin–Ono equation. More precisely, for solutions of (HWM_d) of the form

$$ \begin{align*}\mathbf{U}(t) = \mathbf{U}_\infty + \mathbf{V}(t) \in M_d(\mathbb{C}) \oplus C([0,T]; H^s(\mathbb{R}; M_d(\mathbb{C})) \quad \text{with } s> \frac{3}{2} \, , \end{align*} $$

we derive that

(1.20)

$$ \begin{align} \boxed{\Pi_+ \mathbf{V}(t,z) = \frac{1}{2\pi \mathrm{i}} I_+ \left ( (X^* + t T_{\mathbf{U}_0} -z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}_0 \right ) \quad \text{for } t \in [0,T] \text{ and } z \in \mathbb{C}_+} \end{align} $$

where $\mathbf {U}_0 = \mathbf {U}_\infty + \mathbf {V}_0 \in M_d(\mathbb {C}) \oplus H^s(\mathbb {R}; M_d(\mathbb {C}))$ denotes the initial datum for (HWM_d). In this formula, we emphasize the fact that $T_{\mathbf {U}_0}$ is now regarded as a Toeplitz operator acting on the Hardy space $L^2_+(\mathbb {R}; M_d(\mathbb {C}))$ with functions taking values in the space of complex $d \times d$ -matrices $M_d(\mathbb {C})$ . Furthermore, in analogous fashion to [Reference Gérard10], the operators $I_+$ and $X^*$ are given by

$$ \begin{align*}I_+(f) = \lim_{\xi \to 0^+} \widehat{f}(\xi) \quad \text{and} \quad \widehat{(X^* f)}(\xi) = \mathrm{i} \frac{d \widehat{f}}{d \xi}(\xi) \end{align*} $$

defined on their suitable domains $\mathrm {dom}(I_+)$ and $\mathrm {dom}(X^*)$ in $L^2_+(\mathbb {R};\mathcal {V})$ ; see Section 2 below for details. Now, the main challenge is to decide whether we can exploit this explicit representation above to deduce that these strong solutions can be extended to all (forward) times, that is, whether it is true that $\mathbf {U} \in C([0,\infty ); H^s_\bullet (\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d)))$ holds? Surprisingly, this turns out to be a rather delicate question whose affirmative answer must necessarily exploit the algebraic constraints satisfied by the matrix-valued function $\mathbf {U}$ solving (HWM_d). By contrast, we remark that the explicit formula (up to an inessential rescaling of t) for the dispersionless limit of the scalar-valued Benjamin–Ono on the line reads the same as (1.20) with the simple replacement of $T_{\mathbf {U}_0}$ with the Toeplitz operator $T_{u_0} : L^2_+(\mathbb {R};\mathbb {C}) \to L^2_+(\mathbb {R}; \mathbb {C})$ with the bounded scalar-valued function $u_0 \in L^2(\mathbb {R}) \cap L^\infty (\mathbb {R})$ . However, for the dispersionless limit of (BO), it is known [Reference Gérard12] that strong continuity of the flow breaks down in finite-time (corresponding to development of shocks). Thus we cannot expect to derive global-in-time existence for (HWM_d) by a naive use of (1.20) neglecting the algebraic constraints for $\mathbf {U}$ .

In order to further exploit the fact that the initial data $\mathbf {U}_0$ for (HWM_d) are valued in $\mathsf {Gr}_k(\mathbb {C}^d)$ , we appeal again to the key identity (1.18). As a direct consequence, we obtain the natural orthogonal decomposition of the underlying Hardy space of the form

$$ \begin{align*}L^2_+(\mathbb{R}; \mathcal{V}) = \mathfrak{H}_0 \oplus \mathfrak{H}_1 \end{align*} $$

with the closed subspace

$$ \begin{align*}\mathfrak{H}_0 := \mathrm{ker}(K_{\mathbf{U}_0}) \quad \text{and} \quad \mathfrak{H}_1 := \mathfrak{H}_0^\perp = \overline{\mathrm{ran}(K_{\mathbf{U}_0})} \, , \end{align*} $$

where we recall the definition of the trace-class operator $K_{\mathbf {U}_0}=H_{\mathbf {U}_0}^* H_{\mathbf {U}_0}$ . Now, it turns out that $\Pi _+ \mathbf {V}_0 \in \mathfrak {H}_1$ and, in addition to this, we see that $\mathfrak {H}_1$ is an invariant subspace of both $T_{\mathbf {U}_0}$ as well as the semigroup generated by $X^*$ . As a consequence, we see that the resolvent appearing on right-hand side in (1.20) satisfies the mapping property $(X^* + t T_{\mathbf {U}_0} - z \mathrm {Id})^{-1} : \mathfrak {H}_1 \to \mathfrak {H}_1$ for any $t \in \mathbb {R}$ . Hence the explicit flow formula found for (HWM_d) effectively takes place only the invariant subspace $\mathfrak {H}_1$ . This is a great deal of information which can be used to deduce global existence of strong solutions! In particular, an adaptation of the classical Kronecker theorem for Hankel operators shows that

$$ \begin{align*}\dim(\mathfrak{H}_1) < +\infty \quad \text{if and only if} \quad \mathbf{U}_0 \text{ is a rational map} \,. \end{align*} $$

Thanks to this fact, the proof of global existence of strong solutions via (1.20) for rational initial data $\mathbf {U}_0$ amounts to showing that $M(t)=X^* + t T_{\mathbf {U}_0}$ has no real eigenvalues for any $t \in \mathbb {R}$ , proving its injectivity on $\mathfrak {H}_1$ and hence the surjectivity of $M(t)$ because $\mathfrak {H}_1$ is finite-dimensional in this setting.

Remark. The case of nonrational initial data $\mathbf {U}_0$ , which implies that $\dim \mathfrak {H}_1 = +\infty $ , and the question of global well-posedness for (HWM_d) will be studied in our companion work [Reference Gérard and Lenzmann16] posed on the torus.

Finally, let us briefly comment on the strategy behind the proofs of our further main results stated as Theorems 1.3 and 1.7 concerning the long-time behaviour of rational solutions. Inspired by our recent study of N-solitons for the Calogero–Moser derivative NLS in [Reference Gérard and Lenzmann15], the main idea rests on using the explicit flow formula combined with a perturbation analysis of the family of (bounded) operators

$$ \begin{align*}\varepsilon X^* + T_{\mathbf{U}_0} : \mathfrak{H}_1 \to \mathfrak{H}_1 \end{align*} $$

with $\varepsilon = \frac {1}{t}$ in the regime where $\varepsilon \to 0$ under the assumption that $T_{\mathbf {U}_0} : \mathfrak {H}_1 \to \mathfrak {H}_1$ has simple spectrum. However, as a striking difference to the analysis in [Reference Gérard and Lenzmann15], we will encounter that turbulence (i.e., growth of higher Sobolev norms) can be ruled out for rational solutions of (HWM_d) provided that the Lax operator $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R}; \mathbb {C}^d)$ has simple discrete spectrum.

Links to Schrödinger maps and spin Benjamin–Ono equation

In order to put (HWM_d) in a broader geometric perspective, we recall the well-known fact that $\mathbf {Gr}_k(\mathbb {C}^d)$ is a Kähler manifold of complex dimension $k(d-k)$ . Its complex structure $J_A$ on the tangent space $T_A \mathsf {Gr}_k(\mathbb {C}^d)$ at a point $A \in \mathbf {Gr}_k(\mathbb {C}^d)$ can be expressed as the matrix commutator

$$ \begin{align*}J_A(X) = -\frac{\mathrm{i}}{2}[A, X] \,. \end{align*} $$

Thus we see that (HWM_d) can be written as a Schrödinger maps-type equation of the formFootnote ¹

(SM)

$$ \begin{align} \partial_t \mathbf{U} = J_{\mathbf{U}} |D| \mathbf{U} \end{align} $$

with the first-order pseudo-differential operator $|D|$ . However, we will not further exploit this geometric point of view in our analysis here.

On the other hand, we also mention the remarkable fact that (HWM_d) can be formally seen as the zero-dispersion limit of the spin Benjamin–Ono equation (sBO), which was recently introduced by Berntson–Langmann–Lenells in [Reference Berntson, Langmann and Lenells4]. In our choice of units, this equation can be written as

(sBO)

$$ \begin{align} \partial_t \mathbf{V} = \frac{1}{2} \partial_x ( |D| \mathbf{V} - \mathbf{V}^2) - \frac{\mathrm{i}}{2} [\mathbf{V}, |D| \mathbf{V}] \, , \end{align} $$

for the matrix-valued map $\mathbf {V} : [0,T) \times \mathbb {R} \to M_d(\mathbb {C})$ ; see also [Reference Gérard11] where a Lax pair structure for (sBO) was found. We notice that, in the special case of real-valued maps $\mathbf {V}(t,x) \in \mathbb {R}$ , we obtain the classical Benjamin–Ono equation (apart from trivial rescaling of t compared to the standard form of this equation).

At least on a formal level, we see that replacing $|D|$ by $\varepsilon |D|$ with $\varepsilon> 0$ in (sBO) and forcing the condition that , we are led to (HWM_d) when formally taking the zero-dispersion limit as $\varepsilon \to 0$ . For a rigorous analysis of the zero-dispersion of the scalar Benjamin–Ono equation, we refer to the recent work in [Reference Gérard12]. However, as already mentioned above, we will encounter a striking difference in our analysis here due to the algebraic constraint that is absent in the scalar case. From an operator theoretic point of view, this remarkable difference stems from the fact that Toeplitz operators $T_{\mathbf {U}}$ with matrix-valued symbols $\mathbf {U} : \mathbb {R} \to \mathbb {C}^{d \times d}$ for $d \geq 2$ can have entirely different spectral properties compared to Toeplitz operators $T_f$ with scalar-valued symbols $f : \mathbb {R} \to \mathbb {C}$ . The interested reader will find more details on this difference further below.

2. Preliminaries and notation

In this section, we set up some definitions and notation used throughout this paper.

Sobolev-type spaces

For the study of the generalized half-wave maps equations (HWM $_d$ ), we introduce the following Sobolev-type spaces for matrix-valued functions. For an integer $d \geq 2$ , we use $M_d(\mathbb {C}) \equiv \mathbb {C}^{d \times d}$ to denote the Hilbert space of complex $d \times d$ -matrices equipped with the inner product

$$ \begin{align*}\langle A, B \rangle_F := \mathrm{Tr}( A B^*) \quad \text{for } A,B \in M_d(\mathbb{C}) \end{align*} $$

and the corresponding Frobenius norm of a matrix $A \in M_d(\mathbb {C})$ will be denoted by $| A |_F = \sqrt {\langle A, A \rangle _F}$ .

The Lebesgue spaces $L^p(\mathbb {R}; M_d(\mathbb {C}))$ and $L^p_{\mathrm {loc}}(\mathbb {R}; M_d(\mathbb {C}))$ are defined in an obvious manner. For $s> 0$ , we use the Sobolev spaces

$$ \begin{align*}\dot{H}^s(\mathbb{R}; M_d(\mathbb{C})) := \left \{ \mathbf{U} \in L^1_{\mathrm{loc}}(\mathbb{R}; M_d(\mathbb{C})) \mid \| \mathbf{U} \|_{\dot{H}^s} := \| |D|^s \mathbf{U} \|_{L^2} < +\infty \right \} \, , \end{align*} $$

$$ \begin{align*}H^s(\mathbb{R}; M_d(\mathbb{C})) := \left \{ \mathbf{V} \in L^1_{\mathrm{loc}}(\mathbb{R}; M_d(\mathbb{C})) \mid \| \mathbf{V} \|_{H^s} := \| \langle D \rangle^s \mathbf{V} \|_{L^2} < +\infty \right \} \,. \end{align*} $$

We set $\dot {H}^\infty (\mathbb {R}; M_d(\mathbb {C})) := \cap _{s> 0} \dot {H}^s(\mathbb {R}; M_d(\mathbb {C}))$ and $H^\infty (\mathbb {R}; M_d(\mathbb {C})) := \cap _{s>0} H^s(\mathbb {R}; M_d(\mathbb {C}))$ . Note that $\| \cdot \|_{\dot {H}^s}$ is a semi-norm, since nontrivial constant maps also belong to $\dot {H}^s(\mathbb {R}; M_d(\mathbb {C}))$ for $s>0$ . Furthermore, for $0 \leq k \leq d$ given, we define the spaces

$$ \begin{align*}\dot{H}^s(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) := \left \{ \mathbf{U} \in \dot{H}^s(\mathbb{R}; M_d(\mathbb{C})) \mid \mathbf{U}(x) \in \mathsf{Gr}_k(\mathbb{C}^d) \text{ for a. e. } x \in \mathbb{R} \right \} \,. \end{align*} $$

Note that the scaling-critical energy space associated to (HWM $_d$ ) with target $\mathsf {Gr}_k(\mathbb {C}^d)$ is $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ equipped with the Gagliardo semi-norm $\| \cdot \|_{\dot {H}^{\frac 1 2}}$ such that

(2.1)

$$ \begin{align} \| \mathbf{U} \|_{\dot{H}^{\frac 1 2}}^2 = \| |D|^{\frac 1 2} \mathbf{U} \|_{L^2}^2 = \frac{1}{2 \pi} \int_{\mathbb{R}} \! \int_{\mathbb{R}} \frac{|\mathbf{U}(x)- \mathbf{U}(y)|_F^2}{|x-y|^2} \, dx \, dy \,. \end{align} $$

Note that $E(\mathbf {U}) = \frac {1}{2} \| \mathbf {U} \|_{\dot {H}^{\frac 1 2}}^2$ is the Hamiltonian energy functional for (HWM $_d$ ) with the natural symplectic form for maps defined on $\mathbb {R}$ with values in the complex Grassmannian $\mathsf {Gr}_k(\mathbb {C}^d)$ .

In addition to the space $\dot {H}^s$ -spaces, it turns out to be convenient to introduce the following family of affine inhomogeneous Sobolev-type spaces given by

$$ \begin{align*}H^s_\bullet(\mathbb{R}; M_d(\mathbb{C})) := M_d(\mathbb{C}) \oplus H^s(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

and we define $H^\infty _\bullet (\mathbb {R}; M_d(\mathbb {C})) := \cap _{s> 0} H^s_\bullet (\mathbb {R};M_d(\mathbb {C}))$ . Furthermore, we set

$$ \begin{align*}H^s_\bullet(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) := \left \{ \mathbf{U}\in H^s_\bullet(\mathbb{R}; M_d(\mathbb{C})) \mid \mathbf{U}(x) \in \mathsf{Gr}_k(\mathbb{C}^d) \text{ for a. e. } x\in \mathbb{R} \right \} \,. \end{align*} $$

It is easy to see that the following strict inclusions hold true:

$$ \begin{align*}\mathcal{R}at(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \subsetneq H^s_\bullet(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \subsetneq \dot{H}^s(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \subsetneq L^\infty(\mathbb{R}; M_d(\mathbb{C})) \, , \end{align*} $$

where we recall that $\mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ denotes the space of rational maps from $\mathbb {R}$ to $\mathsf {Gr}_k(\mathbb {C}^d)$ .

For (HWM) with maps valued in the unit sphere $\mathbb {S}^2 \subset \mathbb {R}^3$ , we make use of the corresponding Sobolev spaces $H^s(\mathbb {R}; \mathbb {R}^3)$ and $\dot {H}^s(\mathbb {R}; \mathbb {R}^3)$ , where the energy space is

$$ \begin{align*}\dot{H}^{\frac 1 2}(\mathbb{R}; \mathbb{S}^2) := \left \{ \mathbf{u} \in \dot{H}^{\frac 1 2}(\mathbb{R}; \mathbb{R}^3) \mid \mathbf{u}(x) \in \mathbb{S}^2 \text{ for a. e. } x \in \mathbb{R} \right \} \end{align*} $$

endowed with the Gagliardo semi-norm $\| \cdot \|_{\dot {H}^{\frac 1 2}}$ such that

$$ \begin{align*}\| \mathbf{u} \|_{\dot{H}^{\frac 1 2}}^2 = \| |D|^{\frac 1 2} \mathbf{u} \|_{L^2}^2 = \frac{1}{2 \pi} \int_{\mathbb{R}} \! \int_{\mathbb{R}} \frac{|\mathbf{u}(x)- \mathbf{u}(y)|^2}{|x-y|^2} \, dx \, dy \,. \end{align*} $$

From the introduction above, we recall that unit vectors $\mathbf {u} \in \mathbb {S}^2$ can be equivalently encoded by using the standard Pauli matrices $(\sigma _1, \sigma _2, \sigma _3)$ via the relation

$$ \begin{align*}\mathbf{U} = \mathbf{u} \cdot \boldsymbol{\sigma} = u_1 \sigma_1 +u_2 \sigma_2 + u_3 \sigma_3 = \left ( \begin{array}{cc} u_3 & u_1 - \mathrm{i} u_2 \\ u_1 + \mathrm{i} u_2 & -u_3 \end{array} \right ) \, , \end{align*} $$

where we easily check that $\mathbf {U} = \mathbf {U}^*$ with and $\mathrm {Tr}(\mathbf {U}) = 0$ . Also, we find that $u_k= \frac {1}{2} \mathrm {Tr}(\mathbf {U} \sigma _k) = \frac {1}{2} \langle \mathbf {U}, \sigma _k \rangle _F$ for $k=1,2,3$ . Thus, by means of the relation $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma }$ , we find the equivalence of (semi)-norms $\| \mathbf {u} \|_{\dot {H}^{s}} \sim \| \mathbf {U} \|_{\dot {H}^{s}}$ for all $s>0$ .

Hardy spaces, Toeplitz and Hankel operators

We consider the Hilbert space $L^2(\mathbb {R}; \mathcal {V})$ for maps on $\mathbb {R}$ with values in the finite-dimensional Hilbert spaces

$$ \begin{align*}\mathcal{V} = \mathbb{C}^d \quad \text{or} \quad \mathcal{V} = M_d(\mathbb{C}) \, , \end{align*} $$

which we endow with their natural inner products and norms, that is,

$$ \begin{align*}\langle u,v \rangle_{\mathcal{V}} = \sum_{k=1}^d u_k \overline{v}_k \quad \text{if } \mathcal{V}= \mathbb{C}^d, \qquad \langle A,B \rangle_{\mathcal{V}} = \mathrm{Tr}(A B^*) \quad \text{if } \mathcal{V} = M_d(\mathbb{C}) \,. \end{align*} $$

The Cauchy–Szegő projection $\Pi _+ : L^2(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R};\mathcal {V})$ onto the Hardy space

$$ \begin{align*}L^2_+(\mathbb{R}; \mathcal{V}) := \{ f \in L^2(\mathbb{R}; \mathcal{V}) \mid \mathrm{supp} \, \widehat{f} \subset [0, \infty) \} \end{align*} $$

is given by

$$ \begin{align*}(\Pi_+ f)(x) := \frac{1}{2 \pi} \int_0^{+\infty} \mathrm{e}^{\mathrm{i} x \xi} \widehat{f}(\xi) \, d\xi \quad \text{with} \quad \widehat{f}(\xi) = \int_{-\infty}^{+\infty} f(x) \mathrm{e}^{-\mathrm{i} \xi x} \, dx \, , \end{align*} $$

or, equivalently, we have on the Fourier side. We use

$$ \begin{align*}\Pi_- := \mathrm{Id} - \Pi_+ \end{align*} $$

to denote projection onto the orthogonal complement

$$ \begin{align*}L^2_-(\mathbb{R}; \mathcal{V}) := (L^2_+(\mathbb{R};\mathcal{V}))^\perp = \{ f \in L^2(\mathbb{R};\mathcal{V}) \mid \mathrm{supp} \, \widehat{f}(\xi) \subset (-\infty,0] \}\,. \end{align*} $$

From standard Paley–Wiener theory we recall that elements $f \in L^2_+(\mathbb {R}; \mathcal {V})$ can be naturally identified with holomorphic functions defined on the complex upper half-plane $\mathbb {C}_+$ such that

$$ \begin{align*}L^2_+(\mathbb{R}; \mathcal{V}) \cong\left \{ f \in \mathrm{Hol}(\mathbb{C}_+; \mathcal{V}) \mid \sup_{y>0} \int_{\mathbb{R}} |f(x+ \mathrm{i} y)|_{\mathcal{V}} ^2 \, dx < +\infty \, \right \} \, , \end{align*} $$

where $|\cdot |_{\mathcal {V}}$ denotes the norm on $\mathcal {V}$ . Throughout this paper, we will freely make use of this fact and we thus regard elements $f \in L^2_+(\mathbb {R}; \mathcal {V})$ as holomorphic functions $f=f(z)$ with $z \in \mathbb {C}_+$ . We use $\mathsf {H} = -\mathrm {i} \Pi _+ + \mathrm {i} \Pi _-$ to denote the Hilbert transform on $L^2(\mathbb {R};\mathcal {V})$ , which can also be written as the singular integral operator

$$ \begin{align*}(\mathsf{H} f)(x) = \frac{1}{\pi} \mathrm{p.v.} \int_{\mathbb{R}} \frac{f(y)}{x-y} \, dy. \end{align*} $$

For a bounded matrix-valued function $\mathbf {U} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ , we define the corresponding Toeplitz operator as

$$ \begin{align*}T_{\mathbf{U}} : L^2_+(\mathbb{R}; \mathcal{V}) \to L^2_+(\mathbb{R};\mathcal{V}), \quad f \mapsto T_{\mathbf{U}} f := \Pi_+(\mathbf{U} f) \,. \end{align*} $$

Likewise, the corresponding Hankel operator is given by

$$ \begin{align*}H_{\mathbf{U}} : L^2_+(\mathbb{R}; \mathcal{V}) \to L^2_-(\mathbb{R}; \mathcal{V}), \quad f \mapsto H_{\mathbf{U}} f := \Pi_-(\mathbf{U} f ) \,. \end{align*} $$

We remark that we adapt the definition of $H_{\mathbf {U}}$ from Peller’s book [Reference Peller32]; another (equivalent) definition of Hankel operators can be achieved by anti-linear operators (see, e.g., [Reference Gérard and Pushnitski17]). However, for studying the Lax pair structure for (HWM $_d$ ), we have found it more convenient to use the present convention for $H_{\mathbf {U}}$ .

A central fact about Hankel operators used in this paper is Kronecker’s theorem, which relates the rationality of the symbol $\mathbf {U}$ with the property that $H_{\mathbf {U}}$ has finite rank. We refer the reader to Section 4 for details. Furthermore, we remark that $H_{\mathbf {U}}$ is Hilbert-Schmidt if and only if $\mathbf {U} \in \dot {H}^{\frac 1 2}$ ; see again Section 4 for a detailed discussion.

The operators $X^*$ , X, and $I_+$

On the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ , we recall that the adjoint Lax–Beurling semigroup $\{ S(\eta )^* \}_{\eta \geq 0}$ is given by

$$ \begin{align*}(S(\eta)^* f)(x) = \Pi_+( e^{-\mathrm{i} \eta x} f(x)) \quad \text{for } f \in L^2_+(\mathbb{R}; \mathcal{V}) \text{ and } \eta \geq 0 \, , \end{align*} $$

which corresponds to the contraction semigroup of left shifts on $L^2_+(\mathbb {R}; \mathcal {V})$ . We remark that $S(\eta )^* = e^{-\mathrm {i} \eta X^*}$ , where its generator $X^*$ is given by the unbounded operator

with the operator domain

$$ \begin{align*}\mathrm{dom}(X^*) = \big \{ f \in L^2_+(\mathbb{R}; \mathcal{V}) \mid \frac{d \widehat{f}}{d\xi} \in L^2(\mathbb{R}_+; \mathcal{V}) \big \} \,. \end{align*} $$

It is straightforward to check that all rational functions $f\in L^2_+(\mathbb {R}; \mathcal {V})$ belong to $\mathrm {dom}(X^*)$ . For $z_0 \in \mathbb {C}_+$ , the action of the resolvent $(X^*-z_0)^{-1}$ is easily found to be

$$ \begin{align*}((X^*-z_0)^{-1} f)(z) = \frac{f(z)-f(z_0)}{z-z_0} \, , \end{align*} $$

for all $f \in L^2_+(\mathbb {R}; \mathcal {V})$ . We remark that $X^*$ is the adjoint of the unbounded operator

$$ \begin{align*}(X f)(x)= x f \end{align*} $$

corresponding to multiplication with $x \in \mathbb {R}$ and its operator domain is given by

$$ \begin{align*} \mathrm{dom}(X) & = \big \{ f \in L^2_+(\mathbb{R}; \mathcal{V}) \mid x f \in L^2(\mathbb{R}; \mathcal{V}) \big \} \\ & = \big \{ f \in L^2_+(\mathbb{R}; \mathcal{V}) \mid \frac{d \widehat{f}}{d\xi} \in L^2(\mathbb{R}_+; \mathcal{V}) \text{ and } \widehat{f}(0) = 0 \big \} \,. \end{align*} $$

Note that X is the generator of the Lax–Beurling semigroup $\{ S(\eta ) \}_{\eta \geq 0}$ corresponding to right shifts on $L^2_+(\mathbb {R}; \mathcal {V})$ , that is, we have

$$ \begin{align*}(S(\eta) f)(x) = e^{\mathrm{i} \eta x} f(x) \quad \text{for } f \in L^2_+(\mathbb{R}; \mathcal{V}) \text{ and } \eta \geq 0 \,. \end{align*} $$

We will sometimes use the notation $S(\eta ) = e^{\mathrm {i} \eta X}$ . Note that the strict inclusion $\mathrm {dom}(X) \subsetneq \mathrm {dom}(X^*)$ holds, for example, the rational function $\frac {1}{x+\mathrm {i}} \in \mathrm {dom}(X^*)$ does not belong to $\mathrm {dom}(X)$ . Further details on the generators $X^*$ and X can be found in [Reference Gérard and Pushnitski17] in the scalar-valued case when $\mathcal {V}$ is replaced by $\mathbb {C}$ , but the necessary adaptations to our setting are elementary.

In addition to the generator $X^*$ , another important operator in our analysis is given by the (unbounded) linear operator

$$ \begin{align*}I_+ : \mathrm{dom}(X^*) \subset L^2_+(\mathbb{R}; \mathcal{V}) \to \mathcal{V}, \quad f \mapsto I_+(f):= \widehat{f}(0^+) = \lim_{\xi \to 0^+} \widehat{f}(\xi) \,. \end{align*} $$

Note that the definition of $I_+$ as the one-sided limit of $\widehat {f}(\xi )$ as $\xi \to 0^+$ makes sense for any $f \in \mathrm {dom}(X^*)$ by the standard trace theorem for Sobolev functions in $H^1(\mathbb {R}_+)$ . An alternative and useful expression for the action of $I_+$ is found by using the approximate identity $\chi _\varepsilon $ with

$$ \begin{align*}\chi_\varepsilon(x) := \frac{1}{1- \mathrm{i} \varepsilon x} \in L^2_+(\mathbb{R}; \mathbb{C}) \quad \text{for } \varepsilon> 0 \,. \end{align*} $$

Let $\mathbf {v} \in \mathcal {V}$ be a fixed vector. By Plancherel’s identity, we have

$$ \begin{align*}\lim_{\varepsilon \to 0} \langle f, \mathbf{v} \chi_\varepsilon \rangle = \lim_{\varepsilon \to 0} \frac{1}{\varepsilon} \int_0^\infty \langle \widehat{f}(\xi), \mathbf{v} \rangle_{\mathcal{V}} \, \mathrm{e}^{-\xi/\varepsilon} \, d \xi = \langle \widehat{f}(0^+), \mathbf{v} \rangle_{\mathcal{V}} = \langle I_+(f), \mathbf{v} \rangle_{\mathcal{V}} \,. \end{align*} $$

Thus, for any orthonormal basis $(\mathbf {v}_1, \ldots , \mathbf {v}_N)$ in $\mathcal {V}$ with $N = \dim \mathcal {V}$ , we obtain

(2.2)

$$ \begin{align} I_+(f) = \lim_{\varepsilon \to 0} \sum_{k=1}^{N} \langle f, \mathbf{v}_k \chi_\varepsilon \rangle \mathbf{v}_k \,. \end{align} $$

For later use, we also record the following formula

(2.3)

$$ \begin{align} \mathrm{Im} \langle X^* f, f \rangle = -\frac{1}{4 \pi} | I_+(f)|^2_{\mathcal{V}} \quad \text{for } f \in \mathrm{dom}(X^*) \, , \end{align} $$

which is a simple consequence from Plancherel’s identity and integration by parts.

Finally, we record another elementary fact involving the operators $I_+$ and $X^*$ as follows. Let $f \in L^2_+(\mathbb {R}, \mathcal {V})$ be given. As before, we suppose that $(\mathbf {v}_1, \ldots , \mathbf {v}_N)$ is an orthonormal basis of $\mathcal {V}$ with $N = \dim \mathcal {V}$ . Thus we can write

$$ \begin{align*}f(x) = \sum_{k=1}^N f_k(x) \mathbf{v}_k \end{align*} $$

with $f_k(x) = \langle f(x), \mathbf {v}_k \rangle _{\mathcal {V}} \in L^2_+(\mathbb {R}; \mathbb {C})$ for $k=1, \ldots , N$ . Since $\widehat {f}(\xi )= \sum _{k=1}^N \widehat {f}_k(\xi ) \mathbf {v}_k$ , we find

$$ \begin{align*} \widehat{f}(\xi) = \sum_{k=1}^N \lim_{\varepsilon \to 0} \left ( \int_{\mathbb{R}} \mathrm{e}^{-\mathrm{i} x \xi} \frac{ f_k(x) }{1+\mathrm{i} \varepsilon x} \, dx \right ) \mathbf{v}_k = \sum_{k=1}^N \lim_{\varepsilon \to 0} \langle S(\xi)^* f, \mathbf{v}_k\chi_\varepsilon \rangle \, \mathbf{v}_k \,. \end{align*} $$

By taking the inverse Fourier transform, we obtain, for any $z \in \mathbb {C}_+$ , that

$$ \begin{align*} f(z) & = \frac{1}{2 \pi} \int_0^{\infty} \mathrm{e}^{\mathrm{i} z \xi} \left ( \sum_{k=1}^N \lim_{\varepsilon \to 0} \langle S(\xi)^* f, \mathbf{v}_k\chi_\varepsilon \rangle \, \mathbf{v}_k \right ) d \xi \\ & = \frac{1}{2 \pi} \lim_{\varepsilon \to 0} \sum_{k=1}^N \left ( \int_0^{\infty} \langle e^{\mathrm{i} z \xi -\mathrm{i} \xi X^*} f, \mathbf{v}_k\chi_\varepsilon \rangle \, d \xi \right ) \mathbf{v}_k \\ & = \frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \sum_{k=1}^N \left \langle (X^*- z \mathrm{Id})^{-1} f, \mathbf{v}_k \chi_\varepsilon \right \rangle \mathbf{v}_k \,. \end{align*} $$

In view of (2.2), we therefore deduce the identity

(2.4)

$$ \begin{align} \boxed{f(z) = \frac{1}{2 \pi \mathrm{i}} I_+[(X^*- z \mathrm{Id})^{-1} f] } \end{align} $$

which is valid for any $f \in L^2_+(\mathbb {R}; \mathcal {V})$ and $z \in \mathbb {C}_+$ .

3. Lax pair structure

In this section, we will largely extend the results from [Reference Gérard and Lenzmann14], where a Lax pair structure for (HWM) was discovered. In fact, we will consider the generalized matrix-valued equation (HWM $_d$ ) in this section.

Let $d \geq 2$ and $0 \leq k \leq d$ be fixed integers. We consider solutions $\mathbf {U} : [0,T] \times \mathbb {R} \to M_d(\mathbb {C})$ to the initial-value problem for the generalized matrix-valued half-wave maps equation which is given by

(HWM_d)

$$\begin{align} \partial_t \mathbf{U} = -\frac{\mathrm{i}}{2} [ \mathbf{U}, |D| \mathbf{U} ], \quad \mathbf{U}(0) = \mathbf{U}_0 \in H^s(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \,. \end{align}$$

For local well-posedness of (HWM_d) with initial data in the inhomogeneous Sobolev-type spaces $H^s_\bullet (\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ with $s> \frac {3}{2}$ , we refer the reader to Section 5 below. Note that in (HWM_d) we use $[X,Y] \equiv XY-YX$ to denote the commutator of $d \times d$ -matrices and the operator $|D|$ is supposed to act on each component of the matrix-valued function $\mathbf {U}$ .

We introduce some notation as follows. We recall that $\mathcal {V}$ either denotes $\mathbb {C}^d$ or $M_d(\mathbb {C})$ , equipped with their natural inner products and norms. For a bounded matrix-valued function $\mathbf {F} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ , we let $\mu _{\mathbf {F}}$ denote the corresponding multiplication operator acting on $L^2(\mathbb {R}; \mathcal {V})$ , that is, we set

$$ \begin{align*}(\mu_{\mathbf{F}} f)(x) = \mathbf{F}(x) f(x) \,. \end{align*} $$

This distinction between $\mathbf {F}$ and its multiplication operator $\mu _{\mathbf {F}}$ will be needed for better clarity in this section.Footnote ²

Given a map $\mathbf {U} : [0,T] \times \mathbb {R} \to \mathsf {Gr}_k(\mathbb {C}^d)$ and some time $t \in [0,T]$ , we denote the corresponding (bounded) multiplication operator by

$$ \begin{align*}\mu_{\mathbf{U}(t)} : L^2(\mathbb{R}, \mathcal{V}) \to L^2(\mathbb{R}, \mathcal{V}), \quad f \mapsto \mu_{\mathbf{U}(t)} f \,. \end{align*} $$

Since $\mathbf {U}(t,x)^* = \mathbf {U}(t,x)$ for a. e. $x \in \mathbb {R}$ , we readily see that $\mu _{\mathbf {U}(t)}= \mu _{\mathbf {U}(t)}^*$ is self-adjoint.

We have the following general result about (HWM_d) that establishes a general Lax pair structure.

Lemma 3.1 (Lax equation).

Let $s> \frac {3}{2}$ and suppose $\mathbf {U} \in C([0,T], H^s_\bullet (\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ is a solution of (HWM_d). Then for any operator $L_{\mathbf {U}(t)} \in \{ \mu _{\mathbf {U}(t)}, \Pi _+, \Pi _- \}$ , it holds

$$ \begin{align*}\frac{d}{dt} L_{\mathbf{U}(t)} = [B_{\mathbf{U}(t)}, L_{\mathbf{U}(t)}] \, , \end{align*} $$

with the operator

$$ \begin{align*}B_{\mathbf{U}} = -\frac{\mathrm{i}}{2} ( \mu_{\mathbf{U}} \circ |D| + |D| \circ \mu_{\mathbf{U}}) + \frac{\mathrm{i}}{2} \mu_{|D| \mathbf{U}} \,. \end{align*} $$

Remarks. 1) From the assumed regularity of $\mathbf {U}=\mathbf {U}(t,x)$ above, we readily infer that the pseudo-differential operator $B_{\mathbf {U}}$ is of order one with operator domain $\mathrm {dom}(B_{\mathbf {U}}) = H^1(\mathbb {R}; \mathcal {V})$ , which is found to be essentially skew-adjoint, that is, there exists a unique skew-adjoint extension with $B_{\mathbf {U}}^* = -B_{\mathbf {U}}$ . See Appendix D for details.

2) The fact $\mu _{\mathbf {U}(t)}$ together with orthogonal projections $\Pi _{\pm }$ are Lax operators for the same $B_{\mathbf {U}}$ allows us to restrict the Lax structure to the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ involving Toeplitz and Hankel operators; see below for more details.

Proof. We divide the proof into the following cases.

Case: $\boldsymbol {L=\mu }_{\mathbf {U}}$ . Using (HWM_d), we directly find

(3.1)

$$ \begin{align} \partial_t \mu_{\mathbf{U}} = -\frac{\mathrm{i}}{2} [\mu_{\mathbf{U}}, \mu_ {|D| \mathbf{U}}] = \frac{\mathrm{i}}{2} [\mu_{|D| \mathbf{U}}, \mu_{\mathbf{U}}] \,. \end{align} $$

In view of the expression for $B_{\mathbf {U}}$ , it remains to show that

(3.2)

$$ \begin{align} [\mu_{\mathbf{U}} \circ |D| + |D| \circ \mu_{\mathbf{U}}, \mu_{\mathbf{U}}] = 0 \,. \end{align} $$

Indeed, by using that $(\mu _{\mathbf {U}})^2 = \mu _{\mathbf {U}^2} = \mathrm {Id}$ since , we readily check that

$$ \begin{align*} [\mu_{\mathbf{U}} \circ |D| + |D| \circ \mu_{\mathbf{U}}, \mu_{\mathbf{U}}] & = (\mu_{\mathbf{U}} \circ |D| + |D| \circ \mu_{\mathbf{U}}) \circ \mu_{\mathbf{U}} - \mu_{\mathbf{U}} \circ (\mu_{\mathbf{U}} \circ |D| + |D| \circ \mu_{\mathbf{U}}) \\ & =\mu_{\mathbf{U}} \circ |D| \circ \mu_{\mathbf{U}} + |D| - |D| - \mu_{\mathbf{U}} \circ |D| \circ \mu_{\mathbf{U}} = 0 \,. \end{align*} $$

Case: $\boldsymbol {L=\Pi _{\pm }}$ . Here it is convenient to show that the Hilbert transform $\mathsf {H}$ is a Lax operator for $B_{\mathbf {U}}$ . The claim then readily follows for $\Pi _{\pm }=\frac 1 2 (\mathrm {Id} \mp \mathrm {i} \mathsf {H})$ , since $\mathrm {Id}$ commutes with any operator. Since $\frac {d}{dt} \mathsf {H} \equiv 0$ , we need to show that

$$ \begin{align*}[B_{\mathbf{U}},\mathsf{H}] = 0. \end{align*} $$

From the well-known product identity

$$ \begin{align*}\mathsf{H}(fg) = (\mathsf{H} f) g + f (\mathsf{H} g) + \mathsf{H}(\mathsf{H} f \mathsf{H} g) \end{align*} $$

and using that $\mathsf {H} |D|=-\partial _x$ , we readily find that

$$ \begin{align*}[\mathsf{H},\mu_{|D| \mathbf{U}}] = -\mu_{\partial_x \mathbf{U}} - \mathsf{H} \mu_{\partial_x \mathbf{U}} \mathsf{H}. \end{align*} $$

Hence we get

$$ \begin{align*} [\mathsf{H}, |D| \circ \mu_{\mathbf{U}} + \mu_{\mathbf{U}} \circ |D| ] & = \mathsf{H} \circ ( |D| \circ \mu_{\mathbf{U}} + \mu_{\mathbf{U}} \circ |D| ) - (|D| \circ \mu_{\mathbf{U}} + \mu_{\mathbf{U}} \circ |D|) \circ \mathsf{H} \\ & = -\partial_x \circ \mu_{\mathbf{U}} + \mathsf{H} \circ \mu_{\mathbf{U}} \circ \mathsf{H} \partial_x - \mathsf{H} \partial_x \circ \mu_{\mathbf{U}} \circ \mathsf{H} + \mu_{\mathbf{U}} \circ \partial_x \\ & = \mu_{\mathbf{U}} \circ \partial_x - \partial_x \circ \mu_{\mathbf{U}} + \mathsf{H} \circ \mu_{\mathbf{U}} \circ \partial_x \mathsf{H} - \mathsf{H} \partial_x \circ \mu_{\mathbf{U}} \circ \mathsf{H} \\ & = [\mu_{\mathbf{U}},\partial_x] + \mathsf{H} [\mu_{\mathbf{U}},\partial_x] \mathsf{H} \\ & = -\mu_{\partial_x \mathbf{U}} - \mathsf{H} \circ \mu_{\partial_x \mathbf{U}} \circ \mathsf{H}. \end{align*} $$

Therefore, we find

$$ \begin{align*} [\mathsf{H},B_{\mathbf{U}}] & = \frac{\mathrm{i}}{2} [\mathsf{H}, |D| \circ \mu_{\mathbf{U}} + \mu_{\mathbf{U}} \circ |D| ] - \frac{\mathrm{i}}{2} [\mathsf{H}, \mu_{|D| \mathbf{U}}] \\ & = \frac{\mathrm{i}}{2} (-\mu_{\partial_x \mathbf{U}} - \mathsf{H} \circ \mu_{\partial_x \mathbf{U}} \circ \mathsf{H} ) - \frac{\mathrm{i}}{2} ( -\mu_{\partial_x \mathbf{U}} - \mathsf{H} \circ \mu_{\partial_x \mathbf{U}} \circ \mathsf{H} ) = 0 \,. \end{align*} $$

This completes the proof of Lemma 3.1.

From the Leibniz rule for commutators $[X,YZ]=Y[X,Z] + [X,Y]Z$ and the corresponding rule for derivatives $\frac {d}{dt} (XY) = \dot {X}Y+ X \dot {Y}$ , we immediately observe from Lemma 3.1 that all finite linear combinations of products involving the operators $\{ \mu _{\mathbf {U}}, \Pi _+, \Pi _- \}$ are Lax operators too. For instance, in view of $\mathsf {H} = -\mathrm {i} \Pi _+ + \mathrm {i} \Pi _-$ , we recover the following Lax operator of commutator-type with

$$ \begin{align*}L_{\mathbf{U}} = [\mathsf{H}, \mu_{\mathbf{U}}] = \mathsf{H} \mu_{\mathbf{U}} - \mu_{\mathbf{U}} \mathsf{H} \, , \end{align*} $$

which was already found in [Reference Gérard and Lenzmann14]. By taking traces of powers of $L_{\mathbf {U}}$ , we obtain the conserved quantities

$$ \begin{align*}\mathrm{Tr}(|L_{\mathbf{U}(t)}|^p) = \text{const.} \quad \text{for } 0 < p < \infty. \end{align*} $$

Thus, by adapting Peller’s theorem, we obtain the a priori bounds

$$ \begin{align*}\mathrm{Tr}(|L_{\mathbf{U}(t)}|^p) \sim_p \| \mathbf{u}(t) \|_{\dot{B}^{1/p}_{p}}^p \sim \| \mathbf{u}(0) \|_{\dot{B}^{1/p}_p}^p \end{align*} $$

for the homogeneous Besov-type norms $\| \cdot \|_{\dot {B}^{1/p}_p}$ for solutions of (HWM $_d$ ).Footnote ³ However, these a priori bounds are not known to provide sufficient control to deduce global-in-time existence of solutions.

In order to further exploit the Lax pair structure attached to (HWM $_d$ ), we make the following observation involving operator analysis on Hardy spaces. Notice that, for a bounded matrix-valued function $\mathbf {F} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ , that the corresponding Toeplitz and Hankel operators with symbol $\mathbf {F}$ can be written as $T_{\mathbf {F}} f = \Pi _+ (\mu _{\mathbf {F}} f)$ and $H_{\mathbf {F}} f = \Pi _- (\mu _{\mathbf {F}} f)$ , using $\mu _{\mathbf {F}}$ for the corresponding multiplication with symbol $\mathbf {F}$ . Now, by using Lemma 3.1 together with $T_{\mathbf {U}} = \Pi _+ \mu _{\mathbf {U}} \Pi _+$ and $[B_{\mathbf {U}}, \Pi _{\pm }] \equiv 0$ (by Lemma 3.1 too), we can easily deduce the following fact.

Corollary 3.1 (Toeplitz Lax Structure).

For $\mathbf {U}=\mathbf {U}(t,x)$ as in Lemma 3.1, we have the Lax equation

$$ \begin{align*}\frac{d}{dt} T_{\mathbf{U}(t)} = \left [ B^+_{\mathbf{U}(t)}, T_{\mathbf{U}(t)} \right ] \,. \end{align*} $$

Here $B^+_{\mathbf {U}} = \Pi _+ B_{\mathbf {U}} \Pi _+$ is the compression of $B_{\mathbf {U}}$ onto $L^2_+(\mathbb {R};\mathcal {V})$ which is given by

$$ \begin{align*}B^+_{\mathbf{U}} = -\frac{\mathrm{i}}{2} ( T_{\mathbf{U}} \circ D + D \circ T_{\mathbf{U}} ) + \frac{\mathrm{i}}{2} T_{|D| \mathbf{U}} \end{align*} $$

with $D=-\mathrm {i} \partial _x$ .

Remarks. 1) Note that the compressed operator $B^+_{\mathbf {U}}$ is a differential operator as $|D| f = Df = -\mathrm {i} \partial _x f$ for $f \in (H^1 \cap L^2_+)(\mathbb {R}; \mathcal {V})$ .

2) For $t \in [0,T]$ , let $\mathcal {U}(t) : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ denote the unitary operator generated by the skew-adjoint operator $B^+_{\mathbf {U}(t)}$ so that

(3.3)

$$ \begin{align} \frac{d}{dt} \mathcal{U}(t) = B^+_{\mathbf{U}(t)} \mathcal{U}(t) \quad \text{for } t \in [0,T], \quad \mathcal{U}(0) = \mathrm{Id} \,. \end{align} $$

For existence and uniqueness of this operator-valued initial-value problem, we refer to Appendix D. As a direct consequence of Corollary 3.1, we find that $T_{\mathbf {U}(t)}$ and $T_{\mathbf {U}(0)}$ are given by unitary conjugation:

$$ \begin{align*}T_{\mathbf{U}(t)} = \mathcal{U}(t) T_{\mathbf{U}(0)} \mathcal{U}(t)^* \quad \text{for } t \in [0,T] \,. \end{align*} $$

In particular, we obtain the invariance of the spectrum $\sigma (T_{\mathbf {U}(t)}) = \sigma (T_{\mathbf {U}(0)})$ for $t \in [0,T]$ .

3) Of course, the Hankel operator $H_{\mathbf {U}(t)}$ also satisfies a corresponding Lax equation with $B_{\mathbf {U}(t)}^+$ replaced by the “twisted” compressed operator $\Pi _- B_{\mathbf {U}(t)} \Pi _+$ . But in what follows we shall only work with the Lax equation for $T_{\mathbf {U}(t)}$ , which allow us to conclude all the necessary facts for our arguments developed below.

For later use, we record the following commutator relations, where we remind the reader that we occasionally use $A.B$ to denote matrix product $AB$ on $M_d(\mathbb {C})$ for better readability.

Lemma 3.2. Let $\mathbf {U} = \mathbf {U}_\infty + \mathbf {V} \in M_d(\mathbb {C}) \oplus (L^\infty \cap L^2)(\mathbb {R}; M_d(\mathbb {C}))$ . Then, for every $f \in \mathrm {dom}(X^*)$ , we have $T_{\mathbf {U}} f \in \mathrm {dom}(X^*)$ and

$$ \begin{align*}[X^*, T_{\mathbf{U}}] f = \frac{\mathrm{i}}{2 \pi} \Pi_+ \mathbf{V}.I_+(f) \,. \end{align*} $$

Moreover, it holds that

$$ \begin{align*}[X^*, T_{\mathbf{U}}^2] f = \frac{\mathrm{i}}{2 \pi} T_{\mathbf{U}}( \Pi_+ \mathbf{V}.I_+(f)) + \frac{\mathrm{i}}{2 \pi} \Pi_+ \mathbf{V}.I_+(T_{\mathbf{U}} f) \,. \end{align*} $$

Proof. First, we note that $[X^*, T_{\mathbf {U}_\infty }] = 0$ , since $\mathbf {U}_\infty \in M_d(\mathbb {C})$ is a constant matrix. Also, we evidently have that $T_{\mathbf {U}_\infty } f \in \mathrm {dom}(X^*)$ whenever $f \in \mathrm {dom}(X^*)$ .

Thus it remains to discuss the commutator $[X^*, T_{\mathbf {U}}] = [X^*, T_{\mathbf {V}}]$ with $\mathbf {V} \in (L^\infty \cap L^2)(\mathbb {R}; M_d(\mathbb {C}))$ . Indeed, by adapting the proof in [Reference Gérard and Pushnitski17][Lemma 2.3] to the matrix-valued symbol $\mathbf {V}$ , we find that

$$ \begin{align*}[X^*,T_{\mathbf{V}}] f = \frac{\mathrm{i}}{2 \pi} \Pi_+ \mathbf{V}.I_+(f) \end{align*} $$

noticing that $T_{\mathbf {V}} f \in \mathrm {dom}(X^*)$ for any $f \in \mathrm {dom}(X^*)$ . We leave the details to the reader.

The commutator identity for $[X^*, T_{\mathbf {U}}^2]$ simply follows from the first identity and the fact that $[A,BC] = B[A,C] + [A,B]C$ .

4. Spectral analysis of $T_{\mathbf {U}}$

As in the previous sections, we let $\mathcal {V}$ either stand for the Hilbert spaces $\mathbb {C}^d$ or $M_d(\mathbb {C})$ for some given integer $d \geq 2$ . The aim of this section is to derive some fundamental spectral properties of the Toeplitz operator

$$ \begin{align*}T_{\mathbf{U}} : L^2_+(\mathbb{R}; \mathcal{V}) \to L^2_+(\mathbb{R}; \mathcal{V}), \quad f \mapsto T_{\mathbf{U}} f= \Pi_+(\mathbf{U} f) \,. \end{align*} $$

Throughout the following we will always assume that

$$ \begin{align*}\mathbf{U}(x) = \mathbf{U}_\infty + \mathbf{V}(x) \in H^{\frac 1 2}_\bullet(\mathbb{R}; M_d(\mathbb{C})) \equiv M_d(\mathbb{C}) \oplus H^{\frac{1}{2}}(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

together with the pointwise algebraic constraints

As a consequence, we see that the corresponding Toeplitz operator $T_{\mathbf {U}}=T_{\mathbf {U}}^*$ is self-adjoint and bounded with operator norm $\| T_{\mathbf {U}} \| \leq 1$ . Moreover, we readily check that the following properties hold.

(i) $\mathbf {U}_\infty ^* = \mathbf {U}_\infty $ and .
(ii) $\mathbf {U}, \mathbf {V} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ with $\|\mathbf {U} \|_{L^\infty } = 1$ and $\|\mathbf {V} \|_{L^\infty } \leq \|\mathbf {U}_\infty \|_{L^\infty } + \| \mathbf {U} \|_{L^\infty } = 2$ .
(iii) $\mathbf {V} = \mathbf {V}_{+} + \mathbf {V}_{+}^*$ with $\mathbf {V}_{+} = \Pi _+ \mathbf {V}$ and $\mathbf {V}_{+}^* = \Pi _- \mathbf {V}$ .

Fredholm property and invariant subspaces

Recall that

$$ \begin{align*}H_{\mathbf{U}}: L^2_+(\mathbb{R}; \mathcal{V}) \to L^2_-(\mathbb{R}; \mathcal{V}), \quad H_{\mathbf{U}} f= \Pi_-(\mathbf{U} f) \end{align*} $$

denotes the corresponding (block) Hankel operator with matrix-valued symbol $\mathbf {U}$ . For later use, we remark that the adjoint Hankel operator is given by

$$ \begin{align*}H_{\mathbf{U}}^* : L^2_-(\mathbb{R}; \mathcal{V}) \to L^2_+(\mathbb{R};\mathcal{V}), \quad H_{\mathbf{U}}^* f = \Pi_+(\mathbf{U} f) \,. \end{align*} $$

Remark. Here we used the fact that $\mathbf {U}(x)^* = \mathbf {U}(x)$ almost everywhere. For general matrix-valued symbols $\mathbf {F}\in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ , the adjoint Hankel operator is $H_{\mathbf {F}}^* f = \Pi _+(\mathbf {F}^* f)$ for $f\in L^2_-(\mathbb {R};\mathcal {V})$ .

We have the following general fact, for matrix-valued symbols $\mathbf {U}$ satisfying the assumptions stated above.

Lemma 4.1 (Key Identity and Fredholmness).

We have the identity

$$ \begin{align*}T_{\mathbf{U}}^2 = \mathrm{Id} - K_{\mathbf{U}} \quad \text{on } L^2_+(\mathbb{R}; \mathcal{V}) \, , \end{align*} $$

where the self-adjoint operator

$$ \begin{align*}K_{\mathbf{U}} := H^*_{\mathbf{U}} H_{\mathbf{U}} : L^2_+(\mathbb{R}; \mathcal{V}) \to L^2_+(\mathbb{R}; \mathcal{V}) \end{align*} $$

satisfies $0 \leq K_{\mathbf {U}} \leq \mathrm {Id}$ and it is trace-class with

$$ \begin{align*}\mathrm{Tr}(K_{\mathbf{U}}) = \mathrm{Tr}(H^*_{\mathbf{U}} H_{\mathbf{U}}) = \mathrm{const.} \cdot \| \mathbf{U} \|_{\dot{H}^{\frac 1 2}}^2 \,. \end{align*} $$

Moreover, the Toeplitz operator $T_{\mathbf {U}}$ is Fredholm with index 0.

Proof. Let us consider the case $\mathcal {V}= \mathbb {C}^d$ , where we remark that the proof for $\mathcal {V} = M_d(\mathbb {C})$ is analogous. Suppose that $f \in L^2_+(\mathbb {R}; \mathbb {C}^d)$ is given. Using that holds on $L^2_+(\mathbb {R}; \mathbb {C}^d)$ and $\mathbf {U}^*(x)=\mathbf {U}(x)$ for a. e. $x \in \mathbb {R}$ , we observe that

$$ \begin{align*} T_{\mathbf{U}} ( T_{\mathbf{U}} f) & = \Pi_+(\mathbf{U} \Pi_+(\mathbf{U} f)) = \Pi_+ ({\mathbf{U}} (\mathrm{Id} - \Pi_-) \mathbf{U} f) \\ & = \Pi_+ f - \Pi_+ ( {\mathbf{U}} (\Pi_- \mathbf{U} f) ) = f - H_{\mathbf{U}}^* H_{\mathbf{U}} f, \end{align*} $$

since we trivially have that $\Pi _+ f = f$ for $f \in L^2_+(\mathbb {R}; \mathbb {C}^d)$ . This proves the claimed identity.

Consider now the bounded and self-adjoint operator

$$ \begin{align*}K_{\mathbf{U}} := H_{\mathbf{U}}^* H_{\mathbf{U}} : L^2_+(\mathbb{R}; \mathbb{C}^d) \to L^2_+(\mathbb{R}; \mathbb{C}^d) \,. \end{align*} $$

Clearly, we have that $K_{\mathbf {U}} \geq 0$ is non-negative. Also, we notice that $\| K_{\mathbf {U}} \| \leq \| H_{\mathbf {U}} \|^2 \leq \| \mathbf {U} \|_{L^\infty } = 1$ , which shows that $K_{\mathbf {U}} \leq \mathrm {Id}$ holds in the sense of operators. Next, we observe that $K_{\mathbf {U}}$ is trace-class with

(4.1)

$$ \begin{align} \mathrm{Tr}(K_{\mathbf{U}}) = \mathrm{Tr}(H_{\mathbf{U}}^* H_{\mathbf{U}}) = \| H_{\mathbf{U}} \|_{HS}^2 = c \cdot \| \mathbf{U} \|_{\dot{H}^{\frac 1 2}}^2 \, , \end{align} $$

where $c>0$ is some numerical constant. Here $\| A \|_{HS}$ denotes the Hilbert–Schmidt norm of a bounded operator $A : H_1 \to H_2$ with separable Hilbert spaces $H_1, H_2$ , that is, we have

$$ \begin{align*}\| A \|_{HS}^2 = \sum_{n=1}^\infty \langle A e_n, A e_n \rangle_{H_2} \, , \end{align*} $$

where $(e_n)_{n \in \mathbb {N}}$ is an arbitrary orthonormal basis of $H_1$ . For the last equation in (4.1), we give an elementary proof taken from [Reference Gérard and Lenzmann14]. Using the orthogonal decomposition $L^2(\mathbb {R};\mathbb {C}^d) = L^2_+(\mathbb {R}; \mathbb {C}^d) \oplus L^2_-(\mathbb {R}; \mathbb {C}^d)$ , we consider the commutator of $\mathbf {U}$ viewed as multiplication operator on $L^2(\mathbb {T}; \mathbb {C}^d)$ with the Hilbert transform $\mathsf {H}$ . This can be written as a $2 \times 2$ -matrix of operators such that

$$ \begin{align*}[\mathsf{H}, \mathbf{U}] = \left ( \begin{array}{cc} 0 & -\mathrm{i} \Pi_+ \mathbf{U} \Pi_- \\ \mathrm{i} \Pi_- \mathbf{U} \Pi_+ & 0 \end{array} \right ) : L^2_+(\mathbb{R}; \mathbb{C}^d) \oplus L^2_-(\mathbb{R}; \mathbb{C}^d) \to L^2_+(\mathbb{R}; \mathbb{C}^d) \oplus L^2_-(\mathbb{R};\mathbb{C}^d). \end{align*} $$

On the other hand, from the singular integral formula for $\mathsf {H}$ , we easily see that $[\mathsf {H}, \mathbf {U}]$ has the integral kernel $h_{\mathbf {U}}(x,y) = \frac {1}{\pi } \frac {\mathbf {U}(x)-\mathbf {U}(y)}{x-y} \in L^2(\mathbb {R} \times \mathbb {R}; M_d(\mathbb {C}))$ . Hence its Hilbert-Schmidt norm as an operator acting on $L^2(\mathbb {R}; \mathcal {V})$ can be directly computed as

$$ \begin{align*}\| [\mathsf{H}, \mathbf{U}] \|_{HS}^2 = \| h_{\mathbf{U}} \|_{L^2(\mathbb{R} \times \mathbb{R}; M_d(\mathbb{C}))}^2 = \frac{1}{\pi^2} \int_{\mathbb{R}} \! \int_{\mathbb{R}} \frac{|\mathbf{U}(x)- \mathbf{U}(y)|_F^2}{|x-y|^2} \, dx \,dy \, , \end{align*} $$

where $| \cdot |_F$ denotes the Frobenius norm of matrices in $M_d(\mathbb {C})$ . Next, by using that $\mathbf {U}= \mathbf {U}^*$ holds, we see that $\| h_{\mathbf {U}} \|_{HS}^2 = \| \Pi _+ \mathbf {U} \Pi _- \|_{HS}^2 + \| \Pi _- \mathbf {U} \Pi _+ \|_{HS}^2 =2 \| H_{\mathbf {U}} \|_{HS}^2$ . Recalling the formula (2.1), we deduce that the last equation in (4.1) holds.

It remains to prove that $T_{\mathbf {U}}$ is Fredholm with index 0. Indeed, we readily see that $T_{\mathbf {U}}$ is Fredholm since $T_{\mathbf {U}}$ is invertible modulo compact operators, which directly follows from the identity $T_{\mathbf {U}} T_{\mathbf {U}} = T_{\mathbf {U}}^2 = \mathrm {Id} - K_{\mathbf {U}}$ . Since $T_{\mathbf {U}}$ is self-adjoint, its Fredholm index must be 0.

In view of the general identity established in Lemma 4.1, it is natural to introduce the following closed subspaces

(4.2)

$$ \begin{align} \boxed {\mathfrak{H}_0 := \mathrm{ker}(K_{\mathbf{U}}) \quad \text{and} \quad \mathfrak{H}_1 := \overline{\mathrm{ran}(K_{\mathbf{U}})} } \end{align} $$

which yields the orthogonal decomposition

(4.3)

$$ \begin{align} \boxed{L^2_+(\mathbb{R}; \mathcal{V}) = \mathfrak{H}_0 \oplus \mathfrak{H}_1 } \end{align} $$

As a direct consequence, we obtain a decomposition of the Toeplitz operator $T_{\mathbf {U}}$ into invariant subspaces in the spirit of the celebrated Sz.–Nagy–Foiaș decomposition for contractions on Hilbert spaces [Reference Sz.-Nagy, Foias, Bercovici and Kérchy34] (also referred to as Langer’s lemma in [Reference Langer24]), which in turn is a generalization of the well-known Wold decomposition for isometries on Hilbert spaces. Note that $T_{\mathbf {U}}$ is a contraction, because its operator norm satisfies $\|T_{\mathbf {U}}\| \leq \| \mathbf {U} \|_{L^\infty } = 1$ , where in fact we have equality in view of the identity in Lemma 4.1 above.

Proposition 4.1. The subspaces $\mathfrak {H}_0$ and $\mathfrak {H}_1$ are invariant under $T_{\mathbf {U}}$ . Moreover, the restriction $T_{\mathbf {U}} |_{\mathfrak {H}_0}$ is unitary, whereas the restriction $T_{\mathbf {U}} |_{\mathfrak {H}_1}$ is completely nonunitary (c.n.u.), that is, there is no nontrivial invariant subspace in $\mathfrak {H}_1$ on which $T_{\mathbf {U}}$ is unitary.

Proof. Since the operator $K_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ with $0 \leq K_{\mathbf {U}} \leq 1$ is compact and self-adjoint, we can write

$$ \begin{align*}K_{\mathbf{U}} = \sum_{j=1}^N \lambda_j \langle \cdot, \varphi_j\rangle \varphi_j \end{align*} $$

with $N = \mathrm {rank}(K_{\mathbf {U}}) \in \mathbb {N} \cup \{\infty \}$ , $\lambda _j \in (0, 1]$ for $j=1, \ldots , N$ , and the corresponding eigenvectors $(\varphi _j)_{j=1}^N$ form an orthonormal basis of $\mathfrak {H}_1 = \overline {\mathrm {ran}(K_{\mathbf {U}})}$ . By the identity $T_{\mathbf {U}}^2 = \mathrm {Id} - K_{\mathbf {U}}$ and from elementary spectral calculus for the self-adjoint restrictions $T_{\mathbf {U}} |_{E_{\lambda _j}}$ on the finite-dimensional subspaces $E_{\lambda _j}=\mathrm {ker}(K_{\mathbf {U}}-\lambda _j \mathrm {Id}) \subset \mathfrak {H}_1$ , we deduce that

(4.4)

$$ \begin{align} T_{\mathbf{U}} = \sum_{j=1}^N \varepsilon_j \sqrt{1- \lambda_j} \langle \cdot, \varphi_j \rangle \varphi_j \quad \text{on } \mathfrak{H}_1 \end{align} $$

with some $\varepsilon _j \in \{ \pm 1 \}$ for $j=1, \ldots , N$ . Evidently, we have that $\mathfrak {H}_1$ is invariant under $T_{\mathbf {U}}$ and we see that $T_{\mathbf {U}} |_{\mathfrak {H}_1}$ is c. n. u. Because otherwise $T_{\mathbf {U}} |_{\mathfrak {H}_1}$ would have an eigenvalue $\mu \in \{ \pm 1\}$ on some finite-dimensional subspace $E_{\lambda _j}$ , contradicting the above explicit formula since $\lambda _j> 0$ holds.

Since $\mathfrak {H}_0 = \mathrm {ker}(K_{\mathbf {U}}) = \mathfrak {H}_1^\perp $ and by self-adjointness of $T_{\mathbf {U}}$ , we see that $T_{\mathbf {U}}(\mathfrak {H}_0) \subset \mathfrak {H}_0$ . Furthermore from $T_{\mathbf {U}}^2 = \mathrm {Id} - K_{\mathbf {U}}$ , we readily find $T^2_{\mathbf {U}} |_{\mathfrak {H}_0} = \mathrm {Id} |_{\mathfrak {H}_0}$ , which implies that the self-adjoint operator $T_{\mathbf {U}} |_{\mathfrak {H}_0}$ is also unitary.

Thanks to the formula $T_{\mathbf {U}}^2 = \mathrm {Id} - K_{\mathbf {U}}$ and the decomposition obtained in Proposition 4.1, we deduce that the spectrum of $T_{\mathbf {U}}$ decomposes as

$$ \begin{align*}\sigma(T_{\mathbf{U}}) = \sigma_{\mathrm{e}}(T_{\mathbf{U}}) \sqcup \sigma_{\mathrm{d}}(T_{\mathbf{U}}) \, , \end{align*} $$

where the essential and discrete spectra of $T_{\mathbf {U}}$ are given by

$$ \begin{align*}\sigma_{\mathrm{e}}(T_{\mathbf{U}}) = \sigma(T_{\mathbf{U}} |_{\mathfrak{H}_0}) \subset \{ \pm 1 \} \end{align*} $$

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathbf{U}}) = \sigma_{\mathrm{d}}( T_{\mathbf{U}} |_{\mathfrak{H}_1}) = \{ \varepsilon_j \sqrt{1-\lambda_j} \mid j=1,\ldots, \mathrm{rank}(K_{\mathbf{U}}) \} \, , \end{align*} $$

with the sequences $(\varepsilon _j)\subset \{ \pm 1 \}$ and $(\lambda _j) \subset (0, 1]$ taken from (4.4) above.

Remark. As an aside, we remark that the property of the Toeplitz operator $T_{\mathbf {U}}$ having nonempty discrete spectrum is due to the fact its symbol $\mathbf {U}(x)$ is matrix-valued. By contrast, a classical result due to Widom [Reference Widom36] states that any Toeplitz operator $T_\varphi : L^2_+(\mathbb {R}; \mathbb {C}) \to L^2_+(\mathbb {R}; \mathbb {C})$ with a scalar-valued symbol $\varphi \in L^\infty (\mathbb {R}; \mathbb {C})$ has a spectrum $\sigma (T_\varphi )$ which must be a connected subset in $\mathbb {C}$ , which shows that $\sigma _{\mathrm {d}}(T_\varphi ) = \emptyset $ in this case.

As a next step, we find some explicit elements in the invariant subspace $\mathfrak {H}_1$ . The use of this fact will become clear later on when proving our global well-posedness result for (HWM $_d$ ). Recall our assumption that

(4.5)

$$ \begin{align} \mathbf{U} = \mathbf{U}_\infty + \mathbf{V} \in M_d(\mathbb{C}) \oplus H^{\frac 1 2}(\mathbb{R}; M_d(\mathbb{C})) \,. \end{align} $$

For later use, we make the following observation. For better readability, we use $B.A$ to denote the product $BA$ of two matrices $B, A\in M_d(\mathbb {C})$ .

Proposition 4.2. Let $\mathcal {V} = M_d(\mathbb {C})$ . For any constant matrix $A \in M_d(\mathbb {C})$ , it holds that $\Pi _+ \mathbf {V}.A \in \mathfrak {H}_1$ .

Proof. Since $K_{\mathbf {U}} = H^*_{\mathbf {U}} H_{\mathbf {U}}$ , we find that $\mathrm {ker}(K_{\mathbf {U}}) = \mathrm {ker}(H_{\mathbf {U}})$ , which yields that $\mathfrak {H}_1= (\mathfrak {H}_0)^\perp = \overline {\mathrm {ran}(H^*_{\mathbf {U}})}$ . Hence we have to show that $\Pi _+ \mathbf {V}.A \in \overline {\mathrm {ran}(H^*_{\mathbf {U}})}$ holds for any constant matrix $A \in M_d(\mathbb {C})$ . Indeed, by recalling $\chi _\varepsilon = \frac {1}{1-\mathrm {i} \varepsilon x} \in L^2_+(\mathbb {R}; \mathbb {C})$ for $\varepsilon> 0$ and hence $\overline {\chi }_\varepsilon \in L^2_-(\mathbb {R}; \mathbb {C})$ , we notice

$$ \begin{align*} \lim_{\varepsilon \to 0} H^*_{\mathbf{U}}(\overline{\chi}_\varepsilon A) & = \lim_{\varepsilon \to 0} \Pi_+( \mathbf{U}. \overline{\chi}_\varepsilon A) \\ & = \lim_{\varepsilon \to 0} \Pi_+ ( (\mathbf{U}_\infty + \Pi_+ \mathbf{V} + \Pi_- \mathbf{V}). \overline{\chi}_\varepsilon A ) \\ & = \lim_{\varepsilon \to 0} \Pi_+( (\Pi_+\mathbf{V}) \overline{\chi_\varepsilon}).A = \Pi_+ \mathbf{V}.A, \end{align*} $$

because of $\lim _{\varepsilon \to 0} \Pi _+ ( f \overline {\chi }_\varepsilon ) = f$ in $L^2_+(\mathbb {R}; \mathcal {V})$ by dominated convergence. This shows that $\Pi _+ \mathbf {V}.A$ belongs to $\mathfrak {H}_1=\overline {\mathrm {ran}(H^*_{\mathbf {U}})}$ .

Next, by using the well-known fact that kernels of Hankel operators are invariant under the Lax–Beurling semigroup $\{ S(\eta ) \}_{\eta \geq 0}$ , we obtain the following result.

Lemma 4.2. It holds that $S(\eta )\mathfrak {H}_0 \subset \mathfrak {H}_0$ and $S(\eta )^* \mathfrak {H}_1 \subset \mathfrak {H}_1$ for all $\eta \geq 0$ .

Proof. Let $f \in \mathfrak {H}_0 = \mathrm {ker}(K_{\mathbf {U}}) = \mathrm {ker}(H_{\mathbf {U}})$ . For any $\eta \geq 0$ , we immediately observe that

$$ \begin{align*} H_{\mathbf{U}}(S(\eta) f) & = \Pi_-(\mathbf{U} S(\eta) f) = \Pi_- (\mathrm{e}^{\mathrm{i} \eta x} \mathbf{U} f) \\ & = \Pi_- (\mathrm{e}^{\mathrm{i} \eta x} \Pi_-(\mathbf{U} f)) = \Pi_- (\mathrm{e}^{\mathrm{i} \eta x} H_{\mathbf{U}}(\mathbf{U} f)) = 0 \,. \end{align*} $$

Thus we find $S(\eta ) f \in \mathfrak {H}_0$ for any $f \in \mathfrak {H}_0$ . This proves that $S(\eta ) \mathfrak {H}_0 \subset \mathfrak {H}_0$ .

Since $\mathfrak {H}_0 \perp \mathfrak {H}_1$ , we directly see that $S(\eta )^* f \in \mathfrak {H}_1$ for any $f \in \mathfrak {H}_1$ and $\eta \geq 0$ with the adjoint Lax–Beurling semigroup $\{ S(\eta )^* \}_{\eta \geq 0}$ acting on $L^2_+(\mathbb {R}; \mathcal {V})$ .

Remark. As a direct consequence of the well-known Lax–Beurling theorem (see the version in [Reference Lax25] for a direct application to our setting) about invariant subspaces of $S(\eta )$ , we can deduce the following fact: If $\mathfrak {H}_0 = \mathrm {ker}(K_{\mathbf {U}}) \neq \{ 0 \}$ is nontrivial, there exist a subspace $\mathcal {V}' \subseteq \mathcal {V}$ and a function

$$ \begin{align*}\Theta \in L^\infty_+(\mathbb{R}; \mathrm{End}(\mathcal{V}'; \mathcal{V})) \quad \text{with} \quad \Theta(x)^* \Theta(x) = \mathrm{Id}_{\mathcal{V}'} \quad \text{for a. e. } x \in \mathbb{R} \end{align*} $$

such that

$$ \begin{align*}\mathfrak{H}_0 = \Theta L^2_+(\mathbb{R}; \mathcal{V}') \quad \text{and} \quad \mathfrak{H}_1 = (\Theta L^2_+(\mathbb{R}; \mathcal{V}'))^\perp \,. \end{align*} $$

The matrix-valued function $\Theta $ is called a (left) inner function and the subspace $\mathfrak {H}_1$ is thus the model space generated by $\Theta $ . However, we will not exploit this fact in the present paper.

Spectral properties for rational data

Recall that $\mathfrak {H}_1= \overline {\mathrm {ran}(K_{\mathbf {U}})}$ . We have the following characterization when the subspace $\mathfrak {H}_1$ is finite-dimensional, corresponding to the fact that the compact operator $K_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ has finite rank.

Lemma 4.3 (Kronecker-type theorem).

Let $\mathbf {U} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ be of the form (4.5) with $\mathbf {U}(x) = \mathbf {U}(x)^*$ for a.e. $x \in \mathbb {R}$ . Then $K_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R};\mathcal {V})$ has finite rank (i.e., we have $\dim \mathfrak {H}_1 < \infty $ ) if and only if $\mathbf {U}$ is a rational function.

Remark 4.1. Since $K_{\mathbf {U}}=H_{\mathbf {U}}^* H_{\mathbf {U}}= \mathrm {Id}-T_{\mathbf {U}}^2$ is a Lax operator for (HWM $_d$ ), we see that rationality is preserved along the flow. For (HWM) with target $\mathbb {S}^2 \cong \mathsf {Gr}_1(\mathbb {C}^2)$ , this feature was already observed in [Reference Gérard and Lenzmann14].

Proof. Since $\dim \mathrm {ran}(H_{\mathbf {U}}^* H_{\mathbf {U}}) = \dim \mathrm {ran}(H_{\mathbf {U}})$ , it suffices to consider the Hankel operator $H_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_-(\mathbb {R}; \mathcal {V})$ . Furthermore, since $\mathbf {U}_\infty \in M_d(\mathbb {C})$ is constant, we see that $H_{\mathbf {U}_\infty }=0$ and hence $H_{\mathbf {U}}= H_{\mathbf {U}_\infty + \mathbf {V}} = H_{\mathbf {V}}$ . Thus it remains to discuss $H_{\mathbf {V}}$ with the matrix-valued symbol $\mathbf {V} \in (H^{\frac 1 2} \cap L^\infty )(\mathbb {R}; M_d(\mathbb {C}))$ for the rest of the proof.

We first recall the following general Kronecker-type theorem valid for Hankel operators acting on the Hardy space $L_+^2(\mathbb {T}; \mathcal {H})$ on the torus $\mathbb {T} \cong \partial \mathbb {D}$ , where $\mathcal {H}$ is a given separable complex Hilbert space (not necessarily finite-dimensional). Correspondingly, we use $\mathbb {P}_+$ and $\mathbb {P}_- = \mathrm {Id} - \mathbb {P}_+$ to denote the Cauchy–Szegő projections on $L^2(\mathbb {T}; \mathcal {E})$ ; see [Reference Peller32] for a general background. As usual, we use $\mathcal {B}(\mathcal {H}, \mathcal {K})$ to denote the Banach space of bounded linear operators from $\mathcal {H}$ to another complex Hilbert space $\mathcal {K}$ . From [Reference Peller32][Chapter 2, Theorem 5.3] we directly deduce the following result.

Theorem 4.1 (Kronecker’s theorem on $L^2_+(\mathbb {T}; \mathcal {H})$ ).

Let $\mathcal {H}, \mathcal {K}$ be separable complex Hilbert spaces and assume $\Phi \in L^\infty (\mathbb {T}; \mathcal {B}(\mathcal {H}, \mathcal {K}))$ . Define the Hankel operator $H_\Phi : L^2_+(\mathbb {T}; \mathcal {H}) \to L^2_-(\mathbb {T}; \mathcal {K})$ by $H_\Phi f = \mathbb {P}_-(\Phi f)$ . Then $\mathrm {rank} \, H_\Phi < \infty $ if and only if $\mathbb {P}_- \Phi : \mathbb {T} \to \mathcal {B}(\mathcal {H}, \mathcal {K})$ is a rational map of the form

$$ \begin{align*}\mathbb{P}_- \Phi = \sum_{\lambda \in \Lambda} \sum_{n=1}^{k(\lambda)} \frac{T_{\lambda, n}}{(z- \lambda)^n} \, , \end{align*} $$

where $\Lambda $ is a finite subset in $\mathbb {D}$ and the $k(\lambda )$ are positive integers and $T_{n, \lambda } \in \mathcal {B}(\mathcal {H}, \mathcal {K}) \setminus \{ 0 \}$ are finite-rank operators, $\lambda \in \Lambda $ , and $1 \leq n \leq k(\lambda )$ .

Let us now take the finite-dimensional spaces $\mathcal {H}=\mathcal {K} = \mathcal {V}$ in the previous result with either $\mathcal {V} = \mathbb {C}^d$ or $\mathcal {V} = M_d(\mathbb {C})$ . For any $\Phi \in L^\infty (\mathbb {T}; M_d(\mathbb {C}))$ given, we deduce the equivalence

$$ \begin{align*}H_{\Phi} = \mathbb{P}_- \Phi \mathbb{P}_+ \text{ has finite rank if and only if } \mathbb{P}_- \Phi \in L^\infty(\mathbb{T}; M_d(\mathbb{C})) \text{ is rational.} \end{align*} $$

Now using the standard conformal map $\omega : \mathbb {D} \to \mathbb {C}_+$ with $\omega (\zeta ) = \mathrm {i} \frac {1 + \zeta }{1-\zeta }$ , let us define the map

$$ \begin{align*}(\mathcal{U} f)(x) = \frac{1}{\sqrt{\pi}} \frac{(f \circ \omega^{-1})(x)}{x+\mathrm{i}} = \frac{1}{\sqrt{\pi}} \frac{1}{x+ \mathrm{i}} f \left ( \frac{x-\mathrm{i}}{x+\mathrm{i}} \right ) \quad \text{for } f \in L^2(\mathbb{T}; \mathcal{V}), \end{align*} $$

which is known to be unitary operator from $L^2(\mathbb {T}; \mathcal {V})$ to $L^2(\mathbb {R}; \mathcal {V})$ with the property that $\mathcal {U}(L^2_+(\mathbb {T}; \mathcal {V})) = L^2_+(\mathbb {R}; \mathcal {V})$ ; see [Reference Peller32][Appendix 2.1]. We easily verify that

$$ \begin{align*}H_{\Phi} = \mathcal{U}^* H_{\mathbf{V}} \mathcal{U} \quad \text{with } \Phi = \mathbf{V} \circ \omega, \end{align*} $$

see, for example, [Reference Peller32][Chapter 1, Lemma 8.3]. Since compositions with $\omega $ and $\omega ^{-1}$ preserve rationality and in view of the identity $\Pi _- \mathbf {V} = (\mathbb {P}_- ( \mathbf {V} \circ \omega )) \circ \omega ^{-1}$ , we deduce

$$ \begin{align*}H_{\mathbf{V}} \text{ has finite rank if and only if } \Pi_- \mathbf{V} : \mathbb{R} \to M_d(\mathbb{C}) \text{ is rational.} \end{align*} $$

Finally, by recalling that $\Pi _+ \mathbf {V} = (\Pi _- \mathbf {V})^*$ , we conclude that $\mathbf {V} = (\Pi _- \mathbf {V})^* + \Pi _- \mathbf {V}$ is a rational function if and only if $H_{\mathbf {V}}$ has finite rank.

We now show that, for rational matrix-valued symbols $\mathbf {U}$ , the subspace $\mathfrak {H}_1$ is also an invariant subspace for the unbounded operator $X^*$ , which is the generator of the adjoint Lax–Beurling semigroup $\{ S(\eta )^* \}_{\eta \geq 0}=\{ \mathrm {e}^{-\mathrm {i} \eta X^*} \}_{\eta \geq 0}$ .

Proposition 4.3. If $\mathbf {U} \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ , then $\mathfrak {H}_1 \subset \mathrm {dom}(X^*)$ and $X^*(\mathfrak {H}_1) \subset \mathfrak {H}_1$ .

Proof. By Lemma 4.2, we recall that the adjoint Lax–Beurling semigroup $S(\eta )^*$ acts invariantly on $\mathfrak {H}_1$ . Moreover, by Lemma 4.3, we know that $\dim \mathfrak {H}_1 < \infty $ . By standard arguments from semigroup theory, it follows that the generator $X^*$ restricted to the finite-dimensional invariant subspace $\mathfrak {H}_1$ is bounded and thus its domain $\mathrm {dom} (X^*|_{\mathfrak {H}_1})$ is thus all of $\mathfrak {H}_1$ . In particular, we have $\mathfrak {H}_1 \subset \mathrm {dom} (X^*)$ with $X^*(\mathfrak {H}_1) \subset \mathfrak {H}_1$ .

5. Local well-posedness and explicit flow formula

In this section, we derive the explicit flow formula valid for (HWM $_d$ ) for sufficiently smooth solutions. In fact, this formula will play an essential rôle for obtaining the main results of this paper. Let us also remark that similar explicit flow formulae have been recently derived for other completely integrable equations which feature a Lax pair structure on Hardy spaces such as the cubic Szegő equation [Reference Gérard and Grellier13], the Benjamin–Ono equation [Reference Gérard10] and the Calogero–Moser derivative NLS [Reference Gérard and Lenzmann15, Reference Badreddine2, Reference Killip, Laurens and Visan21].

Local well-posedness for sufficiently regular data

We start with a result on local well-posedness for the matrix-valued (HWM $_d$ ) for sufficiently regular initial data of the form

(5.1)

$$ \begin{align} \mathbf{U}_0(x) = \mathbf{U}_\infty + \mathbf{V}_0(x) \in M_d(\mathbb{C}) \oplus H^s(\mathbb{R}; M_d(\mathbb{C})) \, , \end{align} $$

satisfying the constraints

(5.2)

In what follows, we will always assume that

$$ \begin{align*}s> \frac{3}{2} \,. \end{align*} $$

In particular, the initial datum $\mathbf {U}_0 : \mathbb {R} \to M_d(\mathbb {C})$ is of class $C^1$ by Sobolev embeddings. In view of (5.2), we easily conclude that $\mathrm {Tr}(\mathbf {U}_0(x))$ can only attain integer values, whence it follows $\mathrm {Tr}(\mathbf {U}_0(x)) = \text {const}.$ on $\mathbb {R}$ by continuity.Footnote ⁴ As a consequence, we deduce that there exists some integer $0 \leq k \leq d$ such that

$$ \begin{align*}\mathbf{U}_0(x) \in \mathsf{Gr}_k(\mathbb{C}^d) \quad \text{for } x \in \mathbb{R}. \end{align*} $$

We have the following result.

Lemma 5.1. Let $s> \frac {3}{2}$ , $d \geq 2$ , and assume $\mathbf {U}_0 : \mathbb {R} \to M_d(\mathbb {C})$ satisfies (5.1) and (5.2). Then, for any $R> 0$ , there exists some $T=T(R)>0$ such that for every $\mathbf {U}_0=\mathbf {U}_\infty + \mathbf {V}_0$ as above with $\| \mathbf {V}_0 \|_{H^s} < R$ , there exists a unique solution of (HWM_d) of the form

$$ \begin{align*}\mathbf{U}(t) = \mathbf{U}_\infty + \mathbf{V}(t) \in M_d(\mathbb{C}) \oplus C([0,T]; H^s(\mathbb{R};M_d(\mathbb{C}))) \end{align*} $$

and we have $\mathbf {U}(t,x) \in \mathsf {Gr}_k(\mathbb {C}^d)$ for all $t \in [0,T]$ and $x \in \mathbb {R}$ with some integer $0 \leq k \leq d$ .

Furthermore, the $H^\sigma $ -regularity of $\mathbf {V}_0$ with $\sigma> s$ is propagated on the whole maximal time-interval of existence of $\mathbf {U}(t)$ , and the flow map $\mathbf {V}_0 \mapsto \mathbf {V}(t)$ is continuous in the $H^\sigma $ -topology.

Remark. For proving the above local well-posedness result, the Hermitian constraint in (5.2) is the relevant one. However, the second constraint in (5.2) will be essential to obtain a global well-posedness result below based on the Lax pair structure, which involves the use of both pointwise constraints stated in (5.2).

Proof. We postpone the detailed proof of Lemma 5.1 to Appendix D.

Explicit flow formula

Inspired by the very recent work [Reference Gérard10] on the Benjamin–Ono equation, we next derive an explicit flow formula for solutions of (HWM_d) based on its Lax pair structure acting on the Hardy space. Note that, in this formula, we choose the vector space $\mathcal {V} = M_d(\mathbb {C})$ for the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ .

Lemma 5.2 (Explicit Flow Formula).

Let $s> \frac {3}{2}$ , $d \geq 2$ , and $\mathbf {U}(t) = \mathbf {U}_\infty + \mathbf {V}(t) \in M_d(\mathbb {C}) \oplus C([0,T]; H^s(\mathbb {R}, M_d(\mathbb {C}))$ be as in Lemma 5.1 above. Then it holds that

$$ \begin{align*}\Pi_+ \mathbf{V}(t,z) = \frac{1}{2 \pi \mathrm{i}} I_+ \left [ (X^*+tT_{\mathbf{U}_0} - z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}_0 \right ] \quad \text{for } z \in \mathbb{C}_+ \text{ and } t \in [0,T]. \end{align*} $$

Here $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ denotes the Toeplitz operator $T_{\mathbf {U}_0} f = \Pi _+(\mathbf {U}_0 f)$ with $\mathcal {V} = M_d(\mathbb {C})$ .

Before we turn to the proof of Lemma 5.2, we need some commutator identities as follows. Recall that

$$ \begin{align*}B_{\mathbf{U}} = -\frac{\mathrm{i}}{2} ( \mu_{\mathbf{U}} |D| + |D| \mu_{\mathbf{U}} ) + \frac{\mathrm{i}}{2} \mu_{|D| \mathbf{U}} \,. \end{align*} $$

In fact, since we can restrict to the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ , it will be convenient to work with the compression of $B_{\mathbf {U}}$ to the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ denoted by

$$ \begin{align*}B^+_{\mathbf{U}} := \Pi_+ B_{\mathbf{U}} \Pi_+ = -\frac{\mathrm{i}}{2} (T_{\mathbf{U}} D + D T_{\mathbf{U}}) + \frac{\mathrm{i}}{2} T_{|D| \mathbf{U}} \quad \text{with } D=-\mathrm{i} \partial_x \,. \end{align*} $$

Note that $D \geq 0$ on $L^2_+(\mathbb {R}; \mathcal {V})$ with its operator domain $\mathrm {dom}(D) = H^1_+(\mathbb {R}; \mathcal {V})$ . The Lax equation for $T_{\mathbf {U}(t)} : L^2_+(\mathbb {R}, \mathcal {V}) \to L^2_+(\mathbb {R}, \mathcal {V})$ can thus be written as

$$ \begin{align*}\frac{d}{dt} T_{\mathbf{U}(t)} = [B_{\mathbf{U}(t)}^+, T_{\mathbf{U}(t)} ] \,. \end{align*} $$

We have the following key commutator identity.

Proposition 5.1. For any $f \in \mathrm {dom}(X^*) \cap H^1_+(\mathbb {R}; \mathcal {V})$ , it holds that

$$ \begin{align*}[X^*, B^+_{\mathbf{U}}] f = T_{\mathbf{U}} f \,. \end{align*} $$

Proof. Using the fact that $[X^*, D] = \mathrm {i} \, \mathrm {Id}$ and by Lemma 3.2, we calculate

$$ \begin{align*} [X^*, T_{\mathbf{U}} D + D T_{\mathbf{U}}] f & = [X^*, T_{\mathbf{U}}] Df + T_{\mathbf{U}} [X^*,D] f + [X^*,D] T_{\mathbf{U}} f + D [X^*,T_{\mathbf{U}}] f \\ & = \frac{\mathrm{i}}{2 \pi} \Pi_+ \mathbf{V}.I_+(Df) + \mathrm{i} T_{\mathbf{U}} f + \mathrm{i} T_{\mathbf{U}} f + \frac{\mathrm{i}}{2 \pi} D (\Pi_+ \mathbf{V}.I_+(f)) \\ & = 2 \mathrm{i} T_{\mathbf{U}} f + \frac{\mathrm{i}}{2 \pi} \Pi_+(D \mathbf{V}).I_+(f) \, , \end{align*} $$

where also used that $I_+(Df)=0$ holds. By applying Lemma 3.2 once again,

$$ \begin{align*}[X^*,T_{|D| \mathbf{U}}] f = \frac{\mathrm{i}}{2 \pi} \Pi_+(|D| \mathbf{V}).I_+(f) = \frac{\mathrm{i}}{2 \pi} \Pi_+(D \mathbf{V}).I_+(f) \,. \end{align*} $$

In view of these identities, we easily conclude the claimed identity.

We are now ready to turn to the proof of Lemma 5.2.

Proof of Lemma 5.2 (Explicit Flow Formula).

We divide the proof into the following steps. We remind the reader that we take $\mathcal {V} = M_d(\mathbb {C})$ in the following.

Step 1. Recalling identity (2.4), we write

$$ \begin{align*}\Pi_+ \mathbf{V}(t,z) = \frac{1}{2 \pi \mathrm{i}} I_+((X^*-z \mathrm{Id})^{-1} \Pi_+\mathbf{V}(t)) \quad \text{for } z \in \mathbb{C}_+ \text{ and } t \in [0,T] \,. \end{align*} $$

Let $F \in M_d(\mathbb {C})$ and $z \in \mathbb {C}_+$ be fixed from now on. We find

$$ \begin{align*} \langle \Pi_+ \mathbf{V}(t,z), F \rangle_{\mathcal{V}} & = \frac{1}{2 \pi \mathrm{i}} \langle I_+((X^*-z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}(t), F \rangle_{\mathcal{V}} \\ &= \frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \left \langle (X^*-z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}(t), F \chi_\varepsilon \right \rangle \\ & = \frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \left \langle \mathcal{U}(t)^* (X^*- z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}(t), \mathcal{U}(t)^* (F \chi_\varepsilon) \right \rangle \, , \end{align*} $$

where we also use that $\mathcal {U}(t)^* : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ is a unitary map for any $t \in [0,T]$ , which is given by the solution of the initial-value problem

$$ \begin{align*}\frac{d}{dt} \mathcal{U}(t) = B_{\mathbf{U}(t)}^+ \mathcal{U}(t) \quad \text{for } t \in [0,T], \quad \mathcal{U}(0) = \mathrm{Id} \,. \end{align*} $$

See Appendix D for details. Using the identity

$$ \begin{align*}\mathcal{U}(t)^* (X^*-z \mathrm{Id})^{-1} = (\mathcal{U}(t)^* X^* \mathcal{U}(t) - z \mathrm{Id})^{-1} \mathcal{U}(t)^* \, , \end{align*} $$

we conclude

$$ \begin{align*}\langle \Pi_+ \mathbf{V}(t,z), F \rangle_{\mathcal{V}} = \frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \left \langle (\mathcal{U}(t)^* X^* \mathcal{U}(t) - z \mathrm{Id})^{-1} \mathcal{U}(t)^* (\Pi_+ \mathbf{V}(t)), \mathcal{U}(t)^*(F \chi_\varepsilon) \right \rangle \end{align*} $$

for any $z \in \mathbb {C}_+$ , $t \in [0,T]$ and $F \in M_d(\mathbb {C})$ .

Step 2. We will now discuss the individual terms which appear in the expression derived in Step 1 above. First, we notice that

$$ \begin{align*}\frac{d}{dt} \mathcal{U}(t)^* X^* \mathcal{U}(t) = \mathcal{U}(t)^* [X^*, B_{\mathbf{U}(t)}^+] \mathcal{U}(t) = \mathcal{U}(t)^* T_{\mathbf{U}(t)} \mathcal{U}(t) = T_{\mathbf{U}_0} \, , \end{align*} $$

where we used Proposition 5.1 together with the fact that $T_{\mathbf {U}(t)} = \mathcal {U}(t) T_{\mathbf {U}_0} \mathcal {U}(t)^*$ holds thanks to the Lax evolution. By integration on the interval $[0,t]$ , we get

(5.3)

$$ \begin{align} \mathcal{U}(t)^* X^* \mathcal{U}(t) = X^* + t T_{\mathbf{U}_0} \,. \end{align} $$

Next, we observe

$$ \begin{align*}\frac{d}{dt} ( \mathcal{U}(t)^* (F \chi_\varepsilon)) = -\mathcal{U}(t)^*(B_{\mathbf{U}(t)}^+ (F \chi_\varepsilon)) = o(1) \, , \end{align*} $$

where $o(1) \to 0$ in $L^2$ as $\varepsilon \to 0$ uniformly with respect to $t \in [0,T]$ . To see this, we remark

$$ \begin{align*} B_{\mathbf{U}}^+ (F \chi_\varepsilon) & = \frac{\mathrm{i}}{2} T_{\mathbf{U}}(F D\chi_\varepsilon) + \frac{\mathrm{i}}{2} D(T_{\mathbf{U}} F \chi_\varepsilon) - \frac{\mathrm{i}}{2} T_{|D| \mathbf{U}}(F \chi_\varepsilon) \\ & \to \frac{\mathrm{i}}{2} \Pi_+(D \mathbf{U}). F -\frac{\mathrm{i}}{2} \Pi_+(D\mathbf{U}). F = 0 \quad \text{as } \varepsilon \to 0 \end{align*} $$

in $L^2(\mathbb {R}; \mathcal {V})$ uniformly in $t \in [0,T]$ . Therefore, by integrating in t, we conclude

(5.4)

$$ \begin{align} \mathcal{U}(t)^*(F \chi_\varepsilon) = F \chi_\varepsilon + o(1) \end{align} $$

with $o(1) \to 0$ in $L^2_+(\mathbb {R}, \mathcal {V})$ as $\varepsilon \to 0$ uniformly in $t \in [0,T]$ .

It remains to discuss the last term from Step 1. Here we claim that

(5.5)

$$ \begin{align} \mathcal{U}(t)^* (\Pi_+ \mathbf{V}(t)) = \Pi_+ \mathbf{V}_0 \,. \end{align} $$

Since $\mathcal {U}(0)^* = \mathrm {Id}$ , we need to show that the time derivative of the left-hand side vanishes. Indeed, we note

$$ \begin{align*}\frac{d}{dt} \left ( \mathcal{U}(t)^*(\Pi_+ \mathbf{V}(t)) \right ) = \mathcal{U}(t)^* \left ( -B_{\mathbf{U}(t)}^+ \Pi_+ \mathbf{V}(t) + \partial_t \Pi_+ \mathbf{V}(t) \right ) \,. \end{align*} $$

Now, by the Lax equation $\frac {d}{dt} T_{\mathbf {U}(t)} = [B_{\mathbf {U}(t)}^+, T_{\mathbf {U}(t)}]$ and if we let denote the identity matrix in $M_d(\mathbb {C})$ , we find

$$ \begin{align*}\frac{d}{dt} T_{\mathbf{U}(t)} (E \chi_\varepsilon) = B_{\mathbf{U}(t)}^+ T_{\mathbf{U}(t)} (E \chi_\varepsilon) - T_{\mathbf{U}(t)} B_{\mathbf{U}(t)}^+ (E \chi_\varepsilon) \end{align*} $$

For the first term on right-hand side, we observe

$$ \begin{align*}B_{\mathbf{U}(t)}^+ T_{\mathbf{U}(t)} (E \chi_\varepsilon) \to B_{\mathbf{U}(t)}^+ (\Pi_+ \mathbf{V}(t)) \quad \text{in } L^2_+ \text{ as } \varepsilon \to 0 \end{align*} $$

uniformly in $t \in [0,T]$ . Furthermore, in the same way as in the discussion showing that $B_{\mathbf {U}(t)}^+ (F \chi _\varepsilon ) \to 0$ as $\varepsilon \to 0$ for any constant matrix $F \in M_d(\mathbb {C})$ , we conclude

$$ \begin{align*}T_{\mathbf{U}(t)} B_{\mathbf{U}(t)}^+(E \chi_\varepsilon) \to 0 \quad \text{in } L^2_+ \text{ as } \varepsilon \to 0 \end{align*} $$

uniformly in t. On the other hand, we have

$$ \begin{align*}\frac{d}{dt} T_{\mathbf{U}}(t) E \chi_\varepsilon \to \partial_t \Pi_+ \mathbf{V} \quad \text{in } L^2_+ \text{ as } \varepsilon \to 0 \end{align*} $$

uniformly in $t \in [0,T]$ . In summary, we infer that $\partial _t \Pi _+ \mathbf {V}(t) = B_{\mathbf {U}(t)} \Pi _+ \mathbf {V}(t)$ holds, whence it follows

$$ \begin{align*}\frac{d}{dt} \left ( \mathcal{U}(t)^* (\Pi_+ \mathbf{V}(t)) \right ) = \mathcal{U}(t)^* (-B_{\mathbf{U}(t)}^+ \Pi_+ \mathbf{V}(t) + \partial_t \Pi_+ \mathbf{V}(t) ) = 0 \,. \end{align*} $$

This completes the proof of (5.5).

Step 3. Combining the results from Step 1 and Step 2 above, we conclude, for any $F \in M_d(\mathbb {C})$ and $z \in \mathbb {C}_+$ , that

$$ \begin{align*} \langle \Pi_+ \mathbf{V}(t,z), F \rangle_{\mathcal{V}} & = \frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \left \langle (\mathcal{U}(t)^* X^* \mathcal{U}(t) - z \mathrm{Id})^{-1} \mathcal{U}(t)^* (\Pi_+ \mathbf{V}(t)), \mathcal{U}(t)^*(F \chi_\varepsilon) \right \rangle \, \\ & = \frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \left \langle (X^*+ t T_{\mathbf{U}_0} - z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}_0, F \chi_\varepsilon \right \rangle \\ & = \frac{1}{2 \pi \mathrm{i}}\left \langle I_+[(X^*+tT_{\mathbf{U}_0} - z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}_0)], F \right \rangle_{\mathcal{V}} \,. \end{align*} $$

Since $F \in M_d(\mathbb {C})$ is arbitrary, we deduce the claimed formula for $\Pi _+ \mathbf {V}(t,z) \in \mathcal {V}$ .

The proof of Lemma 5.2 is now complete.

6. Global well-posedness for rational data

We are now ready to prove global well-posedness for (HWM $_d$ ) with rational initial data

$$ \begin{align*}\mathbf{U}_0 \in \mathcal{R}at(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \end{align*} $$

for any $d \geq 2$ and $0 \leq k \leq d$ . The main argument rests on exploiting the explicit flow formula derived above. First, we start with the following general result, which in fact does not require rational initial data.

Lemma 6.1. Let $d \geq 2$ be an integer. Suppose $\mathbf {W} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ has the following properties

$$ \begin{align*}\mathbf{W}(x) = \mathbf{W}(x)^* \ \ \text{a. e.}, \quad \mathbf{W}(x) = \mathbf{W}_\infty + \mathbf{V}_0(x) \in M_d(\mathbb{C}) \oplus L^2(\mathbb{R}; M_d(\mathbb{C})) \,. \end{align*} $$

Then $X^*+ T_{\mathbf {W}}$ acting on $L^2_+(\mathbb {R}; M_d(\mathbb {C}))$ has no real eigenvalues, that is, its point spectrum satisfies $\sigma _{\mathrm {p}}(X^*+ T_{\mathbf {W}}) \cap \mathbb {R} = \emptyset $ .

Proof. Let $x \in \mathbb {R}$ . Since $X^*$ is closed, we find that $\mathcal {E} := \mathrm {ker}(X^*+T_{\mathbf {W}} - x\mathrm {Id})$ is a closed subspace in $L^2_+(\mathbb {R}; M_d(\mathbb {C}))$ ; see also Section 2 for general properties of $X^*$ as well as ??. Moreover, from the eigenvalue equation

$$ \begin{align*}(X^*+ T_{\mathbf{W}} - x) f = 0 \end{align*} $$

we see that $\mathcal {E} \subset \mathrm {dom}(X^*)$ . By taking the imaginary part of the inner product with f and using that $T_{\mathbf {W}}^* = T_{\mathbf {W}}$ is self-adjoint and that x is a real number, we conclude that $\mathrm {Im} \langle X^*f, f \rangle = 0$ . Recalling the identity (2.3), we deduce

$$ \begin{align*}I_+(f) = 0 \quad \text{for } f \in \mathcal{E} \,. \end{align*} $$

In view of Lemma 3.2, we also notice

$$ \begin{align*}[X^*, T_{\mathbf{W}}] f = \frac{\mathrm{i}}{2 \pi} \Pi_+ \mathbf{V}_0.I_+(f) = 0 \quad \text{for } f \in \mathcal{E} \, , \end{align*} $$

which shows that $X^* f \in \mathcal {E}$ for all $f \in \mathcal {E}$ . Thus $\mathcal {E}$ is an invariant subspace for $X^*$ . For the semigroup $\{ S(\eta )^* \}_{\eta \geq 0}$ generated by $X^*$ , we thus deduce

$$ \begin{align*}S(\eta)^* f = e^{-\mathrm{i} \eta X^*} f \in \mathcal{E} \quad \text{for all } f \in \mathcal{E} \text{ and all } \eta \geq 0 \,. \end{align*} $$

But this implies that, for every $f \in \mathcal {E}$ ,

$$ \begin{align*}0 = I_+(S(\eta)^* f) = \widehat{f}(\eta) \quad \text{for all } \eta \geq 0. \end{align*} $$

Hence we see that $f = 0$ for all $f \in \mathcal {E}$ . Therefore, the subspace

$$ \begin{align*}\mathcal{E} = \mathrm{ker}(X^*+T_{\mathbf{W}} - x\mathrm{Id} ) = \{ 0 \} \end{align*} $$

is trivial for any $x \in \mathbb {R}$ .

Proof of Theorem 1.5

We are now ready to prove global well-posedness for (HWM $_d$ ) for rational initial data

$$ \begin{align*}\mathbf{U}_0 \in \mathcal{R}at(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \end{align*} $$

where $d \geq 2$ and $0 \leq k \leq d$ are given integers. We note that

$$ \begin{align*}\mathbf{U}_0 = \mathbf{U}_\infty + \mathbf{V}_0 \in \mathsf{Gr}_k(\mathbb{C}^d) \oplus H^\infty(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

holds. Hence, by the local well-posedness result from Lemma 5.1, there exists a unique maximal solution

$$ \begin{align*}\mathbf{U}(t) = \mathbf{U}_\infty + \mathbf{V}(t) \in C([0,T_{\max}); \mathsf{Gr}_k(\mathbb{C}^d) \oplus H^\infty(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

of (HWM $_d$ ) with initial datum $\mathbf {U}(0) = \mathbf {U}_0$ and maximal (forward) time of existence $T_{\max } \in (0, +\infty ]$ such that the following implication holds:

(6.1)

$$ \begin{align} T_{\max} < +\infty \quad \Rightarrow \quad \lim_{t \nearrow T_{\max}} \| \mathbf{V}(t) \|_{H^2} = +\infty \,. \end{align} $$

Thus to show that $T_{\max } = +\infty $ holds true we argue by contradiction and we suppose that $T_{\max } < +\infty $ . We now claim that

(6.2)

$$ \begin{align} \sup_{t \in [0,T_{\max})} \| \mathbf{V}(t) \|_{H^2} < +\infty \, , \end{align} $$

which implies that $T_{\max } = +\infty $ must hold by (6.1). To prove (6.2), we first note that $\mathbf {V}(t,x)=\mathbf {V}(t,x)^*$ for $t \in [0,T_{\max })$ and $x \in \mathbb {R}$ . Therefore $\mathbf {V}(t) = \Pi _+ \mathbf {V}(t) + (\Pi _+ \mathbf {V}(t))^*$ and hence it suffices to show that

(6.3)

$$ \begin{align} \sup_{t \in [0,T_{\max})} \| \Pi_+ \mathbf{V}(t) \|_{H^2} < +\infty \,. \end{align} $$

In view of the explicit flow formula in Lemma 5.2, we define

(6.4)

$$ \begin{align} \mathrm{EF}[\mathbf{U}_0](t,z) := \frac{1}{2 \pi \mathrm{i}} I_+ \left [( X^*+ t T_{\mathbf{U}_0} -z \mathrm{Id})^{-1} \Pi_+ \mathbf{V}_0 \right ] \quad \text{for } t \geq 0 \text{ and } z \in \overline{\mathbb{C}}_+ \, , \end{align} $$

where $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; M_d(\mathbb {C})) \to L^2_+(\mathbb {R}; M_d(\mathbb {C}))$ . Let us check that $\mathrm {EF}[\mathbf {U}_0]$ is indeed well-defined for all $t \geq 0$ and $z \in \overline {\mathbb {C}}_+$ . By Lemma 4.3 (Kronecker-type theorem), the subspace $\mathfrak {H}_1 = \overline {\mathrm {ran}(K_{\mathbf {U}_0})} \subset \mathrm {dom}(X^*)$ is finite-dimensional. By Propositions 4.2 and 4.3, we deduce $ \Pi _+ \mathbf {V}_0 \in \mathfrak {H}_1$ and that

$$ \begin{align*}M(t) := X^* + t T_{\mathbf{U}_0} : \mathfrak{H}_1 \to \mathfrak{H}_1 \end{align*} $$

is an endomorphism on the finite-dimensional subspace $\mathfrak {H}_1$ . Moreover, by Lemma 6.1 with $\mathbf {W} = t \mathbf {U}_0$ , we see that the eigenvalues of $M(t)$ cannot be real, that is, $\sigma (M(t)) \cap \mathbb {R} = \emptyset $ holds, which implies that

$$ \begin{align*}\sigma(M(t)) \subset \mathbb{C}_- \quad \text{for all } t \geq 0 \,. \end{align*} $$

Hence the resolvent $(X^* + t T_{\mathbf {U}_0}- z \mathrm {Id})^{-1} : \mathfrak {H}_1 \to \mathfrak {H}_1$ exists for all $t \geq 0$ and $z \in \overline {\mathbb {C}}_+$ . Moreover, by continuity of eigenvalues of $M(t)$ with respect to t, we deduce that, for any compact interval $I \subset [0,\infty )$ , it holds that

$$ \begin{align*}\| (X^* + t T_{\mathbf{U}_0}-z\mathrm{Id})^{-1} \|_{\mathfrak{H}_1 \to \mathfrak{H}_1} \leq C(I, \mathbf{U}_0) \quad \text{for all } t \in I \text{ and } z \in \overline{\mathbb{C}}_+ \, , \end{align*} $$

with some finite constant $C(I, \mathbf {U}_0)>0$ . Since $I_+ : \mathfrak {H}_1 \subset \mathrm {dom}(X^*) \to M_d(\mathbb {C})$ is bounded (as a linear map on a finite-dimensional Hilbert space), we deduce $\mathrm {EF}[\mathbf {U}_0](t,z)$ is a rational function in z for any $t \geq 0$ , whose poles belong to a compact subset $K =K(I, \mathbf {U}_0) \subset \mathbb {C}_-$ when $t \in I$ for any given compact time interval $I \subset [0,\infty )$ .

To summarize, we have shown that, for any given compact interval $I \subset [0,\infty )$ , there exists some constant $C = C(I, \mathbf {U}_0)> 0$ such that

$$ \begin{align*}|\alpha | + \frac{1}{|\mathrm{Im} \, \alpha|} \leq C \end{align*} $$

whenever $\alpha $ is a pole of the rational map $z \mapsto \mathrm {EF}[\mathbf {U}_0](t,z)$ with $t \in I$ . By possibly enlarging the constant $C>0$ , we obtain the $L^\infty $ -bound with

$$ \begin{align*}\sup_{x \in \mathbb{R}} | \mathrm{EF}[\mathbf{U}_0](t,x) |_F \leq C \quad \text{for } t \in I \,. \end{align*} $$

Since $\mathfrak {H}_1$ has finite dimension, we easily deduce that the degree of the denominator of the rational functions $z \mapsto \mathrm {EF}[\mathbf {U}_0](t,z)$ can be uniformly bounded for $t \in I$ . Hence, by applying Lemma 6.2 below, we deduce

$$ \begin{align*}\sup_{t \in I} \| \mathrm{EF}[\mathbf{U}_0](t) \|_{H^2} \leq C(I, \mathbf{U}_0) \end{align*} $$

with some finite constant $C(I, \mathbf {U}_0)$ .

Since $\Pi _+ \mathbf {V}(t) = \mathrm {EF}[\mathbf {U}_0](t)$ for $t \in [0, T_{\max })$ and by taking a compact interval $I \subset [0,\infty )$ with $[0,T_{\max }) \subset I$ , we conclude that (6.3) holds true. This completes the proof that the maximal (forward) time of existence must be $T_{\max } = +\infty $ .

Finally, by the time reversal symmetry of (HWM $_d$ ) with

$$ \begin{align*}\mathbf{U}(t,x) \mapsto -\mathbf{U}(-t,-x) \, , \end{align*} $$

which maps solutions to solutions (and evidently preserves rationality in x), we deduce that solutions of (HWM $_d$ ) with rational initial data also uniquely extend to all negative times $t \in (-\infty , 0]$ .

This completes the proof of Theorem 1.5.

Proof of Theorem 1.2

This is a direct consequence of Theorem 1.5. Indeed, let $\mathbf {u}_0 \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ be given and define $\mathbf {U}_0 = \mathbf {u}_0 \cdot \boldsymbol {\sigma } \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_1(\mathbb {C}^2))$ . By Theorem 1.5, there exists a unique global solution $\mathbf {U} = \mathbf {U}(t,x)$ of (HWM $_2$ ) with initial datum $\mathbf {U}(0) = \mathbf {u}_0$ . Hence

$$ \begin{align*}\mathbf{u}(t,x) = \frac{1}{2} \mathrm{Tr}( \mathbf{U}(t,x) \boldsymbol{\sigma} )=\frac{1}{2} ( \mathrm{Tr}(\mathbf{U}(t,x) \sigma_1), \mathrm{Tr}(\mathbf{U}(t,x) \sigma_2), \mathrm{Tr}(\mathbf{U}(t,x) \sigma_3)) \end{align*} $$

is the claimed unique global-in-time solution of (HWM) with initial datum $\mathbf {u}(0)=\mathbf {u}_0$ .

We close this section with the following auxiliary result used above.

Lemma 6.2. Let $\mathcal R \subset \mathbb {C}(X)$ be a subset of rational functions. We assume that there exists $C>0$ such that the following properties hold.

1. If $\alpha $ is a pole of some $R\in \mathcal R$ , then
$$ \begin{align*}|\alpha | +\frac{1}{|\mathrm{Im}(\alpha )|} \leq C\ .\end{align*} $$
2. For every $R\in \mathcal R$ , $R(x)\to 0$ as $x\to \infty $ and
$$ \begin{align*}\| R \|_{L^\infty (\mathbb{R} )}\leq C\ .\end{align*} $$
3. There exists an integer N such that the degree of the denominator of every $R\in \mathcal R$ is at most N.

Then, for every integer $k\ge 0$ , it holds that

$$ \begin{align*}\sup_{R\in \mathcal R}\| R\|_{H^k(\mathbb{R} )}<\infty \ .\end{align*} $$

Proof. Given $R\in \mathcal R$ , write

$$ \begin{align*}R(x)=\frac{P(x)}{Q(x)}\ ,\ Q(x)=\prod_{j=1}^D (x-\alpha _j)\ ,\ P\in \mathbb{C}[X]\ ,\ \mathrm{deg}(P)<D\leq N\ .\end{align*} $$

Because of properties (1) and (2),

$$ \begin{align*}\max_{0\leq x\leq 1} |P(x)|\leq C\max_{0\leq x\leq 1}\left | \prod_{j=1}^D (x-\alpha _j)\ \right |\leq C(1+C)^N\ .\end{align*} $$

Consequently, all the coefficients $a_j$ of P satisfy

$$ \begin{align*}\sup_{j<D}|a_j|\leq B(N,C)\ ,\end{align*} $$

for some constant $B(N,C)$ depending only on N and C. Similarly, from property (1), all the coefficients of Q are uniformly bounded by a constant depending only on C and N. Moreover, from property (1), for every $x\in \mathbb {R}$ ,

$$ \begin{align*}|Q(x)|\geq \left ( ||x|-C|^2+C^{-2} \right )^{D/2}\ .\end{align*} $$

Notice that the k-th derivative $R^{(k)}$ is a sum of a finite number – depending only on k – of terms of the form

$$ \begin{align*}\frac{ P^{(m)}Q^{(m_1)}\dots Q^{(m_r)} }{Q^{r+1}}\end{align*} $$

where $ 0\leq r\leq k$ and $m+m_1+\dots m_r =k$ . Notice that the degree of the numerator is at most $(r+1)D-k-1$ , and that its coefficients are all bounded by a constant depending only on k, N and C. Consequently,

$$ \begin{align*}\| R^{(k)}\|_{L^2}^2\leq A(k,N,C) \max _{r\leq k}\max _{\ell \leq (r+1)D-k-1}\int_{\mathbb{R}} \frac{x^{2\ell}}{( ||x|-C|^2+C^{-2} )^{D(r+1)}}\, dx\end{align*} $$

with some constant $A(k,N,C)>0$ . This completes the proof.

7. Soliton Resolution and Non-Turbulence

In this section we prove our next main result Theorem 1.7, which shows soliton resolution and non-turbulence for rational solutions of (HWM $_d$ ) under the spectral assumption that the Toeplitz operator $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R};\mathbb {C}^d)$ has simple discrete spectrum.

Preliminaries

Let $d \geq 2$ and $0 \leq k \leq d$ be given integers. In what follows, we suppose that $\mathbf {U}_0 \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ holds, that is, the map $\mathbf {U}_0 : \mathbb {R} \to M_d(\mathbb {C})$ is a rational matrix-valued function satisfying the pointwise constraints

In the trivial case of constant initial data $\mathbf {U}_0(x) \equiv \mathbf {U}_\infty $ , we directly obtain Theorem 1.7 with $N=0$ . Hence for the rest of the proof, we will assume that $\mathbf {U}_0$ is nonconstant.

For the following discussion, we need to clearly distinguish between the Toeplitz operator $T_{\mathbf {U}_0}$ acting on the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ with $\mathcal {V}=\mathbb {C}^d$ or $\mathcal {V}=M_d(\mathbb {C})$ , respectively.

From Lemma 4.1, we recall the general formula

(7.1)

$$ \begin{align} T_{\mathbf{U}_0}^2 = \mathrm{Id} - K_{\mathbf{U}_0} \quad \text{on } L^2_+(\mathbb{R}; \mathcal{V}) \, , \end{align} $$

with the trace-class operator $K_{\mathbf {U}_0} = H_{\mathbf {U}_0}^* H_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ . Since $\mathbf {U}_0$ is rational, the operator $K_{\mathbf {U}_0}$ is finite-rank by Lemma 4.3 and we have the finite-dimensional invariant subspace for $T_{\mathbf {U}_0}$ given by

(7.2)

$$ \begin{align} \mathfrak{H}_1(\mathcal{V}) := \mathrm{ran} ( K_{\mathbf{U}_0} : L^2_+(\mathbb{R}; \mathcal{V}) \to L^2_+(\mathbb{R}; \mathcal{V}) ) \, , \end{align} $$

where we use the notation $\mathfrak {H}_1(\mathcal {V})$ instead of $\mathfrak {H}_1$ to keep track of whether we choose $\mathcal {V} = \mathbb {C}^d$ or $\mathcal {V} = M_d(\mathbb {C})$ . We introduce the following short-hand notations

(7.3)

$$ \begin{align} \mathsf{T}:= T_{\mathbf{U}_0} |_{\mathfrak{H}_1(M_d(\mathbb{C}))} \quad \text{and} \quad \widetilde{\mathsf{T}} := T_{\mathbf{U}_0} |_{\mathfrak{H}_1(\mathbb{C}^d)} \,. \end{align} $$

Note that $\mathsf {T} = \mathsf {T}^*$ and $\widetilde {\mathsf {T}} = \widetilde {\mathsf {T}}^*$ are self-adjoint endomorphisms on the finite-dimensional spaces $\mathfrak {H}_1(M_d(\mathbb {C}))$ and $\mathfrak {H}_1(\mathbb {C}^d)$ , respectively. From Proposition 4.3 we recall that the generator $X^*$ of the adjoint Lax–Beurling semigroup also acts invariantly on the finite-dimensional subspace $\mathfrak {H}_1(\mathcal {V})$ . Likewise, we use the following notation

(7.4)

$$ \begin{align} \mathsf{G} := X^* |_{\mathfrak{H}_1(M_d(\mathbb{C}))} \quad \text{and} \quad \widetilde{\mathsf{G}} := X^* |_{\mathfrak{H}_1(\mathbb{C}^d)} \end{align} $$

for the generator $X^*$ of adjoint Lax–Beurling semigroup restricted to the invariant subspaces $\mathfrak {H}_1(M_d(\mathbb {C}))$ and $\mathfrak {H}_1(\mathbb {C}^d)$ , respectively.

Let us now assume $\widetilde {\mathsf {T}}$ has simple spectrum, that is, we have

(7.5)

$$ \begin{align} \sigma(\widetilde{\mathsf{T}}) = \{ v_1, \ldots, v_N \} \quad \text{with} \quad N = \dim \mathfrak{H}_1(\mathbb{C}^d) \,. \end{align} $$

Note that $v_n \in (-1,1)$ for $n = 1, \ldots , N$ . Let $\varphi _n \in \mathfrak {H}_1(\mathbb {C}^d) \subset L^2_+(\mathbb {R}; \mathbb {C}^d)$ be a choice of the corresponding normalized eigenfunctions of $\widetilde {\mathsf {T}}$ such that

$$ \begin{align*}\widetilde{\mathsf{T}} \varphi_n = v_n \varphi_n \quad \text{with} \quad \| \varphi_n \|_{L^2} = 1 \end{align*} $$

for $n=1, \ldots , N$ . Clearly, the family $( \varphi _n )_{1 \leq n \leq N}$ forms an orthonormal basis for $\mathfrak {H}_1(\mathbb {C}^d)$ .

We can easily construct an orthonormal basis of eigenfunction for $\mathsf {T}$ acting on the matrix-valued finite-dimensional Hilbert space $\mathfrak {H}_1(M_d(\mathbb {C}))$ as follows. For $1 \leq n \leq N$ and $1 \leq j \leq d$ , we define the matrix-valued functions $\Phi _{n,j} \in L^2_+(\mathbb {R}; M_d(\mathbb {C}))$ by setting

(7.6)

$$ \begin{align} \Phi_{n,j}:= \left (0, \ldots, \underbrace{\varphi_n}_{j\text{-th column}}, \ldots, 0 \right ) \,. \end{align} $$

We readily check that

(7.7)

$$ \begin{align} \mathsf{T} \Phi_{n,j} = v_n \Phi_{n,j} \quad \text{for } n=1, \ldots, N \text{ and } j=1, \ldots, d \,. \end{align} $$

Thus the eigenvalues $v_n$ for $\mathsf {T}$ are d-fold degenerate in a trivial manner by changing the columns in the matrix-valued functions $\Phi _{n,j}$ .

We have the following fact, whose elementary proof we omit.

Proposition 7.1. The functions $\{ \Phi _{n,j} \}_{1 \leq n \leq N, 1 \leq j \leq d}$ form an orthonormal basis of eigenfunctions for $\mathsf {T} : \mathfrak {H}_1(M_d(\mathbb {C})) \to \mathfrak {H}_1(M_d(\mathbb {C}))$ .

Perturbation analysis as $|t| \to \infty $

From Theorem 1.5 we know that the corresponding solution of (HWM $_d$ ) with rational initial datum $\mathbf {U}_0$ is global in time and satisfies

(7.8)

$$ \begin{align} \mathbf{U}(t,x) = \mathbf{U}_\infty + \Pi \mathbf{V}(t,x) + (\Pi \mathbf{V}(t,x))^* \, , \end{align} $$

where here and in the following we write $\Pi \equiv \Pi _+$ for the Cauchy–Szegő projection for notational simplicity. By the following explicit flow formula from Lemma 5.2, we have

(7.9)

$$ \begin{align} \Pi \mathbf{V}(t,x) = \frac{1}{2 \pi \mathrm{i}} I_+ \left [ (\mathsf{G}+ t \mathsf{T} - x \mathrm{Id})^{-1} \Pi \mathbf{V}_0 \right ] \quad \text{for } (t,x) \in \mathbb{R} \times \mathbb{R} \, , \end{align} $$

using our definitions of $\mathsf {G}$ and $\mathsf {T}$ acting on the finite-dimensional subspace $\mathfrak {H}_1(M_d(\mathbb {C}))$ . Note that we can take $x \in \mathbb {R}$ here, since we have already shown that the rational function $\Pi \mathbf {V}(t,z)$ for $z \in \mathbb {C}_+$ has no poles on the real axis for all $t \in \mathbb {R}$ . Recall also that $\Pi \mathbf {V}_0 \in \mathfrak {H}_1(M_d(\mathbb {C}))$ holds thanks to Proposition 4.2 above.

In order to study the large time limit $t \to \pm \infty $ , it will be convenient to define

(7.10)

$$ \begin{align} \mathsf{M}(\varepsilon) := \varepsilon \mathsf{G} + \mathsf{T} \quad \text{with} \quad \varepsilon := \frac{1}{t} \end{align} $$

for $t \neq 0$ . In terms of these definition, we can write the explicit flow formula as

(7.11)

$$ \begin{align} \Pi V(\varepsilon^{-1}, x) = \frac{\varepsilon}{2 \pi \mathrm{i}} I_+ [ (\mathsf{M}(\varepsilon)- \varepsilon x \mathrm{Id})^{-1} \Pi V_0 ] \,. \end{align} $$

Inspired by the analysis in [Reference Gérard and Lenzmann15] for the study of N-solitons for the Calogero–Moser derivative NLS, we carry out a perturbation analysis of the non-self-adjoint endomorphisms $\mathsf {M}(\varepsilon ) : \mathfrak {H}_1(M_d(\mathbb {C})) \to \mathfrak {H}_1(M_d(\mathbb {C}))$ in the limit $\varepsilon \to 0$ . We have the following facts, where we recall that we always suppose that the nondegeneracy assumption (7.5) for $\widetilde {\mathsf {T}} : \mathfrak {H}_1(\mathbb {C}^d) \to \mathfrak {H}_1(\mathbb {C}^d)$ holds true.

Lemma 7.1. There exists some $\varepsilon _0> 0$ sufficiently small such that the following holds.

(i) For $1 \leq n \leq N$ and $1 \leq j \leq d$ , there exist analytic functions $\varepsilon \mapsto v_n(\varepsilon ) \in \mathbb {C}$ and $\varepsilon \mapsto \Phi _{n,j}(\varepsilon ) \in \mathfrak {H}_1(M_d(\mathbb {C}))$ for $|\varepsilon | \leq \varepsilon _0$ with
$$ \begin{align*}v_n(\varepsilon) = v_n + \varepsilon w_n + O(\varepsilon^2), \quad \Phi_{n,j}(\varepsilon) = \Phi_{n,j} + O(\varepsilon) \, , \end{align*} $$

$$ \begin{align*}\mathsf{M}(\varepsilon) \Psi_{n,j}(\varepsilon)= v_n(\varepsilon) \Psi_{n,j}(\varepsilon) \,. \end{align*} $$

The functions $\{ \Psi _{n,j}(\varepsilon ) \}_{1 \leq n \leq N, 1 \leq j \leq d}$ form a basis for $\mathfrak {H}_1(M_d(\mathbb {C}))$ for $|\varepsilon | \leq \varepsilon _0$ .
(ii) For $1 \leq n \leq N$ , we have
$$ \begin{align*}w_n = \langle \widetilde{\mathsf{G}} \varphi_n, \varphi_n \rangle \quad \text{and} \quad \mathrm{Im} \, w_n < 0 \,. \end{align*} $$

Remark. The fact that all complex numbers $w_n$ have nonvanishing imaginary part will play a fundamental role to obtain a priori bound on all higher Sobolev norms for $\mathbf {U}(t,x)$ , that is, it rules out the phenomenon of turbulence in the limit $t \to \pm \infty $ . This is in striking contrast to the analysis of N-soliton solutions for the Calogero–Moser derivative NLS studies in [Reference Gérard and Lenzmann15], where the corresponding perturbative analysis yields the vanishing of the imaginary parts in the limit $t \to \pm \infty $ (which corresponds to the limit $\varepsilon \to 0$ ).

Proof. We divide the proof of Lemma 7.1 into the following steps.

Step 1. Let $\varepsilon _0> 0$ be a constant chosen later. For $|\varepsilon | \leq \varepsilon _0$ , we define the endomorphisms

(7.12)

$$ \begin{align} \widetilde{\mathsf{M}}(\varepsilon) := \widetilde{\mathsf{T}} + \varepsilon \widetilde{\mathsf{G}} : \mathfrak{H}_1(\mathbb{C}^d) \to \mathfrak{H}_1(\mathbb{C}^d) \,. \end{align} $$

Note that $\widetilde {\mathsf {M}}(0) = \widetilde {\mathsf {T}} = \widetilde {\mathsf {T}}^*$ is self-adjoint with simple spectrum $\sigma (\widetilde {\mathsf {T}}) = \{ v_1, \ldots , v_N \}$ with a corresponding orthonormal basis of eigenfunctions $(\varphi _n)_{1 \leq n \leq N}$ . By standard analytic perturbation theory, there exist analytic functions $\varepsilon \mapsto v_n(\varepsilon ) \in \mathbb {C}$ and $\varepsilon \mapsto \varphi _n(\varepsilon ) \in \mathfrak {H}_1(\mathbb {C}^d)$ for $1 \leq n \leq N$ such that

(7.13)

$$ \begin{align} \widetilde{\mathsf{T}}(\varepsilon) \varphi_n(\varepsilon) = v_n(\varepsilon) \varphi_n(\varepsilon) \end{align} $$

for $|\varepsilon | \leq \varepsilon _0$ , where $\varepsilon _0> 0$ is some sufficiently small constant. We have

(7.14)

$$ \begin{align} v_n(\varepsilon) = v_n + \varepsilon w_n + O(\varepsilon^2), \quad \varphi_n(\varepsilon) = \varphi_n + O(\varepsilon) \, , \end{align} $$

(7.15)

$$ \begin{align} w_n = \langle \widetilde{\mathsf{G}} \varphi_n, \varphi_n \rangle \,. \end{align} $$

Since $(\varphi _n)_{1 \leq n \leq N}$ forms an orthonormal basis for $\mathfrak {H}_1(\mathbb {C}^d)$ and by continuity with respect to $\varepsilon $ , we readily see that the perturbed eigenvectors $(\varphi _n(\varepsilon ))_{1 \leq n \leq N}$ also form a (not necessarily orthonormal) basis of $\mathfrak {H}_1(\mathbb {C}^d)$ , provided that $\varepsilon _0> 0$ is sufficiently small. By defining

$$ \begin{align*}\Phi_{n,j}(\varepsilon) := \left (0, \ldots, \underbrace{\varphi_n(\varepsilon)}_{j\text{-th column}}, \ldots, 0 \right ) \, , \end{align*} $$

we easily verify that (i) holds true.

Step 2. It remains to prove item (ii). Thus we claim, for any $1 \leq n \leq N$ ,

(7.16)

$$ \begin{align} \mathrm{Im} \, w_n = \mathrm{Im} \, \langle \widetilde{\mathsf{G}} \varphi_n, \varphi_n \rangle < 0 \,. \end{align} $$

Indeed, let $1 \leq n \leq N$ be given. From the general identity (2.3), we recall that

(7.17)

$$ \begin{align} \mathrm{Im} \, \langle \widetilde{\mathsf{G}} \varphi_n, \varphi_n \rangle = - \frac{1}{4 \pi} |I_+(\varphi_n)|^2 \leq 0 \, , \end{align} $$

with $I_+(f) = \lim _{\xi \to 0^+} \widehat {f}(\xi )$ and $f \in \mathrm {dom}(X^*)$ . [Note that $\varphi _n \in \mathrm {dom}(X^*)$ , since $\varphi _n$ is a rational function.] To prove (7.16), we argue by contradiction as follows. Let us assume that

(7.18)

$$ \begin{align} I_+(\varphi_n) = 0 \,. \end{align} $$

By the commutator formula in Lemma 3.2, we deduce

(7.19)

$$ \begin{align} [\widetilde{\mathsf{G}}, \widetilde{\mathsf{T}}] \varphi_n = \frac{\mathrm{i}}{2 \pi} \Pi \mathbf{V}_0.I_+(\varphi_n) = 0 \,. \end{align} $$

Thus from $\widetilde {\mathsf {T}} \varphi _n = v_n \varphi _n$ we see that $\widetilde {\mathsf {T}} \widetilde {\mathsf {G}} \varphi _n = v_n \widetilde {\mathsf {G}} \varphi _n$ . But since $\widetilde {\mathsf {T}}$ has simple spectrum by assumption, we conclude $\widetilde {\mathsf {G}} \varphi _n = \alpha \varphi _n$ for some constant $\alpha \in \mathbb {C}$ . By taking the Fourier transform, this yields

(7.20)

$$ \begin{align} \mathrm{i} \frac{d}{d \xi} \widehat{\varphi}_n(\xi) = \alpha \widehat{\varphi}_n(\xi) \quad \text{for } \xi \geq 0 \,. \end{align} $$

Thus we find $\widehat {\varphi }(\xi ) = A \mathrm {e}^{-\mathrm {i} \alpha \xi }$ for $\xi \geq 0$ with some constant $A \neq 0$ (since $\varphi _n \not \equiv 0$ ). Moreover, we infer that $\mathrm {Im} \, \alpha < 0$ since $\widehat {\varphi }_n \in L^2(\mathbb {R}_+)$ . But this implies that

$$ \begin{align*}I_+(\varphi_n) = \lim_{\xi \to 0^+} \widehat{\varphi}_n(\xi) = A \neq 0 \, , \end{align*} $$

contradicting our assumption that $I_+(\varphi _n)=0$ holds. This shows (7.16) and completes the proof of Lemma 7.1.

Proof of Theorem 1.7

We are now ready to give the proof of Theorem 1.7. Adapting the notation from above, we proceed as follows.

Asymptotic behavior as $t \to \pm \infty $

Recall that $\varepsilon = t^{-1}$ for $t \neq 0$ . In what follows, we shall always assume that $|\varepsilon | \leq \varepsilon _0$ with the constant $\varepsilon _0> 0$ from Lemma 7.1 above, which amounts to considering times t with $|t| \geq T_0$ where $T_0 = \varepsilon ^{-1}$ .

We expand $\Pi \mathbf {V}_0 \in \mathfrak {H}_1(M_d(\mathbb {C}))$ in terms of the basis $(\Phi _{n,j}(\varepsilon ))_{1 \leq n \leq N, 1 \leq j \leq d}$ from Lemma 7.1, that is, we write

(7.21)

$$ \begin{align} \Pi \mathbf{V}_0 = \sum_{n=1}^N \sum_{j=1}^d \alpha_{n,j}(\varepsilon) \Phi_{n,j}(\varepsilon) \end{align} $$

with some coefficients $\alpha _{n,j}(\varepsilon ) \in \mathbb {C}$ . From Lemma 7.1 and the fact that $\varepsilon x \in \mathbb {R}$ does not belong to the spectrum $\sigma (\mathsf {M}(\varepsilon )) = \{ v_1(\varepsilon ), \ldots , v_N(\varepsilon ) \} \subset \mathbb {C}_-$ , we conclude that

(7.22)

$$ \begin{align} (\mathsf{M}(\varepsilon) - \varepsilon x \mathrm{Id})^{-1} \Pi \mathbf{V}_0 = \sum_{n=1}^N \sum_{j=1}^d \frac{\alpha_{n,j}(\varepsilon)}{v_n(\varepsilon) - \varepsilon x} \Phi_{n,j}(\varepsilon) \quad \text{for } |\varepsilon| \leq \varepsilon_0 \text{ and } x \in \mathbb{R} \,. \end{align} $$

In view of the explicit formula (7.11) together with $\varepsilon = t^{-1}$ and the properties stated in Lemma 7.1, we obtain that

(7.23)

$$ \begin{align} \Pi \mathbf{V}(t,x) = \frac{\varepsilon}{2 \pi \mathrm{i}} I_+ \left [ \sum_{n=1}^N \sum_{j=1}^d \frac{\alpha_{n,j}(\varepsilon)}{v_n(\varepsilon) - \varepsilon x} \Phi_{n,j}(\varepsilon) \right ] = \sum_{n=1}^N \frac{A_n(t)}{x-z_n(t)} \,. \end{align} $$

Here we set

(7.24)

$$ \begin{align} z_n(t) := t v_n(t^{-1}) = t v_n + w_n + O(t^{-1}) \in \mathbb{C}_- \quad \text{for } |t| \geq T_0 \, , \end{align} $$

and $A_n(t) \in M_d(\mathbb {C})$ are the matrix-valued functions defined as

(7.25)

$$ \begin{align} A_n(t) := - \frac{1}{2 \pi \mathrm{i}} \sum_{j=1}^d \alpha_{n,j}(t^{-1}) I_+[\Phi_{n,j}(t^{-1})] \quad \text{for } |t| \geq T_0 \,. \end{align} $$

Note that, by choosing $T_0> 0$ possibly larger, we can henceforth ensure that

(7.26)

$$ \begin{align} |z_n(t) - z_m(t)| \geq \frac{|t|}{2} \cdot \min_{n \neq m} |v_n - v_m|> 0 \quad \text{for } |t| \geq T_0 \text{ and } n \neq m, \end{align} $$

implying that the poles $z_1(t), \ldots , z_N(t) \in \mathbb {C}_-$ are pairwise distinct whenever $|t| \geq T_0$ .

Next we show, after discarding possibly trivial zero terms, that all the matrices $A_j(t) \in M_d(\mathbb {C})$ are nonzero and nilpotent of degree 2. Moreover, their limits as $t \to \pm \infty $ both exist, coincide, and are nonzero as well.

Proposition 7.2. There exists an integer $1 \leq M \leq N$ such that, after possibly relabelling $ \{ A_n(t), z_n(t) \}_{n=1}^N$ , it holds that

$$ \begin{align*}\Pi_+ \mathbf{V}(t,x) = \sum_{n=1}^M \frac{A_n(t)}{x-z_n(t)} \quad \text{for } |t| \geq T_0 \text{ and } x \in \mathbb{R} \,. \end{align*} $$

Here the matrices $A_n(t) \in M_d(\mathbb {C})$ satisfy $A_n(t) \neq 0$ and $A_n(t)^2 = 0$ for $|t| \geq T_0$ .

In addition, it holds

$$ \begin{align*}A_n(t) = A_n + O(t^{-1}) \end{align*} $$

with some nonzero limits $A_n \neq 0$ satisfiying $A_n^2 = 0$ for $1 \leq n \leq M$ .

Remark. In the special case of (HWM) with target $\mathbb {S}^2$ , corresponding to the target $\mathsf {Gr}_1(\mathbb {C}^2)$ in the matrix-valued case, we will see below that actually $M=N$ must always hold. This observation is based on the simple algebraic fact that nonzero matrices $A \in M_2(\mathbb {C})$ with $A^2=0$ must have $\mathrm {rank}(A) = 1$ . See below for more details.

Proof. Step 1. We first show that $A_n(t)^2 = 0$ holds for $1 \leq n \leq N$ and $|t| \geq T_0$ . Indeed, we know that

(7.27)

$$ \begin{align} \mathbf{U}(t,x) = \mathbf{U}_\infty + \sum_{n=1}^N \frac{A_n(t)}{x-z_n(t)} + \sum_{n=1}^N \frac{A_n(t)^*}{x-\overline{z}_n(t)} \quad \text{for } |t| \geq T_0 \end{align} $$

with the pairwise distinct poles $z_1(t), \ldots , z_n(t) \in \mathbb {C}_-$ and some constant matrix $\mathbf {U}_\infty \in \mathsf {Gr}_k(\mathbb {C}^d)$ . From the algebraic constraint and by equating the terms proportional to $(x-z_n(t))^{-2}$ to zero, we conclude that

$$ \begin{align*}A_n(t)^2 = 0 \end{align*} $$

for $|t| \geq T_0$ and $1 \leq n \leq N$ .

Furthermore, we readily see that we have existence and equality of the limits

$$ \begin{align*}\lim_{t \to -\infty} A_n(t) = \lim_{t \to +\infty} A_n(t) =: A_n \in M_d(\mathbb{C}) \,. \end{align*} $$

This directly follows from the properties in Lemma 7.1 which yields that

$$ \begin{align*}\lim_{|t| \to \infty} A_n(t) = -\frac{1}{2 \pi \mathrm{i}} \lim_{\varepsilon \to 0} \sum_{j=1}^d \alpha_{n,j}(t^{-1}) I_+[\Phi_{n,j}(t^{-1})] = -\frac{1}{2 \pi \mathrm{i}} \sum_{j=1}^d \alpha_{n,j} I_+[\Phi_{n,j}] = A_n \end{align*} $$

with the coefficients $\alpha _{n,j} = \langle \Pi \mathbf {V}_0, \Phi _{n,j} \rangle $ . Moreover, since $\Phi _{n,j}(t^{-1}) = \Phi _{n,j} + O(t^{-1})$ , we readily deduce that

$$ \begin{align*}A_n(t) = A_n + O(t^{-1}) \,. \end{align*} $$

Moreover, from $A_n(t)^2 = 0$ for $|t| \geq T_0$ , we readily deduce that the limits satisfy $A_n^2=0$ as well.

Step 2. By plugging (7.27) into (HWM $_d$ ), we obtain the following differential equations for the matrix-valued functions $A_n(t)$ :

(7.28)

$$ \begin{align} \dot{A}_n(t) = \mathrm{i} \sum_{m \neq n}^N \frac{[A_n(t), A_m(t)]}{(z_n(t)-z_m(t))^2} \quad \text{for } |t| \geq T_0 \text{ and } 1 \leq n \leq N \, , \end{align} $$

where $[X,Y]$ denotes the commutator of matrices in $M_d(\mathbb {C})$ . For details of the calculation that derives (7.28), we refer to the proof of [Reference Berntson, Klabbers and Langmann3][Theorem 2.1]; the generalization to (HWM $_d$ ) is straightforward. We also note that the expression on the right-hand side in (7.28) is nonsingular for $|t| \geq T_0$ thanks to (7.26).

We now claim that

(7.29)

$$ \begin{align} A_n(T_0) \neq 0 \quad \Rightarrow \quad A_n(t) \neq 0 \text{ for } t \geq T_0 \text{ and } \displaystyle \lim_{t \to +\infty} A_n(t) \neq 0 \,. \end{align} $$

Indeed, let $\| A \| = (\mathrm {Tr}(A A^*))^{1/2}$ denote the Frobenius norm of a matrix $A \in M_d(\mathbb {C})$ . Since $\| A_m(t) \| \leq C$ for $t \geq T_0$ and $1 \leq m \leq N$ with some constant $C>0$ (by existence of limits shown in Step 1) and from (7.26), we obtain from (7.28) the estimate

(7.30)

$$ \begin{align} \| \frac{d}{dt} A_n(t) \| \lesssim \frac{1}{t^2} \| A_n(t) \| \quad \text{for } t \geq T_0 \,. \end{align} $$

Suppose now that $A_n(T_0) \neq 0$ and let $T \in (T_0,+\infty ]$ . Then by integrating the estimate above, we conclude that

$$ \begin{align*}\int_{T_0}^T \frac{d}{dt} \log \|A_n(t)\| \, dt = \log(\| A_n(T) \| ) - \log(\| A_n(T_0) \|) \lesssim \int_{T_0}^T \frac{dt}{t^2} \lesssim \frac{1}{T_0} < +\infty\end{align*} $$

which rules out $A_n(T) = 0$ for $T \in (T_0, +\infty ]$ . This proves the implication (7.29).

Step 3. Define the integer $0 \leq K \leq N$ by setting

$$ \begin{align*}K := \# \{ 1 \leq n \leq N : A_n(T_0) = 0\} \end{align*} $$

and we let $M := N-K$ . Now if $M=0$ , then $\mathbf {U}(t,x) = \mathbf {U}_0 = \mathbf {U}_\infty $ is a constant solution to (HWM $_d$ ). But this implies that $K_{\mathbf {U}_0} \equiv 0$ and hence $\mathfrak {H}_1(\mathbb {C}^d) = \{ 0 \}$ is trivial, which contradicts our assumption that $N = \dim \mathfrak {H}_1(\mathbb {C}^d) \geq 1$ . Thus we see that $M \geq 1$ holds.

Thus, after relabelling $\{ A_n(T_0), z_n(T_0) \}_{n=1}^N$ if necessary, we see that

$$ \begin{align*}\Pi \mathbf{V}(T_0, x) = \sum_{n=1}^M \frac{A_n(T_0)}{x-z_n(T_0)} \end{align*} $$

with $A_n(T_0) \neq 0$ for $1 \leq n \leq M$ . By (7.29), we deduce that $A_n(t) \neq 0$ for all $t \geq T_0$ and $1 \leq n \leq M$ and $\lim _{t \to +\infty } A_n(t) = A_n \neq 0$ for $1 \leq n \leq M$ . This proves statement of Proposition 7.2 for positive times $t \geq T_0$ .

Finally, since $\lim _{t \to -\infty } A_n(t) = \lim _{t \to +\infty } A_n(t)$ for all $1 \leq n \leq N$ by Step 1, we complete the proof of Proposition 7.2 for negative times $t \leq -T_0$ .

Completing the proof of Theorem 1.7

We are now ready to complete the proof of Theorem 1.7, which we divide into the following steps.

Step 1. In view of Proposition 7.2 above, we define

(7.31)

$$ \begin{align} \mathbf{U}^{\pm}(t,x) := \sum_{n=1}^M \mathbf{Q}_{v_n} (x-v_n t) - (N-1) \mathbf{U}_\infty \end{align} $$

with the rational functions

(7.32)

$$ \begin{align} \mathbf{Q}_{v_n}(x) := \mathbf{U}_\infty + \frac{A_n}{x-y_n + \mathrm{i} \delta_n} + \frac{A_n^*}{x-y_n - \mathrm{i} \delta_n} \,. \end{align} $$

with the nonzero matrices $A_n = \lim _{|t| \to \infty } A_n(t) \in M_d(\mathbb {C})$ and where we set

(7.33)

$$ \begin{align} y_n := \mathrm{Re} \, w_n, \quad \delta_n := -\mathrm{Im} \, w_n> 0 \, \quad \text{for} \quad n=1, \ldots, M \,. \end{align} $$

For the difference

$$ \begin{align*}\mathbf{R}(t) := \mathbf{U}(t) - \mathbf{U}^{\pm}(t) \in H^\infty(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

we claim that

(7.34)

$$ \begin{align} \lim_{t \to \pm\infty} \|\mathbf{R}(t) \|_{H^s} = 0 \quad \text{for any } s \geq 0 \,. \end{align} $$

Indeed, since $\mathbf {R}(t,x)^* = \mathbf {R}(t,x)$ and thus $\mathbf {R} = \Pi \mathbf {R} + (\Pi \mathbf {R})^*$ , it suffices to consider $\Pi \mathbf {R}$ . We note that

$$ \begin{align*} \Pi \mathbf{R}(t,x) & = \sum_{n=1}^M \left ( \frac{A_n(t)}{x-z_n(t)} - \frac{A_n}{x-y_n - v_n t + \mathrm{i} \delta_n} \right ) \\ & = \sum_{n=1}^M \frac{A_n(t)-A_n}{x-z_n(t)} + \sum_{n=1}^M \left ( \frac{A_n}{x-z_n(t)} - \frac{A_n}{x-y_n - v_n t + \mathrm{i} \delta_n} \right ) \\ & =: r_1(t) + r_2(t) \,. \end{align*} $$

By recalling that $A_n(t)-A_n = O(t^{-1})$ and taking the Fourier transform, we see that

$$ \begin{align*}\| r_1(t) \|_{H^s} \leq O(t^{-1}) \sum_{n=1}^M \left ( \int_0^{\infty} \langle \xi \rangle^{2s} \mathrm{e}^{-2\delta \xi} \, d\xi \right)^{1/2} \to 0 \quad \text{as} \quad t \to \pm \infty \, , \end{align*} $$

where we also used that $\mathrm {Im} \, z_n(t) \leq -\delta < 0$ for $|t| \geq T_0$ and $1 \leq n \leq M$ with some constant $\delta> 0$ . Furthermore, we find

$$ \begin{align*} \| r_2(t) \|_{H^s} & \leq C \sum_{n=1}^M \left ( \int_0^{\infty} \langle \xi \rangle^{2s} | \mathrm{e}^{-\mathrm{i} z_n(t) \xi} - \mathrm{e}^{-\mathrm{i} (y_n+v_n t + \mathrm{i} \delta_n) \xi}|^2 \, d \xi \right)^{1/2} \\ & \leq C \sum_{n=1}^M \left ( \int_0^{\infty} \langle \xi \rangle^{2s} \mathrm{e}^{-2 \delta \xi} | \mathrm{e}^{O(t^{-1}) \xi}-1|^2 \, d \xi \right)^{1/2} \to 0 \quad \text{as} \quad t \to \pm \infty, \end{align*} $$

by dominated convergence and by making use of the fact that $z_n(t) = y_n + v_n t - \mathrm {i} \delta _n + O(t^{-1})$ and $\delta _n \geq \delta> 0$ for $n=1, \ldots , M$ . This completes the proof of (7.34).

Step 2. Next, we show that each rational functions $\mathbf {Q}_{v_n}$ yields a profile for a traveling solitary wave for (HWM_d) with velocity $v_n$ .

First, we verify that $\mathbf {Q}_{v_n} : \mathbb {R} \to \mathsf {Gr}_k(\mathbb {C}^d)$ holds. Indeed, for any $1 \leq n \leq N$ and $x \in \mathbb {R}$ fixed, we observe that

$$ \begin{align*} \mathbf{U}(t,x + v_n t) & = \mathbf{Q}_{v_n}(x) + \sum_{j \neq n}^N \left ( \frac{A_j}{x-y_j-(v_j-v_n)t} + \frac{A_j^*}{x-y_j - (v_j -v_n)t} \right ) + \mathbf{R}(t,x) \\ & \to \mathbf{Q}_{v_n}(x) \quad \text{as} \quad |t| \to +\infty \, , \end{align*} $$

which follows from (7.34) and the fact that $v_j \neq v_n$ for $j \neq n$ . From this we easily conclude that $\mathbf {Q}_{v_n}(x) \in \mathsf {Gr}_k(\mathbb {C}^d)$ for all $x \in \mathbb {R}$ .

Next, we prove that each $\mathbf {Q}_{v_n} \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ is a traveling solitary wave profile for the velocity $v_n$ . By taking the limit $\varepsilon = t^{-1} \to 0$ in (7.21) and (7.25), we obtain (using the notation in the proof of Proposition 7.2 above) that

$$ \begin{align*}\Pi \mathbf{V}_0=\sum_{n=1}^N \sum_{j=1}^d \alpha_{n,j}\Phi_{n,j}\, , \quad A_n = -\frac{1}{2\pi \mathrm{i}}\sum_{j=1}^d \alpha_{n,j}I_+(\Phi_{n,j})\,. \end{align*} $$

Let $(e_1,\dots ,e_d)$ be the canonical basis of $\mathbb {C}^d$ . Then $\Phi _{n,j}=e_j^T \varphi _n$ and therefore

$$ \begin{align*}A_n =-\frac{1}{2\pi \mathrm{i}}\left (\sum_{j=1}^d \alpha_{n,j}e_j\right )^T I_+(\varphi_n),\end{align*} $$

or, equivalently, we can write

$$ \begin{align*}A_n=\langle .,\eta_n\rangle_{\mathbb{C}^d} I_+(\varphi_n) \quad \text{with } \eta_n:=\displaystyle \frac{1}{2\mathrm{i} \pi}\sum_{j=1}^d \overline \alpha_{n,j}e_j \in \mathbb{C}^d \text{ for } n=1, \ldots, M \,. \end{align*} $$

Note that $A_n \neq 0$ with $A_n^2 = 0$ . Hence $\eta _n \in \mathbb {C}^d$ and $I_+(\varphi _n) \in \mathbb {C}^d$ are nonzero vectors with $\langle \eta _n, I_+(\varphi _n) \rangle _{\mathbb {C}^d} = 0$ . In particular, we see that $\mathrm {rank}(A_n) = 1$ for $1 \leq n \leq M$ .

Now we reformulate the eigenfunction identity

$$ \begin{align*}T_{\mathbf{U}_0} \varphi_n =v_n \varphi_n \end{align*} $$

for the Toeplitz operator $T_{\mathbf {U}_0} : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R}; \mathbb {C}^d)$ . Indeed, let us apply $I_+$ to both sides while using the following elementary lemma.

Lemma 7.2. Let $f,g\in L^2_+$ be rational functions. Then

$$ \begin{align*}I_+(fg)=0 \quad \text{and} \quad I_+(\Pi (f\overline g))= \int_{\mathbb{R}} f \overline{g} \, dx \, .\end{align*} $$

Proof. This simply follows from by using the following fact: For all $h \in \mathrm {dom}(X^*)$ , we have $I_+(h)=\lim _{\varepsilon \to 0^+} \langle h, \chi _\varepsilon \rangle _{L^2}$ with $\chi _\varepsilon (x):=\frac {1}{1-\mathrm {i} \varepsilon x}$ .

From Lemma 7.2, we infer

$$ \begin{align*}I_+(T_{\mathbf{U}_0}\varphi_n)=\mathbf{U}_\infty I_+(\varphi_n)+\int_{\mathbb{R}} (\Pi \mathbf{V}_0)^* \varphi_n\, dx = \mathbf{U}_\infty I_+(\varphi_n)+2\mathrm{i} \pi \eta_n\ ,\end{align*} $$

because, using that $\varphi _n$ is normalized in $L^2$ ,

$$ \begin{align*}\int_{\mathbb{R}} \Phi_{p,j}^*\varphi_n \, dx = e_j \delta_{n,p}\ .\end{align*} $$

The eigenfunction identity $T_{\mathbf {U}_0} \varphi _n =v_n \varphi _n$ therefore implies

(7.35)

$$ \begin{align} \mathbf{U}_\infty I_+(\varphi_n)=v_n I_+(\varphi _n) -2\pi \mathrm{i} \eta_n. \end{align} $$

Applying the matrix $\mathbf {U}_\infty $ to both sides of the above identity, we get

(7.36)

$$ \begin{align} \mathbf{U}_\infty \eta_n =-v_n\eta _n -\frac{1}{2 \pi \mathrm{i}}(1-v_n^2)I_+(\varphi_n). \end{align} $$

Recall that $I_+(\varphi _n)$ and $\eta _n$ are nonzero vectors in $\mathbb {C}^d$ with $\langle \eta _n, I_+(\varphi _n) \rangle _{\mathbb {C}^d} = 0$ . We denote by $P_n = \mathrm {span} \{ \eta _n, I_+(\varphi _n) \}$ the two-dimensional plane in $\mathbb {C}^d$ generated by these two vectors. We notice that $\mathbf {U}_\infty $ preserves $P_n$ and hence it preserves $P_n^\perp $ , since $\mathbf {U}_\infty ^*=\mathbf {U}_\infty $ . It is now easy to check that the kernel of $H_{\mathbf {Q}_{v_n}}$ is given by

$$ \begin{align*}\mathrm{ker}( H_{\mathbf{Q}_{v_n}})= \frac{x-y_n-\mathrm{i}\delta_n}{x-y_n+\mathrm{i}\delta _n}L^2_+(\mathbb{R} )I_+(\varphi_n) \oplus L^2_+(\mathbb{R} )\eta_n \oplus (L^2_+(\mathbb{R} )\otimes P_n^\perp )\end{align*} $$

and that its orthogonal subspace in $L^2_+(\mathbb {R}; \mathbb {C}^d)$ is generated by

$$ \begin{align*}\psi _n(x):=\frac{1}{x-y_n+\mathrm{i} \delta_n}I_+(\varphi_n)\ .\end{align*} $$

Furthermore, from (7.35), (7.36) and the identity

$$ \begin{align*}\mathbf{U}_\infty A_n +A_n \mathbf{U}_\infty =\frac{A_nA_n^*+A_n^*A_n}{2\mathrm{i} \delta _n} \, ,\end{align*} $$

we get $\Vert I_+(\varphi _n)\Vert _{\mathbb {C}^d}^2=4\pi \delta _n$ and

$$ \begin{align*}T_{\mathbf{Q}_{v_n}}\psi_n =v_n \psi_n\ .\end{align*} $$

Finally, a direct calculation using again (7.35) and (7.36) leads to

$$ \begin{align*}-2\mathrm{i} v_n\mathbf{Q}_{v_n}'(x)=[\mathbf{Q}_{v_n},|D| \mathbf{Q}_{v_n}](x)\ ,\end{align*} $$

which precisely means that $\mathbf {Q}_{v_n} (x-v_nt)$ is a traveling solitary wave for (HWM_d) with velocity $v_n$ .

Step 3. We next show that the integer $1 \leq M \leq N$ given in Proposition 7.2 must satisfy

$$ \begin{align*}M=N \end{align*} $$

where $\sigma _{\mathrm {d}}(T_{\mathbf {U}_0}) = \{v_1, \ldots , v_N\}$ . To see this, we recall from Proposition 7.2 that

$$ \begin{align*}\mathbf{U}(t,x) = \mathbf{U}_\infty + \sum_{n=1}^M \frac{A_n(t)}{x-z_n(t)} + \sum_{n=1}^M \frac{A_n(t)^*}{x-\overline{z}_n(t)} \quad \text{for } |t| \geq T_0 \end{align*} $$

with nonzero matrices $A_1(t), \ldots , A_M(t) \in M_d(\mathbb {C})$ such that $A_n(t)^2 =0$ and pairwise distinct poles $z_1(t), \ldots , z_M(t) \in \mathbb {C}_-$ . Furthermore, from (7.25) and the arguments in the beginning of Step 2 above, we deduce that

$$ \begin{align*}A_n(t) = \langle \cdot, \eta_n(t) \rangle_{\mathbb{C}^d} I_+(\varphi_n(t)) \quad \text{for } |t| \geq T_0 \end{align*} $$

with nonzero vectors $\eta _n(t),I_+(\varphi _n(t)) \in \mathbb {C}^d$ such that $\langle \eta _n(t), I_+(\varphi _n(t)) \rangle _{\mathbb {C}^d} = 0$ . In particular, we conclude that $\mathrm {rank}(A_n(t)) = 1$ for $|t| \geq T_0$ . Hence we can apply Lemma B.1 (see also the remark there) to deduce that $\mathrm {rank}(K_{\mathbf {U}(T_0)}) = M$ .

On the other hand, thanks to the Lax evolution, we get that $K_{\mathbf {U}(T_0)}= \mathcal {U}(T_0) K_{\mathbf {U}_0} \mathcal {U}(T_0)^*$ with some unitary map $\mathcal {U}(T_0) : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R}; \mathbb {C}^d)$ . This implies $\mathrm {rank}(K_{\mathbf {U}(T_0)}) = \mathrm {rank}(K_{\mathbf {U}_0}) = N$ , whence it follows that $M=N$ .

Step 4. Finally, we observe that $\| \mathbf {U}^{(\infty )}(t) \|_{\dot {H}^s} \leq C$ for all $t \in \mathbb {R}$ with some constant $C>0$ depending on $s>0$ . Furthermore, in view of (7.34) and $\mathbf {U} \in C(\mathbb {R}; \dot {H}^\infty )$ , we readily deduce the a priori bounds

$$ \begin{align*}\sup_{t \in \mathbb{R}} \| \mathbf{U}(t) \|_{\dot{H}^s} \leq C(\mathbf{U}_0,s) < \infty \end{align*} $$

for any $s>0$ .

The proof of Theorem 1.7 is now complete.

8. Refined analysis for target $\mathbb {S}^2$

We now consider (HWM) with target $\mathbb {S}^2$ . The goal of this section is to refine the general Theorem 1.7 on soliton resolution for the target $\mathbb {S}^2 \cong \mathsf {Gr}_1(\mathbb {C}^2)$ , leading to Theorem 1.3. Moreover, we will establish that the spectral condition of simplicity of the discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}_0})$ holds for a dense subset of rational initial data in the case of the target $\mathbb {S}^2$ , as formulated in Theorem 1.4. The proof of this density result will make essential use of the stereographic projection $\mathbb {S}^2 \to \mathbb {C} \cup \{ \infty \}$ to find a suitable parametrization of rational maps from $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ and the corresponding Toeplitz operators $T_{\mathbf {U}}$ with rational matrix-valued symbol $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma }$ . Our arguments will be based on analyticity properties to finally conclude Theorem 1.4. We expect that the density result stated in Theorem 1.4 can be generalized to (HWM $_d$ ) with target $\mathsf {Gr}_k(\mathbb {C}^d)$ . However, the algebraic and analytic challenges would require a vast extension of the following analysis, which we haven chosen not to pursue here.

For the reader’s convenience, we recall (HWM) with target $\mathbb {S}^2$ is equivalent to (HWM $_d$ ) with $d=2$ for matrix-valued maps of the form

$$ \begin{align*}\mathbf{U}(x) = \mathbf{u}(x) \cdot \boldsymbol{\sigma} = \left ( \begin{array}{cc} u_3(x) & u_1(x) - \mathrm{i} u_2(x) \\ u_1(x) + \mathrm{i} u_2(x) & -u_3(x) \end{array} \right ) \in \mathsf{Gr}_1(\mathbb{C}^2) \end{align*} $$

where $\mathbf {u} = (u_1,u_2,u_3) : \mathbb {R} \to \mathbb {S}^2$ .

Parametrization by stereographic projection

Let $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ be a map and, as usual, we set $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma }$ . For the rest of this subsection, we will consider the case $\mathcal {V}=\mathbb {C}^2$ , that is, we consider the Toeplitz operator

$$ \begin{align*}T_{\mathbf{U}} : L^2_+(\mathbb{R}; \mathbb{C}^2) \to L^2_+(\mathbb{R}; \mathbb{C}^2) \end{align*} $$

acting on $\mathbb {C}^2$ -valued functions in the Hardy space $L^2_+$ . Likewise, the operators $H_{\mathbf {U}}$ and $K_{\mathbf {U}} = H_{\mathbf {U}}^* H_{\mathbf {U}}$ act on $L^2_+(\mathbb {R}; \mathbb {C}^2)$ throughout the following. By using the (inverse) stereographic projection

$$ \begin{align*}\hat{\mathbb{C}} = \mathbb{C} \cup \{ \infty \} \to \mathbb{S}^2, \quad z \mapsto \left ( \frac{2 \, \mathrm{Re} \, z}{z \overline{z} + 1}, \frac{2 \, \mathrm{Im} \, z}{z \overline{z} + 1}, \frac{z \overline{z}-1}{z \overline{z} + 1} \right ) \, , \end{align*} $$

we obtain the following explicit description in the case of rational maps from $\mathbb {R}$ to $\mathbb {S}^2$ .

Theorem 8.1. Let $\mathbf {u} =(u_1, u_2, u_3) : \mathbb {R} \to \mathbb {S}^2$ be a rational map. Given an integer $N \geq 1$ , the following statements are equivalent.

(i) $\dim \mathfrak {H}_1 = \mathrm {rank}(K_{\mathbf {U}}) = N$ .
(ii) The least common denominator of $u_1, u_2, u_3$ has degree $2N$ .
(iii) There exists a rational function $R \in \mathbb {C}(X)$ of the form
$$ \begin{align*}R(x) = \frac{P(x)}{Q(x)} \, , \end{align*} $$
where $P \in \mathbb {C}[X]$ is a polynomial of degree N and $Q \in \mathbb {C}[X]$ is a nonzero polynomial of degree at most $N-1$ , such that P and Q have no common factors such that, up to rotation on the sphere $\mathbb {S}^2$ , we have
$$ \begin{align*}u_1(x) + \mathrm{i} u_2(x) = \frac{2 R(x)}{R(x) \overline{R}(x)+1} \, , \quad u_3(x) = \frac{R(x) \overline{R}(x) -1}{R(x) \overline{R}(x) + 1} \,. \end{align*} $$

Remarks. 1) We use $\mathbb {C}(X)$ to denote the field of rational functions with one variable with coefficients in $\mathbb {C}$ . Likewise, we use $\mathbb {C}[X]$ to denote the ring of complex polynomials over $\mathbb {C}$ . The variable X either represents an element $x \in \mathbb {R}$ or $z \in \mathbb {C}$ .

2) For a polynomial $T \in \mathbb {C}[X]$ with $T(x) = \sum _{j=0}^N t_j x^j$ , we denote its complex conjugate by $\overline {T}(x) = \sum _{j=0}^N \overline {t}_j x^j$ obtained by complex conjugation of its coefficients. Likewise, for a rational function $R = P/Q \in \mathbb {C}(X)$ , we denote its complex conjugate by $\overline {R}= \overline {P}/\overline {Q}$ .

Proof. The proof of Theorem 8.1 is given in Appendix B.

In view of Theorem 8.1 we introduce, for an integer $N \geq 1$ , the following subsets of rational functions

$$ \begin{align*}\mathcal{R}_N := \left \{ \frac{P(x)}{Q(x)} \in \mathbb{C}(X) \mid \deg P = N, \deg Q \leq N-1, Q \not \equiv 0, \mathrm{gcd}(P, Q) = 1 \right \} \,. \end{align*} $$

For $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ , we can henceforth assume that $\mathbf {u}(\infty ) = \mathbf {e}_3$ by rotational symmetry on $\mathbb {S}^2$ . By Theorem 8.1, we have the canonical equivalence of sets

$$ \begin{align*}\mathcal{K}_N := \{ \mathbf{u} \in \mathcal{R}at(\mathbb{R}; \mathbb{S}^2) \mid \mathbf{u}(\infty) = \mathbf{e}_3, \; \mathrm{rank}(K_{\mathbf{U}}) = N \} \cong \mathcal{R}_N \end{align*} $$

by means of the (inverse) stereographic projection in Theorem 8.1 (iii) above.

Next, we analyze the topological properties of $\mathcal {R}_N$ more closely. For $P/Q \in \mathcal {R}_N$ , we can assume without loss of generality that P is a monic polynomial, that is, we denote

$$ \begin{align*}P(x) = x^N + p_1 x^{N-1} + \ldots + p_N \quad \text{with } p_k \in \mathbb{C} \text{ for } k=1, \ldots N \,. \end{align*} $$

The polynomials $Q \in \mathbb {C}[X]$ will be written as

$$ \begin{align*}Q(x) = q_1 x^{N-1} + \ldots + q_N \quad \text{with } q_k \in \mathbb{C} \text{ for } k=1, \ldots, N \, , \end{align*} $$

where $(q_1, \ldots , q_N) \neq (0,\ldots , 0)$ . Evidently, we can identify the pair of polynomials $(P, Q) \in \mathbb {C}[X] \times \mathbb {C}[X]$ above uniquely by elements in $\mathbb {C}^N \times (\mathbb {C}^N \setminus \{ 0 \})$ . In particular, the set $\mathcal {R}_N$ can be naturally regarded as a subset in $\mathbb {C}^{2N}$ . We have the following result.

Lemma 8.1. The set $\mathcal {R}_N \subset \mathbb {C}(X)$ can be canonically identified with a nonempty, open and connected subset $\mathcal {A}_N$ in $\mathbb {C}^{2N}$ .

Proof. We divide the proof into the following steps.

Step 1. Elements $P/Q \in \mathcal {R}_N$ can be canonically identified with pairs $(P,Q) \in \mathbb {C}^{2N}$ of the form

$$ \begin{align*}P = (p_1, \ldots, p_N) \in \mathbb{C}^N, \quad Q = (q_1, \ldots, q_N) \in \mathbb{C}^N \setminus \{ 0 \} \, , \end{align*} $$

such that P and Q have no common factor as polynomials. Let $\mathcal {A}_N \subset \mathbb {C}^{2N}$ denote the set of such pairs $(P,Q)$ . By the fundamental theorem of algebra, we can write $P(x) = \prod _{j=1}^N (x-\xi _j)$ where $\xi _1, \ldots , \xi _N \in \mathbb {C}$ denote the roots of P counted with their multiplicity. In order to take into account possible permutations of the roots, we introduce the quotient space

$$ \begin{align*}\mathbb{C}^N_{\mathrm{sym}} = \mathbb{C}^N / \sim\end{align*} $$

with the equivalence relation $(\xi _1, \ldots , \xi _N) \sim (\xi _{\sigma (1)}, \ldots , \xi _{\sigma (N)})$ for all permutations $\sigma \in S_N$ . We use $[\xi _1, \ldots , \xi _N]$ to denote elements in $\mathbb {C}^N_{\mathrm {sym}}$ . It is a classical fact that the map which assigns to any polynomial P of degree N its roots modulo permutations,

$$ \begin{align*}\tau : \mathbb{C}^N \to \mathbb{C}^N_{\mathrm{sym}}, \quad P \mapsto [\xi_1, \ldots, \xi_N], \end{align*} $$

is continuous. Let us define the map

$$ \begin{align*}F : \mathbb{C}^N \times (\mathbb{C}^{N} \setminus \{ 0 \} ) \to \mathbb{C}, \quad (P,Q) \mapsto \prod_{j=1}^N Q(\xi_j(P)), \end{align*} $$

where $[\xi _1(P), \ldots , \xi _N(P)] \in \mathbb {C}^N_{\mathrm {sym}}$ denote the roots (modulo permutations) of the polynomial P. Clearly, we have

$$ \begin{align*}F(P,Q) \neq 0 \quad \Leftrightarrow \quad P \text{ and } Q \text{ have no common factor}. \end{align*} $$

By continuity of the map F, we deduce that the set

$$ \begin{align*}\mathcal{A}_N = \{ (P,Q) \in \mathbb{C}^N \times \mathbb{C}^N \setminus \{ 0 \} : F(P,Q) \neq 0 \} \end{align*} $$

is an open subset in $\mathbb {C}^{2N}$ . Moreover, it is evident that $\mathcal {A}_N$ is nonempty.

Step 2. Next, we prove that $\mathcal {A}_N \subset \mathbb {C}^{2N}$ is connected. Since $\mathcal {A}_N$ is open, this is equivalent to being pathwise connected. For $(P,Q) \in \mathcal {A}_N$ , we define the set

$$ \begin{align*}V_P = \{ Q \in \mathbb{C}^N : F(P,Q)=0 \} = \{ Q \in \mathbb{C}^N \mid \prod_{j=1}^N Q(\xi_j(P))=0 \} \,. \end{align*} $$

As a zero set of a nontrivial polynomial in $Q=(q_1, \ldots , q_N) \in \mathbb {C}^N$ , we see that $V_P$ is an algebraic set in $\mathbb {C}^N$ with $0 \in V_P$ .Footnote ⁵ Regarding its complement, we claim that

(8.1)

$$ \begin{align} \mathbb{C}^N \setminus V_P \text{ is connected} \,. \end{align} $$

Since $\mathbb {C}^N \setminus V_P$ is open, this claim is equivalent to pathwise connectedness of this set. Let $Q, \tilde {Q} \in \mathbb {C}^N \setminus V_P$ with $Q \neq \tilde {Q}$ be given and consider the set

$$ \begin{align*}L = \{ Q + \zeta (\tilde{Q}-Q) \mid \zeta \in \mathbb{C} \} \, , \end{align*} $$

which corresponds to the complex line in $\mathbb {C}^N$ that connects Q and $\tilde {Q}$ . Since we have $L \not \subseteq V_P$ and $V_P$ is the zero set of a polynomial in $Q \in \mathbb {C}^N$ , there are only finitely many points of intersections of L with $V_P$ , that is,

$$ \begin{align*}L \cap V_P = \{ z_1, \ldots, z_K \} \end{align*} $$

for some $z_1, \ldots , z_K \in \mathbb {C}^N$ . However, the set $L \setminus \{ z_1, \ldots , z_K \} \simeq \mathbb {R}^2 \setminus \{ p_1, \ldots , p_K \}$ with finitely many points $p_1, \ldots , p_K \in \mathbb {R}^2$ is pathwise connected. Thus there exists a continuous map $\gamma : [0,1] \to \mathbb {C}^N \setminus V_P$ with $\gamma (0)= Q$ and $\gamma (1) = \tilde {Q}$ . This proves (8.1).

Next, we suppose $(P, Q) \in \mathcal {A}_N$ and $(\tilde {P}, \tilde {Q}) \in \mathcal {A}_N$ are given. We prove that $(P,Q)$ and $(\tilde {P}, \tilde {Q})$ can be connected by a continuous path in $\mathcal {A}_N$ as follows. We consider the sets

$$ \begin{align*}W = \{ P \} \times (\mathbb{C}^N \setminus V_P) \quad \text{and} \quad \tilde{W} = \{\tilde{P} \} \times (\mathbb{C}^N \setminus V_{\tilde{P}}) \,. \end{align*} $$

Evidently, we have that $(P,Q) \in W$ and $(\tilde {P}, \tilde {Q}) \in \tilde {W}$ . Let $Q_*=(0, \ldots , 0, 1) \in \mathbb {C}^N$ corresponding to the constant polynomial $Q_*(x) \equiv 1$ . By (8.1) and the evident fact that $Q_* \in (\mathbb {C}^N \setminus V_P) \cap (\mathbb {C}^N \setminus V_{\tilde {P}})$ , we can find two continuous paths in $\mathcal {A_N}$ that connect $(P,Q)$ with $(P, Q_*)$ and $(\tilde {P}, \tilde {Q})$ with $(\tilde {P}, Q_*)$ , respectively. Furthermore, we easily construct a continuous path in $\mathcal {A}_N$ which connects $(P,Q_*)$ and $(\tilde {P},Q_*)$ . This shows that $\mathcal {A}_N \subset \mathbb {C}^{2N}$ is pathwise connected.

This completes the proof of Lemma 8.1.

With the results derived above, we are now ready to give the proofs of Theorems 1.3 and 1.4 for (HWM) with target $\mathbb {S}^2$ .

Proof of Theorem 1.3 (soliton resolution for target $\mathbb {S}^2$ )

Suppose $\mathbf {u}_0 \in \mathcal {R}at(\mathbb {R};\mathbb {S}^2)$ satisfies the assumptions of Theorem 1.3 and let $\mathbf {U}_0 = \mathbf {u}_0 \cdot \boldsymbol {\sigma } \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_1(\mathbb {C}^2))$ be the corresponding initial datum for (HWM_d) with $d=2$ .

By applying Theorem 1.7 and using the identification $\mathsf {Gr}_1(\mathbb {C}^2) \cong \mathbb {S}^2$ via the use of the Pauli matrices $\boldsymbol {\sigma }=(\sigma _1, \sigma _2, \sigma _3)$ , we obtain that

$$ \begin{align*}\lim_{t \to \pm \infty} \| \mathbf{u}(t) - \mathbf{u}^{\pm}(t) \|_{\dot{H}^s} = 0 \quad \text{for any } s> 0 \, , \end{align*} $$

with

$$ \begin{align*}\mathbf{u}^{\pm}(t,x) = \sum_{j=1}^N \mathbf{q}_{v_j}(x-v_jt) - (N-1) \mathbf{u}_\infty \,. \end{align*} $$

Here each $\mathbf {q}_{v_j} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ is a profile of a ground state traveling solitary wave for (HWM) with velocity $v_j$ and it is given by

$$ \begin{align*}\mathbf{q}_{v_j}(x) = \mathbf{u}_\infty + \frac{A_j}{x-y_j+ \mathrm{i} \delta_j} + \frac{A_j^*}{x-y_j - \mathrm{i} \delta_j} \,. \end{align*} $$

The proof of Theorem 1.3 is now complete.

Proof of Theorem 1.4 (density of rational data with simple discrete spectrum)

Let $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ be given and set $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma } \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_1(\mathbb {C}^2))$ as usual. We recall that the discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ of the Toeplitz operator $T_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ is found to be

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathbf{U}}) = \sigma(T_{\mathbf{U}} |_{\mathfrak{H}_1}) \end{align*} $$

with the finite-dimensional subspace $\mathfrak {H}_1 = \mathrm {ran}(K_{\mathbf {U}})=\mathrm {ran}(\mathrm {Id}-T_{\mathbf {U}}^2)$ . We are interested in the case when $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ is simple and therefore we define the set

$$ \begin{align*}\mathcal{R}at_{\mathrm{s}}(\mathbb{R}; \mathbb{S}^2) := \{ \mathbf{u} \in \mathcal{R}at(\mathbb{R}; \mathbb{S}^2) \mid \sigma_{\mathrm{d}}(T_{\mathbf{U}}) \text{ is simple} \} \,. \end{align*} $$

We have the following result (stated as Theorem 1.4 in the introduction).

Theorem 8.2. The subset $\mathcal {R}at_{\mathrm {s}}(\mathbb {R}; \mathbb {S}^2)$ is dense in $\dot {H}^{\frac 1 2}(\mathbb {R}, \mathbb {S}^2)$ .

Proof. We divide the proof into the following steps.

Step 1. For a given integer $N \geq 1$ , we define the set

$$ \begin{align*}\mathcal{K}_N := \left \{ K_{\mathbf{U}} : \mathrm{Rank}(K_{\mathbf{U}}) = N \text{ with } \mathbf{u} \in \mathcal{R}at(\mathbb{R}; \mathbb{S}^2) \text{ and } \mathbf{u}(\pm \infty) = \mathbf{e}_3 \right \} \,. \end{align*} $$

From Theorem 8.1 part (iii), we recall that $\mathcal {K}_N$ is canonically identified with set of rational functions $\mathcal {R}_N \subset \mathbb {C}(X)$ via the (inverse) stereographic projection. By Lemma 8.1, we can canonically identify $\mathcal {R}_N$ with a nonempty, open and connected subset $\mathcal {A}_N \subset \mathbb {C}^{2N}$ . Let us write $R=P/Q \equiv (P,Q) \in \mathcal {A}_N$ in what follows.

Next, we define the map $\mathsf {u} : \mathcal {A}_N \to L^\infty (\mathbb {R}; \mathbb {R}^3)$ with

$$ \begin{align*} (\mathsf{u}(P,Q))(x) & :=\left ( \frac{ 2 \mathrm{Re} (P(x) \overline{Q}(x))}{P(x) \overline{P}(x) + Q(x) \overline{Q}(x)}, \frac{2 \mathrm{Im}( P(x) \overline{Q}(x))}{ \overline{P}(x) P(x) + \overline{Q}(x) Q(x)}, \right. \\ & \qquad \left. \frac{P(x) \overline{P}(x) - Q(x) \overline{Q}(x)}{P(x) \overline{P}(x) + Q(x) \overline{Q}(x)} \right ) \quad \text{with } x \in \mathbb{R} \,. \end{align*} $$

Note that, for any $(P,Q) \in \mathcal {A}_N$ , the map $x \mapsto (\mathsf {u}(P,Q))(x)$ belongs to $\mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ and it evidently satisfies $(\mathsf {u}(P,Q))(\pm \infty ) = \mathbf {e}_3$ . Correspondingly, we obtain a map $\mathsf {U} : \mathcal {A}_N \to L^\infty (\mathbb {R}; M_2(\mathbb {C}))$ by setting

(8.2)

$$ \begin{align} \mathsf{U}(P,Q) := \mathsf{u}(P,Q) \cdot \boldsymbol{\sigma} \,. \end{align} $$

By Theorem 8.1, the map

(8.3)

$$ \begin{align} \mathsf{K} : \mathcal{A}_N \to \mathcal{B}(L^2_+(\mathbb{R}; \mathbb{C}^2)), \quad (P,Q) \mapsto \mathsf{K}(P,Q) := H_{\mathsf{U}(P,Q)}^* H_{\mathsf{U}(P,Q)} \end{align} $$

is injective and and its image satisfies $\mathsf {K}(\mathcal {A}_N) = \mathcal {K}_N$ .

Step 2. We claim that

$$ \begin{align*}\mathsf{K} : \mathcal{A}_N \to \mathcal{B}(L^2_+(\mathbb{R}; \mathbb{C}^2)) \text{ is real analytic} \end{align*} $$

with the usual identification that $\mathcal {A}_N \subset \mathbb {C}^{2N} \cong \mathbb {R}^{4N}$ . Indeed, since the expressions in (8.2) and (8.3) are linear and quadratic, respectively, this amounts to showing that

$$ \begin{align*}\mathsf{u} : \mathcal{A}_N \to L^\infty(\mathbb{R}; \mathbb{R}^3) \text{ is a real analytic map} \,. \end{align*} $$

Indeed, let $(P,Q) \in \mathcal {A}_N$ be given. We show that $\mathsf {u}$ is real analytic in an open neighborhood around $(P,Q)$ by showing that is the restriction of a complex analytic mapping. For $\varepsilon> 0$ , we consider the open set

$$ \begin{align*}\Omega_{\varepsilon} := \{ (P_1, P_2, Q_1, Q_2) \in \mathbb{C}^{4N} \mid |(P_1,P_2, Q_1, Q_2) - (P,\overline{P}, Q, \overline{Q})| < \varepsilon \} \ \end{align*} $$

and the map $\tilde {\mathsf {u}} : \Omega _\varepsilon \to L^\infty (\mathbb {R}; \mathbb {R}^3)$ defined as

$$ \begin{align*} \tilde{\mathsf{u}}(P_1, P_2, Q_1, Q_2)(x) & := \left ( \frac{ P_1(x) Q_2(x) + P_2(x) Q_1(x)}{P_1(x) P_2(x) + Q_1(x) Q_2(x)}, \frac{1}{2 \mathrm{i}} \frac{P_1(x) Q_2(x)-P_2(x) Q_1(x)}{ P_1(x) P_2(x) + Q_1(x) Q_2(x)}, \right. \\ & \qquad \left. \frac{P_1(x) P_2(x) - Q_1(x) Q_2(x)}{P_1(x) P_2(x) + Q_1(x) Q_2(x)} \right ) \quad \text{with } x \in \mathbb{R} \,. \end{align*} $$

Note that if $\varepsilon> 0$ is sufficiently small, the denominator $P_1(x) P_2(x) + Q_1(x) Q_2(x) \neq 0$ for all $x \in \mathbb {R}$ for $(P_1,P_2,Q_1, Q_2) \in \Omega _\varepsilon $ and hence the map $\tilde {\mathsf {u} }: \Omega _\varepsilon \to L^\infty (\mathbb {R}; \mathbb {R}^3)$ is well-defined. Clearly, the map $\tilde {\mathsf {u}} : \Omega _\varepsilon \to L^\infty (\mathbb {R}; \mathbb {R}^3)$ is $C^1$ and satisfies the Cauchy–Riemann equations and hence it is complex analytic. In view of the fact that

$$ \begin{align*}\mathsf{u}(\eta,\zeta) = \tilde{\mathsf{u}}(\eta,\overline{\eta}, \zeta, \overline{\zeta}) \quad \text{for } (\eta, \overline{\eta}, \zeta, \overline{\zeta}) \in \Omega_\varepsilon \, , \end{align*} $$

we conclude that $\mathsf {u} : \mathcal {A}_N \to L^\infty (\mathbb {R}; \mathbb {R}^3)$ is real analytic.

Step 3. Since the image $\mathsf {K}(\mathcal {A}_N) = \mathcal {K}_N$ belongs to the subspace $\mathcal {F}_N$ of bounded operators in $\mathcal {B}(L^2_+(\mathbb {R}; \mathbb {C}^2))$ with finite rank N, we see that the maps

$$ \begin{align*}\mathcal{A}_N \to \mathbb{R}, \quad (P,Q) \mapsto \mathrm{Tr}(\mathsf{K}(U,P)^m) \end{align*} $$

are well-defined for any integer $m \geq 1$ . In fact, these maps are real analytic as being the composition of real analytic maps.

Let $p_{\mathsf {K}(P,Q)}(\lambda )$ denote the characteristic polynomial of the endomorphism $\mathsf {K}(P,Q) : \mathfrak {H}_1 \to \mathfrak {H}_1$ on the N-dimensional subspace $\mathfrak {H}_1 = \mathrm {ran} (\mathsf {K}(P,Q))$ . Applying the Plemelj–Smithies formula (see, e.g., [Reference Gohberg, Goldberg and Krupnik19]) in the theory of Fredholm determinants, we obtain that

$$ \begin{align*}p_{\mathsf{K}(P,Q)}(\lambda) = \det(\lambda \mathrm{Id} - \mathsf{K}(P,Q)) = \sum_{k=0}^N (-1)^k C_k(\mathsf{K}(P,Q)) \lambda^{N-k} \, , \end{align*} $$

with the coefficients

$$ \begin{align*}C_k(\mathsf{A}) = \frac{1}{k!} \det \left ( \begin{array}{lllll} \mathrm{Tr} (\mathsf{A}) & k-1 & 0 & \cdots & 0 \\ \mathrm{Tr} (\mathsf{A}^2) & \mathrm{Tr} (\mathsf{A} ) & k-2 & \ddots & \vdots \\ \vdots & \vdots & \ddots & \ddots & 0 \\ \mathrm{Tr} ( \mathsf{A}^{k-1}) & \mathrm{Tr} ( \mathsf{A}^{k-2}) & \cdots & \mathrm{Tr}(\mathsf{A}) & 1 \\ \mathrm{Tr}(\mathsf{A}^k) & \mathrm{Tr}(\mathsf{A}^{k-1}) & \cdots & \mathrm{Tr}(\mathsf{A}^2) & \mathrm{Tr}(\mathsf{A}) \end{array} \right ) \, , \end{align*} $$

where $k=0, \ldots , N$ . This shows that the coefficients of $p_{\mathsf {K}(P,Q)}(\lambda )$ are real analytic functions of $(P,Q) \in \mathcal {A}_N$ . As a consequence, the discriminant function

$$ \begin{align*}\mathfrak{d} : \mathcal{A}_N \to \mathbb{R}, \quad (P,Q) \mapsto \mathfrak{d}(P,Q) := \mathrm{disc}(p_{\mathsf{K}(P,Q)}) \end{align*} $$

is also a real analytic function on the open and connected set $\mathcal {A}_N \subset \mathbb {C}^{2N} \cong \mathbb {R}^{4N}$ . Moreover, we have $\mathfrak {d}(P,Q) \neq 0$ if and only if $\mathsf {K}(P,Q) : \mathfrak {H}_1 \to \mathfrak {H}_1$ has simple eigenvalues, which by the identity in Lemma 4.1, is equivalent to having simple spectrum of $T_{\mathsf {U}(P,Q)}^2=\mathrm {Id} - \mathsf {K}(P,Q)$ on $\mathfrak {H}_1$ . Thus we find

$$ \begin{align*}\mathfrak{d}(P,Q) \neq 0 \text{ if and only if the discrete spectrum } \sigma_{\mathrm{d}}(T_{\mathsf{U}(P,Q)}^2) \text{ is simple} \,. \end{align*} $$

Defining the set

$$ \begin{align*}\widetilde{\mathcal{A}}_N := \{ (P,Q) \in \mathcal{A}_N \mid \mathfrak{d}(P,Q) \neq 0 \} \, , \end{align*} $$

we conclude from the real analyticity of the function $\mathfrak {d}$ on the connected set $\mathcal {A}_N$ that either

$$ \begin{align*}\widetilde{\mathcal{A}}_N \text{ is a dense and open subset in } \mathcal{A}_N \ , \end{align*} $$

or it holds $\widetilde {\mathcal {A}}_N = \emptyset $ , in which case we must have $\mathfrak {d} \equiv 0$ on $\mathcal {A}_N$ . However, by an explicit construction in Lemma C.2 below, we conclude that $\mathfrak {d} \not \equiv 0$ on $\mathcal {A}_N$ . Hence we have shown that

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathsf{U}(P,Q)}^2) \text{ is simple for all } (P,Q) \in \widetilde{\mathcal{A}}_N \end{align*} $$

with some dense and open subset $\widetilde {\mathcal {A}}_N \subset \mathcal {A}_N$ . Note that, by self-adjointness of $T_{\mathbf {U}}$ , the simplicity of $\sigma _{\mathrm {d}}(T_{\mathbf {U}}^2)$ implies that $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ is simple as well. Hence we deduce that

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathsf{U}(P,Q)}) \text{ is simple for all } (P,Q) \in \widetilde{\mathcal{A}}_N \,. \end{align*} $$

Step 4. We are now ready to finish the proof of Theorem 8.2. Let $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ be given. Note that $\lim _{x \to \pm \infty }\mathbf {u}(x) = \mathbf {p}$ for some unit vector $\mathbf {p} \in \mathbb {S}^2$ . By rotational symmetry, we can henceforth assume that

$$ \begin{align*}\mathbf{p} = \mathbf{e}_3 \,. \end{align*} $$

Let $N = \mathrm {Rank}(K_{\mathbf {U}})$ and $\mathfrak {H}_1 = \mathrm {ran}(K_{\mathbf {U}})$ . If $N=0$ (which corresponds to the constant map $\mathbf {u} \equiv \mathbf {e}_3$ ) then $\dim \mathfrak {H}_1 = 0$ and thus $\sigma _{\mathrm {d}}(T_{\mathbf {U}}) = \emptyset $ which is trivially simple. Also if $N=1$ , we have $\dim \mathfrak {H}_1 = 1$ and thus $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ is evidently simple.

Henceforth we assume that $N \geq 2$ holds. Note that there is a (unique) point $(P,Q) \in \mathcal {A}_N$ such that

$$ \begin{align*}\mathbf{U} = \mathsf{U}(P,Q) \quad \text{and} \quad K_{\mathbf{U}} = \mathsf{K}(P,Q) \in \mathcal{K}_N \,. \end{align*} $$

By density $\widetilde {\mathcal {A}}_N \subset \mathcal {A}_N$ , we can find a sequence $(P_k, Q_k) \in \widetilde {\mathcal {A}}_N$ such that $(P_k, Q_k) \to (P,Q)$ in $\mathbb {C}^{2N}$ . Letting $\mathbf {U}_k = \mathsf {U}(P_k,Q_k)$ , we conclude that

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathbf{U}_k}) \text{ is simple for all } k \in \mathbb{N} \,. \end{align*} $$

Moreover, from $(P_k,Q_k) \to (P,Q)$ in $\mathbb {C}^{2N}$ it is easy to see that $\| \mathbf {U}_k - \mathbf {U} \|_{\dot {H}^{\frac 1 2}} \to 0$ as $k \to \infty $ . Equivalently, in terms of the rational functions $\mathbf {u}_k =( u_{k,1}, u_{k_2}, u_{k,3}) \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ with

$$ \begin{align*}\mathbf{u}_{k,j} = \frac{1}{2} \mathrm{Tr}_{\mathbb{C}^2} (\mathbf{U}_k \sigma_j) \quad \text{for } j=1,2,3 \text{ and } k \in \mathbb{N}, \end{align*} $$

we deduce that $\| \mathbf {u}_k - \mathbf {u} \|_{\dot {H}^{\frac 1 2}} \to 0$ as $k \to \infty $ . This proves the density of $\mathcal {R}at_{\mathrm {s}}(\mathbb {R}; \mathbb {S}^2) \subset \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ as stated above. The proof of Theorem 8.2 is now complete.

A. Density of rational maps

Let $d \geq 2$ and $0 \leq k \leq d$ be given integers. Recall that

$$ \begin{align*}\mathcal{R}at(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) = \left \{ \mathbf{U} : \mathbb{R} \to \mathsf{Gr}_k(\mathbb{C}^d) \mid \mathbf{U}(x) \text{ is rational in } x\in \mathbb{R} \right \} \end{align*} $$

denotes the set of rational maps from $\mathbb {R}$ into the complex Grassmannian $\mathsf {Gr}_k(\mathbb {C}^d)$ , which we identify with the set of matricesFootnote ⁶

Furthermore, we recall the space

$$ \begin{align*}\dot{H}^{\frac 1 2}(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) = \left \{ \mathbf{U} \in \dot{H}^{\frac 1 2}(\mathbb{R}; \mathbb{C}^{d \times d}) \mid \mathbf{U}(x) \in \mathsf{Gr}_k(\mathbb{C}^d) \text{ for a. e. } x \in \mathbb{R} \right \} \, , \end{align*} $$

equipped with Gagliardo semi-norm $\| \cdot \|_{\dot {H}^{\frac 1 2}}$ given through

$$ \begin{align*}\| \mathbf{U} \|_{\dot{H}^{\frac 1 2}}^2 = \| |D|^{\frac 1 2} \mathbf{U} \|_{L^2}^2 = \frac{1}{2 \pi} \int_{\mathbb{R}} \int_{\mathbb{R}} \frac{|\mathbf{U}(x)- \mathbf{U}(y)|_F^2}{|x-y|^2} \, dx \, dy \, , \end{align*} $$

where $|A|_F = (\mathrm {Tr}(A^* A))^{1/2}$ denotes the Frobenius norm of a matrix $A \in \mathbb {C}^{d \times d}$ .

Theorem A.1. $\mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ is dense in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ . That is, for every $\mathbf {U} \in \dot {H}^{\frac 1 2}(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ , there exists a sequence $\mathbf {U}_n \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ such that $\| \mathbf {U}_n - \mathbf {U} \|_{\dot {H}^{\frac 1 2}} \to 0$ as $n \to \infty $ .

Before we give the proof of Theorem A.1 below, we obtain the following fact.

Corollary A.1. $\mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ is dense in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathbb {S}^2)$ .

Proof. By Theorem A.1, the set $\mathcal {R}at(\mathbb {R}; \mathsf {Gr}_1(\mathbb {C}^2))$ is dense in $\dot {H}^{\frac 1 2}(\mathbb {R}; \mathsf {Gr}_1(\mathbb {C}^2))$ . Recall that, thanks to the linear relation $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma }$ with $\boldsymbol {\sigma }=(\sigma _1, \sigma _2, \sigma _3)$ denoting the standard Pauli matrices, we can easily check the equivalence of norms $\| \mathbf {U} \|_{\dot {H}^{\frac 1 2}} \sim \| \mathbf {u} \|_{\dot {H}^{\frac 1 2}}$ and we thus conclude.

Next, we turn to the proof of Theorem A.1. Here it is convenient to first prove the corresponding result in the periodic setting as follows. Let $\mathbb {T} = \mathbb {R}/2 \pi \mathbb {Z}$ denote the one-dimensional torus. Correspondingly, we define the space

$$ \begin{align*}H^{\frac 1 2}(\mathbb{T}; \mathsf{Gr}_k(\mathbb{C}^d)) := \{ \mathbf{U} \in H^{\frac 1 2}(\mathbb{T}; \mathbb{C}^{d \times d}) \mid \mathbf{U}(t) \in \mathsf{Gr}_k(\mathbb{C}^d) \text{ for a. e. } t \in \mathbb{T} \} \, , \end{align*} $$

endowed with the $H^{\frac 1 2}$ -norm for maps from $\mathbb {T}$ into $\mathbb {C}^{d \times d}$ . Likewise, we also define

$$ \begin{align*}\mathcal{R}at(\mathbb{T}; \mathsf{Gr}_k(\mathbb{C}^d)) := \{ \mathbf{U} : \mathbb{T} \to \mathsf{Gr}_k(\mathbb{C}^d) \mid \mathbf{U}(t) \text{ is rational in } z = \mathrm{e}^{\mathrm{i} t} \text{ with } t \in \mathbb{T} \} \,. \end{align*} $$

It is easy to see that $\mathcal {R}at(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d)) \subset H^{\frac 1 2}(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ holds. In fact, we will show the following result.

Theorem A.2. $\mathcal {R}at(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ is dense in $H^{\frac 1 2}(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ .

Proof of Theorem A.2.

First, we recall the following general result due to Brezis–Nirenberg [Reference Brezis and Nirenberg6] for Sobolev spaces of functions with values in smooth and closed (i.e., compact with no boundary) manifolds. Indeed, we have that $\mathsf {Gr}_k(\mathbb {C}^d)$ is a smooth and closed manifold of real dimension $2k(d-k)$ . Now from [Reference Brezis and Nirenberg6][Lemma A.12] we obtain the following result; see also [Reference Mazowiecka and Schikorra30][Section 2] for a recent and detailed discussion of density of smooth maps in Sobolev spaces in the setting of manifolds.

Lemma A.1. $C^\infty (\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ is dense in $H^{\frac 1 2}(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ .

To complete the proof of Theorem A.2, it remains to establish the following result.

Lemma A.2. For every $\mathbf {U} \in C^\infty (\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ , there exists a sequence

$$ \begin{align*}\mathbf{U}_N \in \mathcal{R}at(\mathbb{T}; \mathsf{Gr}_k(\mathbb{C}^d))\end{align*} $$

such that $\| \mathbf {U}_N - \mathbf {U} \|_{H^{\frac 1 2}} \to 0$ as $N \to \infty $ .

Remark. The proof below can actually be used to prove density with respect to the $\| \cdot \|_{H^s}$ -norm for all $s \geq 0$ .

Proof of Lemma A.2.

Let $\mathbf {U} \in C^\infty (\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ be given. We define the map $\mathbf {P} \in C^\infty (\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ by setting . We have

$$ \begin{align*}\mathbf{P}(t) = \mathbf{P}(t)^* = \mathbf{P}(t)^2 \quad \text{and} \quad \mathrm{rank}(\mathbf{P}(t)) = k \quad \text{for all } t \in \mathbb{T}. \end{align*} $$

We claim that there exists a smooth map $\mathbf {G} \in C^\infty (\mathbb {T}; \mathbb {C}^{d \times k})$ such that

(A.1)

$$ \begin{align} \mathbf{P}(t) \mathbf{G}(t) = \mathbf{G}(t) \quad \text{and} \quad \mathrm{rank}(\mathbf{G}(t)) = k \quad \text{for } t \in \mathbb{T} \,. \end{align} $$

To prove this claim, we use a result in [Reference Sibuya33][Theorem 6], where the following result is shown (up to trivial modifications of notation and changing the period of 1 to $2\pi $ ).

Proposition A.1. Let $\mathbf {A} \in C^\infty (\mathbb {R}; \mathbb {C}^{d \times d})$ with $\mathbf {A}(t+2\pi ) = \mathbf {A}(t)$ for all $t \in \mathbb {R}$ and assume that

$$ \begin{align*}\mathrm{rank}(\mathbf{A}(t)) = m \quad \text{for all } t \in \mathbb{R} \end{align*} $$

with some constant $m \leq d$ . Then there exists $\mathbf {B} \in C^\infty (\mathbb {R}; \mathbb {C}^{d \times (d-m)})$ such that

$$ \begin{align*}\mathbf{B}(t+2 \pi) = \mathbf{B}(t), \quad \mathbf{A}(t) \mathbf{B}(t) = 0, \quad \mathrm{rank}(\mathbf{B}(t))=d-m \quad \text{for } t \in \mathbb{R} \,. \end{align*} $$

By applying Proposition A.1 to where $m=d-k$ , we complete the proof of claim (A.1) by setting $\mathbf {G}(t) := \mathbf {B}(t)$ .

Let us now return to the proof of Lemma A.2. We claim that

(A.2)

$$ \begin{align} \mathbf{P}(t) = \mathbf{G}(t) [\mathbf{G}(t)^* \mathbf{G}(t)]^{-1} \mathbf{G}(t)^* \quad \text{for } t \in \mathbb{T} \, , \end{align} $$

Note that, since $\mathrm {rank}(\mathbf {G}(t)) = k$ for $\mathbf {G}(t) \in \mathbb {C}^{d \times k}$ , we obtain that $\mathbf {G}(t)^* \mathbf {G}(t) \in \mathbb {C}^{k \times k}$ is invertible for any $t \in \mathbb {T}$ .

To show (A.2), let $\tilde {\mathbf {P}}(t)$ denote its right-hand side. Evidently, we have $\tilde {\mathbf {P}}(t)^* = \tilde {\mathbf {P}}(t)$ and $\tilde {\mathbf {P}}(t) = \tilde {\mathbf {P}}(t)^2$ .

Notice that $v \in \mathrm {ker}(\tilde {\mathbf {P}}(t))$ if and only if $(\mathbf {G}(t)^* \mathbf {G}(t))^{-1}(\mathbf {G}(t)^* v) \in \mathrm {ker}(\mathbf {G}(t))$ . Hence $\mathrm {ker}(\tilde {\mathbf {P}}(t))=\mathrm {ker}(\mathbf {G}(t)^*)$ and by orthogonal complements we find $\mathrm {ran}(\tilde {\mathbf {P}}(t)) = \mathrm {ran}(\mathbf {G}(t))$ .

On the other hand, we have $\mathrm {rank}(\mathbf {P}(t))=k=\mathrm {rank}(\mathbf {G}(t))$ and $\mathrm {ran}(\mathbf {G}(t)) \subset \mathrm {ran}(\mathbf {P}(t))$ since $\mathbf {P}(t) \mathbf {G}(t) = \mathbf {G}(t)$ . Hence $\mathrm {ran}(\mathbf {P}(t))=\mathrm {ran}(\mathbf {G}(t))$ .

We readily conclude that $\mathrm {ran} (\tilde {\mathbf {P}}(t)) = \mathrm {ran}(\mathbf {P}(t))$ . But this implies that the self-adjoint projections $\tilde {\mathbf {P}}(t)$ and $\mathbf {P}(t)$ must be identical. Hence (A.2) holds true.

For $N \in \mathbb {N}$ , we let $\mathbf {G}_N(t)$ be the truncated Fourier series of $\mathbf {G} \in C^\infty (\mathbb {T}; \mathbb {C}^{d \times k})$ , that is,

$$ \begin{align*}\mathbf{G}_N(t) = \sum_{|n| \leq N} \widehat{\mathbf{G}}_n \mathrm{e}^{\mathrm{i} n t} \end{align*} $$

with coefficients $\widehat {\mathbf {G}}_n = \frac {1}{2 \pi } \int _0^{2 \pi } \mathbf {G}(t) \mathrm {e}^{-\mathrm {i} n t} \, dt \in \mathbb {C}^{d \times k}$ for $n \in \mathbb {Z}$ . Clearly, we have $\mathbf {G}_N \in \mathcal {R}at(\mathbb {T}; \mathbb {C}^{d \times k})$ together with the fact that

(A.3)

$$ \begin{align} \| \mathbf{G}_N - \mathbf{G} \|_{H^1} \to 0 \quad \text{as} \quad N \to \infty \,. \end{align} $$

By Sobolev embeddings, we have the uniform convergence $\| \mathbf {G}_N - \mathbf {G} \|_{L^\infty } \to 0$ as $N \to \infty $ . Recall that $\mathbf {G}(t)^* \mathbf {G}(t) \in \mathbb {C}^{k \times k}$ is invertible for all $t \in \mathbb {T}$ . Thus we deduce

$$ \begin{align*}\mathbf{G}_N(t)^* \mathbf{G}_N(t) \in \mathbb{C}^{k \times k} \text{ is invertible for all } t \in \mathbb{T} \text{ and } N \geq N_0 \, , \end{align*} $$

with some sufficiently large constant $N_0 \geq 1$ . Also, this shows that $\mathrm {rank}(\mathbf {G}_N(t)) = k$ for all $t \in \mathbb {T}$ and $N \geq N_0$ .

For $N \geq N_0$ , we now define the sequence $\mathbf {P}_N : \mathbb {T} \to \mathbb {C}^{d \times d}$ by

$$ \begin{align*}\mathbf{P}_N(t) := \mathbf{G}_N(t) [ \mathbf{G}_N(t)^* \mathbf{G}_N(t) ]^{-1} \mathbf{G}_N(t)^* \,. \end{align*} $$

Evidently, we have $\mathbf {P}_N(t) = \mathbf {P}_N(t)^* = \mathbf {P}_N(t)^2$ for any $t \in \mathbb {T}$ . Moreover, we find that $\mathrm {rank} (\mathbf {P}_N(t)) = k$ for $t \in \mathbb {T}$ and $N \geq N_0$ . Thus $\mathbf {P}_N : \mathbb {T} \to \mathsf {Gr}_k(\mathbb {C}^d)$ for all $N \geq N_0$ .

Now, recall that $\mathbf {G}_N \in \mathcal {R}at(\mathbb {T}; \mathbb {C}^{d \times k})$ . But this implies that the right-hand side in the definition of the maps $\mathbf {P}_N(t)$ is also rational in $z= \mathrm {e}^{\mathrm {i} t} \in \mathbb {S}$ , that is, we have

$$ \begin{align*}\mathbf{P}_N \in \mathcal{R}at(\mathbb{T}; \mathsf{Gr}_k(\mathbb{C}^d)) \quad \text{for all } N \geq N_0 \,. \end{align*} $$

Now, from the convergence (A.3) together with the fact that the Sobolev space $H^1(\mathbb {T})$ is an algebra, it is straightforward to derive

$$ \begin{align*} \mathbf{P}_N(t) & = \mathbf{G}_N(t) [ \mathbf{G}_N(t)^* \mathbf{G}_N(t) ]^{-1} \mathbf{G}_N(t)^* \\ & \quad \to \mathbf{G}(t) [\mathbf{G}(t)^* \mathbf{G}(t)]^{-1} \mathbf{G}(t) = \mathbf{P}(t) \quad \text{in } H^1(\mathbb{T}; \mathbb{C}^{d \times d}) \,. \end{align*} $$

Thanks to the elementary embedding $H^{1} \subset H^{\frac 1 2}$ this implies that $\| \mathbf {P}_N - \mathbf {P} \|_{H^{\frac 1 2}} \to 0$ as $N \to \infty $ .

Finally, we see that the sequence satisfies $\| \mathbf {U}_N - \mathbf {U} \|_{H^{\frac 1 2}} = 2 \| \mathbf {P}_N - \mathbf {P} \|_{H^{\frac 1 2}} \to 0$ as $N \to \infty $ . The proof of Lemma A.2 is now complete.

The proof of Theorem A.2 now follows immediately from Lemmas A.1 and A.2.

With the help of Theorem A.2, we are now ready to prove Theorem A.1.

Proof of Theorem A.1.

We will make use of the known conformal invariance of the Gagliardo semi-norm $\| \cdot \|_{\dot {H}^{\frac 1 2}}$ . In what follows, we will identify maps defined on $\mathbb {T}$ as maps defined on $\mathbb {S}^1$ by means of $z= \mathrm {e}^{\mathrm {i} t} \in \mathbb {S}^1$ with $t \in \mathbb {T}$ .

Let

$$ \begin{align*}\mathcal{S} : \mathbb{R} \to \mathbb{S}^1 \setminus \{ \mathrm{i} \} ,\quad x \mapsto \mathrm{i} \frac{x-\mathrm{i}}{x+\mathrm{i}} \,. \end{align*} $$

denote the inverse stereographic projection from $\mathbb {R}$ to $\mathbb {S} \setminus \{ \mathrm {i} \}$ . Assume that $\mathbf {U} : \mathbb {R} \to \mathsf {Gr}_k(\mathbb {C}^d)$ and $\tilde {\mathbf {U}} : \mathbb {S} \to \mathsf {Gr}_k(\mathbb {C}^d)$ are related by $\mathbf {U} = \tilde {\mathbf {U}} \circ \mathcal {S}$ . A well-known calculationFootnote ⁷ shows that

$$ \begin{align*} \| \mathbf{U} \|_{\dot{H}^{\frac 1 2}(\mathbb{R})}^2 & = \frac{1}{2 \pi} \int_{\mathbb{R}} \int_{\mathbb{R}} \frac{|\mathbf{U}(x) - \mathbf{U}(y)|_F^2}{|x-y|^2 } \, dx \, dy \\ & = \frac{1}{2 \pi} \int_{\mathbb{T}} \int_{\mathbb{T}} \frac{|\tilde{\mathbf{U}}(t)- \tilde{\mathbf{U}}(s)|_F^2}{2-2 \cos(t-s) } \, dt \, ds = \| \tilde{\mathbf{U}} \|_{\dot{H}^{\frac 1 2}(\mathbb{T})}^2 \end{align*} $$

Thus for a given map $\mathbf {U} \in \dot {H}^{\frac {1}{2}}(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ , we set $\tilde {\mathbf {U}}(z) = (\mathbf {U} \circ \mathcal {S}^{-1})(z)$ which is defined for almost every $z \in \mathbb {S}$ . Then $\tilde {\mathbf {U}} \in H^{\frac 1 2}(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ by the above integral identity. By Theorem A.2, there exists a sequence $\tilde {\mathbf {U}}_N \in \mathcal {R}at(\mathbb {T}; \mathsf {Gr}_k(\mathbb {C}^d))$ with

$$ \begin{align*}0 \leq \| \tilde{\mathbf{U}}_N - \tilde{\mathbf{U}} \|_{\dot{H}^{\frac 1 2}(\mathbb{T})} \leq \| \tilde{\mathbf{U}}_N - \tilde{\mathbf{U}} \|_{H^{\frac 1 2}(\mathbb{T})} \to 0 \quad \text{as} \quad N \to \infty \,. \end{align*} $$

Note that the sequence of functions

$$ \begin{align*}\mathbf{U}_N := \tilde{\mathbf{U}}_N \circ \mathcal{S} \in \mathcal{R}at(\mathbb{R}; \mathsf{Gr}_k(\mathbb{C}^d)) \, \end{align*} $$

since $\mathcal {S}$ preserves rationality. Finally, we deduce that

$$ \begin{align*}\| \mathbf{U}_N - \mathbf{U} \|_{\dot{H}^{\frac 1 2}(\mathbb{R})} = \| \tilde{\mathbf{U}}_N - \tilde{\mathbf{U}} \|_{\dot{H}^{\frac 1 2}(\mathbb{T})} \to 0 \quad \text{as} \quad N \to \infty \,. \end{align*} $$

This completes the proof of Theorem A.1.

B. Stereographic parametrization

In this section, we give the proof of Theorem 8.1. Hence we always assume that $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ is a rational map and, as usual, we denote $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma } : \mathbb {R} \to \mathsf {Gr}_1(\mathbb {C}^2)$ for the corresponding rational matrix-valued map. Note that here we always consider $T_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ , that is, we take $\mathcal {V} = \mathbb {C}^2$ . Also, the operators $H_{\mathbf {U}}$ and $K_{\mathbf {U}} = H_{\mathbf {U}}^* H_{\mathbf {U}}$ are always understood as acting on $L^2_+(\mathbb {R}; \mathbb {C}^2)$ in what follows.

We first collect some auxiliary results as follows.

Lemma B.1. Assume $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ is a rational function of the form

$$ \begin{align*}\mathbf{u}(x) = \mathbf{u}_\infty + \sum_{j=1}^N \left ( \frac{\mathbf{s}_j}{x-z_j} + \frac{\overline{\mathbf{s}}_j}{x-\overline{z}_j} \right ) \end{align*} $$

with some integer $N \geq 1$ , $\mathbf {u}_\infty \in \mathbb {S}^2$ , $\mathbf {s}_1, \ldots , \mathbf {s}_N \in \mathbb {C}^3 \setminus \{ 0 \}$ , and pairwise distinct poles $z_1, \ldots , z_N \in \mathbb {C}_-$ . Then it holds $\mathrm {Rank} (K_{\mathbf {U}}) = N$ .

Remark. By a straightforward extension of the proof below, we obtain the following result: Let $\mathbf {U} \in \mathcal {R}at(\mathbb {R}; \mathsf {Gr}_k(\mathbb {C}^d))$ be of the form

$$ \begin{align*}\mathbf{U}(x) = \mathbf{U}_\infty + \sum_{j=1}^N \frac{A_j}{x-z_j} + \sum_{j=1}^N \frac{A_j^*}{x-\overline{z}_j} \end{align*} $$

with some integer $N \geq 1$ , $\mathbf {U}_\infty \in \mathsf {Gr}_k(\mathbb {C}^d)$ , nonzero matrices $A_1, \ldots , A_N \in M_d(\mathbb {C})$ with $A_j^2=0$ and $\mathrm {rank}(A_j)=1$ , and pairwise distinct poles $z_1, \ldots , z_N \in \mathbb {C}_-$ . Then we have $\mathrm {rank}(K_{\mathbf {U}}) = N$ for the operator $K_{\mathbf {U}} = H_{\mathbf {U}}^* H_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathbb {C}^d) \to L^2_+(\mathbb {R}; \mathbb {C}^d)$ .

Proof. Since $K_{\mathbf {U}} = H_{\mathbf {U}}^* H_{\mathbf {U}}$ , we have $\dim \mathrm {ran}(H_{\mathbf {U}}^*) = \dim \mathrm {ran} (K_{\mathbf {U}})$ . Therefore, we need to determine the rank of the adjoint Hankel operator $H_{\mathbf {U}}^* : L^2_-(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ with

$$ \begin{align*}H_{\mathbf{U}}^*f = \Pi_+ (\mathbf{U} f) = \Pi_+ \left ( \sum_{j=1}^N \frac{A_j}{x-z_j} f \right ) \quad \text{for } f \in L^2_-(\mathbb{R}; \mathbb{C}^2) \end{align*} $$

with the matrices $A_j = \mathbf {s}_j \cdot \boldsymbol {\sigma } \in M_2(\mathbb {C})$ . From the constraint $\mathbf {u}(x) \cdot \mathbf {u}(x) = 1$ , we readily deduce that the nonzero vectors $\mathbf {s}_j \in \mathbb {C}^3 \setminus \{ 0 \}$ satisfy $\mathbf {s}_j \cdot \mathbf {s}_j = 0$ for all $j =1, \ldots , N$ . To see this, we recall $\mathbf {u}_\infty \cdot \mathbf {u}_\infty =1$ and that the poles $\{ z_j \}_{j=1}^N$ are pairwise distinct, so that an elementary expansion in partial fractions yields

$$ \begin{align*} 1 & = \mathbf{u}(x) \cdot \mathbf{u}(x) = \left ( \mathbf{u}_\infty + \sum_{j=1}^N \left ( \frac{\mathbf{s}_j}{x-z_j} + \frac{\overline{\mathbf{s}}_j}{x-\overline{z}_j} \right ) \right ) \cdot \left ( \mathbf{u}_\infty + \sum_{k=1}^N \left ( \frac{\mathbf{s}_k}{x-z_k} + \frac{\overline{\mathbf{s}}_k}{x-\overline{z}_k} \right ) \right ) \\ & = 1 + \sum_{j=1}^N \frac{\mathbf{s}_j \cdot \mathbf{s}_j}{(x-z_j)^2} + \text{rational terms not containing } \displaystyle \frac{1}{(x-z_j)^{2}} \text{ for any } j=1, \ldots, N. \end{align*} $$

Hence we conclude that $\mathbf {s}_j \cdot \mathbf {s}_j = 0$ for all $j=1, \ldots , N$ . Next, by elementary algebra for the Pauli matrices, we find and hence each matrix $A_j \in M_2(\mathbb {C})$ has exactly rank one. On the other hand, we easily verify that

$$ \begin{align*}\Pi_+ \left ( \frac{1}{x-\zeta} f \right ) = \frac{f(\zeta)}{x-\zeta} \quad \text{for } f \in L^2_-(\mathbb{R};\mathbb{C}^2) \text{ and } \zeta \in \mathbb{C}_- \,. \end{align*} $$

In particular, we see that $f \mapsto \Pi _+((x-\zeta )^{-1} f)$ has rank one for $\zeta \in \mathbb {C}_-$ . Since each matrix $A_j^*$ has rank one and in view of

$$ \begin{align*}H^*_{\mathbf{U}} f = \Pi_+ (\mathbf{U} f) = \sum_{j=1}^N A_j \Pi_+\left ( \frac{1}{(x-z_j)} f \right ) \, , \end{align*} $$

we deduce the upper bound $\mathrm {rank}(H^*_{\mathbf {U}}) \leq N$ .

It remains to show that $\mathrm {rank}(H^*_{\mathbf {U}}) \geq N$ holds. Take vectors $v_j \in \mathbb {C}^2$ with $A_j v_j \neq 0$ for $j=1, \ldots , N$ . Now we consider the functions $f_1, \ldots , f_N \in L^2_-(\mathbb {R};\mathbb {C}^2)$ given by

$$ \begin{align*}f_j(x) = \prod_{k=1, \, k \neq j}^N \frac{x-z_k}{x-\overline{z}_k} \frac{v_j}{x-\overline{z}_j} \,. \end{align*} $$

An explicit calculation shows that

$$ \begin{align*}H_{\mathbf{U}}^* f_j = \frac{A_j f_j(z_j)}{x-z_j} \,. \end{align*} $$

Since $A_j f_j(z_j) \neq 0$ and $z_1, \ldots , z_N \in \mathbb {C}_-$ are pairwise distinct, we see that $\mathrm {rank}(H_{\mathbf {U}}^*) \geq N$ . This completes the proof.

The next lemma addresses the case of nonsimple poles occurring in the rational map $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ and we derive a lower bound for $\mathrm {rank}(K_{\mathbf {U}})$ .

Lemma B.2. Suppose that $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ is of the form

$$ \begin{align*}\mathbf{u}(x) = \mathbf{u}_\infty + \sum_{j=1}^p \sum_{k=1}^{m_j} \left ( \frac{\mathbf{s}_{j,k}}{(x-z_j)^k} + \frac{\overline{\mathbf{s}}_{j,k}}{(x-\overline{z}_j)^k} \right ) \end{align*} $$

with some integers $N \geq 1$ , $1 \leq p,m_j \leq N$ , vectors $\mathbf {s}_{j,k} \in \mathbb {C}^3 \setminus \{ 0 \}$ , and pairwise distinct $z_1, \ldots , z_N \in \mathbb {C}_-$ . Then it holds that

$$ \begin{align*}\mathrm{Rank}(K_{\mathbf{U}}) \geq N = \sum_{j=1}^p m_j \,. \end{align*} $$

Proof. As before, we need to bound $\mathrm {rank}(H^*_{\mathbf {U}})$ . We adapt the second part of the proof of Lemma B.1 as follows. For any $\zeta \in \mathbb {C}_-$ and any integer $k \geq 1$ , we obtain by Taylor’s formula that

$$ \begin{align*}\Pi_+\left ( \frac{1}{(x-\zeta)^k} f \right ) = \sum_{\ell=0}^{k-1} \frac{f^{(\ell)}(\zeta)}{\ell! (x-\zeta)^{k-l}} \end{align*} $$

for any $f \in L^2_-(\mathbb {R}; \mathbb {C}^2)$ . Now we choose $j \in \{ 1, \ldots , p \}$ and $k \in \{ 1, \ldots , m_j \}$ . We claim that there exists $f_{j,k} \in L^2_-(\mathbb {R}; \mathbb {C}^2)$ such that

$$ \begin{align*}f_{j,k}^{(\ell)} (z_i) = 0 \quad \text{for } i \neq j \text{ and } \ell \in \{0, \ldots, m_i-1 \} \, , \end{align*} $$

$$ \begin{align*}f_{j,k}^{(\ell)}(z_j) = 0 \text{ and } A_{j,m_j} f^{(k-1)}_{j,k}(z_j) \neq 0 \quad \text{for } \ell \in \{0, \ldots, m_j-1 \} \text{ and } \ell \neq k-1 \, , \end{align*} $$

with the rank-one matrices $A_{j,k} = \mathbf {s}_{j,k} \cdot \boldsymbol {\sigma } \in M_2(\mathbb {C})$ . Indeed, just choose

$$ \begin{align*}f_{j,k}(x) = \prod_{i=1, \, i \neq j}^N \left ( \frac{x-z_i}{x-\overline{z}_i} \right ) \sum_{r=k}^{m_j} \frac{(x-z_j)^{r-1}}{(x-\overline{z}_j)^{r}} v_{j,k,r} \end{align*} $$

with nonzero vectors $v_{j,k,r} \in \mathbb {C}^2$ such that $A_{j,m_j} v_{j,k,k} \neq 0$ and with the other $v_{j,k,r}$ determined by induction on r. Then

$$ \begin{align*}H^*_{\mathbf{U}}(f_{j,k}) = \frac{1}{(k-1)!} \sum_{r=k}^{m_j} \frac{A_{j,r} f_{j,k}(z_j)}{(x-z_j)^{r-k+1}} \,. \end{align*} $$

It remains to observe that these rational functions are linearly independent as $j \in \{1, \ldots , p \}$ and $k \in \{1, \ldots , m_j \}$ , which is elementary in view of the leading singularity in $(x-z_j)$ .

We are now ready to give the proof of Theorem 8.1. For the reader’s convenience, we recall that the statement of Theorem 8.1, which is now labeled as Theorem B.1 here.

Theorem B.1. Let $\mathbf {u} =(u_1, u_2, u_3) : \mathbb {R} \to \mathbb {S}^2$ be a rational map. Given an integer $N \geq 1$ , the following statements are equivalent.

(i) $\dim \mathfrak {H}_1 = \mathrm {rank}(K_{\mathbf {U}}) = N$ .
(ii) The least common denominator of $u_1, u_2, u_3$ has degree $2N$ .
(iii) There exists a rational function $R \in \mathbb {C}(X)$ of the form
$$ \begin{align*}R(x) = \frac{P(x)}{Q(x)} \, , \end{align*} $$
where $P \in \mathbb {C}[X]$ is a polynomial of degree N and $Q \in \mathbb {C}[X]$ is a nonzero polynomial of degree at most $N-1$ , such that P and Q have no common factor such that, up to rotation on the sphere $\mathbb {S}^2$ , we have
$$ \begin{align*}u_1(x) + \mathrm{i} u_2(x) = \frac{2 R(x)}{R(x) \overline{R}(x)+1} \, , \quad u_3(x) = \frac{R(x) \overline{R}(x) -1}{R(x) \overline{R}(x) + 1} \,. \end{align*} $$

Proof of Theorem B.1.

We divide the proof into the following steps.

$\mathbf {(ii) \Rightarrow (iii)}$ . Assume $\mathbf {u} = (u_1, u_2, u_3) : \mathbb {R} \to \mathbb {S}^2$ is a rational map with the least common denominator given by a polynomial $D \in \mathbb {R}[X]$ of degree $2N$ . (Note that D must have even degree, since $u_1, u_2, u_3$ are real-valued rational functions with no poles in $\mathbb {R}$ .) Moreover, up to a rotation on $\mathbb {S}^2$ , we may assume that $u_3(x) \to 1$ as $|x| \to \infty $ , so that there exist polynomials $Q_j \in \mathbb {R}[X]$ such that

$$ \begin{align*}u_j(x) = \frac{Q_j(x)}{D(x)} \quad \text{for } j=1,2,3, \end{align*} $$

where $Q_1, Q_2$ have degree at most $2N-1$ and $Q_3$ has degree $2N$ , with the same leading coefficient as D. Now the condition $u_1^2 + u_2^2 + u_3^2 = 1$ means that

$$ \begin{align*}Q_1^2 + Q_2^2 + Q_3^2 = D^2 \, , \end{align*} $$

or equivalently

$$ \begin{align*}(Q_1 + \mathrm{i} Q_2)(Q_1 - \mathrm{i} Q_2) = (D-Q_3)(D+Q_3) \,. \end{align*} $$

Since $Q_3$ and D have the same leading coefficient, the degree of $D+Q_3$ is $2N$ and the degree $\delta $ of $D-Q_3$ is at most $2N-1$ . Denote by d the degree of $Q_1 + \mathrm {i} Q_2$ . Since $Q_1$ and $Q_2$ are real polynomials, d is also the degree of $Q_1 - \mathrm {i} Q_2$ and hence

$$ \begin{align*}2d = \delta + 2N \,. \end{align*} $$

This implies

$$ \begin{align*}N \leq d \leq 2N-1 \,. \end{align*} $$

Furthermore, we recall that D is the least common denominator of $u_1, u_2, u_3$ which means that $Q_1, Q_2, Q_3, D$ have no common factor, or equivalently the polynomials

$$ \begin{align*}Q_1+ \mathrm{i} Q_2, Q_1 - \mathrm{i} Q_2, D-Q_3, D + Q_3 \end{align*} $$

have no common factor.

Now, we claim that $Q_1 + \mathrm {i} Q_2$ and $D-Q_3$ have at least $d-N$ common zeros – counted with multiplicities. Indeed, assume that $\alpha \in \mathbb {C}$ is a zero of $D-Q_3$ of multiplicity $m \geq 1$ . We distinguish the following cases depending whether $\alpha \in \mathbb {R}$ or $\alpha \not \in \mathbb {R}$ .

If $\alpha $ is real, then $\alpha $ is in fact a zero of $Q_1$ and $Q_2$ , hence it is a zero of $Q_1+\mathrm {i} Q_2$ and $Q_1 -\mathrm {i} Q_2$ with the same multiplicity $\mu $ . Since $\alpha $ cannot be a zero of $D+Q_3$ (otherwise $Q_1+\mathrm {i} Q_2, Q_1 - \mathrm {i} Q_2, D-Q_3, D+ Q_3$ would have common factor), we infer that $2 \mu = m$ .

If $\alpha $ is not real, then $\alpha $ is a zero of $Q_1 + \mathrm {i} Q_2$ or $Q_1 - \mathrm {i} Q_2$ . Since $\overline {\alpha }$ is also a zero of the real polynomial $D-Q_3$ , we can choose the zero $\beta \in \{ \alpha , \overline {\alpha } \}$ having the maximal multiplicity $\mu $ as zero of $Q_1 + \mathrm {i} Q_2$ . This shows $\mu \geq \frac {m}{2}$ .

Summing up, we have found a common factor of $D-Q_3$ and $Q_1 + \mathrm {i} Q_2$ with degree at least equal to half of the degree of $D-Q_3$ , namely $d-N$ . Therefore we can write

(B.1)

$$ \begin{align} \frac{Q_1 + \mathrm{i} Q_2}{D-Q_3} = \frac{P}{Q} \end{align} $$

where P and Q are polynomials in $\mathbb {C}[X]$ with no common factor, and P has degree $d-r$ , Q has degree $2(d-N)-r$ for some $r \geq d-N$ . Notice that $\deg P> \deg Q$ .

Next, we prove that equality $r=d-N$ holds. Indeed, from (B.1), we conclude

$$ \begin{align*}\frac{Q_1}{D} = \frac{P \overline{Q} + \overline{P} Q}{P \overline{P}+ Q \overline{Q}}, \quad \frac{Q_2}{D} = \frac{P \overline{Q}- \overline{P}Q}{\mathrm{i}(P \overline{P} + Q \overline{Q})}, \quad \frac{Q_3}{D} = \frac{P \overline{P} - Q \overline{Q}}{P \overline{P}+ Q \overline{Q}} \,. \end{align*} $$

This implies that $u_1, u_2, u_3$ have a common denominator of degree equal to $2 \deg P$ . Hence

$$ \begin{align*}2(d-r) \geq 2N \, , \end{align*} $$

which implies $r \leq d-N$ , leading to the desired equality $r=d-N$ . By defining the rational function

$$ \begin{align*}R(x) = \frac{P(x)}{Q(x)} \in \mathbb{C}(X), \end{align*} $$

we conclude that (iii) holds. This completes the proof of the implication $(ii) \Rightarrow (iii)$ .

$\mathbf {(iii) \Rightarrow (ii)}$ . Suppose we are given a rational function

$$ \begin{align*}R(x) = \frac{P(x)}{Q(x)} \, , \end{align*} $$

where P is a polynomial of degree N, Q is a polynomial of degree at most $N-1$ , and $P, Q$ have no common factor. The formulae

$$ \begin{align*}u_1 = \frac{R + \overline{R}}{R \overline{R}+1}, \quad u_2 = \frac{R - \overline{R}}{\mathrm{i} (R \overline{R}+1)}, \quad u_3 = \frac{R \overline{R}-1}{R \overline{R}+1} \end{align*} $$

clearly define a rational map $\mathbf {u}=(u_1,u_2, u_3)$ from $\mathbb {R}$ with values in $\mathbb {S}^2$ . Furthermore, we see that $|P|^2 + |Q|^2$ is a common denominator of $u_1,u_2,u_3$ and its degree is $2N$ . Let us prove that $|P|^2 + |Q|^2$ is the least common denominator of $u_1, u_2, u_3$ . We argue by contradiction. Suppose there is a common factor of the polynomials

$$ \begin{align*}P \overline{Q} + \overline{P} Q, P \overline{Q}- \overline{P} Q, P \overline{P} - Q \overline{Q}, P \overline{P} + Q \overline{Q} \end{align*} $$

or equivalently of the polynomials

$$ \begin{align*}P \overline{Q}, \overline{P} Q, P \overline{P}, Q \overline {Q} \,. \end{align*} $$

Since $P, Q$ have no common factor, there exist polynomials $U, V$ such that

$$ \begin{align*}UP + VQ = 1 \,. \end{align*} $$

Therefore the polynomials

$$ \begin{align*}\overline{U} (P \overline{P} )+ \overline{V}(P \overline{Q}) = P, \quad \overline{U} (\overline{P} Q) + \overline{V} ( Q \overline{Q} ) = Q \end{align*} $$

would have a common factor, which yields a contradiction. This proves that (iii) implies (ii).

$\mathbf {(ii) \Rightarrow (i).}$ Let us assume that $\mathbf {u} = (u_1, u_2, u_3) : \mathbb {R} \to \mathbb {S}^2$ has a least common denominator of degree $2N$ . We claim that

(B.2)

$$ \begin{align} \mathrm{Rank} (K_{\mathbf{U}}) = N \,. \end{align} $$

Indeed, if $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ has only simple poles (i.e., the assumptions of Lemma B.1 are satisfied), we can directly apply Lemma B.1 to conclude that (B.2) holds.

To deal with the case of multiple poles occurring in $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ , we need the following approximation result.

Lemma B.3. Let $P \in \mathbb {C}[X]$ be a polynomial of degree $N \geq 1$ , $Q \in \mathbb {C}[X]$ be a nonzero polynomial of degree at most $N-1$ , and assume that $P,Q$ have no common factor. Then there exist sequence $P_n, Q_n \in \mathbb {C}[X]$ such that $P_n, Q_n$ have no common factor and

$$ \begin{align*}\deg P_n = N, \quad \deg Q_n \leq N-1, \quad P_n \to P, \quad Q_n \to Q \quad \text{in } \mathbb{C}[X] \,. \end{align*} $$

Furthermore, the zeros of $|P_n|^2 + |Q_n|^2$ are simple for every $n \in \mathbb {N}$ .

Proof of Lemma B.3.

Consider the set $\mathcal {A}$ of pairs of polynomials $(P,Q) \in \mathbb {C}[X] \times \mathbb {C}[X]$ such that $\deg = N$ , $\deg Q \leq N-1$ and $P,Q$ have no common factor and P is monic. By Lemma 8.1, we can identify $\mathcal {A}$ with a connected open subset in $\mathbb {C}^{2N}$ . On the set $\mathcal {A}$ , the condition that the discriminant of $|P|^2 + |Q|^2$ is different from 0 is an open dense subset. This completes the proof.

Suppose now that $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ has multiple poles and the least common denominator of $u_1, u_2, u_3$ has degree $2N$ . By Lemma B.2, we must have

$$ \begin{align*}\mathrm{Rank} \, (K_{\mathbf{U}}) \geq N \,. \end{align*} $$

On the other hand, by the proven implication $(ii) \Rightarrow (iii)$ , there exists a rational function

$$ \begin{align*}R(x) = \frac{P(x)}{Q(x)} \in \mathbb{C}(X) \end{align*} $$

with $\deg P = N$ , $\deg Q \leq N-1$ with $Q \not \equiv 0$ and $Q,P$ have no common factor, such that (up to rotation on $\mathbb {S}^2$ ) the rational function $\mathbf {u}=(u_1, u_2, u_3)$ is given by the inverse stereographic projection applied to $R(x)$ . Now, let us take sequences $P_n, Q_n \in \mathbb {C}[X]$ as provided in Lemma B.3. Define $R_n(x) = P_n(x)/Q_n(x) \in \mathbb {C}(X)$ and consider the sequence of rational maps $\mathbf {u}^{(n)} : \mathbb {R} \to \mathbb {S}^2$ with components

$$ \begin{align*}u_1^{(n)}(x) + \mathrm{i} u_2^{(n)}(x)= \frac{2 R_n(x)}{|R_n(x)|^2 +1} , \quad u_3^{(n)}(x) = \frac{|R_n(x)|^2-1}{|R_n(x)|^2 +1} \,. \end{align*} $$

Since $|P_n|^2 + |Q_n|^2$ has only simple zeros, we see that each rational map $\mathbf {u}^{(n)}$ has only simple poles. Applying the known implication $(iii) \Rightarrow (ii)$ , we conclude that $u_1^{(n)}, u_2^{(n)}, u_3^{(n)}$ for all n have a least common denominator of degree $2N$ . Thus for every rational map $\mathbf {u}^{(n)} : \mathbb {R} \to \mathbb {S}^2$ we can apply Lemma B.1 to conclude that

$$ \begin{align*}\mathrm{Rank} \, (K_{\mathbf{U}_n}) = N \quad \text{for all } n \in \mathbb{N} \,. \end{align*} $$

On the other hand, since $\mathbf {u}^{(n)}(x) \to \mathbf {u}(x)$ pointwise and $|\mathbf {u}^{(n)}(x)| = 1$ , we see that $K_{\mathbf {U}_n} f \to K_{\mathbf {U}} f$ in $L^2(\mathbb {R}, \mathbb {C}^2)$ for every $f \in L^2_+(\mathbb {R}, \mathbb {C}^2)$ by dominated convergence. From this we easily deduce that

$$ \begin{align*}N=\liminf_{n \to \infty} \mathrm{Rank} (K_{\mathbf{U}_n}) \geq \mathrm{Rank}(K_{\mathbf{U}}) \,. \end{align*} $$

This completes the proof that (B.2) holds whenever $u_1, u_2, u_3$ have a least common denominator of degree $2N$ .

$\mathbf {(i) \Rightarrow (ii)}$ . Suppose now that $\mathrm {Rank}(K_{\mathbf {U}}) = N$ holds for some integer $N \geq 1$ . Let $D \in \mathbb {R}[X]$ denote the least common denominator of $u_1, u_2, u_3$ . Since D has no zeros in $\mathbb {R}$ , we must have that $\deg D = 2 m$ for some integer $m \geq 1$ . We claim that

$$ \begin{align*}m=N \,. \end{align*} $$

Indeed, if $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ has simple poles (in the sense of Lemma B.1), we can use Lemma B.1 directly to deduce that $m=N$ must hold.

If $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ has multiple poles, then $\deg D = 2m$ where $m \geq 1$ is the number of poles of $\mathbf {u}$ counted with multiplicity. By the same argument using approximation with simple pole rational functions $\mathbf {u}^{(n)} : \mathbb {R} \to \mathbb {S}^2$ as in the previous step, we conclude that $\mathrm {Rank}(K_{\mathbf {U}}) = m$ . Hence $m=N$ is also true in this case.

The proof of Theorem B.1 is now complete.

C. Construction of $T_{\mathbf {U}}$ with simple discrete spectrum

The aim of this section is to construct, for given $N \geq 1$ , rational maps $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ such that the corresponding Toeplitz operator $T_{\mathbf {U}} : L^2_+(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ has simple discrete spectrum

$$ \begin{align*}\sigma_{\mathrm{d}}(T_{\mathbf{U}}) = \{ v_1, \ldots, v_N \} \, , \end{align*} $$

where $v_j \in (-1, 1)$ for $j=1, \ldots , N$ are arbitrarily given simple eigenvalues. To achieve this, we will use a perturbative construction by using N simple traveling solitary waves for (HWM) with different velocities $v_j \in (-1,1)$ that are sufficiently far separated from each other.

For a rational map $\mathbf {u} \in \mathcal {R}at(\mathbb {R}; \mathbb {S}^2)$ , we henceforth assume without loss of generality that

$$ \begin{align*}\mathbf{u}_\infty := \lim_{|x| \to \infty} \mathbf{u}(x) = \mathbf{e}_3=(0,0,1) \in \mathbb{S}^2 \end{align*} $$

by rotational symmetry on the sphere $\mathbb {S}^2$ . For a given velocity $v \in (-1,1)$ , we define the unit vector $\mathbf {n}_v \in \mathbb {S}^2$ by setting

(C.1)

$$ \begin{align} \mathbf{n}_v := (0,\sqrt{1-v^2}, v ) \quad \text{so that } v = \mathbf{n} \cdot \mathbf{u}_\infty \,. \end{align} $$

For later use, we also define the unit vectors $\mathbf {n}_{v,1}, \mathbf {n}_{v,2} \in \mathbb {S}^2$ with

(C.2)

$$ \begin{align} \mathbf{n}_{v,1} := \mathbf{e}_1 = (1,0,0) \quad \text{and} \quad \mathbf{n}_{v,2} := \mathbf{n}_v \times \mathbf{n}_{v,1} = (0,v, -\sqrt{1-v^2}) \,. \end{align} $$

Thus $(\mathbf {n}_v, \mathbf {n}_{v,1}, \mathbf {n}_{v,2})$ forms a (positively oriented) orthonormal basis of unit vectors in $\mathbb {R}^3$ whose use will become clear below.

Furthermore, it will be convenient to consider poles $z \in \mathbb {C}_-$ of the form

(C.3)

$$ \begin{align} z = y - \mathrm{i} \in \mathbb{C}_- \quad \text{with } y \in \mathbb{R} \,. \end{align} $$

Next, we construct a rational function $\mathbf {q}_{v,z} : \mathbb {R} \to \mathbb {S}^2$ of the form

$$ \begin{align*}\mathbf{q}_{v,z}(x) := \mathbf{e}_3 + \frac{\mathbf{s}_{v}}{x-z} + \frac{\overline{\mathbf{s}}_{v}}{x-\overline{z}} \end{align*} $$

with some complex vector $\mathbf {s}_v \in \mathbb {C}^3 \setminus \{ 0 \}$ . By plugging this ansatz into the pointwise constraint $|\mathbf {q}_{v,z}(x)|^2 = 1$ for $x \in \mathbb {R}$ and equating all terms proportional to $(x-z)^{-1}$ and $(x-z)^{-2}$ to zero, we easily find the following constraints equivalent to the condition $|\mathbf {q}_{v,z}(x)|^2=1$ :

(C.4)

$$ \begin{align} \mathbf{s}_v \cdot \mathbf{s}_v = 0 \quad \text{and} \quad \mathbf{s}_v \cdot \left ( \mathbf{e}_3 + \frac{\overline{\mathbf{s}}_v}{z - \overline{z}} \right ) = 0 \, , \end{align} $$

where $\mathbf {a} \cdot \mathbf {b} = a_1 b_1 + a_2 b_2 + a_3 b_3$ for $\mathbf {a}, \mathbf {b} \in \mathbb {C}^3$ . In view of [Reference Berntson, Klabbers and Langmann3][Lemma B.1], we make the ansatz

$$ \begin{align*}\mathbf{s}_v = s_{v} ( \mathbf{n}_{v,1} + \mathrm{i} \mathbf{n}_{v,2} ) \end{align*} $$

with some complex number $s_{v} \in \mathbb {C}^*$ and with the real unit vectors $\mathbf {n}_{v,1}$ and $\mathbf {n}_{v,2}$ from (C.2) above. This automatically ensures that the first constraint in (C.4) holds. Next, by recalling that $z-\overline {z} = -2 \mathrm {i}$ for the pole $z \in \mathbb {C}_-$ , the second equation in (C.4) becomes

$$ \begin{align*}s_v ( \mathbf{n}_{v,1} + \mathrm{i} \mathbf{n}_{v,2}) \cdot \left ( \mathbf{e}_3 - \frac{\overline{s}_v(\mathbf{n}_{v,1} -\mathrm{i} \mathbf{n}_{v,2})}{2 \mathrm{i}} \right ) = 0 \,. \end{align*} $$

Since $(\mathbf {n}_{v,1}+\mathrm {i} \mathbf {n}_{v,2})\cdot (\mathbf {n}_{v,1}-\mathrm {i} \mathbf {n}_{v,2}) = 2$ , we readily find that $s_v \in \mathbb {C}^*$ is given by

(C.5)

$$ \begin{align} s_v = -\mathrm{i} ( \mathbf{n}_{v,1} -\mathrm{i} \mathbf{n}_{v,2}) \cdot \mathbf{e}_3 = \mathrm{i} \sqrt{1-v^2} \,. \end{align} $$

In summary, we find that

$$ \begin{align*}\mathbf{q}_{v,z}(x) = \mathbf{e}_3 + \frac{\mathbf{s}_v}{x-z} + \frac{\overline{\mathbf{s}}_v}{x-\overline{z}} \end{align*} $$

with

(C.6)

$$ \begin{align} \mathbf{s}_v = \mathrm{i} \sqrt{1-v^2} ( \mathbf{n}_{v,1} + \mathrm{i} \mathbf{n}_{v,2} ) = \sqrt{1-v^2} \ \left ( \begin{array}{c} \mathrm{i} \\ -v \\ \sqrt{1-v^2} \end{array} \right ) \,. \end{align} $$

We remark that the simple pole rational function $\mathbf {q}_{v,z} : \mathbb {R} \to \mathbb {S}^2$ yields a traveling solitary wave solution

$$ \begin{align*}\mathbf{u}(t,x) = \mathbf{q}_{v,z}(x-vt) \end{align*} $$

of (HWM) with velocity v and $\lim _{|x| \to \infty } \mathbf {u}(t,x) = \mathbf {e}_3$ , which follows by a direct calculation which we omit here.

We have the following main result.

Lemma C.1. Let $N \geq 1$ be an integer and let $v_1, \ldots , v_N \in (-1,1)$ be given. Then there is a sufficiently small constant $\varepsilon _0=\varepsilon _0(N)> 0$ such that the following holds.

Let $z_1, \ldots , z_N \in \mathbb {C}_-$ be pairwise distinct poles of the form (C.3) and define

$$ \begin{align*}\varepsilon := \frac{1}{\min_{j \neq k} |z_k -z_j|}> 0 \quad \text{and} \quad \vec{z} = (z_1, \ldots, z_N). \end{align*} $$

Then if $\varepsilon < \varepsilon _0$ , there exists a rational map $\mathbf {u}_{\vec {z}} : \mathbb {R} \to \mathbb {S}^2$ of the form

$$ \begin{align*}\mathbf{u}_{\vec{z}}(x) = \mathbf{e}_3 + \sum_{j=1}^N \frac{\mathbf{s}_{j,\vec{z}}}{x-z_j} + \sum_{j=1}^N \frac{\overline{\mathbf{s}}_{j,\vec{z}}}{x- \overline{z}_j} \, \end{align*} $$

with some $\mathbf {s}_{j,\vec {z}} \in \mathbb {C}^3 \setminus \{ 0 \}$ . Moreover, we have that

$$ \begin{align*}\mathbf{s}_{j,\vec{z}} = \mathbf{s}_{v_j} + O(\varepsilon) \quad \text{for } j=1, \ldots, N \, , \end{align*} $$

where $\mathbf {s}_{v_j} \in \mathbb {C}^3 \setminus \{0\}$ is given by (C.6) with $v=v_j$ .

Proof. We arrange the proof into the following steps.

Step 1. Let $z_1, \ldots , z_N \in \mathbb {C}_-$ be pairwise distinct with $\mathrm {Im} \, z_j = -1$ for $j=1, \ldots , N$ and set $\varepsilon = 1/{\min _{j \neq k}|z_k-z_j|}> 0$ . We denote $\vec {z} = (z_1, \ldots , z_N)$ - For $\varepsilon \in (0, \varepsilon _0)$ with $\varepsilon _0=\varepsilon _0(N)> 0$ chosen below, we need to find $\mathbf {s}_{1, \vec {z}}, \ldots , \mathbf {s}_{N, \vec {z}} \in \mathbb {C}^3 \setminus \{ 0 \}$ such that the following nonlinear constraints are satisfied:

(C.7)

$$ \begin{align} \mathbf{s}_{j,\vec{z}} \cdot \mathbf{s}_{j,\vec{z}} = 0 \quad \text{for } j=1, \ldots, N \, , \end{align} $$

(C.8)

$$ \begin{align} \mathbf{s}_{j,\vec{z}} \cdot \left ( \mathbf{e}_3 + \sum_{k \neq j}^N \frac{\mathbf{s}_{k,\vec{z}}}{z_j - z_k} + \sum_{k=1}^N \frac{\overline{\mathbf{s}}_{k,\vec{z}}}{z_j-\overline{z}_k} \right ) = 0 \quad \text{for } j=1, \ldots, N \,. \end{align} $$

In fact, these conditions follow simply by a partial fraction expansion for the constraint $\mathbf {u}_{\vec {z}}(x) \cdot \mathbf {u}_{\vec {z}}(x) = 1$ with our ansatz for $\mathbf {u}_{\vec {z}}(x)$ stated above. As for (C.7), we recall from [Reference Berntson, Klabbers and Langmann3][Lemma B.1] the algebraic fact that any $\mathbf {s} \in \mathbb {C}^3 \setminus \{ 0 \}$ with $\mathbf {s} \cdot \mathbf {s} = 0$ can be written as

$$ \begin{align*}\mathbf{s} = s ( \mathbf{n}_1 + \mathrm{i} \mathbf{n}_2) \end{align*} $$

with a complex number $s \in \mathbb {C}^*$ and real unit vectors $\mathbf {n}_1, \mathbf {n}_2 \in \mathbb {S}^2$ such that $\mathbf {n}_1 \cdot \mathbf {n}_2 = 0$ . In fact, this representation is unique modulo $U(1)$ -rotations in the plane spanned by $\mathbf {n}_1$ and $\mathbf {n}_2$ with a corresponding phase rotation of $s_j$ .

Next, we define the vectors

$$ \begin{align*}\mathbf{s}_j:= \mathbf{s}_{v_j} \in \mathbb{C}^3 \setminus \{0 \} \text{ given by (C.6) with } v=v_j \end{align*} $$

and we fix corresponding real unit vectors $\mathbf {n}_{j,1}, \mathbf {n}_{j,2} \in \mathbb {S}^2$ as defined in (C.2) with $v=v_j \in (-1,1)$ . Thus we have

$$ \begin{align*}\mathbf{s}_{j} = s_j ( \mathbf{n}_{j,1} + \mathrm{i} \mathbf{n}_{j,2}) \end{align*} $$

with some complex numbers $s_j \in \mathbb {C}^*$ to be determined for $j=1, \ldots , N$ .

For the vectors $\mathbf {s}_{j,\vec {z}}$ to be found, we make the ansatz

$$ \begin{align*}\mathbf{s}_{j,\vec{v}} = s_{j,\vec{z}} ( \mathbf{n}_{j,1} + \mathrm{i} \mathbf{n}_{j,2}) \quad \text{with } s_{j,\vec{z}} \in \mathbb{C}^* \,. \end{align*} $$

Note that the vectors $\mathbf {n}_{j,1}$ and $\mathbf {n}_{j,2}$ are fixed and only depend on $v_j$ but not on the poles $(z_1, \ldots , z_N)$ . Clearly, the first set of constraints (C.7) is automatically satisfied by our ansatz for $\mathbf {s}_{j,\vec {z}}$ . Thus we only need to show how to solve (C.8) in the rest of the proof, provided that the constant $\varepsilon _0 =\varepsilon _0(N) \ll 1$ is sufficiently small.

Step 2. In order to solve (C.8), we devise an iteration scheme as follows inspired by the discussion in [Reference Berntson, Klabbers and Langmann3]Footnote ⁸. First, let us write (C.8) as

$$ \begin{align*}\mathbf{s}_{j,\vec{z}} \cdot \left ( \mathbf{m}_{j,\varepsilon} + \frac{\overline{\mathbf{s}}_{j,\vec{z}}}{z_j-\overline{z}_j} \right ) = 0 \quad \text{with} \quad \mathbf{m}_{j,\vec{z}}:= \mathbf{e}_3 + \sum_{k \neq j}^N \left ( \frac{\overline{\mathbf{s}}_{k,\vec{z}}}{z_j - \overline{z}_k} + \frac{\mathbf{s}_{k,\vec{z}}}{z_j - z_k} \right ) \,. \end{align*} $$

If we recall that $\mathbf {s}_{j,\vec {z}} = s_{j,\vec {z}}(\mathbf {n}_{j,1} + \mathrm {i} \mathbf {n}_{j,2})$ and $z_j-\overline {z}_j=-2 \mathrm {i}$ , we find the equation

$$ \begin{align*}s_{j,\vec{z}} (\mathbf{n}_{j,1} + \mathrm{i} \mathbf{n}_{j,2}) \cdot \left ( \mathbf{m}_{j,\vec{z}} - \frac{\overline{s}_{j,\vec{z}}(\mathbf{n}_{j,1} - \mathrm{i} \mathbf{n}_{j,2})}{2 \mathrm{i}} \right ) = s_{j,\vec{z}} \left ( (\mathbf{n}_{j,1} + \mathrm{i} \mathbf{n}_{j,2}) \cdot \mathbf{m}_{j,\vec{z}} + \mathrm{i} \overline{s}_{j,\vec{z}} \right ) = 0, \end{align*} $$

which has the unique nontrivial solution

$$ \begin{align*}s_{j,\vec{z}} = -\mathrm{i} (\mathbf{n}_{j,1} - \mathrm{i} \mathbf{n}_{j,2}) \cdot \overline{\mathbf{m}}_{j,\vec{z}} \,. \end{align*} $$

Since $\mathbf {m}_{j,\vec {z}}$ does not depend on $\mathbf {s}_{j,\vec {z}}$ , this suggests the following iteration scheme: If $s_{j,\vec {z}}^{(n)}$ is given, we define the next iterate $s_{j,\vec {z}}^{(n+1)}$ by

$$ \begin{align*}s_{j,\vec{z}}^{(n+1)} := -\mathrm{i} (\mathbf{n}_{j,1}- \mathrm{i} \mathbf{n}_{j,2}) \cdot \left ( \mathbf{e}_3 + \sum_{k \neq j}^N \left ( \frac{s_{k,\vec{z}}^{(n)}(\mathbf{n}_{k,1} + \mathrm{i} \mathbf{n}_{k,2})}{\overline{z}_j-z_k} + \frac{\overline{s}_{k,\vec{z}}^{(n)}(\mathbf{n}_{k,1} - \mathrm{i} \mathbf{n}_{k,2})}{\overline{z}_j - \overline{z}_k} \right ) \right ) \,. \end{align*} $$

Thus we need to solve the fixed point equation

$$ \begin{align*}\vec{s}_{\vec{z}} = F_{\vec{z}}(\vec{s}_{\vec{z}}) \end{align*} $$

with the variable $\vec {s}_{\vec {z}} = (s_{1,\vec {z}}, \ldots , s_{N,\vec {z}})$ and the given parameters $\vec {z} = (z_1, \ldots , z_N)$ , where the map $F_{\vec {z}} : \mathbb {C}^{N} \to \mathbb {C}^N$ is defined by the right-hand side of the iteration scheme above. Recalling that $s_j = -\mathrm {i} (\mathbf {n}_{j,1} - \mathrm {i} \mathbf {n}_{j,2}) \cdot \mathbf {e}_3$ from (C.5), we find

$$ \begin{align*}F_{\vec{z}}(\vec{s}_{\vec{z}}) = \vec{s} + A_{\vec{z}} (\vec{s}_{\vec{z}}) + B_{\vec{z}} (\overline{\vec{s}}_{\vec{z}}) \, , \end{align*} $$

where $\vec {s}=(s_1, \ldots , s_N) \in \mathbb {C}^N$ and $A_{\vec {z}}, B_{\vec {z}} : \mathbb {C}^N \to \mathbb {C}^N$ are linear maps with operator norms

(C.9)

$$ \begin{align} \| A_{\vec{z}} \|_{\mathbb{C}^N \to \mathbb{C}^N} + \| B_{\vec{z}} \|_{\mathbb{C}^N \to \mathbb{C}^N} \leq C \varepsilon \leq C \varepsilon_0 \end{align} $$

with some constant $C> 0$ depending only on N. Hence by taking $\varepsilon _0:=1/(2C)$ , we see that, for any $\varepsilon \in (0,\varepsilon _0)$ , the map $G:=\mathrm {Id} - A_{\vec {z}} - B_{\vec {z}}(\overline {\cdot }) : \mathbb {C}^N \to \mathbb {C}^N$ is invertible by using the Neumann series. Hence $\vec {s}_\varepsilon = G^{-1} (\vec {s})$ is the unique solution of the fixed point equation $\vec {s}_{\vec {z}} = F_{\vec {z}}(\vec {s}_{\vec {z}})$ provided that $\varepsilon \in (0, \varepsilon _0)$ holds.

Step 3. It remains to show that

$$ \begin{align*}\mathbf{s}_{j,\vec{z}} = \mathbf{s}_{j} + O(\varepsilon) \quad \text{for } j=1, \ldots, N. \end{align*} $$

Since $\mathbf {s}_{j,\vec {z}} = s_{j,\vec {z}} ( \mathbf {n}_{j,1}+\mathrm {i} \mathbf {n}_{j,2})$ with vectors $\mathbf {n}_{j,1}, \mathbf {n}_{j,2}$ independent of $\varepsilon $ , this claim is equivalent to proving that

$$ \begin{align*}\vec{s}_{\vec{z}} = \vec{s} + O(\varepsilon) \end{align*} $$

with the notation from Step 2. But from the fixed point equation and estimate (C.9) we readily find

$$ \begin{align*}\| \vec{s}_{\vec{z}} - \vec{s} \|_{\mathbb{C}^N} = \| F_{\vec{z}}(\vec{s}_{\vec{z}}) - \vec{s} \|_{\mathbb{C}^N} \leq C \varepsilon \,. \end{align*} $$

with some constant $C=C(N)> 0$ . Furthermore, since $s_j \neq 0$ for all $j =1, \ldots N$ , we conclude that $\mathbf {s}_{j,\vec {z}} \neq 0$ for all $j=1, \ldots , N$ , provided that $\varepsilon \in (0, \varepsilon _0)$ with $\varepsilon _0=\varepsilon _0(N)> 0$ sufficiently small.

The proof of Lemma C.1 is now complete.

With the help of Lemma C.1 we are now able to prove the following result. Recall that $\mathbf {U} = \mathbf {u} \cdot \boldsymbol {\sigma }$ for a map $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ .

Lemma C.2. For any integer $N \geq 0$ , there exists a rational map $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ with exactly N simple poles such that the discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}}^2)$ is simple.

Remark. Recall that, by self-adjointness of $T_{\mathbf {U}}$ , the simplicity of $\sigma _{\mathrm {d}}(T_{\mathbf {U}}^2)$ implies that $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ is simple.

Proof. For $N=0$ , this is trivially true by taking the constant function $\mathbf {u}(x) \equiv \mathbf {e}_3$ and noticing that $\mathrm {Rank}(K_{\mathbf {U}})=0$ and hence $\sigma _{\mathrm {d}}(T_{\mathbf {U}}) = \emptyset $ . If $N=1$ , we take the stationary solution (i.e., a half-harmonic map)

$$ \begin{align*}\mathbf{u}(x) = \left (0, \frac{2x}{x^2+1}, \frac{x^2-1}{x^2+1} \right ) \in \mathcal{R}at(\mathbb{R}; \mathbb{S}^2) \, , \end{align*} $$

which has $\mathrm {Rank}(K_{\mathbf {U}}) = 1$ with simple discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}}) = \{0 \}$ . Hence it remains to discuss the case $N \geq 2$ , which will be proved in the following steps.

Step 1. Assume $N \geq 2$ in what follows. Let $z_1, \ldots , z_N \in \mathbb {C}_-$ and $v_1, \ldots , v_N \in (-1,1)$ be as in Lemma C.1 with the additional assumption that

$$ \begin{align*}v_j \neq v_k \quad \text{for } j \neq k \,. \end{align*} $$

Consider the rational map $\mathbf {u}_{\vec {z}} : \mathbb {R} \to \mathbb {S}^2$ given by Lemma C.1 with $\varepsilon = 1/\min _{j \neq k} |z_j-z_k| \in (0, \varepsilon _0)$ , where $\varepsilon _0=\varepsilon _0(N)> 0$ denotes the small constant from Lemma C.1. In particular, the rational map $\mathbf {u}_{\vec {z}} : \mathbb {R} \to \mathbb {S}^2$ has exactly N simple poles.

Note that the rational matrix-valued function $\mathbf {U}_{\vec {z}} = \mathbf {u}_{\vec {z}} \cdot \boldsymbol {\sigma }$ is given by

$$ \begin{align*}\mathbf{U}_{\vec{z}}(x) = \sigma_3 + \sum_{j=1}^N \frac{A_{j,\vec{z}}}{x-z_j} + \sum_{j=1}^N \frac{A_{j,\vec{z}}^*}{x-\overline{z}_j} \, , \end{align*} $$

with the nonzero matrices $A_{j,\vec {z}} \in \mathbb {C}^{2 \times 2}$ given by $A_{j,\vec {z}} := \mathbf {s}_{j,\vec {z}} \cdot \boldsymbol {\sigma }$ . Note that $A_{j,\vec {z}}^2 = 0$ which follows from $\mathbf {s}_{j,\vec {z}} \cdot \mathbf {s}_{j,\vec {z}}=0$ . Thus the nilpotent matrices $A_{j,\vec {z}} \in M_2(\mathbb {C})$ have rank one and we can write

$$ \begin{align*}A_{j, \vec{z}} = e_{j,\vec{z}} \langle \cdot, \xi_{j,\vec{z}} \rangle_{\mathbb{C}^2} \end{align*} $$

with some nonzero vectors $e_{j,\vec {z}}, \xi _{j,\vec {z}} \in \mathbb {C}^2 \setminus \{ 0 \}$ such that

$$ \begin{align*}\|e_{j,\vec{z}} \|_{\mathbb{C}^2} = 1 \quad \text{and} \quad \langle e_{j,\vec{z}}, \xi_{j, \vec{z}} \rangle_{\mathbb{C}^2} = 0 \,. \end{align*} $$

Note that $\mathrm {span} \{ e_{j,\vec {z}} \} = \mathrm {ran}(A_{j, \vec {z}})$ and we readily check that

$$ \begin{align*}\mathfrak{H}_1 = \mathrm{ran}(K_{\mathbf{U}_{\vec{z}}}) = \mathrm{ran}(H^*_{\mathbf{U}_{\vec{z}}}) = \mathrm{span} \left \{ \frac{e_{j,\vec{z}}}{x-z_j} \mid j=1, \ldots, N \right \} \, \end{align*} $$

with the operator $K_{\mathbf {U}_{\vec {z}}} = H^*_{\mathbf {U}_{\vec {z}}} H_{\mathbf {U}_{\vec {z}}} : L^2_+(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ .

Step 2. For later use, we recall that the constraint equations (C.7) and (C.8) can be rephrased in terms of matrix-valued functions as follows:

(C.10)

$$ \begin{align} A_{j,{\vec{z}}}^2 = 0, \quad B_{j,\vec{z}} A_{j,\vec{z}} + A_{j,\vec{z}} B_{j,\vec{z}}= 0 \end{align} $$

for all $j=1, \ldots , N$ , where we define the complex $2 \times 2$ -matrices

(C.11)

$$ \begin{align} B_{j,\vec{z}} := \sigma_3 + \sum_{k\neq j}^N \frac{A_{k,\vec{z}}}{z_j - z_k} + \sum_{k =1}^N \frac{A_{k,\vec{z}}^*}{z_j - \overline{z}_k} \,. \end{align} $$

Because of $A_{j,\vec {z}} e_{j,\vec {z}}=0$ and by (C.10), we see that $A_{j,\vec {z}} B_{j,\vec {z}} e_{j,\vec {z}}=0$ . Since $\mathrm {ker}(A_{j,\vec {z}}) = \mathrm {span} \{ e_{j,\vec {z}} \}$ , we deduce

$$ \begin{align*}B_{j,\vec{z}} e_{j,\vec{z}} = b_{j,\vec{z}} e_{j,\vec{z}} \, \end{align*} $$

with some eigenvalue $b_{j,\vec {z}} \in \mathbb {C}$ . Since $\| e_{j,\vec {z}} \|_{\mathbb {C}^2}=1$ , the eigenvalue $b_{j,\vec {z}}$ is evidently given by

(C.12)

$$ \begin{align} b_{j,\vec{z}} & = \langle B_{j,\vec{z}} e_{j,\vec{z}}, e_{j,\vec{z}} \rangle_{\mathbb{C}^2} \nonumber \\ & = \langle \sigma_3 e_{j,\vec{z}}, e_{j,\vec{z}} \rangle_{\mathbb{C}^2} + \sum_{k \neq j}^N \left ( \frac{\langle A_{k,\vec{z}} e_{j,\vec{z}}, e_{j,\vec{z}} \rangle_{\mathbb{C}^2}}{z_j - z_k} + \frac{\langle A_{k,\vec{z}}^* e_{j,\vec{z}}, e_{j,\vec{z}} \rangle_{\mathbb{C}^2}}{z_j - \overline{z}_k} \right ) , \end{align} $$

where we also used the simple fact that $\langle A_{j,\vec {z}}^* e_{j,\vec {z}}, e_{j,\vec {z}} \rangle _{\mathbb {C}^2} =0$ because of $A_{j,\vec {z}} e_{j,\vec {z}} =0$ . Next, by partial fraction decomposition and $A_{j,\vec {z}} e_{j,\vec {z}} = 0$ , we obtain

$$ \begin{align*} T_{\mathbf{U}_{\vec{z}}} \left ( \frac{e_{j, \vec{z}}}{x-z_j} \right ) & = \Pi_+ \left [ \left ( \sigma_3 + \sum_{k=1}^N \frac{A_{k, \vec{z}}}{x-z_k} + \sum_{k=1}^N \frac{A_{k,\vec{z}}^*}{x-\overline{z}_k} \right ) \frac{e_{j,\vec{z}}}{x-z_j} \right ]\\ &= \frac{\sigma_3 e_{j,\vec{z}}}{x-z_j} + \sum_{k \neq j}^N \frac{A_{k,\vec{z}} e_{j,\vec{z}}}{(x-z_k)(x-z_j)} + \sum_{k=1}^N \frac{A_{k,\vec{z}}^* e_{j,\vec{z}}}{(z_j-\overline{z}_k) (x-z_j)} \\ & = \left ( \sigma_3 + \sum_{k \neq j}^N \frac{A_{k,\vec{z}}}{z_j - z_k} + \sum_{k=1}^N \frac{A_{k,\vec{z}}^*}{z_j - \overline{z}_k} \right ) \frac{e_{j,\vec{z}}}{x-z_j} +\sum_{k \neq j}^N \frac{A_{k,\vec{z}} e_{j,\vec{z}}}{(z_k -z_j)(x-z_k)} \\ & = \frac{B_{j,\vec{z}} e_{j,\vec{z}}}{x-z_j} + \sum_{k \neq j}^N \frac{A_{k,\vec{z}} e_{j,\vec{z}}}{(z_k-z_j)(x-z_k)} = \frac{b_{j,\vec{z}} e_{j,\vec{z}}}{x-z_j} + \sum_{k \neq j}^N \frac{A_{k,\vec{z}} e_{j,\vec{z}}}{(z_k-z_j)(x-z_k)} \,. \end{align*} $$

for any $j=1, \ldots , N$ and with the eigenvalues $b_{j,\vec {z}}$ from above. Let $\mathsf {T} \in \mathbb {C}^{N \times N}$ denote the matrix of $T_{\mathbf {U}_{\vec {z}}} : \mathfrak {H}_1 \to \mathfrak {H}_1$ with respect to the basis $\mathcal {B} = \left ( \frac {e_{1,\vec {z}}}{x-z_1}, \ldots , \frac {e_{N,\vec {z}}}{x-z_N} \right )$ . Since $\| A_{j,\vec {z}} \|_{\mathbb {C}^2 \to \mathbb {C}^2} \lesssim \|\mathbf {s}_{j,\vec {z}}\|_{\mathbb {C}^3} \lesssim 1$ , we see that the matrix $\mathsf {T} \in \mathbb {C}^{N \times N}$ is of the form

$$ \begin{align*}\mathsf{T} = \mathrm{diag}(b_{1,\vec{z}}, \ldots, b_{N,\vec{z}}) + \mathsf{B} \end{align*} $$

with some matrix $\mathsf {B}=\mathsf {B}(z_1, \ldots , z_N, v_1, \ldots , v_n)$ such that

$$ \begin{align*}\| \mathsf{B} \|_{\mathbb{C}^N \to \mathbb{C}^N} = O(\varepsilon) \, , \end{align*} $$

where we recall that $\varepsilon = 1/\min _{j \neq k} |z_j - z_k|$ . Furthermore, from (C.12) we deduce that

$$ \begin{align*}b_{j,\vec{z}} = \langle \sigma_3 e_{j,\vec{z}}, e_{j,\vec{z}} \rangle_{\mathbb{C}^2} + O(\varepsilon) \,. \end{align*} $$

Next, we recall that $A_{j,\vec {z}} \to A_j = \mathbf {s}_j \cdot \boldsymbol {\sigma }$ as $\varepsilon \to 0$ by Lemma C.1. Notice that $\mathbf {s}_j$ is given by (C.6) with $v=v_j$ and an elementary calculation shows that $\mathrm {ran}(A_j) = \mathrm {span} \{ e_j \}$ with the unit vector

$$ \begin{align*}e_j=\frac{1}{\sqrt{2}} \left ( \begin{array}{c} \sqrt{1+v_j} \\ \mathrm{i} \sqrt{1-v_j} \end{array} \right ) \in \mathbb{C}^2 \,. \end{align*} $$

Thus we conclude that

$$ \begin{align*}\langle \sigma_3 e_{j,\vec{z}}, e_{j, \vec{z}} \rangle_{\mathbb{C}^2} \to \langle \sigma_3 e_j, e_j \rangle_{\mathbb{C}^2} = v_j \quad \text{as } \varepsilon \to 0 \, , \end{align*} $$

whence it follows that $b_{j,\vec {z}} \to v_j$ as $\varepsilon \to 0$ .

In summary, we have shown that the matrix $\mathsf {T} \in \mathbb {C}^{N \times N}$ for $T_{\mathbf {U}_{\vec {z}}} : \mathfrak {H}_1 \to \mathfrak {H}_1$ with respect to the basis $\mathcal {B} = \left ( \frac {e_{1,\vec {z}}}{x-z_1}, \ldots , \frac {e_{N,\vec {z}}}{x-z_N} \right )$ is of the form

$$ \begin{align*}\mathsf{T} = \mathrm{diag}(v_1, \ldots, v_N) + \mathsf{M} \end{align*} $$

with some matrix $\mathsf {M}=\mathsf {M}(z_1, \ldots , z_N, v_1, \ldots , v_N) \in \mathbb {C}^{N \times N}$ such that

$$ \begin{align*}\| \mathsf{M} \|_{\mathbb{C}^N \to \mathbb{C}^N} = O(\varepsilon) \to 0 \quad \text{as} \quad \varepsilon \to 0. \end{align*} $$

Step 3. Let

denote the characteristic polynomial of $\mathsf {T} \in \mathbb {C}^{N \times N}$ . Since

$$ \begin{align*}\lim_{\varepsilon \to 0} \| \mathsf{T} - \mathrm{diag}(v_1, \ldots, v_n) \|_{\mathbb{C}^N \to \mathbb{C}^N} = 0 \, , \end{align*} $$

we deduce that $a_{k} \to c_k$ as $\varepsilon \to 0$ for all $k=0, \ldots , N-1$ , where

$$ \begin{align*}p(z) = z^N + c_{N-1} z^{N-1} + \ldots + c_0 = \prod_{j=1}^N (z-v_j) \end{align*} $$

is the characteristic polynomial of $\mathrm {diag}(v_1, \ldots , v_N)$ . Note that $p(z)$ has simple zeros due to $v_j \neq v_k$ for $j \neq k$ by assumption. Hence the roots $\{ \lambda _j \}_{j=1}^N$ of $\mathsf {T}$ are also simple, provided that $\varepsilon> 0$ is sufficiently small, and we have $\lambda _j \to v_j$ as $\varepsilon \to 0$ . Since $v_j^2 \neq v_k^2$ for $j \neq k$ by our assumption above, we also find that $\lambda _j^2 \neq \lambda _k^2$ for $j \neq k$ provided that $\varepsilon> 0$ is sufficiently small. This shows that $T_{\mathbf {U}_{\vec {z}}}^2 |_{\mathfrak {H}_1}$ has simple spectrum if $\varepsilon> 0$ is sufficiently small and, by self-adjointness of $T_{\mathbf {U}_{\vec {z}}}$ , this implies simple spectrum of $T_{\mathbf {U}_{\vec {z}}} |_{\mathfrak {H}_1}$ if $\varepsilon =1/\min _{j \neq k}|z_j-z_k|>0$ is sufficiently small. Since $\sigma _{\mathrm {d}}(T_{\mathbf {U}_{\vec {z}}}) = \sigma (T_{\mathbf {U}_{\vec {z}}}|_{\mathfrak {H}_1})$ , this completes the proof of Lemma C.2.

Remark. To conclude our discussion, let us remark that there exist rational data $\mathbf {u} : \mathbb {R} \to \mathbb {S}^2$ with nonsimple discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {U}})$ . For instance, take a solitary wave profile $\mathbf {q} : \mathbb {R} \to \mathbb {S}^2$ given by a Blaschke product of degree $m \geq 2$ and set $\mathbf {Q}_v = \mathbf {q}_v \sigma \boldsymbol {\sigma }$ . Then it is easy to see that the Toeplitz operator $T_{\mathbf {Q}_v} : L^2_+(\mathbb {R}; \mathbb {C}^2) \to L^2_+(\mathbb {R}; \mathbb {C}^2)$ has discrete spectrum $\sigma _{\mathrm {d}}(T_{\mathbf {Q}_v}) = \{ v \}$ where the eigenvalue v is m-fold degenerate.

D. Local well-posedness

In this section, we prove local well-posedness for (HWM_d) for sufficiently regular initial data as stated in Lemma 5.1. Also, we will show well-posedness for the initial-value problem formulated in (3.3) above.

Proof of Lemma 5.1

Let $s> \frac {3}{2}, d \geq 2$ and assume that $\mathbf {U}_0 : \mathbb {R} \to M_d(\mathbb {C})$ is of the form

$$ \begin{align*}\mathbf{U}_0(x) = \mathbf{U}_\infty + \mathbf{V}_0(x) \in M_d(\mathbb{C}) \oplus H^s(\mathbb{R}; M_d(\mathbb{C})) \equiv H^s_\bullet(\mathbb{R}; M_d(\mathbb{C})) \, , \end{align*} $$

satisfying the pointwise constraints

Note that $\mathbf {U}_\infty \in M_d(\mathbb {C})$ is a constant matrix with $\mathbf {U}_\infty =\mathbf {U}_\infty ^*$ and .

Now, for $R> 0$ given and assuming that $\| \mathbf {V}_0 \|_{H^s} < R$ , we wish to prove existence and uniqueness of the solution

$$ \begin{align*}\mathbf{U}(t) = \mathbf{U}_\infty + \mathbf{V}(t) \in M_d(\mathbb{C}) \oplus C([0,T]; H^s(\mathbb{R}; M_d(\mathbb{C})) \, , \end{align*} $$

of (HWM_d) with initial datum $\mathbf {U}(0) = \mathbf {U}_0$ , where $T=T(R)> 0$ is chosen sufficiently small. Once this solution is constructed, it is elementary to check that $\mathbf {U}(t,x)$ satisfies the pointwise constraints above for all $x \in \mathbb {R}$ and times $t \in [0,T]$ . Furthermore, as explained before Lemma 5.1 above, we deduce that $\mathbf {U}(t,x) \in \mathsf {Gr}_k(\mathbb {C}^d)$ for $(t,x) \in [0,T] \times \mathbb {R}$ with some integer $0 \leq k \leq d$ .

Step 1 (Setup). To deal with the quasilinear equation (HWM_d), we use the following iteration scheme. Suppose we are given an initial datum

(D.1)

$$ \begin{align} \mathbf{U}_0 = \mathbf{U}_\infty + \mathbf{V}_0 \in M_d(\mathbb{C}) \oplus H^s(\mathbb{R}; H^s(\mathbb{R}; M_d(\mathbb{C})) \end{align} $$

with values in the Hermitian $d \times d$ -matrices, that is, we assume

(D.2)

$$ \begin{align} \mathbf{U}_0(x) = \mathbf{U}_0(x)^* \quad \text{for } x \in \mathbb{R} \, , \end{align} $$

and with some constant Hermitian matrix $\mathbf {U}_\infty = \mathbf {U}_\infty ^* \in M_d(\mathbb {C})$ . Note that $\mathbf {V}_0(x)=\mathbf {V}_0(x)^*$ must be Hermitian valued, too.

Now, let $R> 0$ be arbitrary and let $T=T(R)> 0$ to be chosen later. We construct the sequence

$$ \begin{align*}\mathbf{U}^{(n)}= \mathbf{U}_\infty + \mathbf{V}^{(n)} \in M_d(\mathbb{C}) \oplus C([0,T]; H^s(\mathbb{R}; M_d(\mathbb{C}))) \quad \text{with } n \in \mathbb{N} \end{align*} $$

by means of the iteration scheme

(D.3)

$$ \begin{align} \partial_t \mathbf{U}^{(n+1)} = -\frac{\mathrm{i}}{2} [\mathbf{U}^{(n)}, |D| \mathbf{U}^{(n+1)}] \quad \text{for } t \in [0,T], \quad \mathbf{U}^{(n+1)}(0) = \mathbf{U}_0 \end{align} $$

and we take $\mathbf {U}^{(0)}(t) \equiv \mathbf {U}_0$ . It is straightforward to show that, given $\mathbf {U}^{(n)} \in M_d(\mathbb {C}) \oplus C([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C}))$ , there exists indeed a unique (Hermitian-valued) solution

$$ \begin{align*}\mathbf{U}^{(n+1)} = \mathbf{U}_\infty + \mathbf{V}^{(n+1)} \in C([0,T]; M_d(\mathbb{C}) \oplus H^s(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

of (D.3) with initial datum $\mathbf {U}^{(n+1)}(0) = \mathbf {U}_0$ ; see Lemma D.1 below and its proof for details. Also, since $\mathbf {U} = \mathbf {U}_\infty + \mathbf {V}$ with the constant matrix $\mathbf {U}_\infty \in M_d(\mathbb {C})$ , we have

$$ \begin{align*}\partial_t \mathbf{V}^{(n+1)} = -\frac{\mathrm{i}}{2} [\mathbf{U}^{(n)}, |D| \mathbf{V}^{(n+1)}] \quad \text{for } t \in [0,T], \quad \mathbf{V}^{(n+1)}(0) = \mathbf{V}_0 \,. \end{align*} $$

Step 2 (Bounds). We assume that $\| \mathbf {V}_0 \|_{H^s} < R$ holds. We claim that the following a priori bound holds

(D.4)

$$ \begin{align} \sup_{t \in [0,T]} \| \mathbf{V}^{(n)}(t) \|_{H^s} \leq 2 \| \mathbf{V}_0 \|_{H^s} \quad \text{for all } n \in \mathbb{N}\, , \end{align} $$

provided that $T=T(R)> 0$ is chosen sufficiently small.

We prove the bound (D.4) as follows. We use $\langle D \rangle ^s$ to denote the regularized fractional derivative of order s given by $\widehat {(\langle D \rangle ^s f)}(\xi ) = (1+|\xi |^2)^{s/2} \widehat {f}(\xi )$ . Omitting the dependence on t for notational convenience, we find (where the assumed regularity suffices to justify the following manipulations):

$$ \begin{align*} \frac{d}{dt} \big \| \langle D \rangle^s \mathbf{V}^{(n+1)} \big \|_{L^2}^2 & = 2 \mathrm{Re} \left \langle \langle D \rangle^s \partial_t \mathbf{V}^{(n+1)}, \langle D \rangle^s \mathbf{V}^{(n+1)} \right \rangle \\ & = \mathrm{Im} \left \langle \langle D \rangle^s [\mathbf{U}^{(n)}, |D| \mathbf{V}^{(n+1)}], \langle D \rangle^s \mathbf{V}^{(n+1)} \right \rangle \\ & = \mathrm{Im} \left \langle [\mathbf{U}^{(n)}, |D| \langle D \rangle^s \mathbf{V}^{(n+1)}], \langle D \rangle^s \mathbf{V}^{(n+1)} \right \rangle \\ & \quad + \mathrm{Im} \left \langle [\langle D \rangle^s, \mathbf{U}^{(n)}] |D| \mathbf{V}^{(n+1)}, \langle D \rangle^s \mathbf{V}^{(n+1)} \right \rangle =: I + II \,. \end{align*} $$

Here we also used the trivial fact that $|D|$ and $\langle D \rangle ^s$ commute. Next, we assert that the term I can be written in exact commutator form with

(D.5)

$$ \begin{align} I = \frac{1}{2} \, \mathrm{Im} \left \langle \big [ [\mathbf{U}^{(n)}, \cdot], |D|] \langle D \rangle^s \mathbf{V}^{(n+1)}, \langle D \rangle^s \mathbf{V}^{(n+1)} \right \rangle \,. \end{align} $$

Here $[\mathbf {U}, \cdot ] \mathbf {F} \equiv \mathbf {U} \mathbf {F} - \mathbf {F} \mathbf {U}$ denotes the pointwise matrix-commutator for matrix-valued functions $\mathbf {U}, \mathbf {F} : \mathbb {R} \to M_d(\mathbb {C})$ . To see that (D.5) holds true, let us write $\mathbf {U} = \mathbf {U}^{(n}$ and $\mathbf {W} = \langle D \rangle ^s \mathbf {V}^{(n+1)}$ for the moment. Then

$$ \begin{align*} I & = \mathrm{Im} \left \langle [\mathbf{U}, |D| \mathbf{W}], \mathbf{W} \right \rangle = \mathrm{Im} \left \langle \big [ [\mathbf{U}, \cdot], |D|] \mathbf{W}, \mathbf{W} \right \rangle + \mathrm{Im} \left \langle |D| [\mathbf{U}, \mathbf{W}], \mathbf{W} \right \rangle \\ & = \mathrm{Im} \left \langle \big [ [\mathbf{U}, \cdot], |D|] \mathbf{W}, \mathbf{W} \right \rangle + \mathrm{Im} \left \langle \mathbf{W}, [\mathbf{U}, |D| \mathbf{W}] \right \rangle = \mathrm{Im} \left \langle \big [ [\mathbf{U}, \cdot], |D|] \mathbf{W}, \mathbf{W} \right \rangle - I \, , \end{align*} $$

where the second last step we used that $|D| = |D|^*$ is symmetric together with the fact $\mathrm {Tr}([\mathbf {U}, \mathbf {A}] \mathbf {B}^*) = \mathrm {Tr}(\mathbf {A} [\mathbf {U}, \mathbf {B}]^*)$ for matrix-valued functions $\mathbf {U}, \mathbf {A}, \mathbf {B} : \mathbb {R} \to M_d(\mathbb {C})$ provided that $\mathbf {U}= \mathbf {U}^*$ is Hermitian. This proves (D.5).

Next, by a classical commutator estimate due to Calderón applied to (D.5) and recalling that $\partial _x \mathbf {U}^{(n)} = \partial _x \mathbf {V}^{(n)}$ , we deduce

$$ \begin{align*}|I| \leq C \| \partial_x \mathbf{V}^{(n)} \|_{L^\infty} \| \langle D \rangle^s \mathbf{V}^{(n+1)} \|_{L^2}^2 \leq C \| \langle D \rangle^s \mathbf{V}^{(n)} \|_{L^2} \| \langle D \rangle^s \mathbf{V}^{(n+1)} \|_{L^2}^2 \, , \end{align*} $$

where in the last step we used the Sobolev inequality $\| \partial _x f \|_{L^\infty } \leq C \| \langle D \rangle ^{s-1} \partial _x f \|_{L^2} \leq C \| \langle D \rangle ^{s} f \|_{L^2}$ , since $H^{s-1}(\mathbb {R}) \subset L^\infty (\mathbb {R})$ thanks to $s> \frac {3}{2}$ .

To estimate the second term $II$ above, we use Cauchy–Schwarz and apply the classical Kato–Ponce commutator to $[\langle D \rangle ^s, \mathbf {U}^{(n)}] = [\langle D \rangle ^s, \mathbf {V}^{(n)}]$ . This yields

$$ \begin{align*} |II| & \leq \| [\langle D \rangle^s, \mathbf{V}^{(n)}] |D| \mathbf{V}^{(n+1)} \|_{L^2} \| \langle D \rangle^s \mathbf{V}^{(n+1)} \|_{L^2} \\ & \leq C ( \| \langle D \rangle^s \mathbf{V}^{(n)} \|_{L^2} \| |D| \mathbf{V}^{(n+1)} \|_{L^\infty} + \| \partial_x \mathbf{V}^{(n)} \|_{L^\infty} \| \langle D \rangle^{s-1} |D| \mathbf{V}^{(n+1)} \|_{L^2} ) \| \langle D \rangle^s \mathbf{V}^{(n+1)} \|_{L^2} \\ & \leq C \| \langle D \rangle^s \mathbf{V}^{(n)} \|_{L^2} \| \langle D \rangle^s \mathbf{V}^{(n+1)} \|_{L^2}^2 \, , \end{align*} $$

where in the last step we used again the Sobolev inequalities $\| \partial _x \mathbf {V}^{(n)} \|_{L^\infty } \leq C \| \langle D \rangle ^s \mathbf {V}^{(n)} \|_{L^2}$ and $\| |D| \mathbf {V}^{(n+1)} \|_{L^\infty } \leq C \| \langle D \rangle ^s \mathbf {V}^{(n+1)} \|_{L^2}$ in view of $s> \frac {3}{2}$ .

Combining the estimates for I and $II$ , we obtain the differential inequality

(D.6)

$$ \begin{align} \frac{d}{dt} \big \| \langle D \rangle^s \mathbf{V}^{(n+1)}(t) \big \|_{L^2}^2 \leq C \| \langle D \rangle^s \mathbf{V}^{(n)}(t) \|_{L^2} \| \langle D \rangle^s \mathbf{V}^{(n+1)}(t) \|_{L^2}^2 \,. \end{align} $$

Next we define the quantities

$$ \begin{align*}M_n(T) = \sup_{t \in [0,T]} \big \| \langle D \rangle^s \mathbf{V}^{(n}(t) \big \|_{L^2}^2 \quad \text{with } n \in \mathbb{N} \,. \end{align*} $$

From (D.6) and Grönwall’s inequality we obtain

(D.7)

$$ \begin{align} M_{n+1}(T) \leq M_0 \cdot \mathrm{e}^{C T \sqrt{M_n(T)} } \end{align} $$

since $M_0:=M_{k}(0) = \| \langle D \rangle ^s \mathbf {V}_0\|_{L^2}^2$ for all $k \in \mathbb {N}$ . Clearly, we have the bound

(D.8)

$$ \begin{align} M_0 \cdot \mathrm{e}^{2CT R} \leq 4 M_0 \end{align} $$

for some sufficiently small time $T=T(R)> 0$ . From $M_0(T) = M_0 < R^2$ and (D.7)–(D.8), it follows by induction that

$$ \begin{align*}M_n(T) \leq 4M_0 \quad \text{for all } n \in \mathbb{N} \,. \end{align*} $$

Since $M_0 = \| \langle D \rangle ^s \mathbf {V}_0 \|_{L^2}^2 = \| \mathbf {V}_0 \|_{H^s}^2$ , we obtain the claimed a priori bound (D.4).

Step 3 (Cauchy Property in $L^2$ ). We demonstrate that the sequence $(\mathbf {V}^{(n)})_{n \in \mathbb {N}}$ is Cauchy in $C([0,T]; L^2(\mathbb {R}; M_d(\mathbb {C}))$ , provided that $T=T(R)> 0$ is small enough. Indeed, let $n \geq 1$ be given. We find

$$ \begin{align*} \partial_t \left ( \mathbf{V}^{(n+1)} - \mathbf{V}^{(n)} \right ) & = \frac{1}{2 \mathrm{i}} \left ( [\mathbf{U}^{(n)}, |D| \mathbf{V}^{(n+1)}] - [\mathbf{U}^{(n-1)}, |D| \mathbf{V}^{(n)}] \right ) \\ & = \frac{1}{2 \mathrm{i}} \left ( [\mathbf{U}^{(n)}, |D| (\mathbf{V}^{(n+1)}- \mathbf{V}^{(n)})] + [ \mathbf{V}^{(n)}-\mathbf{V}^{(n-1)}, |D| \mathbf{V}^{(n)}] \right ) \, , \end{align*} $$

where used the simple fact that $\mathbf {U}^{(n)}-\mathbf {U}^{(n-1)} = \mathbf {V}^{(n)}-\mathbf {V}^{(n-1)}$ . Hence we get

$$ \begin{align*} & \frac{d}{dt} \left \| \mathbf{V}^{(n+1)}- \mathbf{V}^{(n)} \right \|_{L^2}^2 = 2 \mathrm{Re} \left \langle \partial_t(\mathbf{V}^{(n+1)}- \mathbf{V}^{(n)}), \mathbf{V}^{(n+1)}- \mathbf{V}^{(n)} \right \rangle \\ &= \mathrm{Im} \left \langle [\mathbf{U}^{(n)}, |D| (\mathbf{V}^{(n+1)}-\mathbf{V}^{(n)})], \mathbf{V}^{(n+1)}-\mathbf{V}^{(n)} \right \rangle \\ & \quad + \mathrm{Im} \left \langle [\mathbf{V}^{(n)}-\mathbf{V}^{(n-1)}, |D| \mathbf{V}^{(n)}], \mathbf{V}^{(n+1)}-\mathbf{V}^{(n)} \right \rangle \\ & \leq C ( \| \partial_x \mathbf{V}^{(n)} \|_{L^\infty} \| \mathbf{V}^{(n+1)}- \mathbf{V}^{(n)} \|_{L^2}^2 ) \\ & \quad + C (\| \mathbf{V}^{(n)}-\mathbf{V}^{(n-1)} \|_{L^2} \| |D| \mathbf{V}^{(n)} \|_{L^\infty} \| \| \mathbf{V}^{(n+1)}-\mathbf{V}^{(n)} \|_{L^2}) \\ & \leq C( \sqrt{K} ( \| \mathbf{V}^{(n+1} - \mathbf{V}^{(n)} \|_{L^2} + K \| \mathbf{V}^{(n)}-\mathbf{V}^{(n-1)} \|_{L^2} ) \end{align*} $$

with the constant $K>0$ from the a priori bound (D.4) above. Since $\mathbf {V}^{(n+1)}(0)-\mathbf {V}^{(n)}(0) =0$ , we learn from Grönwall’s inequality that

$$ \begin{align*}\sup_{t \in [0,T]} \| \mathbf{V}^{(n+1)}(t)- \mathbf{V}^{(n)}(t) \|_{L^2} \leq C T \sqrt{K} \sup_{t \in [0,T]} \| \mathbf{V}^{(n)}(t) - \mathbf{V}^{(n-1)}(t) \|_{L^2} \,. \end{align*} $$

By choosing $T = T(R)> 0$ even smaller to ensure that $CT \sqrt {K} \leq \frac {1}{2}$ , we deduce that the series

$$ \begin{align*}\sum_{n=0}^\infty \sup_{t \in [0,T]} \| \mathbf{V}^{(n+1)}(t) - \mathbf{V}^{(n)}(t) \|_{L^2} < +\infty \end{align*} $$

is geometrically convergent. In particular, the implies that the sequence $(\mathbf {V}^{(n)})_{n \in \mathbb {N}}$ is Cauchy in $C([0,T]; L^2(\mathbb {R}; M_d(\mathbb {C}))$ .

Thanks to the a priori bound (D.4), this yields that $(\mathbf {V}^{(n)})_{n \in \mathbb {N}}$ forms a Cauchy sequence in $C([0,T]; H^{\tilde {s}}(\mathbb {R}; M_d(\mathbb {C}))$ for $0 \leq \tilde {s} <s$ . Moreover, we readily check that its limit

$$ \begin{align*}\mathbf{U} := \mathbf{U}_\infty + \lim_{n \to \infty} \mathbf{V}^{(n)} \in M_d(\mathbb{C}) \oplus C([0,T]; H^{\tilde{s}}(\mathbb{R}; M_d(\mathbb{C})) \end{align*} $$

solves (HWM_d) with initial datum $\mathbf {U}(0)=\mathbf {U}_0$ .

Step 4 (Continuity of Flow in $H^s$ ). It remains to show that

$$ \begin{align*}\mathbf{V} \in C([0,T]; H^s(\mathbb{R}; M_d(\mathbb{C})) \,. \end{align*} $$

Note that, by previous discussion, we can only deduce that $\mathbf {V} \in C_{w}([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C}))$ holds, that is, for $t_n \to t$ we only have that $\mathbf {V}(t_n) \rightharpoonup \mathbf {V}(t)$ in $H^s$ . To extend this to strong continuity, we can make use the idea of frequency envelopes, which was recently generalized as an abstract interpolation result in [Reference Alazard, Burq, Ifrim, Tataru and Zuily1].

Indeed, for real $t \geq 0$ , we introduce the Sobolev spaces $H^s_{\mathrm {H}}$ of matrix-valued maps with Hermitian values by setting

$$ \begin{align*}H^t_{\mathrm{H}} := \{ \mathbf{F} \in H^t(\mathbb{R}; M_d(\mathbb{C})) \mid \mathbf{F}(x) = \mathbf{F}(x)^* \text{ for a. e. } x \in \mathbb{R} \} \, , \end{align*} $$

equipped with the norm $\| \cdot \|_{H^t}$ . Let $B_R = \{ \mathbf {F} \in H^s_{\mathrm {H}} \mid \| \mathbf {F} \|_{H^s} < R \}$ . From Step 2 and Step 3, we obtain the map

$$ \begin{align*}\Phi : B_R \to C([0,T]; H^{0}_{\mathrm{H}}), \quad \mathbf{W}_0 \mapsto \mathbf{W} := \lim_{n \to \infty} \mathbf{W}^{(n)} \end{align*} $$

using the iteration scheme with initial data $\mathbf {U}_0 = \mathbf {U}_\infty + \mathbf {W}_0$ . Moreover, from the previous discussion, we deduce the following bounds

(B ₁) $\| \Phi (\mathbf {W}_0) - \Phi (\widetilde {\mathbf {W}}_0) \|{C_T H^0} \leq C_0 \| \mathbf {W}_0 - \widetilde {\mathbf {W}}_0 \|_{H^0}$ for all $\mathbf {W}_0, \widetilde {\mathbf {W}}_0 \in B_R$ ,
(B ₂) $\| \Phi (\mathbf {W}_0) \|_{C_T H^{s+1}} \leq 2 \| \mathbf {W}_0 \|_{H^{s+1}}$ for all $\mathbf {W}_0 \in B_R \cap H^{s+1}_{\mathrm {H}}$ ,

with some constant $C_0> 0$ . Indeed, the weak Lipschitz estimate $(B_1)$ follows from the arguments in Step 3, whereas the bound $(B_2)$ simply follows from repeating Step 2 with $s>\frac {3}{2}$ replaced by $s+1$ and by choosing $T=T(R)> 0$ possibly even smaller. From [Reference Alazard, Burq, Ifrim, Tataru and Zuily1] we now conclude that

$$ \begin{align*}\Phi(\mathbf{V}_0) \in C([0,T]; H^s_{\mathrm{H}}) \end{align*} $$

and that we have continuous dependence of the map $\mathbf {V}_0 \mapsto \Phi (\mathbf {V}_0)$ on the initial data in $B_R$ .

Step 5 (Conclusion). Thus far we have proved local-in-time existence of solutions for (HWM_d) for initial data in $H^s$ with $s> \frac {3}{2}$ and satisfying the Hermitian condition (D.2). Moreover, by a direct calculation and using the regularity of the solutions, we readily check by a Grönwall-type argument that uniqueness holds for $C([0,T]; H^s)$ for a given initial datum $\mathbf {U}(0) = \mathbf {U}_0$ .

Also, a direct calculation (which we omit) shows that the pointwise constraint is also preserved by the flow.

Finally, the claimed propagation of higher Sobolev regularity also follows from the previous estimates. Indeed, let $\sigma> s > \frac {3}{2}$ and suppose that $\mathbf {V}_0 \in H^\sigma _{\mathrm {H}}$ . Inspecting the arguments in Step 2, we deduce that

$$ \begin{align*} \| \langle D \rangle^\sigma \mathbf{V}(t) \|_{L^2}^2 & \leq C \left ( \| \partial_x \mathbf{V}(t)\|_{L^\infty} + \| |D| \mathbf{V}(t) \|_{L^\infty} \right ) \| \langle D \rangle^\sigma \mathbf{V}(t) \|_{L^2}^2 \\ & \leq C \| \mathbf{V}(t) \|_{H^{s}} \|\langle D \rangle^\sigma \mathbf{V}(t) \|_{L^2}^2 \, , \end{align*} $$

where we used the Sobolev embedding $H^{s}(\mathbb {R}) \subset L^\infty (\mathbb {R})$ for $s> \frac {3}{2}$ . By Grönwall’s inequality, we readily deduce that the maximal times of existence of $H^\sigma $ and $H^s$ -solutions with $\sigma> s > \frac {3}{2}$ coincide.

This completes the proof of Lemma 5.1.

In the proof above, we need the following auxiliary result.

Lemma D.1. Let $s> \frac {3}{2}, d \geq 2$ , and $\mathbf {U} = \mathbf {U}_\infty + \mathbf {V} \in C([0,T]; M_d(\mathbb {C}) \oplus H^s(\mathbb {R}; M_d(\mathbb {C})))$ . Then, for every $\widetilde {\mathbf {V}}_0 \in H^s(\mathbb {R}; M_d(\mathbb {C}))$ , there exists a unique solution $\widetilde {\mathbf {U}} = \mathbf {U}_\infty + \widetilde {\mathbf {V}} \in M_d(\mathbb {C}) \oplus C([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C})))$ of

$$ \begin{align*}\partial_t \widetilde{\mathbf{U}} = -\frac{\mathrm{i}}{2} [ \mathbf{U}, |D| \widetilde{\mathbf{U}}] \quad \text{on } [0,T] \text{ and } \widetilde{\mathbf{U}}(0) = \mathbf{U}_\infty + \widetilde{\mathbf{V}}_0 \,. \end{align*} $$

Moreover, if $\widetilde {\mathbf {U}}(0,x)=\widetilde {\mathbf {U}}(0,x)^*$ and for all $x \in \mathbb {R}$ , then $\widetilde {\mathbf {U}}(t,x)= \widetilde {\mathbf {U}}(t,x)$ and $\widetilde {\mathbf {U}}(t,x) = \widetilde {\mathbf {U}}(t,x)^*$ for all $(t,x) \in [0,T] \times \mathbb {R}$ .

Proof. Since $\mathbf {U}_\infty \in M_d(\mathbb {C})$ is constant matrix, it suffices to show existence and uniqueness of $\widetilde {\mathbf {V}} \in C([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C})))$ solving

(D.9)

$$ \begin{align} \partial_t \widetilde{\mathbf{V}} = -\frac{\mathrm{i}}{2} [\mathbf{U}, |D| \widetilde{\mathbf{V}}] \quad \text{with } \widetilde{\mathbf{V}} = \widetilde{\mathbf{V}}_0 \,. \end{align} $$

We construct approximate solutions of this linear equation by the following scheme. For $\varepsilon> 0$ , we introduce the smoothing operator

$$ \begin{align*}J_\varepsilon := (1+\varepsilon |D|)^{-1} \quad \text{with } \|J_\varepsilon \|_{L^2 \to L^2} \leq 1 \text{ and } \| J_\varepsilon \|_{H^s \to H^{s+1}} \leq \varepsilon^{-1} \,. \end{align*} $$

By standard arguments, we obtain a unique solution $\widetilde {\mathbf {V}}_\varepsilon \in C([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C})))$ of the initial-value problem

$$ \begin{align*}\partial_t \widetilde{\mathbf{V}}_\varepsilon = -\frac{\mathrm{i}}{2} [\mathbf{U}, |D| J_\varepsilon \widetilde{\mathbf{V}}_\varepsilon] \quad \text{with } \widetilde{\mathbf{V}}_\varepsilon(0) = \widetilde{\mathbf{V}}_0 \, , \end{align*} $$

using that $|D| J_\varepsilon : H^s \to H^s$ is a bounded map together with the fact that $H^s(\mathbb {R})$ is an algebra for $s> \frac {3}{2}$ . Now, by adapting the discussion from the previous discussion, we derive the estimate

$$ \begin{align*}\frac{d}{dt} \| \langle D \rangle^s \widetilde{\mathbf{V}}(t) \|_{L^2}^2 \leq C \left ( \left \| [\mathbf{U}, \cdot], |D|J_\varepsilon ] \right \|_{L^2 \to L^2} + \| \langle D \rangle^s \mathbf{V}\|_{L^2} \right ) \| \langle D \rangle^s \widetilde{\mathbf{V}}_\varepsilon \|_{L^2}^2 \end{align*} $$

using also again the Kato–Ponce estimate together with the fact that $\|\langle D \rangle ^{s-1} |D| J_\varepsilon \widetilde {\mathbf {V}} \|_{L^2} \leq \| \langle D \rangle ^s \widetilde {\mathbf {V}} \|_{L^2}$ . To bound the commutator term, we note that if $a=a(x)$ denotes multiplication by a Lipschitz function then

$$ \begin{align*} [a, |D| J_\varepsilon] & = |D| [a, (1+\varepsilon |D|)^{-1}] + [a,|D|] (1+\varepsilon |D|)^{-1} \\ & = -\varepsilon |D| (1+\varepsilon |D|)^{-1} [a,|D|] (1+\varepsilon |D|)^{-1} + [a,|D|] (1+\varepsilon |D|)^{-1} \,. \end{align*} $$

Thus by Calderón’s commutator estimate and the facts $\| \varepsilon |D|(1+\varepsilon |D|)^{-1} \|_{L^2 \to L^2} \leq 1$ and $\| (1+\varepsilon |D|)^{-1} \|_{L^2 \to L^2} \leq 1$ , we deduce

$$ \begin{align*}\left \| [\mathbf{U}, \cdot], |D|J_\varepsilon ] \right \|_{L^2 \to L^2} \leq C \| \partial_x \mathbf{V} \|_{L^\infty} \leq C \| \langle D \rangle^s \mathbf{V} \|_{L^2} \end{align*} $$

since $s> \frac 3/2$ . Because of $\sup _{t \in [0,T]} \| \langle D \rangle ^s \mathbf {V}(t) \|_{L^2} < +\infty $ , integrating the previous differential inequality yields the bound

(D.10)

$$ \begin{align} \sup_{t \in [0,T]} \| \langle D \rangle^s \widetilde{\mathbf{V}}_\varepsilon(t) \|_{L^2} \leq \mathrm{e}^{CT} \| \langle D \rangle^s \widetilde{\mathbf{V}}_0 \|_{L^2} \end{align} $$

which is independent of $\varepsilon>0$ . Moreover, this bound and the equation for $\widetilde {\mathbf {V}}_\varepsilon $ imply that

$$ \begin{align*}\| \partial_t \widetilde{\mathbf{V}}_\varepsilon(t) \|_{L^2} \leq C \| \mathbf{U}(t) \|_{L^\infty} \| |D| J_\varepsilon \widetilde{\mathbf{V}}_\varepsilon \|_{L^2} \leq C \left ( \| \mathbf{U}_\infty \|_{L^\infty} + \| \langle D \rangle^s \mathbf{V}(t) \|_{L^2} \right ) \| \langle D \rangle^s \widetilde{\mathbf{V}}_\varepsilon(t) \|_{L^2} \,. \end{align*} $$

Hence it follows that

$$ \begin{align*}\sup_{t \in [0,T]} \| \partial_t \widetilde{\mathbf{V}}_\varepsilon(t) \|_{L^2} \leq C \| \langle D \rangle^s \widetilde{\mathbf{V}}_0 \|_{L^2} \end{align*} $$

independent of $\varepsilon> 0$ . Thus, by standard compactness arguments (see, e.g., [Reference Cazenave7][Proposition 1.1.2]), we deduce that $(\widetilde {\mathbf {V}}_{\varepsilon _n})$ converges for some sequence $\varepsilon _n \to 0$ to some limit $\widetilde {\mathbf {V}} \in C_w([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C}))$ solving

$$ \begin{align*}\partial_t \widetilde{\mathbf{V}} = -\frac{\mathrm{i}}{2} [ \mathbf{U}, |D| \widetilde{\mathbf{V}} ] \quad \text{with } \widetilde{\mathbf{V}}(0) = \widetilde{\mathbf{V}}_0 \,. \end{align*} $$

By mimicking the arguments in the previous proof using the abstract interpolation result, we actually deduce strong continuity, that is, we have $\widetilde {\mathbf {V}} \in C([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C}))$ . Uniqueness of the solution follows from a simple Grönwall argument in the same fashion when deriving (D.10).

Finally, we remark that the conversation of the pointwise constraints follows by a direct calculation, which we omit. This completes the proof of Lemma D.1.

We conclude this section by showing existence and uniqueness for the operator-valued initial-value problem (3.3) that appears in the discussion of the Lax structure which reads

(D.11)

$$ \begin{align} \partial_t \mathcal{U}(t) = B_{\mathbf{U}(t)}^+ \mathcal{U}(t) \quad \text{for } t \in [0,T], \quad \mathcal{U}(0) = \mathrm{Id} \,. \end{align} $$

for the operator-valued map $\mathcal {U} : [0,T] \to \mathcal {B}(L^2_+(\mathbb {R}; \mathcal {V}))$ . As usual, we use $\mathcal {B}(H)$ to denote the Banach space of bounded linear maps $H \to H$ with a given Hilbert space H. Recall that

(D.12)

$$ \begin{align} B_{\mathbf{U}}^+ = \frac{\mathrm{i}}{2} \left ( T_{\mathbf{U}} \circ D + D \circ T_{\mathbf{U}} \right ) - \frac{\mathrm{i}}{2} T_{|D| \mathbf{U}} \end{align} $$

with $D = -\mathrm {i} \partial _x$ denotes the compression of $B_{\mathbf {U}}$ on the Hardy space $L^2_+(\mathbb {R}; \mathcal {V})$ . Recall that for solutions $\mathbf {U} \in M_d(\mathbb {C}) \oplus H^s(\mathbb {R}; M_d(\mathbb {C}))$ with $s> \frac {3}{2}$ as given by Lemma 5.1, the operators $\{ B^+_{\mathbf {U}(t)} \}_{t \in [0,T]}$ are a family of (essentially) skew-adjoint operators on $L^2_+(\mathbb {R}; \mathcal {V})$ with operator domain $H^1_+(\mathbb {R};\mathcal {V}) = L^2_+(\mathbb {R}; \mathcal {V}) \cap H^1(\mathbb {R}; \mathcal {V})$ ; see also the remark below. Recall that we either take $\mathcal {V}= \mathbb {C}^d$ or $\mathcal {V} = M_d(\mathbb {C})$ equipped with their natural scalar products.

Lemma D.2. Let $s> \frac {3}{2}$ and $d \geq 2$ . Assume $\mathbf {U} = \mathbf {U}_\infty + \mathbf {V} \in M_d(\mathbb {C}) \oplus C([0,T]; H^s(\mathbb {R}; M_d(\mathbb {C}))$ is a solution given by Lemma 5.1. Then there exists a unique solution $\mathcal {U} : [0,T] \to \mathcal {B}(L^2_+(\mathbb {R}; \mathcal {V}))$ of (D.11) with the following properties.

(i) The map $[0,T] \to L^2_+(\mathbb {R}; \mathcal {V})$ with $t \mapsto \mathcal {U}(t) \varphi $ is continuous for every $\varphi \in L^2(\mathbb {R}; \mathcal {V})$ .
(ii) The equation $\partial _t \mathcal {U}(t) = B_{\mathbf {U}(t)}^+ \mathcal {U}(t)$ holds in $H^{-1}_+(\mathbb {R};\mathcal {V})$ for any $t \in [0,T]$ .
(iii) $\mathcal {U}(t) : L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ is unitary for all $t \in [0,T]$ .
(iv) For $\varphi \in H^1_+(\mathbb {R}; \mathcal {V}) \cap \mathrm {dom}(X^*)$ , we have $\mathcal {U}(t) \varphi \in H^1_+(\mathbb {R};\mathcal {V}) \cap \mathrm {dom}(X^*)$ for $t \in [0,T]$ .

Remark. In particular, the proof below shows that, given a time-dependent $\mathbf {U} = \mathbf {U}_\infty + \mathbf {V} \in M_d(\mathbb {C}) \oplus H^s(\mathbb {R}; M_d(\mathbb {C}))$ with some $s> 3/2$ and satisfying $\mathbf {U}(x) = \mathbf {U}(x)^*$ for all $x \in \mathbb {R}$ , the operator $B_{\mathbf {U}}^+ : H^1_+(\mathbb {R}; \mathcal {V}) \subset L^2_+(\mathbb {R}; \mathcal {V}) \to L^2_+(\mathbb {R}; \mathcal {V})$ is essentially skew-adjoint, that is, there exists a unique extension with $(B_{\mathbf {U}}^+)^* = -B_{\mathbf {U}}^*$ , since it is found to be the generator of a strongly continuous one-parameter unitary group on $L^2_+(\mathbb {R}; \mathcal {V})$ .

Proof. For notational convenience, we shall write $L^2_+, H^1_+$ and $H^{-1}_+$ for $L^2_+(\mathbb {R}; \mathcal {V})$ , $H^1_+(\mathbb {R}; \mathcal {V})$ and $H^{-1}_+(\mathbb {R}; \mathcal {V})$ , respectively.

Step 1. We first show that, for every $F_0 \in L^2_+$ , the initial-value problem

(D.13)

$$ \begin{align} \partial_t F = B_{\mathbf{U}}^+ F, \quad F(0) = F_0 \end{align} $$

has a unique solution $F \in C([0,T]; L^2_+(\mathbb {R}; \mathcal {V}))$ and we have $\| F(t) \|_{L^2} = \| F_0 \|_{L^2}$ for $t \in [0,T]$ .

For $\varepsilon> 0$ , we introduce the smoothing operators

$$ \begin{align*}J_\varepsilon := (1+\varepsilon D)^{-1} : L^2_+ \to H^1_+ \quad \text{with } \| J_\varepsilon \|_{L^2 \to L^2} \leq 1 \text{ and } \| J_\varepsilon \|_{L^2 \to H^1} \leq \varepsilon^{-1} \,. \end{align*} $$

Consider the approximate initial-value problem

(D.14)

$$ \begin{align} \partial_t F_\varepsilon = J_\varepsilon B_{\mathbf{U}}^+ J_\varepsilon F_\varepsilon, \quad F_\varepsilon(0) = F_0 \, , \end{align} $$

which has a unique solution $F_\varepsilon \in C^1([0,T]; L^2_+)$ by standard arguments. Since $J_\varepsilon B_{\mathbf {U}(t)} J_\varepsilon $ is a bounded skew-adjoint operator for every $t \in [0,T]$ , we readily find

$$ \begin{align*}\| F_\varepsilon(t) \|_{L^2} = \| F_0 \|_{L^2} \,. \end{align*} $$

By the equation, this implies that $\partial _t F_\varepsilon \in C([0,T]; H^{-1}_+)$ uniformly in $\varepsilon>0$ . Hence the family $\{ F_\varepsilon \}_{\varepsilon> 0}$ is uniformly equicontinuous in $C([0,T]; H^{-1}_+)$ and uniformly bounded in $C([0,T]; L^2_+)$ . By a standard compactness argument (see, e.g., [Reference Cazenave7][Proposition 1.1.2]), we can find a suitable sequence $\varepsilon _n \to 0$ with the limit $F := \lim _{n \to \infty } F_{\varepsilon _n} \in C([0,T]; H^{-1}_+) \cap C_w([0,T]; L^2_+)$ which satisfies

(D.15)

$$ \begin{align} \partial_t F = B_{\mathbf{U}} F, \quad F(0) = F_0 \,. \end{align} $$

We now claim that

(D.16)

$$ \begin{align} \| F(t) \|_{L^2} = \| F_0 \|_{L^2} \quad \text{for } t \in [0,T] \,. \end{align} $$

Indeed, we calculate

(D.17)

$$ \begin{align} \frac{d}{dt} \langle J_\varepsilon F(t), F(t) \rangle = 2 \mathrm{Re} \left \langle [J_\varepsilon, B_{\mathbf{U}(t)}^+] F(t), F(t) \right \rangle \,. \end{align} $$

Using that $[J_\varepsilon , D] = 0$ , $[J_\varepsilon , AB] = A[J_\varepsilon , B] + [J_\varepsilon , A]B$ and $[J_\varepsilon ,A] = -J_\varepsilon [\varepsilon D, A]J_\varepsilon $ , we find

$$ \begin{align*}[J_\varepsilon, B_{\mathbf{U}}] = -\frac{\mathrm{i}}{2} \left ( J_\varepsilon [\varepsilon D, T_{\mathbf{U}}] D J_\varepsilon + D J_\varepsilon [\varepsilon D, T_{\mathbf{U}}] J_\varepsilon \right ) - \frac{\mathrm{i}}{2} [ J_\varepsilon, T_{|D| \mathbf{U}}] =: I_\varepsilon + II_\varepsilon \,. \end{align*} $$

Next, we claim that

(D.18)

$$ \begin{align} I_\varepsilon \varphi \to 0 \quad \text{for every } \varphi \in L^2_+ \text{ as } \varepsilon \to 0 \,. \end{align} $$

By Leibniz’ formula, we find

$$ \begin{align*}\| I_\varepsilon \|_{L^2 \to L^2} \leq C \varepsilon \| \partial_x \mathbf{U} \|_{L^2} \| J_\varepsilon \|_{L^2 \to L^2} \| D J_\varepsilon \|_{L^2 \to L^2} \leq C \varepsilon \varepsilon^{-1} = C \end{align*} $$

independent of $\varepsilon> 0$ . Furthermore, it is easy to dominated convergence (and taking adjoints) that $I_\varepsilon \varphi \to 0$ in $L^2_+$ as $\varepsilon \to 0$ for every $\varphi \in H^{1}_+$ . By density of $H^1_+ \subset L^2_+$ and the uniform bound $\| I_\varepsilon \|_{L^2 \to L^2} \leq C$ , we readily deduce that (D.18) holds. Next, we observe that

$$ \begin{align*}\|II_\varepsilon \|_{L^2 \to L^2} \leq C \| J_\varepsilon \|_{L^2} \| |D| \mathbf{U} \|_{L^\infty} \leq C \end{align*} $$

independent of $\varepsilon> 0$ . Also, by dominated convergence (and taking adjoints) we see that $II_\varepsilon \varphi \to 0$ in $L^2_+$ as $\varepsilon \to 0$ for every $\varphi \in H^1_+$ . Again, we conclude

(D.19)

$$ \begin{align} II_\varepsilon \varphi \to 0 \quad \text{for every } \varphi \in L^2_+ \text{ as } \varepsilon \to 0 \,. \end{align} $$

Going back to (D.17) and using (D.18) and (D.19), we find by integration

$$ \begin{align*}\langle J_\varepsilon F(t), F(t) \rangle = \langle J_\varepsilon F_0, F_0 \rangle + \int_0^t g_\varepsilon(\tau) \, d\tau \end{align*} $$

with $g_\varepsilon (t) \to 0$ in $L^2_+$ as $\varepsilon \to 0$ for every $t \in [0,T]$ . Since also $\| g_\varepsilon (t) \|_{L^2} \leq C$ , we can use dominated convergence when passing to the limit $\varepsilon \to 0$ to find

$$ \begin{align*}\langle F(t), F(t) \rangle = \langle F_0, F_0 \rangle \quad \text{for all } t \in [0,T] \, , \end{align*} $$

which is the desired identity (D.16). Finally, we also remark that conservation of the $L^2$ -norm implies that the strong continuity $F \in C([0,T]; L^2_+)$ . Uniqueness of solutions in this class for the linear equation $\partial _t F = B_{\mathbf {U}} F$ directly follows from $L^2$ -conservation as well.

Step 2. We define the map $\mathcal {U} : [0,T] \to \mathcal {B}(L^2_+)$ by setting $\mathcal {U}(t) F_0 := F(t)$ for $F_0 \in L^2_+$ , where $F \in C([0,T]; L^2_+)$ is the unique solution of $\partial _t F = B_{\mathbf {U}}^+ F$ with $F(0)=F_0$ . By $L^2$ -conservation, we see that $\| \mathcal {U}(t) F_0 \|_{L^2} = \| F_0 \|_{L^2}$ and hence $\mathcal {U}(t)$ is an isometry on $L^2_+$ for any $t \in [0,T]$ . Furthermore, by a time reversal argument for the Schrödinger-type equation (D.13), we see that $\mathcal {U}(t)$ is also surjective on $L^2_+$ . Thus $\mathcal {U}(t)$ is a unitary map on $L^2_+$ for any $t \in [0,T]$ . This proves (iii), whereas the items (i) and (ii) are directly verified.

Step 3. It remains to show property (iv). For $\varphi \in H^1_+(\mathbb {R};\mathcal {V}) \cap \mathrm {dom}(X^*)$ , we can show, by using an approximation argument (whose details we omit) with the family of operators $J_\varepsilon =(1+\varepsilon D)^{-1}$ and $R_\varepsilon =(\varepsilon X^* - \mathrm {i})^{-1}$ with $\varepsilon> 0$ , that the solution $F \in C([0,T]; L^2_+)$ of $\partial _t F = B_{\mathbf {U}} F$ with $F(0)=\varphi $ satisfies $F(t) \in H^1_+(\mathbb {R}; \mathcal {V}) \cap \mathrm {dom}(X^*)$ for $t \in [0,T]$ . Since $F(t)=\mathcal {U}(t) \varphi $ this shows that (iv) holds true.

This completes the proof of Lemma D.2.

Acknowledgements

E. L. thanks Herbert Koch for valuable discussions and he thanks Yi Zhang for the kind hospitality and the opportunity to present this work in a series of talks at the Chinese Academy of Sciences, Beijing, in September 2024. Finally, we are grateful to the anonymous referee for valuable comments and suggestions that have helped improve this paper.

Competing interests

The authors have no competing interests to declare.

Funding statement

P. G. was partially supported by the French Agence Nationale de la Recherche under the ANR project ISAAC–ANR-23–CE40-0015-01. E. L. acknowledges financial support from the Swiss National Science Foundation (SNSF) under Grant No. 204121.

Footnotes

1 A priori this geometric rewriting of (HWM_d) would involve using the projection $P_{\mathbf {U}}$ onto the tangent space $T_{\mathbf {U}} \mathsf {Gr}_k(\mathbb {C}^d)$ , that is, we have $\partial _t \mathbf {U} = J_{\mathbf {U}} P_{\mathbf {U}} |D| \mathbf {U}$ . However, it can readily checked that $[\mathbf {U}, (\mathrm {Id}-P_{\mathbf {U}}) B] = 0$ for Hermitian matrices $B \in M_d(\mathbb {C})$ .

2 Further below, we shall omit this distinction between $\mathbf {F} \in L^\infty (\mathbb {R}; M_d(\mathbb {C}))$ and its corresponding multiplication operator $\mu _{\mathbf {F}}$ acting on $L^2(\mathbb {R}; \mathcal {V})$ .

3 For $0 < p < 1$ , we only have that $\| \cdot \|_{\dot {B}^{1/p}_p}$ is a quasi-semi-norm, since the triangle inequality fails in this case. In our analysis, we only need the case $p=2$ .

4 The fact that $\mathrm {Tr}(\mathbf {U}_0(x)) = \text {const}.$ almost everywhere is even true for $s=1/2$ , since any integer-valued map $\mathrm {Tr}(\mathbf {U}_0) \in \dot {H}^{\frac {1}{2}}(\mathbb {R};\mathbb {R})$ necessarily satisfies $\mathrm {Tr}(\mathbf {U}_0((x) )= \text {const}.$ almost everywhere; see, for example, [Reference Brezis5].

5 In fact, we verify that $V_P= \bigcup _{j=1}^N V_j$ with the linear subspaces $V_j = \mathrm {ker} \, \ell _j$ with the linear forms $\ell _j : \mathbb {C}^N \to \mathbb {C}$ given by $\ell _j(Q) = Q_1 \xi _j(P)^{N-1} + \ldots + Q_{N-1} \xi _j(P) + Q_N$ .

6 Recall also that, via , we have the canonical equivalence $\mathsf {Gr}_k(\mathbb {C}^d) \cong \{ P \in \mathbb {C}^{d \times d} \mid P^* = P = P^2 \text { and } \mathrm {Tr}(P) = k \} \text { in terms of self-adjoint projections } P \text { on } \mathbb {C}^d \text { with } \mathrm {rank}(P)=k$ .

7 This can be traced back to J. Douglas’ seminal work on the Plateau problem [Reference Douglas9].

8 In [Reference Berntson, Klabbers and Langmann3], a different sign convention for the poles $z_j$ and spin vectors $\mathbf {s}_j$ are used. The reader should be aware of this when comparing with our formulae here.

References

Alazard, T., Burq, N., Ifrim, M., Tataru, D., and Zuily, C., ‘Nonlinear interpolation and the flow map for quasilinear equations’, Preprint, 2024, arXiv:2410.06909.Google Scholar

Badreddine, R., ‘On the global well-posedness of the Calogero-Sutherland derivative nonlinear Schrödinger equation’, Pure Appl. Anal. 6 (2024), 379–414.10.2140/paa.2024.6.379CrossRef Google Scholar

Berntson, B. K., Klabbers, R., and Langmann, E., ‘Multi-solitons of the half-wave maps equation and Calogero-Moser spin-pole dynamics’, J. Phys. A. 53 (2020), 505702, 32.10.1088/1751-8121/abb167CrossRef Google Scholar

Berntson, B. K., Langmann, E., and Lenells, J., ‘Spin generalizations of the Benjamin-Ono equation’, Lett. Math. Phys. 112 (2022), Paper No. 50, 45.10.1007/s11005-022-01540-3CrossRef Google Scholar

Brezis, H., ‘How to recognize constant functions. Connections with Sobolev spaces’, Russ. Math. Surv. 57 (2002), 693–708.10.1070/RM2002v057n04ABEH000533CrossRef Google Scholar

Brezis, H. and Nirenberg, L., ‘Degree theory and BMO. I. Compact manifolds without boundaries’, Selecta Math. (N.S.) 1 (1995), 197–263.10.1007/BF01671566CrossRef Google Scholar

Cazenave, T., Semilinear Schrödinger Equations, Courant Lecture Notes in Mathematics vol. 10, (New York University, Courant Institute of Mathematical Sciences, New York; American Mathematical Society, Providence, RI, 2003).Google Scholar

Da Lio, F. and Rivière, T., ‘Three-term commutator estimates and the regularity of

$\frac{1}{2}$ -harmonic maps into spheres’, Anal. PDE 4 (2011), 149–190.Google Scholar

Douglas, J., ‘Solution of the problem of Plateau’, Trans. Amer. Math. Soc. 33 (1931), 263–321.10.1090/S0002-9947-1931-1501590-9CrossRef Google Scholar

Gérard, P., ‘An explicit formula for the Benjamin-Ono equation’, Tunis. J. Math., 5 (2023), 593–603.10.2140/tunis.2023.5.593CrossRef Google Scholar

Gérard, P., ‘The Lax pair structure for the spin Benjamin-Ono equation’, Adv. Contin. Discrete Models (2023), Paper No. 21, 6.10.1186/s13662-023-03768-2CrossRef Google Scholar

Gérard, P., ‘The zero dispersion limit for the Benjamin-Ono equation on the line’, C. R. Math. Acad. Sci. Paris 362 (2024), 619–634.10.5802/crmath.575CrossRef Google Scholar

Gérard, P. and Grellier, S., ‘An explicit formula for the cubic Szegő equation’, Trans. Amer. Math. Soc. 367 (2015), 2979–2995.10.1090/S0002-9947-2014-06310-1CrossRef Google Scholar

Gérard, P. and Lenzmann, E., ‘A Lax pair structure for the half-wave maps equation’, Lett. Math. Phys. 108 (2018), 1635–1648.10.1007/s11005-017-1044-xCrossRef Google Scholar

Gérard, P. and Lenzmann, E., ‘The Calogero-Moser derivative nonlinear Schrödinger equation’, Comm. Pure Appl. Math. 77 (2024), 4008–4062.10.1002/cpa.22203CrossRef Google Scholar

Gérard, P. and Lenzmann, E., On global solutions to the half-wave maps equation on the torus . Work in preparation, 2024.Google Scholar

Gérard, P. and Pushnitski, A., ‘The cubic Szegő equation on the real line: explicit formula and well-posedness on the Hardy class’, Comm. Math. Phys. 405 Paper No. 167, (2024), 31.10.1007/s00220-024-05040-4CrossRef Google Scholar

Gérard, P. and Pushnitski, A., ‘An inverse problem for Hankel operators and turbulent solutions of the cubic Szegő equation on the line’, J. Eur. Math. Soc. (JEMS) 27(11) (2025), 4591–4648. Published online first. https://doi.org/10.4171/JEMS/1457.CrossRef Google Scholar

Gohberg, I., Goldberg, S., and Krupnik, N., Traces And Determinants of Linear Operators, vol. 116 of Operator Theory: Advances and Applications, (Birkhäuser Verlag, Basel, 2000).10.1007/978-3-0348-8401-3CrossRef Google Scholar

Kiesenhofer, A. and Krieger, J., ‘Small data global regularity for half-wave maps in

$\mathrm{n}=4$ dimensions’, Comm. Partial Differential Equations 46 (2021), 2305–2324.10.1080/03605302.2021.1936021CrossRef Google Scholar

Killip, R., Laurens, T., and Visan, M., ‘Scaling-critical well-posedness for continuum Calogero-Moser models’, Preprint, 2023, arXiv:2311.12334.Google Scholar

Koch, H., Tataru, D., and Visan, M., Dispersive equations and nonlinear waves, Oberwolfach Seminars, vol. 45, (Birkhäuser/Springer, Basel, 2014). Generalized Korteweg-de Vries, nonlinear Schrödinger, wave and Schrödinger maps.10.1007/978-3-0348-0736-4CrossRef Google Scholar

Krieger, J. and Sire, Y., ‘Small data global regularity for half-wave maps’, Anal. PDE 11 (2018), 661–682.10.2140/apde.2018.11.661CrossRef Google Scholar

Langer, H., ‘Ein Zerspaltungssatz für Operatoren im Hilbertraum’, Acta Math. Acad. Sci. Hungar. 12 (1961), 441–445.10.1007/BF02023926CrossRef Google Scholar

Lax, P. D., ‘Translation invariant spaces’, Acta Math. 101 (1959), 163–178.10.1007/BF02559553CrossRef Google Scholar

Lenzmann, E. and Schikorra, A., ‘On energy-critical half-wave maps into

${\mathrm{S}}^2$ ’, Invent. Math. 213 (2018), 1–82.10.1007/s00222-018-0785-1CrossRef Google Scholar

Lenzmann, E. and Sok, J., ‘Derivation of the half-wave maps equation from Calogero-Moser spin systems’, Pure Appl. Math. Q. 20 (2024), 1825–1858.10.4310/PAMQ.2024.v20.n4.a10CrossRef Google Scholar

Liu, Y., ‘Global well-posedness for half-wave maps with

${\mathrm{S}}^2$ and

${\mathbb{H}}^2$ targets for small smooth initial data’, Commun. Pure Appl. Anal. 22 (2023), 127–166.10.3934/cpaa.2022148CrossRef Google Scholar

Matsuno, Y., ‘Integrability, conservation laws and solitons of a many-body dynamical system associated with the half-wave maps equation’, Phys. D 430 (2022), Paper No. 133080, 12.10.1016/j.physd.2021.133080CrossRef Google Scholar

Mazowiecka, K. and Schikorra, A., ‘Minimal

${\mathrm{W}}^{\mathrm{s},\frac{\mathrm{n}}{\mathrm{s}}}$ -harmonic maps in homotopy classes’, J. Lond. Math. Soc. (2) 108 (2023), 742–836.10.1112/jlms.12769CrossRef Google Scholar

Millot, V. and Sire, Y., ‘On a fractional Ginzburg-Landau equation and 1/2-harmonic maps into spheres’, Arch. Ration. Mech. Anal. 215 (2015), 125–210.10.1007/s00205-014-0776-3CrossRef Google Scholar

Peller, V. V., Hankel Operators and Their Applications, Springer Monographs in Mathematics (Springer-Verlag, New York, 2003).10.1007/978-0-387-21681-2CrossRef Google Scholar

Sibuya, Y., ‘Some global properties of matrices of functions of one variable’, Math. Ann. 161 (1965), 67–77.10.1007/BF01363248CrossRef Google Scholar

Sz.-Nagy, B., Foias, C., Bercovici, H., and Kérchy, L., Harmonic Analysis Of Operators On Hilbert Space, Universitext, second edn., (Springer, New York, 2010).10.1007/978-1-4419-6094-8CrossRef Google Scholar

Tao, T., ‘Nonlinear dispersive equations’, CBMS Regional Conference Series in Mathematics vol. 106 (Conference Board of the Mathematical Sciences, Washington, DC; by the American Mathematical Society, Providence, RI, 2006). Local and global analysis.Google Scholar

Widom, H., ‘On the spectrum of a Toeplitz operator’, Pacific J. Math. 14 (1964), 365–375.10.2140/pjm.1964.14.365CrossRef Google Scholar

Zhou, T. and Stone, M., ‘Solitons in a continuous classical Haldane-Shastry spin chain’, Phys. Lett. A, 379 (2015), 2817–2825.10.1016/j.physleta.2015.09.014CrossRef Google Scholar

Article contents

Global well-posedness and soliton resolution for the half-wave maps equation with rational data

Abstract

MSC classification

Information

1. Introduction and main results

Global well-posedness for rational data

Theorem 1.2 (GWP for Rational Data).

Soliton resolution and nonturbulence

Theorem 1.3 (Soliton Resolution and Non-Turbulence).

Generalized half-wave maps equation

Theorem 1.5 (GWP of (HWM d ) for Rational Data).

Theorem 1.7 (Soliton Resolution and Non-Turbulence for (HWM d )).

Strategy of proofs

Links to Schrödinger maps and spin Benjamin–Ono equation

2. Preliminaries and notation

Sobolev-type spaces

Hardy spaces, Toeplitz and Hankel operators

The operators $X^*$ , X, and $I_+$

3. Lax pair structure

Lemma 3.1 (Lax equation).

Corollary 3.1 (Toeplitz Lax Structure).

4. Spectral analysis of $T_{\mathbf {U}}$

Fredholm property and invariant subspaces

Lemma 4.1 (Key Identity and Fredholmness).

Spectral properties for rational data

Lemma 4.3 (Kronecker-type theorem).

Theorem 4.1 (Kronecker’s theorem on $L^2_+(\mathbb {T}; \mathcal {H})$ ).

5. Local well-posedness and explicit flow formula

Local well-posedness for sufficiently regular data

Explicit flow formula

Lemma 5.2 (Explicit Flow Formula).

Proof of Lemma 5.2 (Explicit Flow Formula).

6. Global well-posedness for rational data

Proof of Theorem 1.5

Proof of Theorem 1.2

7. Soliton Resolution and Non-Turbulence

Preliminaries

Perturbation analysis as $|t| \to \infty $

Proof of Theorem 1.7

Asymptotic behavior as $t \to \pm \infty $

Completing the proof of Theorem 1.7

8. Refined analysis for target $\mathbb {S}^2$

Parametrization by stereographic projection

Proof of Theorem 1.3 (soliton resolution for target $\mathbb {S}^2$ )

Proof of Theorem 1.4 (density of rational data with simple discrete spectrum)

A. Density of rational maps

Proof of Theorem A.2.

Proof of Lemma A.2.

Proof of Theorem A.1.

B. Stereographic parametrization

Proof of Theorem B.1.

Proof of Lemma B.3.

C. Construction of $T_{\mathbf {U}}$ with simple discrete spectrum

D. Local well-posedness

Proof of Lemma 5.1

Acknowledgements

Competing interests

Funding statement

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests

Theorem 1.5 (GWP of (HWM_d) for Rational Data).

Theorem 1.7 (Soliton Resolution and Non-Turbulence for (HWM_d)).