1 Introduction
1.1 Exchange interactions and high-Tc superconductivity
Exchange interactions in Mathematical Physics. Off-diagonal interaction terms of the form

with
$A,B$
being two monomials of annihilation operators of two species (a) and (b) of quantum particles, play a pivotal rôle in the rigorous understanding of quantum many-body systems at low temperatures. Such terms are also named ‘exchange’ terms because they encode (quantum) processes destroying a set of particles of one specie to create another kind of particles.
For instance, for the Bogoliubov model, an off-diagonal term of the form

exchanging two bosons (
$a=b_{0}$
) having zero momentum (
$k=0$
) with a pair of boson having nonzero momentum of opposite sign (
$b_{k\neq 0}$
), is shown in [Reference Bru and Zagrebnov1] to imply a nonconventional Bose condensation. Made of dressed bound pairs of (zero-momentum) bosons, the nonconventional condensate is structurally different from the Bose-Einstein condensate of the ideal Bose gas. In particular, it must be depleted to take advantage of the effective attraction induced by the exchange interaction (2). See [Reference Bru and Zagrebnov2]. This is reminiscent of liquid helium physics, where 100% superfluid helium occurs at zero temperature with only 9% Bose condensate [Reference Aleksandrov, Zagrebnov, Zh. A. Kozlov and Priezzhev3, Reference Dokukin, Kozlov, Parfenov and Puchkev4, Reference Blagoveshchenskii, Bogoyavlenskii, Karnatsevich, Kolobrodov, Kozlov, Priezzhev, Puchkov, Skomorokhov and Yarunin5, Reference Griffin, Snoke and Stringari6, Reference Griffin7]. Off-diagonal interaction terms (2) are conjectured in [Reference Bru8] to be relevant to explain the macroscopic behavior of weakly interacting Bose gases.
Another example from quantum statistical mechanics is given by the spin-boson model within the so-called ‘rotating wave approximation’. In this approximation, the model has terms of the form

with
$\sigma _{\pm }=\sigma _{x}\pm \sigma _{y}$
(
$\sigma _{x},\sigma _{y}$
being Pauli matrices) and
$b_{k}$
being the annihilation operator of a boson. Note that
$\sigma _{-}$
(
$\sigma _{+}$
) can be related to the annihilation (creation) operator a (
$a^{\ast }$
) of a fermion, via a so-called Jordan-Wigner transformation. Such off-diagonal terms make impossible the diagonalization of the quantum Hamiltonian with usual methods. In particular, the impact of these interaction terms on the properties of the model is expected to be major. For a general presentation of spin-boson models, see, for example, (Reference Lonigro9, Introduction and Section 2.3).
More recently, using the Hubbard model with nearest neighbor interaction near its Hartree-Fock ground state, Bach and Rauch demonstrate [Reference Bach and Rauch10] that interaction terms of the form

exchanging fermions inside (a) and outside (b) of the Fermi surface are the only ones that can prevent from getting uniform Footnote 1 relative bounds of the effective interaction with respect to the effective kinetic energy. See [Reference Bach and Rauch10, Theorems III.1, III.2 and III.3] for more details. In other words, (3) should again have a drastic impact on the corresponding quantum many-body system.
Three-body fermion-boson exchange interactions. In the present paper, for a fairly general function
$\upsilon :\mathbb {Z} ^{2}\rightarrow \mathbb {R}$
, we study the effect of the off-diagonal interaction term

where
$b_{x}$
is the annihilation operator of a spinless boson on the site
$ x $
of the two-dimensional (square) lattice
$\mathbb {Z}^{2}$
, while
$c_{y}$
represents the annihilation of a fermion pair of zero total spin, the two components of which are spread around the lattice position
$y\in \mathbb {Z} ^{2}$
. See Figure 1.

Figure 1 Illustration of fermion-boson exchange interactions in the form of two Feynman diagrams. In theoretical physics, a Feynman diagram visually represents the mathematical expressions that describe the behavior and interactions of quantum particles. In the example on the left, the two arrows indicate that two fermions, named
$(f)$
, ‘collide’ to create a new particle, the boson
$(b)$
. The oscillating line is generally used to describe an interaction with a mediator, which can be seen by combining the two diagrams: two fermions
$(f)$
interact to produce a boson, which annihilates again to produce two fermions
$(f)$
. This can lead to an effective interaction between fermions. In particular, this process could produce a pair of fermions (
$f-f$
) bonded by the exchange of a bosonic field (b), according to the coupling function
$\upsilon $
. This is typically what we are going to show. Note that the opposite combination can also be made: a boson
$(b)$
is destroyed to create two fermions
$(f)$
, which annihilate to recreate a boson
$(b) $
. This does not really create an interaction as such, but a kinetic term, or seen another way, a self-interaction on the boson
$(b)$
. The combination of two diagrams refers to a perturbative approach of second order, but we can also combine several of the same diagrams (perturbative approach of order n). Note, however, that no such perturbative argument is used here.
Note that the opposite combination can also be made: a boson b is destroyed to create two fermions f, which annihilate to recreate a boson
$ b $
. This does not really create an interaction as such, but a kinetic term, or seen another way, a self-interaction. The combination of two diagrams refers to a perturbative approach of second order, but we can also combine several of the same diagrams (perturbative approach of order n).
The purely fermionic part of the considered model corresponds to the extended Hubbard Hamiltonian, as used in the context of ultracold atoms, ions and molecules [Reference Dutta, Gajda, Hauke, Lewenstein, Lühmann, Malomed, Sowiński and Zakrzewski11], while the purely bosonic component refers to an ideal gas; that is, it has only a kinetic part (or ‘hopping term’), without interbosonic interactions. Because of the fermionic part, which is not exactly diagonalizable, the behavior of the full quantum many-body system, outside perturbative regimes, is almost inaccessible with the mathematical tools at our disposal.
We thus consider only a three-body problem, by restricting the model to the sector of one boson and two fermions of opposite spins. In fact, the system restricted to this particular sector is very interesting, both mathematically and physically. Note that such sector restrictions in Fock spaces are also performed for the study of the Pauli-Fierz and Nelson models [Reference Galtbayar, Jensen and Yajima26, Reference Olivieri, Correggi and Falconi28, Reference Dayantsolmon and Galtbayar27] in nonrelativistic Quantum Field Theory (QFT).
Physical context: High-
$T_{c}$
superconductivity of cuprates. Physically, the model is related to cuprate superconductors,Footnote
2
like, for instance,
$\mathrm { La}_{2-x}\mathrm {Sr}_{x}\mathrm {CuO}_{4}$
(LaSr 214 or LSCO) and
$\mathrm {La} _{2-x}\mathrm {Ba}_{x}\mathrm {CuO}_{4}$
. It is known [Reference Saxena12, Reference Plakida13, Reference Wesche14] that in such crystals, charge transport occurs within two-dimensional isotropic layers of copper oxides. This is why we consider here quantum particles on
$2$
-dimensional lattices
$\mathbb {Z}^{2}$
.
A convincing microscopic mechanism behind superconductivity at high critical temperature is still lacking even after almost four decades of intensive theoretical and experimental studies. See Section A.1 for more details. Many physicists believe that the celebrated Hubbard model could be pivotal, one way or another, in order to get a microscopic theory of high-temperature superconductivity, but many alternative explanations or research directions have also been considered in theoretical physics. For some of the more popular models for cuprate superconductors, see, for example, [Reference Plakida13, Chap. 7].
In many theoretical approaches to this problem, the existence of polaronic quasiparticles in relation with the very strong Jahn-Teller (JT) effect associated with copper ions is neglected, as stressed in (Reference Köppel, Yarkony and Barentzen15, Part VII). The role of polarons is, however, highlighted in [Reference Müller and Bednorz16], since the JT effect actually led to the discovery of superconductivity in cuprates in 1986. See [Reference Müller17, p. 2] or [Reference Keller, Bussmann–Holder and Müller18, Reference Alexandrov and Zhao19].
Our theoretical approach differs from most popular ones, being based on the existence of JT bipolarons in copper oxides, as is discussed in the literature [Reference Stoneham and Smith20] at least as early as 1990. The physics behind this approach is explained in detail in [Reference Bru, de Siqueira Pedra and de Pasquale21], where a simplified version of the model studied here is considered. In our microscopic model for cuprate superconductors, as presented in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21], the bosonic operator
$b_{x}$
(
$b_{x}^{\ast }$
) in (4) refers to the annihilation (creation) of a JT bipolaron, whereas the fermionic one
$ c_{y}$
(
$c_{y}^{\ast }$
) annihilates (creates) a fermion pair, which is reminiscent of Cooper pairs in conventional superconductivity.
Bipolaronic pairing mechanisms and cuprate superconductivity. As in the present paper, no ad hoc assumptions, in particular concerning anisotropy, are made in [Reference Bru, de Siqueira Pedra and de Pasquale21]. In fact, [Reference Bru, de Siqueira Pedra and de Pasquale21] proves that unconventional pairing may occur, breaking spontaneously discrete symmetries of the model, like the d-wave pairing, whose wave function is antisymmetric with respect to
$90^{\circ }$
-rotations. It turns out that electrostatic (screened Coulomb) repulsion is crucial for such unconventional pairings, which are meanwhile shown to be concomitant with a strong depletion of superconducting pairs.
Notice that the results of [Reference Bru, de Siqueira Pedra and de Pasquale21] are coherent with experimental observations on the cuprate LaSr 214: The coherence length at optimal doping and the d-wave pair formation in the pseudogap regime (i.e., at temperatures much higher than the superconducting transition temperature) are predicted in good accordance with experimental data. In addition to the
$ d$
-wave pairing and the high-temperature pseudogap regime, the model considered here also captures another very special feature of high-Tc cuprate superconductors – namely, the density waves [Reference Arpaia, Martinelli, Moretti Sala, Caprara, Nag, Brookes, Camisa, Li, Gao, Zhou, Garcia-Fernandez, Zhou, Schierle, Bauch, Peng, Di Castro, Grilli, Lombardi, Braicovich and Ghiringhelli23]. For more details, see also Section A.1.
In fact, it is shown in [Reference Bru, de Siqueira Pedra and de Pasquale22, Section 4.1] that three-body fermion-boson exchange interactions, like the one studied in this paper, imply an effective fermion-fermion interaction. Then, by considering the mean-field limit of it, which corresponds to taking couplings (4) that are very localized in momentum space (Reference Bru, de Siqueira Pedra and de Pasquale22, Section 4.2), it was rigorously proven [Reference de Pasquale24] that, below the critical temperature, the equilibrium states of the (purely fermionic) associated many-body Hamiltonian exhibit periodic modulation in space of the charge density, even incommensurate with respect to the lattice spacing.
1.2 Mathematical results
Previous results. To our knowledge, the model considered here has not been studied mathematically, apart from our own articles [Reference Bru, de Siqueira Pedra and de Pasquale21, Reference Bru, de Siqueira Pedra and de Pasquale22] published in recent years. See also the Ph.D. thesis [Reference de Pasquale24]. Mathematical studies for explicit exchange interaction terms are mainly those presented above. As far as we know, concerning its physical interpretation regarding cuprate superconductivity, our approach has also never been considered by other physicists, and we therefore doubt that any theoretical results in this direction exist in the literature. For more details, see the introductory discussions in [Reference Bru, de Siqueira Pedra and de Pasquale21], which give a concise overview of theories of high-temperature superconductivity.
Mathematically, the present paper improves [Reference Bru, de Siqueira Pedra and de Pasquale21, Reference Bru, de Siqueira Pedra and de Pasquale22] to get more complete and general rigorous results, including, among other things, extended Hubbard interactions and scattering properties. While [Reference Bru, de Siqueira Pedra and de Pasquale21, Reference Bru, de Siqueira Pedra and de Pasquale22] focus only on the ground state energy and the unconventional pairings in the limit of large Hubbard interactions, here we provide the full spectral properties of the corresponding Hamiltonian. In particular, we study in depth the effective dispersion relation associated with dressed bound fermion pairs. It confirms that off-diagonal interactions of the form (1) produce bounded states by reducing the energy of the system, similar to [Reference Bru and Zagrebnov2], possibly with a spectral gap.
This was already done in [Reference Bru, de Siqueira Pedra and de Pasquale21, Reference Bru, de Siqueira Pedra and de Pasquale22], but only for usual ( non-extended) Hubbard interactions and one-range creation / annihilation operators of fermion pairs. Even in this specific case, the dispersion relation of dressed bound fermion pairs was analyzed only to a level of detail enough to deduce unconventional pairings near the ground state. By contrast, in the present paper, other important properties of the dispersion relation, like its regularity, are studied for the first time and in a more general framework.
Last but not least, the localization of dressed bound fermion pairs or the scattering properties of the model have not been studied before.
Localized dressed bound fermion pairs. Using Combes-Thomas estimates, we show, among other things, that the dressed bound fermion pairs are localized, in the sense that the fermion-fermion correlation decays very fast in space. Group velocities and tensor masses of dressed bound fermion pairs are also shown to exist under very natural conditions on the (absolutely summable function)
$\upsilon :\mathbb {Z} ^{2}\rightarrow \mathbb {R}$
appearing in (4).
In fact, our analysis allows one to accurately understand which features of the exchange strength function
$\upsilon $
can strengthen the stability of the dressed bound fermion pairs. For instance,
$\upsilon $
has to be sufficiently strong and localized in Fourier space in order to get a sufficiently strong ‘gluing effect’. Additionally, the boson should be heavier than two fermions.
Notice that this second condition is consistent with the physical interpretation that the boson is a bipolaron, which is known to be (effectively) much heavier than the fermions (electrons or holes), in superconducting cuprates. Observe additionally that the very large mass of bipolarons (and polarons, in general) is one of the main arguments used to discredit theoretical approaches based on bipolarons because it is known from experiments that the charge carriers in superconducting cuprates have an effective mass comparable to that of electrons and holes.
In fact, we prove that the effective mass of bound pairs mainly depends on the properties of the function
$\upsilon $
, that encodes the fermion-boson exchange processes, but not much on the mass of the boson itself. This issue is discussed in [Reference Bru, de Siqueira Pedra and de Pasquale21], in detail. See also the discussion at the end of Section 3.2. That is why we are interested in results concerning the mass tensor for bound pairs and we think we provide here a convincing solution for the ‘mass paradox’ related to bipolaronic pairing mechanisms in the microscopic theory of cuprate superconductors.
Relationship with the enhanced binding of QFT. The formation of dressed bonded fermion pairs as described above is reminiscent of what is known as enhanced binding in Quantum Field Theory (QFT). For more details on this phenomenon, we recommend the lecture notes [Reference Hiroshima, Sasaki, Spohna and Suzuki25], where it is well explained in the context of nonrelativistic QFT. See also the references therein.
For example, the Pauli-Fierz model, which refers to nonrelativistic quantum charge particles interacting with a massless quantized radiation field (photons), can have at low energies a dressed particle with an effective mass bigger than the noninteracting one, leading to the existence of a ground state for the model. A similar fact occurs in the Nelson model, in which N quantum particles interact linearly with a field of photons (or mesons). The formation of such dressed particles is a direct consequence of the bosonic field acting as mediator of a force.
Indeed, in both cases, the model involves a sum of interaction terms of the form
$\psi _{k}\otimes b_{k}+\bar {\psi }_{k}\otimes b_{k}^{\ast }$
, coupling the N-body quantum system with a spinless boson field of momentum k via annihilation/creation operators
$b_{k},b_{k}^{\ast }$
. Note that in this case, there is no transformation of particles of one type into another, as in the exchange interactions described above, but both cases are still similar, especially as we are carrying out our analysis in the sector with only two fermions and one boson. This makes the comparison quite relevant, even if the model and mathematical methods considered here have essential differences as compared to the previous ones.
Scattering properties of the model. We also study here scattering properties of the three-body model in two channels, the unbound and bound pair channels:
-
• The unbound pair (scattering) channel corresponds to the wave and scattering operators with respect to fermionic part, respectively defined via the strong limits
$$ \begin{align*} W^{\pm }\doteq s-{\lim\limits_{t\rightarrow \pm \infty }}\mathrm{e}^{it \mathrm{H}}\mathrm{e}^{-it\mathrm{H}_{f}}P_{\mathrm{ac}}\left( \mathrm{H} _{f}\right) {\qquad \mathrm{and}\qquad }S\doteq \left( W^{+}\right) ^{\ast }W^{-}, \end{align*} $$
$\mathrm {H}_{f}$ is a generic, purely fermionic Hamiltonian representing free fermions that do not interact with any bosonic field,
$ \mathrm {H}$ is the Hamiltonian of the full model and
$P_{\mathrm {ac}}( \mathrm {H}_{f})$ is the orthogonal projection onto the absolutely continuous space of
$\mathrm {H}_{f}$ . It refers to the case in which two fermions start far apart from each other and only experience a very weak repulsion force due to the extended Hubbard interaction, while the probability that they bind together to form a boson is very small. In this situation, we show that two (almost) freely propagating fermions in the distant past can come together and interact with one another, either via the repulsive electrostatic force or by exchanging a boson, and then propagate away, again freely in the distant future. In this channel, the scattering matrix can be explicitly computed via convergent (Dyson) series, making in particular the study of the scattering effect of the fermion-boson-exchange interaction ( 1) technically uncomplicated.
-
• The bound pair (scattering) channel corresponds to the time evolution
$ \mathrm {e}^{it\mathrm {H}}\mathfrak {P}$ ,
$t\in \mathbb {R}$ , where
$\mathrm {H}$ is again the Hamiltonian of the full model and
$\mathfrak {P}$ is an isometry from the
$L^{2}$ -functions on the Brillouin zone to the subspace associated with the fiber bound states of
$\mathrm {H}$ . We show in particular that
$$ \begin{align*} \mathrm{e}^{it\mathrm{H}}\mathfrak{P}=\mathfrak{P}\mathrm{e}^{itM_{\mathrm{E} \left( \cdot \right) }},\qquad t\in \mathbb{R}, \end{align*} $$
$M_{\mathrm {E}\left ( \cdot \right ) }$ being some multiplication operator given by the dispersion relation
$k\mapsto \mathrm {E}\left ( k\right ) $ characterizing the (fiber) bound states at fixed quasi-momentum in the normalized Brillouin zone
$\mathbb {T}^{2}\doteq \lbrack -\pi ,\pi )^{2}$ . In terms of wave operators, it follows that
$$ \begin{align*} W^{\pm }\doteq s-{\lim\limits_{t\rightarrow \pm \infty }}\mathrm{e}^{it \mathrm{H}}\mathfrak{P}\mathrm{e}^{-itM_{\mathrm{E}\left( \cdot \right) }}P_{ \mathrm{ac}}\left( M_{\mathrm{E}\left( \cdot \right) }\right) =\mathfrak{P} P_{\mathrm{a}\mathrm{c}}\left( M_{\mathrm{E}\left( \cdot \right) }\right) , \end{align*} $$
$$ \begin{align*} S\doteq \left( W^{+}\right) ^{\ast }W^{-}=P_{\mathrm{a}\mathrm{c}}\left( M_{ \mathrm{E}\left( \cdot \right) }\right) . \end{align*} $$
It refers to the case in which dressed bound fermion pairs are formed. In contrast with the first channel, now there is a non-negligible bosonic component related with the exchanged boson that ‘glues’ the two fermions together. We prove that those (spatially localized) dressed bound fermion pairs effectively move like a free (quantum spinless) particle. In this case, strictly speaking in the physical sense, there is no scattering, and the pairs evolve freely in space, governed by an effective dispersion relation, the Fourier transform of which is the effective hopping strength for the (spatially localized) dressed bound pairs.
Composite system at strong on-site Hubbard repulsions. We additionally prove that all these properties hold also true in the limit of large on-site fermionic repulsions, provided that two fermions on two different lattice sites can interact via the fermion-boson exchange interaction. It refers to a hard core limit, preventing two fermions from occupying the same lattice site.
For cuprate superconductors, it is an important issue addressed and answered here, because of the undeniable experimental evidence of very strong on-site Coulomb repulsions in cuprates, leading to the universally observed Mott transition at zero doping [Reference Imada, Fujimori and Tokura29, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30].
1.3 Concluding remarks and structure of the paper
To conclude, the mathematical properties of the model studied in the present work are well understood, and as a consequence, the model can serve as a prototypical example of a quantum system including exchange interaction terms of the form (1). From a physics viewpoint, it is also interesting, since dressed bound fermion pairs are good candidates for superconducting charge carriers in cuprate superconductors, as advocated in [Reference Bru, de Siqueira Pedra and de Pasquale21].
More specifically, our main results are Theorems 3.1, 3.5, 3.6, 3.9, 3.11 and 3.14. The paper is organized as follows: Section 2 explains in detail the model, while Section 3 gives the main results. Technical outcomes, along with all their proofs, are gathered in Section 4. Section A is an appendix that gathers important standard mathematical results used here, an overview of cuprate physics for non-physicists, as well as the Fock-space formalism, in order to make the article self-contained and accessible to a wide audience.
Remark 1.1 (d-dimensional lattices).
Our study focuses on two-dimensional lattice systems because of their application to the superconductivity of cuprates and, in particular, their
$ d $
-wave symmetry. However, it can also be done at arbitrary dimension
$ d\geq 1 $
provided the coupling functions used (i.e.,
$\mathrm {u},\mathfrak {p }_{1},\mathfrak {p}_{2},\upsilon :\mathbb {Z}^{d}\rightarrow \mathbb {R}_{0}^{+} $
below) stays absolutely summable. It is also important that the Fourier transforms
$\hat {\upsilon }$
,
$\mathfrak {\hat {p}}_{1}$
and
$\mathfrak {\hat {p}} _{2}$
of the functions
$\upsilon $
,
$\mathfrak {p}_{1}$
and
$\mathfrak {p}_{2}$
remain real-valuedFootnote
3
continuous functions on the d-dimensional torus
$ \mathbb {T}^{d}$
.
Remark 1.2 (Notation).
For any normed vector space
$\mathcal {X}$
over
$\mathbb {C}$
, we omit the subscript
$\mathcal {X}$
to denote its norm
$\Vert \cdot \Vert \equiv \Vert \cdot \Vert _{\mathcal {X}}$
, unless there is any risk of confusion. Mutatis mutandis for the scalar product
$\langle \cdot ,\cdot \rangle \equiv \langle \cdot ,\cdot \rangle _{\mathcal {X}}$
in Hilbert spaces. As is usual,
$ \mathcal {B}(\mathcal {X},\mathcal {Y})$
denotes the set of bounded (linear) operators
$\mathcal {X}\rightarrow \mathcal {Y}$
between two normed spaces
$ \mathcal {X}$
to
$\mathcal {Y}$
. If
$\mathcal {X}=\mathcal {Y}$
,
$\mathcal {B}( \mathcal {X})\equiv \mathcal {B}(\mathcal {X},\mathcal {X})$
and its (operator) norm and its identity are respectively denoted by
$\Vert \cdot \Vert _{ \mathrm {op}}\equiv \Vert \cdot \Vert _{\mathcal {B}(\mathcal {X})}$
and
$ \mathbf {1}_{\mathcal {X}}\equiv \mathbf {1}$
.
$\mathbb {R}_{0}^{+}$
denotes the set of positive real numbers including zero, whereas
$\mathbb {R} ^{+}\doteq \mathbb {R}_{0}^{+}\backslash \{0\}$
is the set of strictly positive real numbers.
2 Setup of the problem
2.1 Background Lattice
Copper oxide superconductors have a relatively complex three-dimensional lattice structure. However, they always contain parallel two-dimensional layers of copper (
$\mathrm {Cu}^{++}$
) and oxygen (
$\mathrm {O}^{--}$
) ions. These
$\mathrm {CuO}_{2}$
layers are essential to understanding low-temperature superconducting properties because the (superconducting) charge transport takes place within the layers. This is explained in [Reference Saxena12, Reference Plakida13, Reference Wesche14]. Considering a weak inter-layer interaction might also help to increase prediction accuracy, but charge transport between each
$\mathrm {CuO}_{2}$
layer or, more generally, in the direction orthogonal to each layer remains negligible.Footnote
4
Each
$\mathrm {CuO}_{2}$
layer generally has the symmetries of the square. In other words, it is invariant under the group
$\{0,\pi /2,\pi ,3\pi /2\}$
generated by
$90^{\circ }$
-degree rotations. See, for example, (Reference Tsuei, Kirtley, Bennemann and Ketterson31, Section 9.1.2), [Reference Saxena12, Section 2.3] and (Reference Wesche14, Section 6.3.1). This is an important symmetry property that we keep in mind throughout our study.
Having in mind these physical observations on cuprates, we consider here quantum particles on lattices
$\mathbb {Z}^{2}$
. It means in particular that (disregarding internal degrees of freedom of the quantum particles, like their spin) the (separable) Hilbert space
$\ell ^{2}(\mathbb {Z}^{2})$
is the ‘one-particle space’ associated with the physical system we are interested in. Its canonical orthonormal basis is
$\{ \mathfrak {e}_{x}\}_{x\in \mathbb {Z}^{2}}$
:

where
$\delta _{\mathfrak {i},\mathfrak {j}}$
is the Kronecker delta.
2.2 Composite of two fermions and one boson
We consider a system of two fermions (electrons or holes in cuprates) with opposite spins interacting via the exchange of one boson in a two-dimensional square lattice. Physically, the boson that we have in mind in cuprate superconductors is a spinless bipolaron, since the very strong Jahn-Teller (JT) effect associated with copper ions is an important property of such cuprates [Reference Müller and Bednorz16, Reference Köppel, Yarkony and Barentzen15]. See Section A.1 for more details. However, the exchanged spinless boson could be of any type, like a phonon or a spin wave, depending on the physical system and mechanism one has in mind.
Hilbert Spaces. All quantum particles possess an intrinsic form of angular momentum known as spin, which is characterized by a quantum number
$\mathfrak {s}\in \mathbb {N}/2$
and a finite spin setFootnote
5
$ \mathrm {S}\doteq \{-\mathfrak {s},-\mathfrak {s}+1,\ldots \mathfrak {s}-1, \mathfrak {s\}}\subseteq \mathbb {N}$
. If
$\mathfrak {s}\notin \mathbb {N}$
is half-integer, then the corresponding particles are named fermions while
$\mathfrak {s}\in \mathbb {N}$
means by definition that we have bosons. For example, photons or spinless bipolarons (
$\mathfrak {s}=0$
) are bosons, while electrons (
$\mathfrak {s}=1/2$
) are fermions. In the latter case,
$\mathrm {S}\doteq \{-1/2,1/2\}$
, and in physics, the spin set is always written as
$\mathrm {S}\equiv \{\uparrow ,\downarrow \}$
, and we thus use this completely standard notation. By the celebrated spin-statistics theorem, fermionic wave functions are antisymmetric with respect to permutations of particles, whereas the bosonic ones are symmetric.
Therefore, the one-particle Hilbert space for the fermions is
$\ell ^{2}( \mathbb {Z}^{2}\times \{\uparrow ,\downarrow \})$
,
$\{\uparrow ,\downarrow \}$
being the usual spin set for electrons or holes, and for two fermions, we hence use the Hilbert space

of antisymmetric functions,Footnote
6
which is a subspace of the fermionic (
$-$
) Fock spaceFootnote
7
$\mathfrak {F}_{-}$
associated with the one-particle Hilbert space
$ \ell ^{2}(\mathbb {Z}^{2}\times \{\uparrow ,\downarrow \})$
. See Equation (A.4) below for the precise definition of
$\mathfrak {F}_{-}$
. The one-particle Hilbert space of the spinless boson is
$\ell ^{2}(\mathbb {Z} ^{2})$
, which can also be seen as a subspace of the bosonic (
$+$
) Fock space

associated with
$\ell ^{2}(\mathbb {Z}^{2})$
. See Equation (A.5) below for the precise definition of
$\mathfrak {F}_{+}$
. For a concise review of bosonic and fermionic Fock spaces, as well as the corresponding annihilation and creation operators, see Section A.2.
We study here the effect of processes of annihilation of two fermions of opposite spins to create a boson, which can conversely be annihilated to create two new fermions. The Hilbert space associated with this composite system, made of two fermions and one boson, is the direct sum
$\mathfrak {h} _{f}\oplus \ell ^{2}(\mathbb {Z}^{2})$
, and not the tensor product
$\mathfrak { h}_{f}\otimes \ell ^{2}(\mathbb {Z}^{2})$
. Note indeed that
$\mathfrak {h} _{f}\oplus \ell ^{2}(\mathbb {Z}^{2})$
can naturally be identifiedFootnote
8
with a subspace of
$\mathfrak {F} _{-}\otimes \mathfrak {F}_{+}$
. This fact already unveils the strong interdependence of the bosonic and fermionic parts. For this reason, from now on, we rather use the term ‘composite of two fermions and one boson instead of ‘three-body system’, in order to avoid any misinterpretation.
Fermionic Hamiltonian. The fermionic part of the (infinite volume) Hamiltonian of the composite is defined to be the restriction
$H_{f}\in \mathcal {B}(\mathfrak {h}_{f})$
of the formal expression

to the Hilbert space
$\mathfrak {h}_{f}$
. Here,
$a_{x,s}$
(
$a_{x,s}^{\ast }$
) denotes the annihilation (creation) operator acting on the fermionic Fock space
$\mathfrak {F}_{-}$
of a fermion at lattice position
$x\in \mathbb {Z} ^{2}$
, the spin of which is
$s\in \{\uparrow ,\downarrow \}$
. As is usual,
$ n_{x,s}\doteq a_{x,s}^{\ast }a_{x,s}$
stands for the number operator of fermions at lattice position
$x\in \mathbb {Z}^{2}$
and spin
$s\in \{\uparrow ,\downarrow \}$
.
The parameter
$\epsilon \in \mathbb {R}_{0}^{+}$
quantifies the hopping amplitude of fermions. In high-
$T_{c}$
superconductors [Reference Imada, Fujimori and Tokura29, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30],
$\epsilon $
is expected to be much smaller than the fermion-fermion interaction energy – more precisely the on-site repulsion strength
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. The function
$\mathrm {u}: \mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
, which represents the fermion-fermion repulsion at all distances, is absolutely summable and invariant with respect to
$90^{\circ }$
-rotations, that is,

Clearly, one could set
$\mathrm {U}=0$
, by redefining the coupling function
$ \mathrm {u}:\mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
. It is, however, convenient to have a separate parameter
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
for the on-site repulsion because we shall later on consider the ‘hard-core limit’
$\mathrm {U}\rightarrow \infty $
for some fixed coupling function
$\mathrm {u}$
.
Extended Hubbard interactions. The above fermion-fermion interactions have been extensively studied in condensed matter physics during the last decade, in particular for two-dimensional systems. For nonzero functions
$\mathrm {u}$
, they are named extended Hubbard interactions and they can drastically change the behavior of the system, as compared to the zero-range case (usual Hubbard interaction,
$\mathrm {u}=0$
). As one example, they are used in the context of ultracold atoms, ions and molecules [Reference Dutta, Gajda, Hauke, Lewenstein, Lühmann, Malomed, Sowiński and Zakrzewski11]. Its bosonic version is also experimentally investigated. See, for example, [Reference Lagoin, Bhattacharya, Grass, Chhajlany, Salamon, Baldwin, Pfeiffer, Lewenstein, Holzmann and Dubin32] published in 2022.
In theoretical studies, frequently, only nearest-neighbor interactions added to the on-site (zero-range) Hubbard interactions are considered. Here, we do not need the restriction to one-range (nearest-neighbor) interactions. We only assume that
$\mathrm {u}$
is absolutely summable (see (7)), which is physically a very mild restriction, since the effective two-particle repulsive electrostatic potential in crystals is expected to decay exponentially fast in space, because of screening effects.
The rotation invariance in Equation (7) refers to the isotropy of the system under consideration. However, as shown in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21], the system has low energy states that spontaneously break the isotropy. This refers to unconventional parings, typically of d -wave type, of electrons one experimentally observes in many high-
$T_{c}$
superconductors [Reference Tsuei and Kirtley33, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30, Reference Plakida13]. In fact, to derive the existence of d- and p-wave pairings starting from a physically sound microscopic model was the aim of [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21]. Here, instead, we keep a broader perspective and do not study this particular question.
Bosonic Hamiltonian. Similar to the fermionic part, the bosonic part of the (infinite volume) Hamiltonian of the system is defined to be the restriction
$H_{b}\in \mathcal {B}(\ell ^{2}(\mathbb {Z}^{2}))$
to the one-boson Hilbert space
$\ell ^{2}(\mathbb {Z}^{2})$
of the formal expression

Here,
$b_{x}$
(
$b_{x}^{\ast }$
) denotes the annihilation (creation) operator acting on the bosonic Fock space
$\mathfrak {F}_{+}$
of a boson at lattice position
$x\in \mathbb {Z}^{2}$
. Observe that the bosonic part only contains a kinetic term. The parameter
$h_{b}\in \mathbb {R}_{0}^{+}$
quantifies the ratio of the effective masses of fermions and bosons: Taking
$h_{b}$
smaller than one physically means that the bosons are heavier than the fermions. As experimentally found [Reference Dzhumanov34, Reference Dzhumanov35, Reference Dzhumanov, Karimboev and Djumanov36, Reference Reagor37] for cuprate superconductors, bipolarons should be much more massive than electrons or holes, and, thus, in the physically relevant regime,
$h_{b}$
is to be taken very small (or even zero, in an idealized situation). See (Reference Bru, de Siqueira Pedra and de Pasquale21, Section 3.1). In the sequel, we take
$h_{b}\in \lbrack 0,1/2]$
, meaning that the boson mass is at least as big as the mass of two fermions, as discussed in Section 3.
Exchange interactions. The term of the Hamiltonian that encodes the decay of a boson into two fermions (i.e., one of the two-electron(hole)-bipolaron-exchange interaction of the Hamiltonian) refers to the bounded operator

which is defined to be the restriction of the formal expression

to
$\ell ^{2}(\mathbb {Z}^{2})$
, where

for some fixed functions
$\mathfrak {p}_{1},\mathfrak {p}_{2}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}$
that are invariant under
$90^{\circ }$
-rotations and exponentially decay in space, that is,

for some
$\alpha _{0}>0$
. In particular, the functions
$\mathfrak {p}_{1}, \mathfrak {p}_{2}$
are absolutely summable in space.
By definition, we take
$\mathfrak {p}_{2}\left ( z\right ) \doteq 0$
if
$z\in \mathbb {Z}^{2}\backslash (2\mathbb {Z})^{2}$
and we also assume that

The condition
$\mathfrak {p}_{1}+\mathfrak {p}_{2}\neq 0$
only ensures the nontriviality of the exchange interaction, while the second condition avoids the singular case of a quasi-momentum
$k_{0}\in \lbrack -\pi ,\pi )^{2}$
at which the exchange interaction trivially vanishes; see below (36). This case can easily be analyzed, but it makes the argumentation cumbersome. So, we omit it here, as it is a highly unusual and irrelevant situation. For example, (13) is already satisfied as soon as
$\mathfrak {p}_{1}\left ( z\right ) \neq 0$
for some
$z\in \mathbb {Z}^{2}\backslash (2\mathbb {Z})^{2}$
, since
$\mathfrak {p}_{2}\left ( z\right ) \doteq 0$
for any
$z\notin (2\mathbb {Z})^{2}$
. In (Reference Bru, de Siqueira Pedra and de Pasquale22, Eq. (6)),
$\mathfrak {p}_{2}=0$
and, given
$\kappa>0$
,
$\mathfrak {p} _{1}(z)=\mathrm {e}^{-\kappa |z|}$
for
$|z|\leq 1$
and
$\mathfrak {p}_{1}(z)=0$
otherwise, while in [Reference Bru, de Siqueira Pedra and de Pasquale21, Eq. (4)],
$\mathfrak {p}_{2}\left ( 2z\right ) =\mathfrak {p}_{1}\left ( z\right ) =1$
when
$|z|\leq 1$
and
$ \mathfrak {p}_{1}\left ( z\right ) =\mathfrak {p}_{2}\left ( z\right ) =0$
otherwise. This are the typical examples we have in mind, the point here being the fact two fermions on different lattice sites can interact by exchanging a boson. See also Section A.1.
Physically,
$c_{y}^{\ast }$
represents the creation of a fermion pair of zero total spin, the two components of which are slightly spread around the lattice position
$y\in \mathbb {Z}^{2}$
. Such pairs have finite size, because of (12). In fact,

where, for any
$\sharp \in \{1,2\}$
,
$r_{\mathfrak {p}_{\sharp }}\doteq 0$
if
$\mathfrak {p}_{\sharp }=0$
, otherwise it is equal to

is naturally seen as being the actual size of such pairs. Note that the last inequality is a consequence of the Cauchy-Schwarz inequality, along with the bound

Allowing two fermions in different lattice site to interact by exchanging a boson simply means that
$r_{\mathfrak {p}}>0$
. For the physical significance of this property for cuprates, see [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21].
Recall that the exchange strength function
$\upsilon :\mathbb {Z} ^{2}\rightarrow \mathbb {R}$
is only absolutely summable, not necessarily exponentially decaying as
$\mathfrak {p}_{1}$
and
$\mathfrak {p}_{2}$
, and invariant under
$90^{\circ }$
-rotations, that is,

Note that the Fourier transforms
$\hat {\upsilon }$
,
$\mathfrak {\hat {p}}_{1}$
and
$\mathfrak {\hat {p}}_{2}$
of
$\upsilon $
,
$\mathfrak {p}_{1}$
and
$ \mathfrak {p}_{2}$
are real-valued continuous functions (on the two-dimensional torus
$\mathbb {T}^{2}$
) that are again invariant under
$ 90^{\circ }$
-rotations. Additionally,
$\mathfrak {\hat {p}}_{1}$
and
$ \mathfrak {\hat {p}}_{2}$
are real analytic, for
$\mathfrak {p}_{1}$
and
$ \mathfrak {p}_{2}$
are exponentially decaying. The reverse process – that is, the annihilation of two unbound fermions to form a boson – is represented by the adjoint operator

Mathematical remarks. The infinite sums (6), (8) and (10) defining formally
$H_{f}\in \mathcal {B}\left ( \mathfrak {h}_{f}\right ) $
,
$H_{b}\in \mathcal {B}(\ell ^{2}(\mathbb {Z}^{2}))$
and
$W_{\mathrm {b\rightarrow f}}\in \mathcal {B}(\ell ^{2}(\mathbb {Z}^{2}),\mathfrak {h}_{f})$
(9) are to be understood as follows: If
$\psi \in \mathfrak {h}_{f}$
or
$\psi \in \ell ^{2}(\mathbb {Z}^{2})$
is a finitely supported function, then the sum corresponding to
$H_{f}\psi $
,
$H_{b}\psi $
or
$W_{\mathrm {b\rightarrow f} }\psi $
is absolutely convergent. Thus,
$H_{f}$
,
$H_{b}$
and
$W_{\mathrm { b\rightarrow f}}$
are well-defined linear operators acting on the dense subspace of such functions. One checks that
$H_{f}$
,
$H_{b}$
and
$W_{ \mathrm {b\rightarrow f}}$
are all bounded on this subspace, and they thus have a unique bounded linear extension to the whole Hilbert space where they are defined – namely,
$\mathfrak {h}_{f}$
for
$H_{f}$
, and
$\ell ^{2}(\mathbb {Z} ^{2})$
for
$H_{b}$
and
$W_{\mathrm {b\rightarrow f}}$
. We denote the extensions again by
$H_{f}$
,
$H_{b}$
and
$W_{\mathrm {b\rightarrow f}}$
. Note that, being a bounded operator,
$W_{\mathrm {b\rightarrow f}}$
has an adjoint (17), while
$H_{f}$
and
$H_{b}$
are clearly symmetric and so, self-adjoint, for they are also bounded.
Full model. Finally, the full Hamiltonian for the fermion-boson composite is defined, in matrix notation for the direct sum
$ \mathfrak {h}_{f}\oplus \ell ^{2}(\mathbb {Z}^{2})$
, as follows:

Observe that this Hamiltonian is invariant under translations, as well as
$ 90^{\circ }$
-rotations.
Using the canonical orthonormal basisFootnote 9

to define the closed subspace

we remark that the zero-spin subspace

is invariant under the action of the (full) Hamiltonian (18). We can thus consider its restriction

to this particular subspace
$\mathfrak {H}\subseteq \mathfrak {h}_{f}\oplus \ell ^{2}(\mathbb {Z}^{2})$
.
In fact, as the boson is assumed to be spinless, by the conservation of angular momentum, we have that the total spin of the fermion pair resulting from a bosonic decay must be zero. In other words, the physically relevant (vector) states of the fermion-boson compound system always lie in
$ \mathfrak {H}$
. Note finally that H inherits the symmetries of the Hamiltonian (18) (i.e., H is invariant under translations and
$ 90^{\circ }$
-rotations). Note that this last symmetry (i.e., the rotation invariance) is mainly relevant for the study of unconventional pairings, which is not done here.
2.3 The model in spaces of quasi-momenta
We have a composite of two fermions and one boson whose Hamiltonian is translation invariant. In this case, it is a standard procedure (see, for example, [Reference Reed and Simon38, Chapter XIII.16]) to use the direct integral decomposition of the Hamiltonian in Fourier space in order to study its spectral properties.
For the two-dimensional lattice
$\mathbb {Z}^{2}$
, the (Fourier) space of quasi-momenta is nothing else than the torus

This set is endowed with the metric
$d_{\mathbb {T}^{2}}$
defined by

where
$|k-p-q|$
is the Euclidean distance between k and
$p+q$
in
$\mathbb {R }^{2}$
. This defines a compact metric space
$(\mathbb {T}^{2},d_{\mathbb {T} ^{2}})$
. Observe also that the usual group operation in
$\mathbb {T}^{2}$
(i.e., the sum in
$\mathbb {R}^{2}$
modulo
$(2\pi ,2\pi )$
) is a continuous operation, while any Borel set in
$\mathbb {T}^{2}$
is also a Borel set in
$ \mathbb {R}^{2}$
(endowed with the Euclidean metric).
We also need the normalized Haar measure
$\nu $
on
$\mathbb {T}^{2}$
defined for any Borel set
$B\subseteq \mathbb {T}^{2}$
by

where
$\boldsymbol {\lambda }$
is the Lebesgue measure in
$\mathbb {R}^{2}$
. This measure appears in relation with direct integrals of constant Hilbert spaces on the two-dimensional torus
$\mathbb {T}^{2}$
, like the Hilbert space

of square-integrable, complex-valued functions on
$\mathbb {T}^{2}$
. Since the Haar measure
$\nu $
is used in all our direct integrals on
$\mathbb {T} ^{2}$
, for simplicity, we often remove the symbol
$\nu $
from the notation of
$L^{2}$
-spaces, unless this information is important to recall.
The Fourier transform can be applied in the fermionic and bosonic sectors. In the fermionic one, there is more than one natural way of implementing the transform, as the corresponding functions have two arguments in
$\mathbb {Z} ^{2}$
. It turns out that to be very useful to extract the total quasi-momentum of fermionic pairs. In fact, we consider the direct integral

of the (constant fiber) Hilbert space

over the torus
$\mathbb {T}^{2}$
, and choose a unitary transformation

in such a way that
$k\in \mathbb {T}^{2}$
, the fiber quasi-momentum, is exactly the total quasi-momentum of the fermion pair.
Recall that
$\mathfrak {H}$
defined in (20) is the Hilbert space on which H is originally defined. More precisely,

where

is the Fourier transform on
$\ell ^{2}(\mathbb {Z}^{2})$
, while the fermionic part

is the composition of two unitary (linear) transformations
$U_{1}$
and
$ U_{2} $
, whose exact definitions are given as follows:

and

Because
$\{\mathfrak {e}_{(x,\uparrow )}\wedge \mathfrak {e}_{(y,\downarrow )}\}_{x,y\in \mathbb {Z}^{2}}$
,
$\{\mathfrak {e}_{(x,y)}\}_{x,y\in \mathbb {Z} ^{2}}$
and
$\{\mathfrak {e}_{x}\otimes \mathfrak {e}_{y}\}_{x,y\in \mathbb {Z} ^{2}}$
are orthonormal bases and
$(x,y)\mapsto (x,x-y)$
is a bijection on
$ \mathbb {Z}^{2}\times \mathbb {Z}^{2}$
,
$U_{1}$
is well-defined as a composition of three unitary linear transformations. Note also that the last unitary linear transformation defining
$U_{2}$
is defined as in Proposition A.8, while the first one defining
$ U_{2}$
is the tensor product
$\mathcal {F}\otimes \mathcal {F}$
of the Fourier transform
$\mathcal {F}$
on
$\ell ^{2}(\mathbb {Z}^{2})$
, defined for any
$ f\in \ell ^{1}(\mathbb {Z}^{2})\subseteq \ell ^{2}(\mathbb {Z}^{2})$
by

$k\cdot x$
being the usual scalar product of
$k\in \mathbb {T}^{2}$
and
$x\in \mathbb {Z}^{2}$
, seen as vectors of
$\mathbb {R}^{2}$
. Here, we use the symbol
$\widehat {\left ( \cdot \right ) }$
to shorten the notation of the Fourier transform. For instance, for any
$x\in \mathbb {Z}^{2}$
, we write above
$\hat { \mathfrak {e}}_{x}$
to denote the function
$\mathrm {e}^{i(\cdot )\cdot x}$
on the torus
$\mathbb {T}^{2}$
. That is,
$\{\hat {\mathfrak {e}}_{x}\}_{x\in \mathbb {Z}^{2}}$
is the image under the Fourier transform of the canonical orthonormal basis
$\{\mathfrak {e}_{x}\}_{x\in \mathbb {Z}^{2}}$
(5) of
$\ell ^{2}(\mathbb {Z}^{2})$
.
For the reader’s convenience and completeness, in Section A.4, we gather key results from the theory of direct integrals with constant fiber Hilbert spaces. In the next subsection, we explain how the properties of the Hamiltonian
$H\in \mathcal {B}(\mathfrak {H} ) $
defined by (21) can be studied on the direct integral (24) over total quasi-momenta.
2.4 Fiber decomposition of the Hamiltonian
By explicit computations, exactly like in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21], we show that the conjugation of the Hamiltonian
$H\in \mathcal {B}(\mathfrak {H})$
with the unitary transformation
$\mathbb {U}$
of Equation (26) is a decomposable operator on the direct integral
$L^{2}(\mathbb {T}^{2},\mathcal {H })$
. To state this result precisely, we need preliminary definitions allowing to define the so-called ‘fiber Hamiltonians’, or ‘fibers’ for short,
$A(k)\in \mathcal {B}(\mathcal {H})$
of
$ \mathbb {U}H\mathbb {U}^{\ast }$
at total quasi-momenta
$k\in \mathbb {T}^{2}$
. In fact, the mapping
$k\mapsto A(k)$
defines an element of the von Neumann algebraFootnote
10

of (equivalence classes of) strongly measurable functions
$\mathbb {T} ^{2}\rightarrow \mathcal {B}(\mathcal {H})$
. See Section A.4 for more details.
Given a total quasi-momentum
$k\in \mathbb {T}^{2}$
and the parameters
$ \epsilon ,h_{b}\in \mathbb {R}_{0}^{+}$
tuning the strengths of the two (fermionic and bosonic) kinetic parts of the model, we define continuous, real-valued functions
$\mathfrak {f}(k),\mathfrak {d}(k),\mathfrak {b}\in C\left ( \mathbb {T}^{2}\right ) $
on the torus
$\mathbb {T}^{2}$
by



for all
$p=(p_{1},p_{2})\in \mathbb {T}^{2}$
, where

Recall that (the
$(2\pi ,2\pi )$
-periodic function)
$\mathfrak {\hat {p}}_{1}$
and
$\mathfrak {\hat {p}}_{2}$
are the Fourier transform of
$\mathfrak {p}_{1}$
and
$\mathfrak {p}_{2}$
, which are the functions defining the operator
$ c_{y}^{\ast }$
in (11), representing the creation of fermion pairs in the model. Note also from (13) that
$ \mathfrak {d}(k)\neq 0$
for all
$k\in \mathbb {T}^{2}$
. Indeed, using
$ \mathfrak {p}_{2}(z)\doteq 0$
for
$z\notin 2\mathbb {Z}$
as well as (31) and (34),

where
$\mathrm {e}^{ik\cdot x}\mathfrak {p}_{\sharp }(x)$
stands for the function
$x\mapsto \mathrm {e}^{ik\cdot x}\mathfrak {p}_{\sharp }(x)$
with
$ \sharp \in \{1,2\}$
.
Then, at any quasi-momentum
$k\in \mathbb {T}^{2}$
and on-site repulsion strength
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, we define the bounded operators
$B_{1,1}\left ( k\right ) $
and
$A_{1,1}(\mathrm {U},k)$
acting on the Hilbert space
$L^{2}\left ( \mathbb {T}^{2}\right ) $
by


where
$M_{\mathfrak {f}(k)}$
stands for the multiplication operator by
$ \mathfrak {f}(k)\in C(\mathbb {T}^{2})$
and
$P_{x}$
is the orthogonal projection onto the one-dimensional subspace
$\mathbb {C}\mathfrak {\hat {e}} _{x}\subseteq L^{2}(\mathbb {T}^{2})$
. Note that the infinite sum defining the bounded operator
$B_{1,1}(k)$
is absolutely convergent, for the function
$\mathrm {u}:\mathbb {Z}^{2}\rightarrow \mathbb {R}$
is, by assumption, absolutely summable. See (7).
We define next


as well as

for any fixed
$k\in \mathbb {T}^{2}$
. By compactness of
$\mathbb {T}^{2}$
and continuity (in operator norm) of the mappings
$k\mapsto A_{i,j}(k)$
for all
$ i,j\in \{1,2\}$
, we have

(see Lemma 4.1), which is meanwhile the fiber decomposition of the operator
$\mathbb {U}H\mathbb {U}^{\ast }$
:
Proposition 2.1 (Fiber decomposition of the quantum model).
The conjugation of H by
$\mathbb {U}$
(26) is decomposable and has
$ A(\cdot )$
as its fibers; that is,

Proof. This is proven from explicit computations which are almost the same as those done in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21]. We postpone the details of this calculation to Section 4.2.
The fiber decomposition given by Proposition 2.1 is useful because it gives access to spectral properties of H. In fact, for an operator that is decomposable on
$L^{2}\left ( \mathbb {T}^{2},\mathcal {H}\right ) $
, that is, an operator unitarily equivalent to an element of the von Neumann algebra
$L^{2}\left ( \mathbb {T}^{2},\mathcal {B}(\mathcal {H})\right ) $
, like the Hamiltonian H, the fibers
$A(k)$
of which are all self-adjoint, it is known that
$\lambda \in \sigma (H)$
if, and only if, for all
$\varepsilon>0$
,

See Theorem A.3. As is usual, here,
$\sigma (X)$
denotes the spectrum of any operator X acting on some Hilbert space.
3 Main results
In this section, we state our main results, starting with general spectral properties of the Hamiltonian H to finish with results related with scattering.
Recall that the model has parameters
$\epsilon ,\mathrm {U},h_{b}\in \mathbb {R }_{0}^{+}$
and
$\alpha _{0}\in \mathbb {R}^{+}$
, and it depends on the choice of functions

(with
$\mathfrak {p}_{2}(z)\doteq 0$
for
$z\notin 2\mathbb {Z}$
) that are absolutely summable and invariant with respect to
$90^{\circ }$
-rotations. Observe additionally that the functions
$\mathfrak {p}_{1}$
and
$\mathfrak {p} _{2}$
are required to be exponentially decaying; that is,
$\mathrm {e} ^{\alpha _{0}|\cdot |}\mathfrak {p}_{1}$
and
$\mathrm {e}^{\alpha _{0}|\cdot |} \mathfrak {p}_{2}$
are absolutely summable for some
$\alpha _{0}>0$
. See Equations (7), (12) and (16). All details of the Hamiltonian, like the precise choice of its parameters and functions, are not explicitly mentioned in our discussions or statements below, unless it is important for clearness. There is however one important condition to clarify:
While some of our results can be obtained without any other restriction, frequently we fix the parameter
$h_{b}$
in the interval
$[0,1/2]$
. This choice physically means that the boson is heavier than two fermions. As already discussed above, the assumption is perfectly justified when one views the two fermions and the boson of the model as being electrons or holes and a bipolaron, respectively, in a cuprate. In fact, polarons (and thus bipolarons) are charge carriers that are self-trapped inside a strong and local lattice deformation that surrounds them, caused by electrostatic interactions between the carriers and the lattice. A priori, such (strong and local) lattice deformations can barely move; that is, their effective mass is huge. See, for example, [Reference Dzhumanov34, Reference Dzhumanov35, Reference Dzhumanov, Karimboev and Djumanov36]. This is coherent with the assumption of a large mass of JT bipolarons in copper oxides [Reference Reagor37], similar to JT polarons [Reference Höck, Nickisch and Thomas39]. See also Section A.1 for more details.
We show that the condition
$h_{b}\in \lbrack 0,1/2]$
is crucial to obtain dressed bound fermion pairs, which are expected to represent the charge carriers below the pseudogap temperature [Reference Bru, de Siqueira Pedra and de Pasquale21].
3.1 Spectral properties
Having in mind Proposition 2.1 and Theorem A.3, we start with the spectral properties of fiber Hamiltonians (42) at any quasi-momentum
$k\in \mathbb {T }^{2}$
. This refers to the following theorem:
Theorem 3.1 (Spectral properties of fiber Hamiltonians).
Fix
$\epsilon ,\mathrm {U}\in \mathbb {R}_{0}^{+}$
,
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
.
-
i.) Essential spectrum
$\sigma _{\mathrm {ess}}\left ( \cdot \right ) $ of the fiber Hamiltonian:
$$ \begin{align*} \sigma _{\mathrm{ess}}\left( A\left( \mathrm{U},k\right) \right) =\, \mathfrak{f}\left( k\right) \left( \mathbb{T}^{2}\right) =2\epsilon \cos \left( k/2\right) \left[ -1,1\right] +4\epsilon . \end{align*} $$
-
ii.) Ground state energy: There is a unique nondegenerate eigenvalue
$\mathrm {E}(\mathrm {U},k)\leq \mathfrak {b}(k)$ of
$A(\mathrm {U} ,k) $ below the essential spectrum, with associated eigenvector
$$ \begin{align*} \Psi (\mathrm{U},k)\doteq (\hat{\psi}_{k}\left( \mathrm{U}\right) ,-1), \quad \text{where}\quad \hat{\psi}_{\mathrm{U},k}\doteq \hat{\upsilon}\left( k\right) \left( A_{1,1}\left( \mathrm{U},k\right) -\mathrm{E}\left( \mathrm{U },k\right) \mathbf{1}\right) ^{-1}\mathfrak{d}\left( k\right) \in L^{2}\left( \mathbb{T}^{2}\right) . \end{align*} $$
In addition,
$\mathrm {E}(\mathrm {U},k)=\mathfrak {b}(k)$ iff
$\hat {\upsilon } (k)=0$ . Recall that
$\mathfrak {b}(k)$ is defined by Equation (32).
-
iii.) Spectral gap and Anderson localization: If
$\hat {\upsilon } (0)\neq 0$ and
$r_{\mathfrak {p}}>0$ , then
$$ \begin{align*} \inf_{\mathrm{U}\in \mathbb{R}_{0}^{+}}\min_{k\in \mathbb{T}^{2}}\left\{ \min \sigma _{\mathrm{ess}}\left( A\left( \mathrm{U},k\right) \right) - \mathrm{E}\left( \mathrm{U},k\right) \right\}>0 \end{align*} $$
$C,\alpha \in \mathbb {R}^{+}$ such that, for all
$k\in \mathbb { T}^{2}$ and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$ ,
$$ \begin{align*} \left\vert \mathcal{F}^{-1}[\hat{\psi}_{\mathrm{U},k}]\left( x\right) \right\vert \leq C\mathrm{e}^{-\alpha |x|},\qquad x\in \mathbb{Z}^{2}. \end{align*} $$
-
iv.)
$\mathrm {E}\left ( \mathrm {U},\cdot \right ) :\mathbb {T} ^{2}\rightarrow \mathbb {R}$ is a continuous function, and if
$\hat {\upsilon }$ is of classFootnote 11
$C^{d}$ on
$(-\pi ,\pi )^{2}\backslash \{0\}\subseteq \mathbb {R}^{2}$ with
$d\in \mathbb {N}\cup \{\omega ,a\}$ , then so does
$\mathrm {E}\left ( \mathrm {U},\cdot \right ) $ on
$(-\pi ,\pi )^{2}\backslash \{0\}$ .
Proof. The theorem is a combination of Theorems 4.8, 4.9, 4.18 and 4.20 together with Propositions 4.2, 4.19 and Corollary 4.6 (see (130)).
Remark 3.2. Recall that if, for some natural number
$d\geq 1$
,

then the Fourier transform
$\hat {\upsilon }$
of the function
$\upsilon : \mathbb {Z}^{2}\rightarrow \mathbb {R}$
, as defined by (31), is of class
$C^{d}$
on the whole torus
$\mathbb {T}^{2}$
.
Remark 3.3. If
$\mathfrak {p}_{1}=\mathfrak {p}_{2}\in \mathbb {C}\mathfrak {e}_{0}$
,i.e.,
$ r_{\mathfrak {p}}=0$
, then Theorem 3.1 (iii) remains true, but not uniformly in
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. That is, in this case, one only has

and there are
$C_{\mathrm {U}},\alpha _{\mathrm {U}}\in \mathbb {R}^{+}$
such that, for all
$k\in \mathbb {T}^{2}$
,

Assertion (i) of Theorem 3.1 holds true for all
$h_{b}\in \mathbb {R}_{0}^{+}$
, but the other assertions need the restriction
$h_{b}\in \lbrack 0,1/2]$
to ensure that the eigenvalues are below the essential spectrum, as stated in Assertion (ii). In fact,
$h_{b}\in \lbrack 0,1/2]$
iff

for all
$k\in \mathbb {T}^{2}$
, with equality only at
$k=0$
. See Equation (32). Therefore, by Assertion (ii),
$\mathrm {E}(\mathrm {U},k)$
belongs to the essential spectrum iff
$k=0$
and
$\hat {\upsilon }(0)=0$
. Otherwise, we have a uniform spectral gap, as stated in Assertion (iii).
As is explained in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21], the eigenvalue
$\mathrm {E} \left ( \mathrm {U},k\right ) $
given by Theorem 3.1 is associated with the formation of dressed bound fermion pairs with total quasi-momentum
$k\in \mathbb {T}^{2}$
. These pairs are generally exponentially localized, thanks to Theorem 3.1 (iii), which basically implies that the two fermions move together confined within some small ball; that is, they are tightly bound in space, provided
$\hat {\upsilon }(0)\neq 0$
. When
$\mathfrak {p}_{1}(z)\neq 0$
or
$\mathfrak {p}_{2}(z)\neq 0$
for some
$z\neq 0,$
or, equivalently,
$r_{\mathfrak {p}}>0$
, the size of the small does not depend upon the Hubbard coupling constant
$\mathrm {U}$
and a very large
$\mathrm {U}\gg 1$
only prevents two fermions from occupying the same lattice site. The condition
$\mathfrak {p}_{1}(z)\neq 0$
or
$\mathfrak {p} _{2}(z)\neq 0$
for some
$z\neq 0$
, or equivalently,
$r_{\mathfrak {p}}>0$
, is therefore pivotal to get (Cooper) fermion pairs, the natural candidates for superconducting charge carriers, in presence of strong on-site Coulomb repulsions, like in cuprates [Reference Imada, Fujimori and Tokura29, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30].
For the usual (i.e., nonextended) Hubbard interaction (
$\mathrm {u}=0$
) and one-rangeFootnote
12
creation operators
$c_{y}^{\ast }$
of fermion pairs (in this case (12) holds true for all
$\alpha _{0}\in \mathbb {R}_{0}^{+}$
), note that a weak form of pair localization was previously shown in the ground state. See, for instance, [Reference Bru, de Siqueira Pedra and de Pasquale22, Theorem 3 and Proposition 13]. In this particular case, estimates of
$\mathrm {E}\left ( \mathrm {U},k\right ) $
and
$\Psi ( \mathrm {U},k)$
are known for large
$\mathrm {U}\gg 1$
. See, for example, [Reference Bru, de Siqueira Pedra and de Pasquale22, Theorem 4, Corollary 5, Theorem 16]. Recall that the aim in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21] was to show the existence of d- and p-wave pairings in the ground state for some physically sound model, and not the systematic study of a general class of models. In [Reference Bru, de Siqueira Pedra and de Pasquale21], we conjecture that such dressed bound fermion pairs represent the charge carriers below the pseudogap temperature in cuprates.
Theorem 3.1 combined with Proposition 2.1 and the theory of direct integrals (cf. Theorem A.3) has direct consequences for the spectrum of the full Hamiltonian
$H\in \mathcal {B}(\mathfrak {H})$
, which is defined by Equation (21). Among other things, we obtain the following corollary:
Corollary 3.4 (Spectral properties of H).
Fix
$\epsilon ,\mathrm {U}\in \mathbb {R}_{0}^{+}$
and
$h_{b}\in \lbrack 0,1/2] $
. Then,

where
$\sigma \left ( H\right ) $
denotes, as is usual, the spectrum of H, and

If additionally
$\hat {\upsilon }(0)\neq 0$
and
$r_{\mathfrak {p}}>0$
, then

Proof. To prove the first assertion, it suffices to combine Proposition 2.1 and Theorem 3.1 with Theorem A.3. The second one can be proven like in [Reference Bru, de Siqueira Pedra and de Pasquale22] by using Kato’s perturbation theory [Reference Kato40]. In Proposition 4.11, we give an alternative and more direct proof of it. Finally, the last assertion is a consequence of the inequalities

and Theorem 3.1 (iii).
Physically speaking, the spectral values of H represent the energy levels that are available to the composite of two fermions and one boson – in particular, for a fermion pair exchanging a boson. As expected, the minimum energy E, also well-known as the ground state energy, is given by minimizing the eigenvalues
$\mathrm {E}(\mathrm {U},k)$
over the torus
$ \mathbb {T}^{2}$
.
We now study the model at very large on-site repulsion
$\mathrm {U}\gg 1$
. In fact, quoting [Reference Bru, de Siqueira Pedra and de Pasquale21], ‘in all cuprates, there is undeniable experimental evidence of strong on-site Coulomb repulsions, leading to the universally observed Mott transition at zero doping [Reference Imada, Fujimori and Tokura29, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30]. This phase is characterized by a periodic distribution of fermions (electrons or holes) with exactly one particle per lattice site. Doping copper oxides with holes or electrons can prevent this situation. Instead, at sufficiently small temperatures a superconducting phase is achieved, as first discovered in 1986 for the copper oxide perovskite
$ \mathrm {La}_{2-x}\mathrm {Ba}_{x}\mathrm {CuO}_{4}$
[Reference Müller and Bednorz16]’. However, instead of the usual s-wave superconductivity, one experimentally observes d-wave superconductivity [Reference Tsuei and Kirtley33, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30, Reference Plakida13]. The fact that only the s-wave pairing is suppressed also advocates for a very local (i.e., on-site) and strong effective repulsion of fermions. For this reason, we consider the limit
$\mathrm {U}\rightarrow \infty $
in our model. It corresponds to a hard core limit because it prevents two fermions from being on the same lattice site.
In the limit
$\mathrm {U}\rightarrow \infty $
, it is easy to see that the ground state energy
$E(\mathrm {U})$
of Corollary 3.4 defines an increasing function of
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, which is bounded from above by
$0$
. Hence,

For more details, see Lemma 4.14. The limit
$\mathrm {U}\rightarrow \infty $
of the eigenvalue and eigenvector of each fiber, given by Theorem 3.1, is less trivial to obtain and is the object of the next theorem:
Theorem 3.5 (Spectral properties of fiber Hamiltonians – Hard-core limit).
Fix
$\epsilon ,\mathrm {U}\in \mathbb {R}_{0}^{+}$
and
${h_{b}\in \lbrack 0,1/2] }$
. The following limits exist:

Assertion (iv) of Theorem 3.1 also holds true for
$\mathrm {U} =\infty $
. In addition, when
$r_{\mathfrak {p}}>0$
,
$\mathrm {E}(\infty ,k)= \mathfrak {b}(k)$
iff
$\hat {\upsilon }(k)=0$
. If
$r_{\mathfrak {p}}>0$
and
$ \hat {\upsilon }(0)\neq 0$
, then
$\Psi (\infty ,0)$
exists.
Note that the eigenvalues given by Theorem 3.1 are not explicitly known. The same is of course true in the hard-core limit
$\mathrm { U}\rightarrow \infty $
. For applications, it is important to have a sufficiently good control on these objects to be able to compute them, either analytically or numerically. This is done in [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21] for the special case of the usual Hubbard interaction (
$\mathrm {u}=0$
) and one-range creation operators
$c_{y}^{\ast }$
of fermion pairs, by providing estimates for
$\mathrm {E}\left ( \mathrm {U},k\right ) $
and
$\Psi \left ( \mathrm {U},k\right ) $
at large
$\mathrm {U}$
. See [Reference Bru, de Siqueira Pedra and de Pasquale22, Theorem 4, Corollary 5, Theorem 16].
Recall that
$\hat {\upsilon }$
is the Fourier transform of
$\upsilon $
, which is the function appearing in Equation (10), encoding the (exchange) interaction between fermion pairs and bosons. By Theorem 3.1, if
$\hat {\upsilon }(k)=0$
, then
$\mathrm {E}(\mathrm {U},k)$
is nothing else than the explicit function
$\mathfrak {b}(k)$
(32). Hence, we focus on the physically more relevant case
$\hat {\upsilon }(k)\neq 0$
. Using the Birman-Schwinger principle (Theorem A.10), we show in this case that the eigenvalue
$\mathrm {E}(\mathrm {U} ,k) $
is the unique solution to a relatively simple equation for real numbers, similar to the characteristic equation used to compute eigenvalues of matrices.
To this end, we define a function
$\mathfrak {T}:\mathcal {D}\rightarrow \mathbb {R}$
on the set

by

for any finite
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
,
$k\in \mathbb {T}^{2}$
and
$x\in \left ( 0,\mathfrak {z}\left ( k\right ) \right ) $
, while for the infinite on-site repulsion,
$k\in \mathbb {T}^{2}$
and
$x\in \left ( 0,\mathfrak {z} \left ( k\right ) \right ) $
,

the above limit existing by virtue of Corollary 4.13. Recall that
$\mathfrak {z}\left ( k\right ) $
is defined by Equation (75). In fact, for any
$k\in \mathbb {T}^{2}$
and
$x\in \left ( 0,\mathfrak {z}\left ( k\right ) \right ) $
,

where
$R_{\mathfrak {s},\mathfrak {s}}$
,
$R_{\mathfrak {s},\mathfrak {d}}$
,
$R_{ \mathfrak {d},\mathfrak {s}}$
and
$R_{\mathfrak {d},\mathfrak {d}}$
are four constants defined by Equations (99)–(102) with
$\lambda =x$
. When
$\mathrm {u}=0$
, these constants are given by explicit integrals on the torus
$\mathbb {T}^{2}$
[Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21]. Then, the eigenvalues, the existence of which is stated in Theorem 3.1, as well as their limits (Theorem 3.5), can be studied via the following characteristic equation:
Theorem 3.6 (Characteristic equation for the fiber ground states).
Fix
$\epsilon \in \mathbb {R}_{0}^{+}$
,
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
such that
$\hat {\upsilon }(k)\neq 0$
. Then, for any
$\mathrm {U }\in \left [ 0,\infty \right ] $
,
$\mathrm {E}(\mathrm {U},k)$
is the unique solution to the equation

Proof. For
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, combine Theorem 4.5 with Theorem 4.8, while for
$\mathrm {U}=\infty $
, use Theorem 4.15.
Notice that, more generally, for any fixed
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, the same characteristic equation determines all eigenvalues of the fiber lying in the resolvent set
$\rho (A_{1,1}(\mathrm {U},k))$
of the operator
$A_{1,1}(\mathrm {U},k)$
. See Theorem 4.5. Also the associated eigenspaces can be explicitly characterized, thanks to Corollary 4.6. In this context, Corollary 4.7 shows that, for any
$h_{b}\in \lbrack 0,1/2]$
and total quasi-momentum
$k\in \mathbb {T}^{2}$
, there is at most one eigenvalue of
$A( \mathrm {U},k)$
in each connected component of
$\rho (A_{1,1}(\mathrm {U} ,k))\cap \mathbb {R}$
.
3.2 Dispersion relation of dressed bound fermion pairs
By Theorem 3.1,
$\mathrm {E}:\mathbb {T}^{2}\rightarrow \mathbb { R}$
is a continuous family of nondegenerate eigenvalues, generally (at least for
$k\neq 0$
) associated with exponentially localized eigenvectors. Note that the case
$k=0$
is particular when
$\hat {\upsilon }(0)=0$
, since
$ \mathrm {E}(0)$
is not an isolated eigenvalue of
$A(\mathrm {U},0)$
. However, the family
$(\mathrm {E}(k))_{k\in \mathbb {T}^{2}}$
is still continuous. The peculiar behavior at
$k=0$
leads us to only consider total quasi-momenta in the subset

as, for instance, in Theorem 3.1 (iv).
Because of Proposition 2.1, the family
$(\mathrm {E}(k))_{k\in \mathbb {T}^{2}}$
can thus be seen as the effective dispersion relation of dressed bound fermion pairs. It is expected to determine transport properties of the quantum system at low temperatures. We now define in mathematical terms what a dispersion relation is.
First, a dispersion relation
$\varkappa :\mathbb {T}^{2}\rightarrow \mathbb {R} $
should be a functions mapping quasi-momenta
$k\in \mathbb {T}^{2}$
on the torus to spectral values of the corresponding fibers. More precisely,
$ \varkappa (k)$
should be an isolated eingenvalue of the fiber associated with the total quasi-momentum k. Recall that the dispersion relation of a (nonrelativistic) particle in the d-dimensional continuum (that is, the particle moves in the continuum d-dimensional space
$\mathbb {R}^{d}$
), whose (isotropic) mass is m, is
$\ k^{2}/2m$
and velocity
$v(k)=k/m$
,
$k\in \mathbb {R}^{d}$
. Having this standard example in mind, we would like also to derive from a dispersion relation a group velocity and a mass tensor, at any fixed quasi-momentum
$k\in \mathbb {T}^{2}$
, as is usual. These are key objects, for instance, in the study of transport properties. Notice that they require sufficient regularity of the dispersion relation to be defined.
Keeping in mind that all our quantities are parametrized by the on-site repulsion
$\mathrm {U}\in \left [ 0,\infty \right ] $
, we define a family of dispersion relations associated with the fiber Hamiltonians
$A(\mathrm {U},k)$
as follows:
Definition 3.7 (Family of dispersion relations).
A function
$\varkappa :\left [ 0,\infty \right ] \times \mathbb {T} ^{2}\rightarrow \mathbb {R}$
is said to be a family of dispersion relations
$ \varkappa (\mathrm {U},\cdot )$
if the following properties are satisfied for all
$\mathrm {U}\in \left [ 0,\infty \right ] $
:
-
i.) For any
$k\in \mathbb {T}^{2}$ and
$\mathrm {U}\in \mathbb {R} _{0}^{+}$ ,
$\varkappa (\mathrm {U},k)$ is an eigenvalue of
$A(\mathrm {U},k)$ and
$$ \begin{align*} \varkappa \left( \infty ,k\right) =\lim\limits_{\mathrm{U}\rightarrow \infty }\varkappa \left( \mathrm{U},k\right) . \end{align*} $$
-
ii.) For all
$\mathrm {U}\in \left [ 0,\infty \right ] $ ,
$\varkappa ( \mathrm {U},\cdot )\in C(\mathbb {T}^{2})$ and is of class
$C^{2}$ on the open set
$\mathbb {S}^{2}\subseteq \mathbb {R}^{2}$ .
The first property is a very natural property, having in mind Proposition 2.1 and the theory of direct integrals (Theorem A.3). The second property of Definition 3.7 is needed to define group velocities and mass tensors.
To explain these two concepts, we need the Hessian of functions
$f\in C^{2}( \mathbb {S}^{2})$
at fixed k, which is denoted by

where
$\mathsf {M}_{2}(\mathbb {R})$
is the set of
$2\times 2$
matrices with real coefficients. It is a straightforward consequence of the regularity of
$ f\in C^{2}(\mathbb {S}^{2})$
that

is continuous. For any
$f\in C^{2}(\mathbb {S}^{2})$
, we consider the set

with
$\mathsf {GL}_{2}\left ( \mathbb {R}\right ) \subseteq \mathsf {M}_{2}\left ( \mathbb {R}\right ) $
being the set of invertible
$2\times 2$
matrices with real coefficients. As
$\mathsf {GL}_{2}(\mathbb {R})\subseteq \mathsf {M}_{2}( \mathbb {R})$
is an open set (see [Reference Folland50, Theorem 1.4]), it then follows that

is also an open set.
We are now in a position to define group velocities and mass tensors of a family of dispersion relations.
Definition 3.8 (Group velocities and mass tensors).
At any
$\mathrm {U}\in \left [ 0,\infty \right ] $
, the group velocity
$\mathbf { v}_{\varkappa ,\mathrm {U}}:\mathbb {S}^{2}\rightarrow \mathbb {R}$
and the mass tensor
$\mathbf {m}_{\varkappa ,\mathrm {U}}:\mathfrak {M}_{\varkappa \left ( \mathrm {U},\cdot \right ) }\rightarrow \mathsf {M}_{2}\left ( \mathbb {R} \right ) $
associated with a family
$\varkappa :\left [ 0,\infty \right ] \times \mathbb {T}^{2}\rightarrow \mathbb {R}$
of dispersion relations are respectively defined by

We deduce from Theorem 4.9 that
$\mathrm {E}$
is a dispersion relation when the function
$\upsilon :\mathbb {Z}^{2}\rightarrow \mathbb {R}$
is at least
$2$
times continuously differentiable and, in this case, we can even compute the group velocity via the characteristic equation (Theorem 3.6).
Theorem 3.9 (Dispersion relations of dressed bound fermion pairs).
Fix
$\epsilon \in \mathbb {R}_{0}^{+}$
and
$h_{b}\in \lbrack 0,1/2]$
. Assume that
$\hat {\upsilon }\in C^{2}(\mathbb {S}^{2})$
.
-
i.) Then,
$\mathrm {E}:\left [ 0,\infty \right ] \times \mathbb {T} ^{2}\rightarrow \mathbb {R}$ given by Theorems 3.1 and 3.5 is a family of dispersion relations.
-
ii.) The associated group velocities are equal to
$$ \begin{align*}\mathbf{v}_{\mathrm{E},\mathrm{U}}\left( k\right) =\left. \left( \hat{\upsilon}\left( k\right) ^{2}\partial _{x}\mathfrak{T}\left( \mathrm{U},k,x\right) +1\right) ^{-1}\vec{\nabla}\left( \hat{\upsilon}\left( k\right)^{2}\mathfrak{T}\left( \mathrm{U},k,x\right) -\mathfrak{b}\left( k\right)\right) \right\vert _{x={\mathrm{E}\left( \mathrm{U},k\right) }} \end{align*} $$
$\mathrm {U}\in \left [ 0,\infty \right ] $ and
$k\in \mathbb {S}^{2}$ , with
$$ \begin{align*} \mathbf{v}_{\mathrm{E},\infty }\left( k\right) =\lim_{\mathrm{U}\rightarrow \infty }\mathbf{v}_{\mathrm{E},\mathrm{U}}\left( k\right) ,\qquad k\in \mathbb{S}^{2}. \end{align*} $$
-
iii.) If
$\hat {\upsilon }$ is real analytic on
$\mathbb {S}^{2}$ , then, for any
$\mathrm {U}\in \left [ 0,\infty \right ] $ , either
$\mathfrak {M}_{ \mathrm {E}\left ( \mathrm {U},\cdot \right ) }$ has full measure or
$\mathfrak {M }_{\mathrm {E}}=\emptyset $ . In particular, the tensor masses
$\mathbf {m}_{ \mathrm {E},\mathrm {U}}$ are either defined almost everywhere in
$\mathbb {S} ^{2}$ or not defined at all.
Similar to Remark 3.2, if for some strictly positive constant
$\gamma>0$
,

then the Fourier transform
$\hat {\upsilon }$
of the function
$\upsilon : \mathbb {Z}^{2}\rightarrow \mathbb {R}$
, as defined via (31), is real analytic on the whole torus
$\mathbb {T}^{2}$
. It is very natural to expect a local interaction between fermion pairs and bosons in (10), meaning here that the function
$\upsilon $
should even have finite support. In particular, all conditions of Theorem 3.9, including the ones of the third assertion, should hold true in the application to superconducting cuprates.
In fact, as shown in [Reference Bru, de Siqueira Pedra and de Pasquale21], the dispersion relation of Theorem 3.9 yields the formation of d-wave pairs when one adjusts the parameters of the model (with
$\mathrm {u}=0$
) to fit those of cuprate superconductors – in particular, the ones of the cuprate
$\mathrm {La}_{2-x}\mathrm {Sr}_{x}\mathrm {CuO}_{4}$
(LaSr 214) near optimal doping. When considering the usual Hubbard model – that is, the case where there is no other fermionic repulsion than the on-site one (i.e.,
$\mathrm {u}=0$
) and no fermion-boson exchange (i.e.,
$ \upsilon =0$
) –
$\mathrm {E}$
turns out to be the function
$\mathfrak {b}: \mathbb {T}^{2}\rightarrow \mathbb {R}$
, defined by (32), which is nothing else than the dispersion relation

of free bosons (bipolarons for cuprates).
By turning on the fermion-boson-exchange interaction, the dispersion relation of dressed bound fermion pairs with lowest energy can strongly deviate from
$\mathfrak {b}$
, the unperturbed one. Recall, for instance, that
$ \mathfrak {b}$
describes bosons with a very large mass as compared to the effective mass of electrons or holes in cuprates. However, as shown in [Reference Bru, de Siqueira Pedra and de Pasquale21], for typical parameters of the cuprate LaSr 214, the effective mass of the bound pair (with dispersion relation
$\mathrm {E}$
) is comparable to the mass of electrons or holes. This is a consequence of the mass of charge carriers calculated in [Reference Chen, Birgeneau, Kastner, Preyer and Thio52], and the fact that a large effective mass of dressed bound fermion pairs and a high fermion-pair depletion,Footnote
13
close to
$90\%$
as measured in [Reference Božović, He, Wu and Bollinger53], is not compatible with our model. This solves the so-called ‘large mass paradox’ of the microscopic theory of cuprate superconductors, based on some kind mechanism involving bipolarons. For more details, see [Reference Bru, de Siqueira Pedra and de Pasquale21] and references therein.
In fact, the effective mass of dressed bound fermion pairs, or more generally its (effective) mass tensor, depends strongly on the coupling function
$\hat {\upsilon }$
near its maximum. Bearing in mind Definitions 3.7 and 3.8, one can therefore provide via Theorems 3.6 and 3.8 not only qualitative but also quantitative information, which is important for describing the physical behavior of fermionic pairs formed in this way by means of a bosonic field. A natural question is then to study its scattering properties and this is precisely what we propose to do in the next section.
3.3 Quantum scattering
Scattering in quantum mechanics constitutes a well-established mathematical theory aiming at analyzing the behavior of quantum systems at large times. To this end, a reference (or free) Hamiltonian Y is chosen and the dynamics
$(\mathrm {e}^{itX})_{t\in \mathbb {R}}$
of the quantum system driven by the (full) Hamiltonian X is compared at large (negative and positive) times to
$(\mathrm {e}^{itY})_{t\in \mathbb {R}}$
. In fact, scattering theory can be viewed as a kind of perturbation theory for the absolutely continuous spectrum of X. See, for example, [Reference Kato40, Chapter X]. For standard textbooks explaining in detail the scattering theory, we recommend [Reference Reed and Simon41, Reference Newton42, Reference Yafaev43]. Below, for the reader’s convenience, we shortly recall definitions that are relevant here.
Take two bounded self-adjoint operators X and Y acting on two Hilbert spaces
$\mathcal {X}$
and
$\mathcal {Y}$
, respectively. Let
$P_{\mathrm {ac} }(Y) $
be the orthogonal projection onto the absolutely continuous space of
$ Y$
, which is defined as follows:

where
$\chi _{\Omega }$
is its characteristic functionFootnote
14
of any Borel set
$\Omega \subseteq \mathbb {R}$
. The so-called wave operators for the pair
$(X,Y)$
with identification operator
$J\in \mathcal {B}\left ( \mathcal {Y},\mathcal {X} \right ) $
is, by definition, the strong limit

when it exists. See, for instance, [Reference Yafaev43, Definition 1.3]. When
$ \mathcal {Y}=\mathcal {X}$
and
$J=\mathbf {1}$
, like in (Reference Yafaev43, Definition 1.1), we use the shorter notation

In case the above wave operators exist, they are partial isometries [Reference Reed and Simon41, Proposition 1, Sect. XI.3]. They are said to be complete when

See [Reference Reed and Simon41, p. 19, Sect. XI.3].
Similarly, in the general case,
$W^{\pm }\left ( X,Y;J\right ) $
are said to be complete whenever

See [Reference Reed and Simon41, p. 35, Sect. XI.3]. The corresponding scattering operator is equal to

It leads to the scattering matrix (or simply S-matrix) in a representation where Y is diagonal, because the scattering operator commutes with Y. See [Reference Yafaev43, Equation (1.12)].
Remark 3.10. For two bounded self-adjoint operators X and Y acting on two Hilbert spaces
$\mathcal {X}$
and
$\mathcal {Y}$
, noteFootnote
15
that

This is an elementary observation used to study the scattering channels in Section 4.6.
In our framework, the Hamiltonian X is the bounded self-adjoint operator

of Proposition 2.1, which acts on the Hilbert space
$L^{2}(\mathbb {T}^{2},\mathcal {H})$
. Below, two different (reference) Hamiltonians Y are taken into account, corresponding to two scattering channels: the unbound and bound pair channels. For cuprates, the first channel should be associated with the high temperature regime, while the second one is related to sufficiently low temperatures.
3.3.1 Unbound pair scattering channel
Far apart from each other, two fermions only experience a very weak repulsion force due to the extended Hubbard interaction while the probability that they bind together to form a boson is also very small. Thus, in this situation, one expects that the dynamics of such a pair is governed by the fermionic part, and even by the hopping term alone. During intermediate times, they could of course interact, as they may get close to each other, and they could even be bound together via the effective attraction caused by fermion-boson exchange processes. The lifetime of bound fermions should, however, be finite in this situation, and they are expected to be released at some point and behave again as two free fermions that go far apart from each other for large times. See Figure 2. We show below that this heuristics can be put in precise mathematical terms.

Figure 2 Illustration of the unbound pair scattering channel: Two free fermions of (quasi-) momentum
$k-p$
and q respectively (i.e., the full momentum of the fermionic pair is k) at time
$t=- \infty $
interact in finite time with the composite system – in particular with the bosonic field – to be asymptotically free again at time
$t=+ \infty $
, thanks to Theorem 3.11. Here,
$S_{k}=S\left ( A\left ( k\right ) ,\left ( M_{ \mathfrak {f}\left ( k\right ) }+R\left ( \mathrm {V},\mathrm {v}\right ) \right ) \oplus A_{2,2}\left ( k\right ) \right ) $
is the scattering operator of this process in each fiber k, which depends explicitly on
$ \hat { \upsilon }\left ( k\right ) $
. See Theorem 3.13 and the example given by Equations (63)–(64).
To this end, define the Hilbert space

as well as the Hamiltonian

for any
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
and absolutely summable function
$ \mathrm {v}:\mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
, where

$M_{\mathfrak {f}\left ( k\right ) }$
being the fiber Hamiltonian defined as the multiplication operator by
$\mathfrak {f}(k)\in C(\mathbb {T}^{2})$
(see ( 33)) while
$P_{x}$
is the orthogonal projection onto the one-dimensional subspace
$\mathbb {C}\mathfrak {\hat {e}}_{x}\subseteq L^{2}( \mathbb {T}^{2})$
. Observe then that

Compare indeed (54) with Equations 42–37 and Proposition 2.1. By Lemma 4.21, note that
$P_{ \mathrm {ac}}(\mathrm {H}_{f})=\mathbf {1}$
.
Let us consider the identification operator
$\mathfrak {U}:\mathfrak {H} _{f}\rightarrow L^{2}\left ( \mathbb {T}^{2},\mathcal {H}\right ) $
defined for any purely fermionic state
$\psi \in \mathfrak {H}_{f}$
by

See Equation (25). Note that
$\mathfrak {U}$
is an isometry (i.e., a norm preserving linear transformation). In fact, it is the canonical fiberwise inclusion of
$\mathfrak {H}_{f}$
into
$L^{2}\left ( \mathbb {T}^{2}, \mathcal {H}\right ) $
. Recall from Proposition 2.1 that
$A\left ( \mathrm {U},\cdot \right ) $
, defined by (37)–(42), is the fiber decomposition of the operator
$\mathbb {U}H\mathbb {U}^{\ast }$
. Then, we obtain wave and scattering operators with respect to fermionic parts:
Theorem 3.11 (Unbound pair (scattering) channel).
Let
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
and
$\mathrm {v}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}_{0}^{+}$
be any absolutely summable function.
-
i.) The wave operators, as defined by (50) for
$X=\mathbb {U}H\mathbb {U}^{\ast }$ ,
$Y=\mathrm {H}_{f}$ and
$ J=\mathfrak {U}$ , satisfy
(57)with range equal to$$ \begin{align} W^{\pm }\left( \mathbb{U}H\mathbb{U}^{\ast },\mathrm{H}_{f};\mathfrak{U} \right) =\left( {\int_{\mathbb{T}^{2}}^{\oplus }}W^{\pm }\left( A\left( k\right) ,\left( M_{\mathfrak{f}\left( k\right) }+R\left( \mathrm{V},\mathrm{ v}\right) \right) \oplus A_{2,2}\left( k\right) \right) \nu \left( \mathrm{d} k\right) \right) \mathfrak{U} \end{align} $$
(58)$$ \begin{align} \mathrm{ran}\left( W^{\pm }\left( \mathbb{U}H\mathbb{U}^{\ast },\mathrm{H} _{f};\mathfrak{U}\right) \right) =\int_{\mathbb{T}^{2}}^{\oplus }L^{2}\left( \mathbb{T}^{2}\right) \oplus \{0\}\,\nu \left( \mathrm{d}k\right) . \end{align} $$
-
ii.) The scattering operator, as defined by (52) for
$X=\mathbb {U}H\mathbb {U}^{\ast }$ ,
$Y=\mathrm {H}_{f}$ and
$J= \mathfrak {U}$ , equals
$$ \begin{align*} S\left( \mathbb{U}H\mathbb{U}^{\ast },\mathrm{H}_{f};\mathfrak{U}\right) = \mathfrak{U}^{\ast }\left( {\int_{\mathbb{T}^{2}}^{\oplus }}S\left( A\left( k\right) ,\left( M_{\mathfrak{f}\left( k\right) }+R\left( \mathrm{V},\mathrm{ v}\right) \right) \oplus A_{2,2}\left( k\right) \right) \nu \left( \mathrm{d} k\right) \right) \mathfrak{U}. \end{align*} $$
Proof. Observe that the operator difference
$(\mathbb {U}H\mathbb {U}^{\ast }-\mathrm { H}_{f})$
is not trace-class (it is not even compact) and, thus, the existence of this scattering channel is not a direct consequence of the well-known Kato-Rosenblum theorem [Reference Reed and Simon41, Theorem XI.8]. In fact, one of the main steps of the proof is to show that this difference is the direct integral of a strongly measurable family of trace-class operators. By this means, we are then able to apply the Kato-Rosenblum theorem ‘fiberwise’ to deduce the first assertion. See Section 4.6.1 for more details – in particular, Theorem 4.23. Assertion (ii) is a direct consequence of Assertion (i) together with the theory of direct integrals.
Remark 3.12. If one would like to go back to the original Hilbert space
$\mathfrak {h}_{0}$
(19) for fermion pairs with opposite spins – that is, if one wishes to use space coordinates, instead of the quasi-momenta – then one employs Theorem 3.11, along with the observation that

where
$U_{f}$
is defined by (28). See also Equation (26) and Proposition 2.1.
Theorem 3.11 refers to the unbound pair (scattering) channel. The subspace
$\mathfrak {H}_{f}\subseteq \mathfrak {H}$
corresponds to the ‘incoming’ (
$+$
) and ‘outcoming’ (
$-$
) scattering states of the quantum system, in this particular scattering channel. Physically, this theorem shows, among other things, that the bosonic component of e
$^{itH}$
vanishes on this channel, as
$t\rightarrow \pm \infty $
. This is a direct consequence of Equation (58).
In addition, Equation (57) gives an explicit fiber decomposition of wave operators with respect to the purely fermionic Hamiltonian in terms of
$ k$
-dependent wave operators defined naturally from the fiber decomposition of the operator
$\mathbb {U}H\mathbb {U}^{\ast }$
. Mutatis mutandis for the scattering operator, thanks to Theorem 3.11 (ii). In other words, the knowledge of scattering properties of each fiber, almost everywhere, entirely determines the scattering properties of the composite system, made of two fermions and one boson. We can now use this property (i.e., Theorem 3.11) to obtain a more computable expression for the wave and scattering operators in each given fiber. This can be done via infinite series (perturbative expansions), thanks to Corollary A.2.
Below, we give an example of such a computation by taking
$\mathrm {U}=\mathrm { V}\in \mathbb {R}_{0}^{+}$
and
$\mathrm {v}=\mathrm {u}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}_{0}^{+}$
in Theorem 3.11. With this particular choice, the unbound pair channel allows one to isolate the fermion-boson exchange mechanism, which in terms of Hamiltonians refers to the use of off-diagonal operators

in the fibers, where, for any
$t\in \mathbb {R}$
and
$k\in \mathbb {T}^{2}$
,

Recall that, for
$m,n\in \{1,2\}$
,
$A_{m,n}(k)$
is defined by (37)–( 41). Below,
$B_{m,n}(k)$
,
$m\neq n$
, stands for the norm-continuous family of operators
$(B_{m,n}^{(t)}(k))_{t\in \mathbb {R}}$
.
To shorten the notation, for any
$s,t\in \mathbb {R}$
, as well as two norm-continuous families
$X\equiv (X_{t})_{t\in \mathbb {R}}$
and
$Y\equiv (Y_{t})_{t\in \mathbb {R}}$
of bounded operators
$X_{t}:\mathcal {X} \rightarrow \mathcal {Y}$
and
$Y_{t}:\mathcal {Y}\rightarrow \mathcal {X}$
on two Hilbert spaces
$\mathcal {X}$
and
$\mathcal {Y}$
, respectively, we define the bounded operators:


The integrals above are Riemann ones, noting that
$(X_{t})_{t\in \mathbb {R}}$
and
$(Y_{t})_{t\in \mathbb {R}}$
are continuous families in Banach spaces – namely,
$\mathcal {B}\left ( \mathcal {X};\mathcal {Y}\right ) $
and
$\mathcal {B} \left ( \mathcal {Y};\mathcal {X}\right ) $
, respectively. Note that
$\cos _{\succ }\left ( XY;s,t\right ) \in \mathcal {B}\left ( \mathcal {Y}\right ) $
and
$\sin _{\succ }\left ( XY;s,t\right ) \in \mathcal {B}\left ( \mathcal {X}, \mathcal {Y}\right ) $
are always absolutely summable series in the operator norm. Then, we obtain the following results:
Theorem 3.13 (Scattering operators as pertubative series).
Let
$\varepsilon \in \mathbb {R}^{+}$
and
$\mathrm {H}_{f}\equiv \mathrm {H} _{f}\left ( \mathrm {U},\mathrm {u}\right ) $
. Then, for any
$\varphi \in \mathfrak {H}_{f}$
, there is
$T>0$
such that

Moreover, for any
$\varphi ,\psi \in \mathfrak {H}_{f}$
, there is
$T>0$
such that

where, for all
$s,t\in \mathbb {R}$
,

Proof. It suffices to combine Lemma 4.24 with Equation (141) and Theorem 3.11, similar to Corollary A.2.
Theorem 3.13 provides a way to approximate the scattering matrix associated with the fermion-boson-exchange interaction. Note for instance from (37)–(41) that the operator
$B_{1,2}^{(t)}\left ( k\right ) B_{2,1}^{(s)}\left ( k\right ) $
and
$B_{2,1}^{(t)}\left ( k\right ) B_{1,2}^{(s)}\left ( k\right ) $
have a relatively simple form for any
$s,t\in \mathbb {R}$
and
$k\in \mathbb {T }^{2}$
:


where
$P_{\mathfrak {d}\left ( k\right ) }$
is the orthogonal projection onto the one-dimensional subspace
$\mathbb {C}\mathfrak {d}\left ( k\right ) \subseteq L^{2}(\mathbb {T}^{2})$
. Similar computations can be done for other choices of
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
and
$\mathrm {v}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}_{0}^{+}$
in Theorem 3.11, like
$\mathrm {V}=0=\mathrm {v}$
(noninteracting fermion systems). See again Corollary A.2.
The understanding of this kind of scattering, regarding free fermion (electron) collisions, is relevant in physics, because it can allow the exchange function
$\upsilon $
of a real system to be studied. It is therefore important to have a model from which not only qualitative, but also quantitative, information can be obtained. This is the purpose of this section, in particular of Theorem 3.11, which gives the explicit dependency of scattering in terms of
$\upsilon $
.
3.3.2 Bound pair scattering channel
Similarly, we also prove the existence of a scattering channel for dressed bound pairs. As dressed bound pairs are space-localized objects (see, for example, Theorem 3.1 (iii)), the fermions forming the pair efficiently exchange a boson, at a non-negligible rate, via the terms
$W_{\mathrm { b\rightarrow f}}$
(9) and
$W_{\mathrm {f\rightarrow b}}$
(17) in the Hamiltonian H. In particular, such quantum states must have some non-negligible bosonic component representing the exchanged boson that ‘glues’ the two fermions together. This dressed bound pair is however expected to move like a free (quantum spinless) particle in the real space. See Figure 3. We translate this physical heuristics in precise mathematical terms by considering the effective dispersion relations

given by Theorems 3.1, 3.5 and 3.9.

Figure 3 Illustration of the bound pair scattering channel. Here, k is the full (quasi-)momentum of the (exponentially localized) dressed bound fermion pairs. The oscillating vertical lines between the two fermions (e.g., electrons) before the scattering process and afterwards characterize their bound via a bosonic (e.g., bipolaronic) particle transfer with coupling function
$\hat {\upsilon }\left ( k\right ) $
; see Figure 1. It illustrates the stability of these pairs of fermions in time, as expressed by Theorem 3.14, that is, the pairs cannot decay into an (even only asymptotically) unbound pair of fermions.
For any
$\mathrm {U}\in \mathbb {R}_{0}^{+}\cup \{\infty \}$
, we consider the identification operator

defined for any
$\varphi \in L^{2}\left ( \mathbb {T}^{2}\right ) $
byFootnote
16

where
$\Psi \left ( \mathrm {U},k\right ) $
is the eigenvector associated with the (nondegenerate) eigenvalue
$\mathrm {E}\left ( \mathrm {U},k\right ) $
, as given by Theorems 3.1, 3.5 and 3.9. Note from Theorem 3.1 that the mapping

is continuous on
$\mathbb {T}^{2}$
for any
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, and its pointwise limit

(cf. Theorem 3.5) is therefore measurable. In particular, the linear transformation
$\mathfrak {P}_{\mathrm {U}}$
is well-defined for any
$ \mathrm {U}\in \mathbb {R}_{0}^{+}\cup \{\infty \}$
. Moreover, one checks that it is norm-preserving.
Since
$\mathrm {E}\left ( \mathrm {U},\cdot \right ) \in C(\mathbb {T}^{2}; \mathbb {R})\subseteq L^{\infty }(\mathbb {T}^{2},\nu )$
(see Theorems 3.1 (iv) and 3.5), we can consider the multiplication operator by
$\mathrm {E}\left ( \mathrm {U},\cdot \right ) $
on
$ L^{2}(\mathbb {T}^{2})$
, which is denoted by

Remark also from Theorems 3.1 (iv) and 3.5 together with Corollary A.5 that
$ P_{\mathrm {ac}}(M_{\mathrm {E}\left ( \mathrm {U},\cdot \right ) })=\mathbf {1}$
whenever
$\hat {\upsilon }$
is real analytic on
$\mathbb {S}^{2}$
. We then study now the (dressed) bound pair scattering channel, which is much simpler than in the unbound pair channel:
Theorem 3.14 (Bound pair (scattering) channel).
Let
$h_{b}\in \lbrack 0,1/2]$
. Then the following assertions hold true:
-
i.) Dynamics and wave operators at finite
$\mathrm {U}\in \mathbb {R} _{0}^{+}$ :
$$ \begin{align*} \mathrm{e}^{it\mathbb{U}H\mathbb{U}^{\ast }}\mathfrak{P}_{\mathrm{U}}= \mathfrak{P}_{\mathrm{U}}\mathrm{e}^{itM_{\mathrm{E}\left( \mathrm{U},\cdot \right) }},\qquad t\in \mathbb{R}. \end{align*} $$
-
ii.) Dynamics in the hard-core limit
$\mathrm {U}\rightarrow \infty $ :
$$ \begin{align*} s-{\lim\limits_{\mathrm{U}\rightarrow \infty }}\mathfrak{P}_{\mathrm{U}}= \mathfrak{P}_{\infty }\qquad \text{and}\qquad s-{\lim\limits_{\mathrm{U} \rightarrow \infty }}\,\mathrm{e}^{it\mathbb{U}H\mathbb{U}^{\ast }}\mathfrak{ P}_{\mathrm{U}}=\mathfrak{P}_{\infty }\mathrm{e}^{itM_{\mathrm{E}\left( \mathrm{\infty },\cdot \right) }},\qquad t\in \mathbb{R}. \end{align*} $$
From Theorem 3.14, (i) the scattering channel is time independent. For instance, for any
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, one trivially checks that

and since
$\mathfrak {P}_{\mathrm {U}}^{\ast }\mathfrak {P}_{\mathrm {U}}= \mathbf {1}$
and
$P_{\mathrm {ac}}$
is a projection, its scattering operator is equal to

If
$\hat {\upsilon }$
is additionally real analytic on
$\mathbb {S}^{2}$
, then
$ P_{\mathrm {ac}}(M_{\mathrm {E}\left ( \mathrm {U},\cdot \right ) })=\mathbf {1}$
, thanks to Theorem 3.1 and Corollary A.5. In this case, the wave and scattering operators are given by

for any
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. Their hard-core limit are then also trivial, thanks to Theorem 3.14 (ii).
This scattering channel is therefore easy to study. In particular, similar to Remark 3.12, we can easily go back to the original Hilbert space
$\mathfrak {h}_{0}$
(19), referring to space coordinates instead of the quasi-momenta. With this aim, we first observe that, for any
$ \mathrm {U}\in \mathbb {R}_{0}^{+}$
,

where
$\mathcal {P}_{\mathrm {U}}\in \mathcal {B}\left ( \mathfrak {h}_{0}, \mathfrak {H}\right ) $
is the new identification operator

and
$U_{f}\doteq U_{2}U_{1}$
. See Equations (26)–(28) and Proposition 2.1.
On the one hand, Equation (67) together with Theorem 3.9 shows that
$ \mathrm {E}\left ( \mathrm {U},\cdot \right ) $
defines (a family of) dispersion relations, in the sense of Definition 3.7. The Fourier transform of
$\mathrm {E}\left ( \mathrm {U},\cdot \right ) $
is the (effective) hopping strength for the (spatially localized) dressed bound pairs. On the other hand, the new identification operator
$\mathcal {P}_{\mathrm {U}}$
is translation invariant, that is,

where, for any fixed
$x\in \mathbb {Z}^{2}$
,
$\theta _{x}\in \mathcal {B}( \mathfrak {h}_{0})$
and
$\Theta _{x}\in \mathcal {B}(\mathfrak {H})$
are the unique unitary operators respectively satisfying

(see Equations (19)) and

See Equations (20). In addition, when
$\hat {\upsilon }(0)\neq 0$
, Theorem 3.1 (iii) shows that the (dressed) fermion pair in the bound pair channel is exponentially localized in space; that is, the associate fermion-fermion correlation function decays exponentially fast with respect to the distance between the fermions, uniformly in time. Note that it is not required that the range of
$\mathcal {P}_{\mathrm {U}}$
has a vanishing bosonic component, because of the expected presence of ‘gluing bosons’ in the dressed bound fermionic pair.
As a consequence, the bound channel describes an effective system of free localized, spinless quasi-particles which minimize the energy at any fixed total quasi-momentum. In particular, such quasi-particles of lowest energy, or dressed fermion pairs, are stable in time; that is, they cannot decay into an (even only asymptotically) unbound pair of fermions. Conversely, we also show in Section 3.3.1 that a pair of fermions that is asymptotically unbound far in the past is not able to bind together to form a stable bound pair in the distant future.
Nevertheless, these quasi-particles should only be stable with respect to external perturbations as soon as their states are related to quasi-momenta k such that
$\mathrm {E}\left ( \mathrm {U},k\right ) <0$
. If the (dressed) quasi-particle is in a state whose support contains fibers k such that
$\mathrm {E}\left ( \mathrm {U},k\right ) \geq 0$
, it is not in the most energetically favorable state, since

(see Theorem 3.1). In fact, if the component corresponding to quasi-momenta k such that
$\mathrm {E}\left ( \mathrm {U},k\right ) \geq 0$
has nonvanishing Lebesgue measure, then the quasi-particle should be instable with respect to external perturbations, by possibly creating unbounded fermions with small quasi-momenta to decrease its total energy. This situation clearly appears for quasi-momenta
$k\in \mathbb {T}^{2}$
such that
$\hat {\upsilon }(k)=0$
or sufficiently small
$\left \vert \hat {\upsilon } (k)\right \vert \ll 1$
when
$k\neq 0$
, since in these two cases, either
$ \mathrm {E}\left ( \mathrm {U},k\right ) =\mathfrak {b}\left ( k\right ) $
(
$\hat { \upsilon }(k)=0$
) or
$\mathrm {E}\left ( \mathrm {U},k\right ) \simeq \mathfrak {b} \left ( k\right ) $
(
$\left \vert \hat {\upsilon }(k)\right \vert \ll 1$
) with
$ \mathfrak {b}\left ( k\right ) \doteq h_{b}\epsilon \left ( 2-\cos \left ( k\right ) \right ) $
(see (32)). If such a decay process really occurs, one should see critical quasi-momenta, like in physical superconductors.
To prevent from this situation, one needs sufficiently strong
$\left \vert \hat {\upsilon }\left ( k\right ) \right \vert \gg 1$
to have
$\mathrm {E}\left ( \mathrm {U},k\right ) <0$
for all
$k\in \mathbb {T}^{2}$
. In the position space, this means that the exchange strength between the two fermions and the boson, represented by the function
$\upsilon :\mathbb {Z}^{2}\rightarrow \mathbb {R}$
appearing in (9)–(17), has to be sufficiently strong and localized, like in Remark 3.2, in order to get a sufficiently strong ‘gluing effect’ for dressed pairs, at all quasi-momenta. Recall also that the boson should be heavier than the two fermions (i.e.,
$h_{b}\in \lbrack 0,1/2]$
).
Last but not least, all this discussion can be extended to the hard-core limit
$\mathrm {U}\rightarrow \infty $
, in view of Theorems 3.9 and 3.14 (ii).
4 Technical results
4.1 Notation
The purpose of this section is to fix (or recall) the notation and terminology that is used throughout the rest of the article. Let
$\mathcal {X} $
be any complex Hilbert space. We denote its scalar product by
$\langle \cdot ,\cdot \rangle _{\mathcal {X}}$
, with the convention that it is antilinear in the first argument and linear in the second one. The norm of
$ \mathcal {X}$
is thus

When there is no danger of confusion, as already said in Remark 1.2, we usually omit the subscript referring to the Hilbert space and write
$\Vert \cdot \Vert $
for
$\Vert \cdot \Vert _{\mathcal {X}}$
and
$ \langle \cdot ,\cdot \rangle $
for
$\langle \cdot ,\cdot \rangle _{\mathcal {X }}$
.
Recall that
$\mathcal {B}(\mathcal {X})$
denotes the set of bounded (linear) operators on
$\mathcal {X}$
.
$\mathbf {1}\equiv \mathbf {1}_{\mathcal {X} }\in \mathcal {B}(\mathcal {X})$
is the identity operator. Given
$T\in \mathcal {B}(\mathcal {X})$
,
$T^{\ast }$
denotes its adjoint operator. The (full) spectrum, essential spectrum and resolvent set of any
$T\in \mathcal {B }(\mathcal {X})$
are denoted by
$\sigma (T)$
,
$\sigma _{\mathrm {ess}}(T)$
and
$\rho (T)$
, respectively. The operator norm of
$\mathcal {B}(\mathcal {X})$
is

Given
$T\in \mathcal {B}(\mathcal {X})$
,
$\mathcal {E}_{T}(\lambda )$
stands for the eigenspace associated with an eigenvalue
$\lambda \in \sigma (T)$
of T.
In all the Section 4, we study properties of the Hamiltonian
$H\in \mathcal {B}(\mathfrak {H})$
defined by (21). As one can see from (18)–(21) combined with (6), (8)–(11) and (17), it depends on several parameters. More precisely,
$\epsilon ,\mathrm {U},h_{b}\in \mathbb {R} _{0}^{+}$
and
$\alpha _{0}\in \mathbb {R}^{+}$
, while

(with
$\mathfrak {p}_{2}(z)\doteq 0$
for
$z\notin 2\mathbb {Z}$
) are all absolutely summable functions that are invariant with respect to
$90^{\circ } $
-rotations. See Equations (7), (12) and (16). The parameters of the operator H are always fixed and arbitrary, unless we need to specify them to clarify some particular statement. Recall that the invariance under
$90^{\circ }$
-rotation is not that important here. In fact, here, the only important point concerning this symmetry is that it implies that the Fourier transforms
$\hat {\upsilon }$
,
$ \hat {\mathfrak {p}}_{1}$
and
$\hat {\mathfrak {p}}_{2}$
are real-valued, because

that is, the real valued functions
$\upsilon $
,
$\mathfrak {p}_{1}$
and
$ \mathfrak {p}_{2}$
are reflection invariant, as a consequence of their
$ 90^{\circ }$
-rotation invariance. Apart of this technical point, it is mainly relevant for the study of unconventional pairings, which is not done in the present work.
Note additionally that the on-site repulsion
$\mathrm {U}\in \mathbb {R} _{0}^{+}$
appears explicitly in all the quantities defined in Sections 2–3. However, in Section 4, this parameter is only important for the Subsections 4.4 and 4.6. Therefore, unless the parameter
$\mathrm {U}$
is important for our discussions or statements, we omit it in order to shorten the notation, by writing

for any function
$f(\mathrm {U},k)$
of the parameters
$\mathrm {U}$
and k.
4.2 Computation of the fiber decomposition of the Hamiltonian
For completeness, we first proof in a simple lemma that the fiber Hamiltonians defined by (42) yield an element of
$ L^{\infty }\left ( \mathbb {T}^{2},\mathcal {B}(\mathcal {H})\right ) $
. Then, we prove Proposition 2.1.
Lemma 4.1 (Elementary properties of fiber Hamiltonians).
Fix
$h_{b},\epsilon ,\mathrm {U}\in \mathbb {R}_{0}^{+}$
. Then,
$A:\mathbb {T} ^{2}\rightarrow \mathcal {B}(\mathcal {H})$
, as defined by (42), is continuous and, in particular,
$A\left ( \cdot \right ) \in L^{\infty }\left ( \mathbb {T}^{2},\mathcal {B}(\mathcal {H})\right ) $
.
Proof. Since
$\cos :\mathbb {R}^{2}\rightarrow \mathbb {R}$
, as defined by (35), is a nonexpansive mapping with period
$2\pi $
, given
$ k,k^{\prime },p\in \mathbb {T}^{2}$
, the quantity

(see (33)) is bounded for any
$q\in \mathbb {Z}^{2}$
by

Hence, taking the minimum over all
$q\in \mathbb {Z}^{2}$
and the supremum over all
$p\in \mathbb {T}^{2}$
, we obtain from (22) and (38) that

for all
$k,k^{\prime }\in \mathbb {T}^{2}$
. In other words, the mapping

is (
$\epsilon $
-Lipschitz) continuous with respect to the metric
$d_{\mathbb { T}^{2}}$
. Similarly, we see that
$\mathfrak {b}:\mathbb {T}^{2}\rightarrow \mathbb {R}$
, defined by (32), is continuous with respect to
$d_{\mathbb { T}^{2}}$
, and hence, so is
$A_{2,2}:\mathbb {T}^{2}\rightarrow \mathcal {L}( \mathbb {C})$
(see (41)). In addition, by the triangle and Cauchy-Schwarz inequalities, for any
$k,k^{\prime }\in \mathbb {T}^{2}$
and
$ \varphi \in L^{2}(\mathbb {T}^{2})$
,

Because of (12) and (16),
$\mathfrak {d}(k),\hat { \upsilon }\in C\left ( \mathbb {T}^{2}\right ) $
. So, since
$\mathbb {T}^{2}$
is (
$d_{\mathbb {T}^{2}}$
-)compact and

we deduce from the last inequality and (39) that
$A_{2,1}:\mathbb {T} ^{2}\rightarrow L^{2}(\mathbb {T}^{2})^{\ast }$
is continuous. As
$ A_{1,2}(k)=A_{2,1}(k)^{\ast }$
for all
$k\in \mathbb {T}^{2}$
, we conclude that the mapping
$A:\mathbb {T}^{2}\rightarrow \mathcal {B}(\mathcal {H})$
is continuous, and hence bounded on the
$d_{\mathbb {T}^{2}}$
-compact set
$ \mathbb {T}^{2}$
.
We now compute the following unitary transformation of the Hamiltonian H (see (18)):

with
$\mathbb {U}$
defined by (26)–(31). In fact, the remaining part of this section is devoted to the computations leading to Proposition 2.1.
To begin with, we observe that, for any lattice site
$x\in \mathbb {Z}^{2}$
and spin
$s\in \{\uparrow ,\downarrow \}$
,
$b_{x}\doteq b(\mathfrak {e}_{x})$
and
$a_{x,s}\doteq a(\mathfrak {e}_{(x,s)})$
, where
$\{\mathfrak {e}_{x}\doteq \delta _{x,\cdot }\}_{x\in \mathbb {Z}^{2}}$
is the canonical orthonormal basis (5) of
$\ell ^{2}(\mathbb {Z}^{2})$
and
$a_{x,s}$
(
$ b_{x}^{\ast }$
) denotes the annihilation operator acting on the fermionic (bosonic) Fock space
$\mathfrak {F}_{-}$
(
$\mathfrak {F}_{+}$
) of a fermion (boson). In both cases,
$\Omega $
denotes the vacuum state. We compute each term of the the right-hand side (68) separately:
$\underline{\mathrm{Computation\ of}\ U_{f}H_{f}U_{f}^{\ast}\ \mathrm{in\ relation\ to}\ {A_{1,1}}}.$
We first note from (A.6) that, for any
$ x,y,u\in \mathbb {Z}^{2}$
and
$s\in \{\uparrow ,\downarrow \}$
,

vanishes whenever
$(x,s)\notin \{(y,\uparrow ),(u,\downarrow )\}$
. Using this observation and (A.7), one concludes that, for any
$y,u\in \mathbb {Z}^{2}$
,

Likewise, we see that, for any
$y,u\in \mathbb {Z}^{2}$
,

Moreover, as
$\mathrm {u}:\mathbb {Z}^{2}\rightarrow \mathbb {R}$
is absolutely summable and invariant with respect to
$180^{\circ }$
-rotations (cf. (7)), we also get that

for any
$y,u\in \mathbb {Z}^{2}$
, which, for
$\mathrm {u}\left ( z\right ) =\delta _{z,0}$
, is equal to

We thus infer from (6) combined with (69)–(72) that

for any
$y,u\in \mathbb {Z}^{2}$
. Then, conjugating
$H_{f}$
by the unitary operator
$U_{f}$
(28)–(31) gives the equality

for any
$y,u\in \mathbb {Z}^{2}$
. By first evaluating the above expression at
$k\in \mathbb {T}^{2}$
, and then at
$p\in \mathbb {T}^{2}$
, and using (33) and (35), we obtain that

for any
$y,u\in \mathbb {Z}^{2}$
. By (37)–(38), it follows that

As
$\{\mathfrak {\hat {e}}_{y}\left ( \cdot \right ) \mathfrak {\hat {e}} _{y-u}\}_{y,u\in \mathbb {Z}^{2}}$
is an orthonormal basis for the Hilbert space

we deduce from (73) that

$\underline{\mathrm{Computation\ of}\ \mathcal {F}H_{b}\mathcal {F}^{\ast }\ \mathrm{in\ relation\ to}\ A_{2,2}.}$
Using (A.8) and (A.9), we conclude that, for any
$y\in \mathbb {Z}^{2}$
,

so that

Therefore, using that
$\mathfrak {\hat {e}}_{y}\equiv \mathcal {F(\mathfrak {e}} _{y}\mathcal {)=}\,\mathrm {e}^{ik\cdot y}$
(see (31)) as well as (32), (35) and (41), we arrive at the result

for all
$y\in \mathbb {Z}^{2}$
and
$k\in \mathbb {T}^{2}$
. As
$\{\hat { \mathfrak {e}}_{y}\}_{y\in \mathbb {Z}^{2}}$
is an orthonormal basis for
$ L^{2}(\mathbb {T}^{2})$
, it follows that

$\underline{\mathrm{Computation\ of}\ \mathcal {F}W_{\mathrm {f\rightarrow b} }U_{f}^{\ast }\ \mathrm{and}\ U_{f}W_{\mathrm {b\rightarrow f}}\mathcal {F}^{\ast }\ \mathrm{in\ relation\ to}\ A_{2,1}\ \mathrm{and}\ A_{1,2}}.$
Observe from (11) and (A.7) that, for all
$y\in \mathbb {Z}^{2}$
,

and as a consequence, using (9)–(10) as well as ( A.8), we get that, for any
$u\in \mathbb {Z}^{2}$
,

Therefore, by (28)–(31), for any
$u\in \mathbb {Z }^{2}$
,

In particular, as
$\upsilon ,\mathfrak {p}_{1},\mathfrak {p}_{2}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}$
are absolutely summable and invariant with respect to
$180^{\circ }$
-rotations (cf. (12) and (16)), we deduce from the last equality that, for any
$u\in \mathbb {Z}^{2}$
and
$k,p\in \mathbb {T}^{2}$
,

Using now that

(
$\mathfrak {p}_{2}\left ( z\right ) \doteq 0$
for
$z\notin (2\mathbb {Z})^{2}$
), we arrive at the equalities

with
$\mathfrak {d}(k)(p)$
and
$A_{1,2}(k)$
defined by (34) and (40), respectively. As
$u\in \mathbb {Z}^{2}$
and
$k,p\in \mathbb {T}^{2}$
are arbitrary in the above equations, this shows that
$U_{f}W_{\mathrm {b} \rightarrow \mathrm {f}}\mathcal {F}^{\ast }$
coincides with the bounded linear transformation

on the orthonormal basis
$\{\hat {\mathfrak {e}}_{u}\}_{u\in \mathbb {Z}^{2}}$
and, therefore,
$U_{f}W_{\mathrm {b}\rightarrow \mathrm {f}}\mathcal {F}^{\ast }=J $
. By taking adjoints on both sides, we also obtain
$\mathcal {F}W_{ \mathrm {f}\rightarrow \mathrm {b}}U_{f}^{\ast }=J^{\ast }$
. Finally, one can easily check from (39) that

This completes the proof of Proposition 2.1.
4.3 Spectrum of the fiber Hamiltonians
We start with the study of the essential spectrum of fiber Hamiltonians (42) at any total quasi-momentum
$k\in \mathbb {T}^{2}$
, before considering afterwards the discrete one in Section 4.3.2. Then, in Section 4.3.3, we study the bottom of the spectrum (
$k\in \mathbb {T}^{2}$
being fixed). The whole study leads to important spectral properties of H, as previously explained, via Proposition 2.1 combined with Theorem A.3.
4.3.1 Essential spectrum
The essential spectrum
$\sigma _{\mathrm {ess}}(A(k))$
of the fiber Hamiltonian

defined by (42) at fixed
$\mathrm {U}\in \mathbb {R} _{0}^{+}$
and total quasi-momentum
$k\in \mathbb {T}^{2}$
, is completely determined by the following proposition:
Proposition 4.2 (Essential spectrum of fiber Hamiltonians).
For any
$k\in \mathbb {T}^{2}$
and
$h_{b},\epsilon ,\mathrm {U}\in \mathbb {R} _{0}^{+}$
, one has

where
$M_{\mathfrak {f}\left ( k\right ) }$
stands for the multiplication operator associated with the function
$\mathfrak {f}(k)$
(33), while
$ B_{1,1}(k)$
and
$A_{1,1}(k)\equiv A_{1,1}(\mathrm {U},k)$
are defined by (37) and (38), respectively.
Proof. Recall that
$\nu $
is the normalized Haar measure defined by (23) on
$\mathbb {T}^{2}$
. Fix
$k\in \mathbb {T}^{2}$
. If
$\lambda $
is an eigenvalue of
$M_{\mathfrak {f}\left ( k\right ) }$
with associated eigenvector
$\varphi \in L^{2}(\mathbb {T}^{2})$
, then

for almost every
$p\in \mathbb {T}^{2}$
. As
$\varphi \neq 0$
, there exists
$ \Omega \subseteq \mathbb {T}^{2}$
with strictly positive measure
$\nu (\Omega )>0$
such that the above equality holds true with
$\varphi (p)\neq 0$
for every
$p\in \Omega $
. Thus,
$\mathfrak {f}(k)(p)=\lambda $
for all
$p\in \Omega $
. Because

we can assume without loss of generality that
$\Omega \subseteq (-\pi ,\pi )^{2}$
. Since
$\mathfrak {f}(k)-\lambda $
is real analytic on the open domain
$(-\pi ,\pi )^{2}$
in
$\mathbb {R}^{2}$
and the zeros of any nonconstant real analytic function have null Lebesgue measure [Reference Mityagin49], we would have
$\nu (\Omega )=0$
, which contradicts our choice of the set
$\Omega $
. Recall indeed that
$\nu $
is the Lebesgue measure, up to a normalization constant (see (23)). Hence,
$M_{\mathfrak {f}\left ( k\right ) } $
has no eigenvalues and, thus,

The last equality holds true, for
$\mathfrak {f}$
is a continuous function on a compact domain – namely, the torus
$\mathbb {T}^{2}$
. Clearly,

Observing that
$A_{1,2}(k)$
,
$A_{2,1}(k)$
,
$A_{2,2}(k)$
and
$P_{x}$
are all rank-one linear transformations, we can apply the stability of the essential spectrum under compact perturbations (see [Reference Schmüdgen95, Corollary 8.16]) to conclude that

In fact, from the absolute summability of the function
$\mathrm {u}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}_{0}^{+}$
(see (7)), along with the closedness of the subspace of compact operators in the Banach space of all bounded operators, the operator defined by the infinite sum

is not only bounded, but even compact on the Hilbert space
$L^{2}(\mathbb {T} ^{2})$
. Recall that
$P_{x}$
denotes the orthogonal projection onto the one-dimensional subspace
$\mathbb {C}\hat {\mathfrak {e}}_{x}\subseteq L^{2}( \mathbb {T}^{2})$
for any
$x\in \mathbb {Z}^{2}$
. For the same reasons,

Corollary 4.3 (Bottom of the spectrum of
$A_{1,1}(k)$
and
$B_{1,1}(k)$
).
For any
$k\in \mathbb {T}^{2}$
, one has that

Proof. Fix
$k\in \mathbb {T}^{2}$
. Since
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, one has the operator inequalities

for the set of positive operators on a Hilbert space forms a norm-closed convex cone. By combining the last inequalities with Proposition 4.2, one arrives at the assertion.
4.3.2 Discrete spectrum
In the following, it is technically convenient to assume that
$\mathfrak {b} (k) $
, which is the kinetic energy of a boson with quasi-momentum k, is below the bottom of the spectrum of
$A_{1,1}(k)$
– that is, the minimum energy of the fermion pair for the same total quasi-momentum. In other words, we assume from now on that

for all
$k\in \mathbb {T}^{2}$
, with equality only at
$k=0$
. See Equation (32) and Corollary 4.3. By direct computations,Footnote
17
one verifies that this amounts to take
$h_{b}$
in the interval
$ [0,1/2]$
. This means that we consider a regime where the boson mass is at least the mass of the two fermions, as physically expected for cuprate superconductors; see [Reference Bru, de Siqueira Pedra and de Pasquale21, Section 3.1].
Proposition 4.4 (Eigenvalues of fiber Hamiltonians – I).
Take any
$k\in \mathbb {T}^{2}$
and
$h_{b}\in \lbrack 0,1/2]$
.
-
i.)
$\lambda \neq \mathfrak {b}(k)$ is an eigenvalue of
$A(k)$ iff there is a nonzero vector
$\varphi \in L^{2}(\mathbb {T}^{2})$ in the kernel of the bounded operator
$$ \begin{align*} A_{1,1}\left( k\right) -\lambda \mathbf{1}-\left( \mathfrak{b}\left( k\right) -\lambda \right) ^{-1}A_{1,2}\left( k\right) A_{2,1}\left( k\right) \in \mathcal{B}\left( L^{2}(\mathbb{T}^{2})\right) . \end{align*} $$
In this case,
$\lambda $ is an eigenvalue of
$A(k)$ with associated eigenvector
$$ \begin{align*} \left( \varphi ,-\left( \mathfrak{b}\left( k\right) -\lambda \right) ^{-1}A_{2,1}\left( k\right) \varphi \right) \in \mathcal{H}\doteq L^{2}\left( \mathbb{T}^{2}\right) \oplus \mathbb{C}. \end{align*} $$
-
ii.)
$\mathfrak {b}(k)$ is an eigenvalue of
$A(k)$ iff
$\hat {\upsilon } (k)=0$ .
Proof. Fix
$k\in \mathbb {T}^{2}$
and
$h_{b}\in \lbrack 0,1/2]$
. We start with the proof of Assertion (i): If
$\lambda \neq \mathfrak {b}(k)$
is an eigenvalue of
$A(k)$
with associated eigenvector
$(\varphi ,z)\in \mathcal {H}\backslash \{0\}$
, then we directly deduce from (42) that


By combining these two equations, we obtain

and thus,

We have that
$\varphi \neq 0$
, for otherwise z would also be zero, by (78), and this would contradict the fact that
$(\varphi ,z)$
is a nonzero vector. The converse is obvious and Assertion (i) holds true.
We now prove Assertion (ii): It is easy to check from (42) that
$\hat {\upsilon }(k)=0$
implies that

Conversely, suppose that
$\mathfrak {b}(k)$
is an eigenvalue of
$A(k)$
with associated eigenvector
$(\varphi ,z)\in \mathcal {H}\backslash \{0\}$
, but
$ \hat {\upsilon }(k)\neq 0$
. Then, by (39) and (42),

Remark that the second equality says that
$\varphi \,\bot \,\mathfrak {d}(k)$
, since we assume
$\hat {\upsilon }(k)\neq 0$
. Considering the scalar product of
$\varphi $
with both sides of the first equation, we then get that

see (40). Because
$h_{b}\in \lbrack 0,1/2]$
, if
$k\neq 0$
, then (75) holds true with a strict inequality, and therefore,

thanks to Corollary 4.3. Hence,

for some constant
$c>0$
, which, combined with (81), in turn implies that
$\varphi =0$
. If now
$k=0$
, then
$\mathfrak {b}(0)=0$
(see (32)) and we obtain from (81) that

since
$M_{\mathfrak {f}\left ( 0\right ) }\leq A_{1,1}(\mathrm {U},0)$
(see (37)–(38)). As

(see (33) and (35)) defines a positive and continuous function that vanishes at
$p=0$
only, one deduces from (82) that
$ \varphi =0$
also when
$k=0$
. In any case,
$\varphi =0$
and so, (80) combined with (40) yields

Since
$\mathfrak {d}(k)\neq 0$
and
$\hat {\upsilon }(k)\neq 0$
, we must have that
$z=0$
.Thus, we arrive at
$(\varphi ,z)=(0,0)$
, which contradicts the fact that
$(\varphi ,z)$
is a nonzero vector. Therefore, if
$\mathfrak {b} (k) $
is an eigenvalue of
$A(k)$
, then we must have
$\hat {\upsilon }(k)=0$
.
The Birman-Schwinger principle (Theorem A.10) allows us to transform the eigenvalue problem for the fiber Hamiltonian
$ A(k) $
into a nonlinear equation on the resolvent set
$\rho (A_{1,1}( \mathrm {U},k))$
of the operator
$A_{1,1}(\mathrm {U},k)$
, which is the resolvent set for a fermion pair with total quasi-momentum
$k\in \mathbb {T} ^{2}$
:
Theorem 4.5 (Characteristic equation for eigenvalues).
Fix
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
. Then,
${\lambda \in \rho (A_{1,1}(k))}$
is an eigenvalue of
$A(k)$
iff it is a solution to the equation

where
$\mathfrak {T}$
is the function defined by (45); that is,

Proof. Fix
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
. We divide the proof in several cases:
Case 1: We first consider the case
$\hat {\upsilon } (k)=0 $
and
$k\neq 0$
. In that situation,
$\mathfrak {b}(k)$
is trivially the only solution to (83). However, we already know from Proposition 4.4 (ii) that
$\mathfrak {b}(k)$
is an eigenvalue of
$A(k)$
. We must therefore prove that there is no other eigenvalue
$\lambda $
of
$A(k)$
in
$\rho (A_{1,1}(k))$
but
$\mathfrak {b}(k)$
. In fact, if such a
$\lambda \in \rho (A_{1,1}(k))$
exists, then, by Proposition 4.4 (i) with
$\hat {\upsilon }(k)=0$
,
$ A_{1,1}(k)-\lambda \mathbf {1}$
would have a nontrivial kernel, which is not possible, for
$\lambda $
is in the resolvent set of
$A_{1,1}(k)$
.
Case 2: Suppose that
$k=0$
and
$\hat {\upsilon }(0)=0$
. We observe that (83) has no solution because

thanks to Proposition 4.2. In addition, by applying Proposition 4.4 (i) and noting that
$ \mathfrak {b}(0)=0$
, we see that
$A(0)$
has no eigenvalues in
$\rho (A_{1,1}(0))$
.
Case 3: Finally, assume that
$\hat {\upsilon }(k)\neq 0$
and take
$\lambda \in \rho (A_{1,1}(k))$
. Observe from Proposition 4.4 (ii) that
$\mathfrak {b}(k)$
cannot be an eigenvalue of
$A(k)$
. Additionally,
$\mathfrak {b}(k)$
cannot be a solution to Equation ( 83). This last observation is proven as follows: When
$k=0$
, this is clear because
$\mathfrak {b}(0)=0$
is not even in the domain of the equation to be solved in (83). For
$k\neq 0$
, if
$\mathfrak { b}(k)$
is a solution to (83), then
$\mathfrak {T}(k, \mathfrak {b}(k))=0$
, but we know from Corollary 4.3 and
$h_{b}\in \lbrack 0,1/2]$
that

for some constant
$c>0$
. Therefore,
$\mathfrak {T}(k,\mathfrak {b}(k))=0$
would yield
$\mathfrak {d}(k)=0$
, which is obviously wrong, by (34). Therefore, in all cases,
$\mathfrak {b}(k)$
cannot be a solution to Equation ( 83), and we can assume that
$\lambda \neq \mathfrak {b}(k)$
. Now, the remaining part of the proof is essentially the same as the one of [Reference Bru, de Siqueira Pedra and de Pasquale22, Proposition 10], but we reproduce it for completeness. By ( 39)–(40), the orthogonal projection S onto the subspace
$ \mathbb {C}\mathfrak {d}(k)\subseteq L^{2}(\mathbb {T}^{2})$
can be written as

Then, observe from Proposition 4.4 (i) that
$\lambda $
is an eigenvalue of
$A(k)$
iff
$\lambda $
is an eigenvalue of
$T-V^{2}$
with


Thus, by applying Theorem A.10, we deduce that
$ \lambda $
is an eigenvalue of
$A(k)$
iff
$1$
is an eigenvalue of the corresponding Birman-Schwinger operator, which, with the above operators T and V, is equal to

Remark in this case that

since, obviously,

We thus conclude that
$\lambda $
is an eigenvalue of
$A(k)$
iff

This completes the proof.
Corollary 4.6 (Eigenspaces of fiber Hamiltonians).
Fix
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
. If
$\lambda \in \rho (A_{1,1}(k))$
is an eigenvalue of the fiber Hamiltonian
$A(k)$
, then the associated eigenspace is

where

In particular,
$\lambda $
is a nondegenerated eigenvalue of
$A(k)$
.
Proof. Assume that
$\hat {\upsilon }(k)\neq 0$
. Recall from the proof of Theorem 4.5 that in this case,
$\mathfrak { b}(k)$
is not an eigenvalue of
$A(k)$
, and so we assume without loss of generality that
$\lambda \neq \mathfrak {b}(k)$
. In this case, observe that we have (87) and a close look at the proof of Theorem A.10, in particular Lemma A.9, leads us to

where

by (85) and (86). From Proposition 4.4 (i) (
$\hat {\upsilon }(k)\neq 0$
), one then obtains that

In view of Equation (39), (88) and Theorem 4.5, the last vector can be rewritten as follows:

whenever
$\hat {\upsilon }(k)\neq 0$
. Finally, if
$\hat {\upsilon }(k)=0$
and
$ \lambda \in \rho (A_{1,1}(k))$
is an eigenvalue of
$A(k)$
, then it is straightforward to check that
$\lambda =\mathfrak {b}(k)$
with eigenspace generated by the vector
$(0,1)\in \mathcal {H}$
; see, for instance, (76)–(77) and (79).
Corollary 4.7 (Eigenvalues of fiber Hamiltonians – II).
Fix
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
. There is at most one eigenvalue of
$A(k)$
in each connected component of
$\rho (A_{1,1}(k))\cap \mathbb {R}$
.
Proof. In view of Theorem 4.5, it suffices to show that the derivative of the mapping

is strictly positive. For any
$x_{0}\in \rho (A_{1,1}(k))\cap \mathbb {R}$
, we have that

4.3.3 Bottom of the spectrum
As is well-known, physical properties of quantum systems at very low temperatures are essentially determined by the bottom of the spectrum of the corresponding Hamiltonian. In our case, having in mind the application to superconductivity in cuprates, we would like to study the bottom of the spectrum of the Hamiltonian
$H\in \mathcal {B}(\mathfrak {H})$
defined by (21). By Proposition 2.1 and Theorem A.3, we thus study the bottom of the spectrum of the fiber Hamiltonian
$A(k)$
(42) at fixed total quasi-momentum
$ k\in \mathbb {T}^{2}$
, similar to [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21].
Theorem 4.8 (Bottom of the spectrum of
$A(k)$
).
Fix
$h_{b}\in \lbrack 0,1/2]$
. If
$k \neq 0$
, then there is exactly one eigenvalue
$\mathrm {E}\left ( k\right ) \equiv \mathrm {E}\left ( \mathrm {U} ,k\right )$
of the Hamiltonian
$A(k)$
strictly below
$\sigma _{\mathrm {ess} }\left ( A\left ( k\right ) \right )$
. In this case, the eigenvalue is nondegenerated and
$\mathrm {E}(k) < \mathfrak {b}(k)$
when
$\hat {\upsilon } (k) \neq 0$
, whereas
$\mathrm {E}(k) = \mathfrak {b}(k)$
if
$\hat {\upsilon }(k) = 0$
. This statement remains valid for
$k = 0$
provided
$\hat {\upsilon }(0) \neq 0$
.
Proof. Assume that
$k\neq 0$
. Recall that
$\mathfrak {z}(k)$
is defined in Corollary 4.3. From Corollaries 4.3 and 4.7, the interval
$(-\infty ,\mathfrak {z}(k))$
contains at most one eigenvalue of
$A(k)$
. By Corollary 4.6, the eigenvalue is nondegenerate, if it exists. If
$\hat {\upsilon }(k)=0$
, we know from Proposition 4.4 (ii) that
$\mathfrak {b}(k)$
is such an eigenvalue. Recall that for
$k\neq 0$
, one has that
$\mathfrak {b} (k)<\mathfrak {z}(k)$
, because
$h_{b}\in \lbrack 0,1/2]$
. Now, suppose that
$ \hat {\upsilon }(k)\neq 0$
. When
$\mathfrak {b}(k)\leq x<\mathfrak {z}(k)$
, we have

for some constant
$c>0$
, and hence,
$\mathfrak {T}(k,x)>0$
; see (84). Consequently,

which means that
$A(k)$
has no eigenvalues in the interval
$[\mathfrak {b}(k), \mathfrak {z}(k))$
, by Theorem 4.5. We shall now look for an eigenvalue in the interval
$(-\infty , \mathfrak {b}(k))$
. On the one hand, using Corollary 4.3, observe that

whenever
$x<\mathfrak {b}(k)$
. Taking
$x\rightarrow -\infty $
,
$\mathfrak {T} (k,x)$
tends to zero, and hence,

On the other hand, the continuity of the mapping

on
$(-\infty ,\mathfrak {b}(k)]$
gives us

By the intermediate value theorem, there is
$\mathrm {E}(k)\in (-\infty , \mathfrak {b}(k))$
such that

By Theorem 4.5,
$\mathrm {E} (k)$
must be an eigenvalue of
$A(k)$
.
The proof for
$k=0$
is done in a similar way. Basically, the only difference is that, in this case,
$\mathfrak {b}(0)=\mathfrak {z}(0)=0$
and

occurs due to other reasons. Indeed, from Corollary 4.3, we deduce that
$\mathfrak {T}(0,\cdot )$
is strictly positive on the interval
$(-\infty ,0)$
. Because of (89), we also have that
$\partial _x\mathfrak {T}(0,x)|_{x = x_0} \geq 0$
whenever
$x_0 < 0$
. Thus, the limit of
$\mathfrak {T}(0,x)$
as
$x \to 0^-$
exists, being possibly infinite.
If
$\hat {\upsilon }(0)=0$
, then, by Theorem 4.5, the fiber Hamiltonian
$A(0)$
has no negative eigenvalues. In this case, we set
$\mathrm {E}(0)=0$
, which is obviously an eigenvalue of
$ A(0)$
with associated eigenvector
$(0,1)$
. (Note that
$\sigma _{\mathrm {ess} }(A(0))=[0,8\epsilon ]$
, by Proposition 4.2.) With this definition, observe that, for all
$k\in \mathbb {T}^{2}$
,
$ \mathrm {E}(k)$
is the minimum spectral value of
$A(k)$
:

The lowest eigenvalue
$\mathrm {E}(k)$
of
$A(k)$
, when
$\mathrm {E}(k)<0$
, is related to the formation of dressed bound fermion pairs of fermions with total quasi-momentum
$k\in \mathbb {T}^{2}$
. In Theorem 4.20, we make this claim more precise and prove the spatial localization of such bounded pairs. Before doing that, we study the regularity of the real-valued function

on the two-dimensional torus.
To this end, we rewrite the characteristic equation given by Theorem 4.5 via the function
$\Phi : \mathcal {O}\rightarrow \mathbb {R}$
defined by

where
$\mathcal {O}$
is the open set

Observe from Equation (89) that
$\partial _{x}\Phi>0$
over the whole domain of
$\Phi $
.
We now study the continuity of the function
$\mathrm {E}:\mathbb {T} ^{2}\rightarrow \mathbb {R}$
and give a sufficient condition for
$\mathrm {E}$
to be of class
$C^{d}$
on
$\mathbb {S}^{2}$
for every
$d\in \mathbb {N}\cup \{\omega ,a\}$
, where
$C^{d}(\Omega )$
,
$d\in \mathbb {N}$
, stands for the space of
$d$
times continuously differentiable functions on
$\Omega $
, while
$C^{\omega }(\Omega )$
and
$C^{a}(\Omega )$
refer to the space of smooth and real analytic functions on
$\Omega $
, respectively.
Theorem 4.9 (Regularity of the function
$\mathrm {E}$
).
Let
$h_{b}\in \lbrack 0,1/2]$
.
-
i.) The family
$\{\mathrm {E}(\mathrm {U},\cdot )\}_{\mathrm {U}\in \mathbb {R}_{0}^{+}}$ of real-valued functions on
$\mathbb {T}^{2}$ is equicontinuous with respect to the metricFootnote 18
$d_{\mathbb {T}^{2}}$ .
-
ii.) If
$\hat {\upsilon }\in C^{d}(\mathbb {S}^{2})$ (
$\mathbb {S} ^{2}\subseteq \mathbb {R}^{2}$ ) for some
$d\in \mathbb {N}\cup \{\omega ,a\}$ , then
$\mathrm {E}\equiv \mathrm {E}(\mathrm {U},\cdot )\in C^{d}(\mathbb {S} ^{2}) $ for all
$\mathrm {U}\in \mathbb {R}_{0}^{+}$ . In this case,
(93)$$ \begin{align} \partial _{k_{j}}\mathrm{E}\left( k\right) =-\left( \partial _{x}\Phi \left( {k,\mathrm{E}\left( k\right) }\right) \right) ^{-1}\partial _{k_{j}}\Phi \left( {k,\mathrm{E}\left( k\right) }\right) ,\qquad k\in \mathbb{S}^{2},\ j\in \{1,2\}. \end{align} $$
Proof. By the spectral theorem, we deduce from (90) that, for any
$ \mathrm {U}\in \mathbb {R}_{0}^{+}$
,

Given any
$\varepsilon>0$
and
$k_{0}\in \mathbb {T}^{2}$
, by the (operator norm) continuity of the mapping
$A\left ( 0,\cdot \right ) :\mathbb {T} ^{2}\rightarrow \mathcal {B}(\mathcal {H})$
at the point
$k_{0}$
, we can find
$ \delta>0$
such that

for every
$k\in \mathbb {T}^{2}$
with
$d_{\mathbb {T}^{2}}(k,k_{0})<\delta $
. Recall that
$\mathcal {H}$
stands for the (fiber) Hilbert space
$L^{2}( \mathbb {T}^{2})\oplus \mathbb {C}$
(see (25)), while
$d_{\mathbb {T} ^{2}}$
is the metric (22) on the torus
$\mathbb {T}^{2}$
. Therefore,
$\mathrm {E}:\mathbb {T}^{2}\rightarrow \mathbb {R}$
can be expressed as the infimum over the equicontinuous family
$\{\langle \psi ,A( \mathrm {U},\cdot )\psi \rangle \}_{\mathrm {U}\in \mathbb {R}_{0}^{+},\psi \in \mathcal {H},\Vert \psi \Vert =1}$
of (continuous) functions and Assertion (i) follows.
Take again some
$k_{0}\in \mathbb {S}^{2}$
. Assume that
$\hat {\upsilon }$
is of class
$C^{d}$
on
$\mathbb {S}^{2}\subseteq \mathbb {R}^{2}$
with
$d\in \mathbb {N}\cup \{\omega ,a\}$
. Let
$\vartheta =(k_{0},\mathrm {E}(k_{0}))\in \mathcal {O}$
. Using Theorems 4.5 and 4.8 as well as Equation (89) and the (operator norm) continuity of the mapping

one checks that
$\Phi \in C^{d}(\mathcal {O})$
with
$d\geq 1$
,
$\Phi (\vartheta )=0$
and
$\partial _{x}\Phi (\vartheta )\neq 0$
. See, for instance, (89). We can thus apply the implicit function theorem (see, for example, [Reference de Oliveira91] for an ordinary version and [Reference Toland92] for an analytic version) to obtain open subsets
$ U\subseteq \mathbb {R}^{2}$
and
$J\subseteq \mathbb {R}$
such that
$\vartheta \in U\times J\subseteq \mathcal {O}$
and, for each
$k\in U$
, there is a unique real number
$\xi (k)\in J$
satisfying
$\Phi (k,\xi (k))=0$
. Moreover, the mapping
$\xi :U\rightarrow J$
defined in this way is of class
$C^{d}$
and its partial derivatives are given by

As
$\xi (k_{0})=\mathrm {E}(k_{0})<\mathfrak {z}(k_{0})$
(see (90)), by continuity, there exists a neighborhood
$V\subseteq U$
of
$k_{0}$
such that
$\xi (k)<\mathfrak {z}(k)$
for every
$k\in V$
. It follows that, for all
$ k\in V$
,
$\xi (k)$
and
$\mathrm {E}(k)$
are in the same connected component
$ (-\infty ,\mathfrak {z}(k))$
, and from
$\partial _{x}\Phi>0$
, we conclude that
$\mathrm {E}\upharpoonright V=\xi \upharpoonright V$
. So,
$\mathrm {E}$
is of class
$C^{d}$
near
$k_{0}$
and (96) yields (93) for any
$k\in V$
– in particular, for
$k=k_{0}$
. As
$k_{0}$
is arbitrary, Assertion (ii) follows.
We can now deduce from Theorem 4.9 that
$\mathrm {E}$
is a dispersion relation (see Definition 3.7) when the function
$\hat {\upsilon }:\mathbb {S}^{2}\rightarrow \mathbb {R}$
is at least
$ 2 $
times continuously differentiable, and in this case, we can even compute the group velocity. To see this, recall that, for any
$f\in C^{2}(\mathbb {S} ^{2})$
, we define in (48) the subset

with
$\mathsf {GL}_{2}\left ( \mathbb {R}\right ) $
being the set of invertible
$ 2\times 2$
matrices with real coefficients.
Corollary 4.10 (
$\mathrm {E}$
as a dispersion relation and group velocity).
Let
$h_{b}\in \lbrack 0,1/2]$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. Then,
$ \mathrm {E}\equiv \mathrm {E}(\mathrm {U},\cdot )\in C(\mathbb {T}^{2})$
and is of class
$C^{2}$
on the open set
$\mathbb {S}^{2}\subseteq \mathbb {R}^{2}$
whenever
$\hat {\upsilon }$
is of class
$C^{2}$
on
$\mathbb {S}^{2}$
. In this case, the corresponding group velocity is

Moreover, if
$\hat {\upsilon }$
is real analytic (i.e., of class
$C^{a}$
in the above terminology) on
$\mathbb {S}^{2}$
, then either
$\mathfrak {M}_{ \mathrm {E}}$
has full measure or is empty.
Proof. The first part of the assertion is a direct application of Theorem 4.9. It remains to study the set
$\mathfrak {M}_{\mathrm {E}}$
. If
$\hat {\upsilon }$
is real analytic then, from Equation (47) and Theorem 4.9, the function
$f:\mathbb {S} ^{2}\longrightarrow \mathbb {R}$
defined by

is real analytic and satisfies
$f^{-1}(\{0\})=\mathbb {S}^{2}\backslash \mathfrak {M}_{\mathrm {E}}$
. Since the zeros of any nonconstant real analytic function have null Lebesgue measure (see, for example, [Reference Mityagin49]), either
$\mathfrak {M}_{\mathrm {E}}$
has full measure or is empty.
It is natural to derive now the ground state energy of the Hamiltonian
$H\in \mathcal {B}(\mathfrak {H})$
defined by (21), which is related to the ground state energy of fiber Hamiltonians, thanks to Proposition 2.1 and Theorem A.3. As expected, one has the following equality for the ground state energy

A proof can be done like [Reference Bru, de Siqueira Pedra and de Pasquale22, Lemma 8] by using Kato’s perturbation theory. In the sequel, we provide an alternative way of proving the equality, which is much more direct.
Proposition 4.11 (Bottom of the spectrum of H).
We have that

Proof. We first remark that the union

of the spectra of all fiber Hamiltonians is closed. To see this, let
$ (\lambda _{n})_{n\in \mathbb {N}}$
be a sequence of real numbers converging to
$\lambda \in \mathbb {R}$
with
$\lambda _{n}\in \sigma (A(k_{n}))$
for some
$k_{n}\in \mathbb {T}^{2}$
at
$n\in \mathbb {N}$
. By compactness of
$ \mathbb {T}^{2}$
, we can assume without loss of generality that
$ (k_{n})_{n\in \mathbb {N}}$
converges to some point
$k_{0}\in \mathbb {T}^{2}$
. By the (operator norm) continuity of the mapping
$A:\mathbb {T} ^{2}\rightarrow \mathcal {B}(\mathcal {H})$
, it follows that

(in operator norm). Hence,
$\lambda \in \sigma (A(k_{0}))$
, for otherwise
$ A(k_{n})-\lambda _{n}\mathbf {1}$
would be invertibleFootnote
19
for sufficiently large n. Thus,
$\mathcal {K}$
is a closed set, and as a consequence, for any
$s\notin \mathcal {K}$
, there is
$\varepsilon>0$
such that

With this property, we infer from Proposition 2.1 and Theorem A.3 that
$\sigma (H)\subseteq \mathcal {K}$
, which yields the inequality

Note that the last equality results from (90). By Theorem 4.9 (i),
$\mathrm {E}:\mathbb {T}^{2}\rightarrow \mathbb {R}$
is continuous. From the compactness of the torus
$\mathbb {T}^{2}$
and the Weierstrass extreme value theorem,
$\mathrm {E}$
has a minimizer in
$\mathbb {T }^{2}$
, say
$k_{0}\in \mathbb {T}^{2}$
. The continuity of
$\mathrm {E}$
at
$ k_{0}$
implies that, for every
$\varepsilon>0$
, there is
$\delta>0$
such that, for all
$k\in \mathbb {T}^{2}$
satisfying
$d_{\mathbb {T} ^{2}}(k,k_{0})<\delta $
,

and as a consequence,

where
$B_{\delta }(k_{0})$
is the open ball (for the metric
$d_{\mathbb {T} ^{2}}$
) centered at
$k_{0}\in \mathbb {T}^{2}$
of radius
$\delta \in \mathbb {R }^{+}$
. By Theorem A.3, this implies that
$\mathrm {E}(k_{0})\in \sigma (H)$
, which, combined with (97), yields the equalities

Using Equations (75) and (90), we note that

4.4 Spectral properties in the hard-core limit
We now study the spectral properties of fiber Hamiltonians
$A(k)$
(42) in the hard-core limit. It refers to the limit
$ \mathrm {U}\rightarrow \infty $
. In fact, a very strong on-site repulsion
$ \mathrm {U}$
(see Equation (6)) prevents two fermions of opposite spins from occupying the same lattice site. We study in particular the continuous function
$\mathrm {E}:\mathbb {T}^{2}\rightarrow \mathbb {R}$
defined by Theorem 4.8, which corresponds to the continuous family of nondegenerate eigenvalues at lowest energies in each fiber, in this limit.
An important result in this context is the characterization of such eigenvalues via the Birman-Schwinger principle, given by Theorem 4.5. In particular, we need first to study the hard-core limit of the characteristic equation, which amounts to determine the limit
$\mathrm {U}\rightarrow \infty $
of the quantity (84); that is,

for
$h_{b}\in \lbrack 0,1/2]$
and
$k\in \mathbb {T}^{2}$
. We start with this point, which allows us to study afterwards the limit of the lowest eigenvalues and all the derived quantities, like, for instance, the group velocity.
4.4.1 The characteristic equation in the hard-core limit
Let
$\mathfrak {s}=\mathfrak {\hat {e}}_{0}\in L^{2}(\mathbb {T}^{2})$
denote the constant function
$\ 1$
on the torus
$\mathbb {T}^{2}$
. For any fixed
$ k\in \mathbb {T}^{2}$
and
$\lambda \in \rho (B_{1,1}(k))$
, define the following four constants:




where we recall that
$B_{1,1}\left ( k\right ) $
is defined by (37). In the following lemma, we write
$\mathfrak {T}(\mathrm {U},k,\lambda )$
, which is defined by (98), in terms of these four quantities.
In the following, it is technically convenient to assume that
$\mathfrak {d} (k)\notin \mathbb {C}\mathfrak {s}$
for all
$k\in \mathbb {T}^{2}$
. Notice that this holds true iff
$\mathfrak {p}_{1}\notin \mathbb {C}\mathfrak {e}_{0}$
or
$ \mathfrak {p}_{2}\notin \mathbb {C}\mathfrak {e}_{0}$
(i.e.,
$r_{\mathfrak {p} }>0 $
). Indeed, recall (36); that is,

where
$\mathrm {e}^{ik\cdot x}\mathfrak {p}_{\sharp }(x)$
stands for the function
$x\mapsto \mathrm {e}^{ik\cdot x}\mathfrak {p}_{\sharp }(x)$
with
$ \sharp \in \{1,2\}$
. See also discussions around Equations 13–14.
Lemma 4.12. Let
$k\in \mathbb {T}^{2}$
,
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
and
$\lambda < \mathfrak {z}(k)$
, with
$\mathfrak {z}(k)\in \mathbb {R}$
defined in Corollary 4.3. Then,

with
$R_{\mathfrak {d},\mathfrak {d}}R_{\mathfrak {s},\mathfrak {s}}-\left \vert R_{\mathfrak {s},\mathfrak {d}}\right \vert ^{2}\geq 0$
. Moreover, if
$r_{ \mathfrak {p}}>0$
(see (14)–(15)), then the inequality is strict.
Proof. The proof of the first part is a slightly more complicated version of the one of [Reference Bru, de Siqueira Pedra and de Pasquale22, Lemma 14]. Fix
$k\in \mathbb {T}^{2}$
,
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
and
$\lambda <\mathfrak {z}(k)$
. Define the complex numbers

where we recall that

by Equation (38). Using Corollary 4.3, note at this point that

In particular,

and the resolvent operators
$(A_{1,1}(\mathrm {U},k)-\lambda \mathbf {1} )^{-1}$
and
$(B_{1,1}(k)-\lambda \mathbf {1})^{-1}$
are strictly positive. By using the second resolvent identity together with Equation (103), we compute that

Recalling that
$P_{0}$
is the orthogonal projection onto
$\mathbb {C} \mathfrak {s=\mathbb {C}\hat {e}}_{0}$
, we have that

In matrix notation, the above equations can be rewritten as

As
$(B_{1,1}(k)-\lambda \mathbf {1})^{-1}\geq 0$
(because of (105)) and
$\mathrm {U}R_{\mathfrak {s},\mathfrak {s}}\geq 0$
,

which means that the matrix appearing in the above determinant is invertible. From this, we conclude that

In particular, since
$R_{\mathfrak {s},\mathfrak {d}}=\overline {R_{\mathfrak {d} ,\mathfrak {s}}}$
,

Because
$\lambda <\mathfrak {z}(k)\leq 0$
and
$(B_{1,1}(k)-\lambda \mathfrak {1 })^{-1}\geq \left \vert \mathfrak {z}(k)-\lambda \right \vert ^{-1}\mathbf {1}$
(see (105)), the sesquilinear form

is a scalar product, and using the Cauchy-Schwarz inequality,Footnote 20 we deduce that

When
$r_{\mathfrak {p}}>0$
, the set
$\{\mathfrak {d}(k),\mathfrak {s}\}$
is linearly independent for every
$k\in \mathbb {T}^{2}$
.
The last lemma is useful to deduce the behavior of the quantity
$\mathfrak {T} (\mathrm {U},k,\lambda )$
at large Hubbard coupling constant
$\mathrm {U}\gg 1$
:
Corollary 4.13 (
$\mathfrak {T}(\mathrm {U}_{0},k, \lambda )$
at large on-site repulsions).
Let
$k\in \mathbb {T}^{2}$
and
$\lambda <\mathfrak {z}(k)$
, with
$\mathfrak {z} (k)\in \mathbb {R}$
defined in Corollary 4.3. Then, for all
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
,

where

Proof. By Lemma 4.12, we have
$R_{ \mathfrak {s},\mathfrak {s}}>0$
and
$R_{\mathfrak {d},\mathfrak {d}}R_{\mathfrak { s},\mathfrak {s}}\geq \left \vert R_{\mathfrak {s},\mathfrak {d}}\right \vert ^{2}$
while

4.4.2 Hard-core dispersion relation of bound pairs of lowest energy
We are now in a position to study the spectral properties of the model
$H\in \mathcal {B}(\mathfrak {H})$
defined by (21) in the hard-core limit. We study in particular its ground state energy
$E(\mathrm {U})$
(see Corollary 3.4) and the limit of the continuous function
$\mathrm {E}: \mathbb {T}^{2}\rightarrow \mathbb {R}$
defined by Theorem 4.8, which corresponds to the continuous family of nondegenerate eigenvalues at lowest energies in the fibers.
We start with the hard-core ground state energy, which is a well-defined quantity that even stays negative:
Lemma 4.14 (Existence of the hard-core ground state energy).
In the hard-core limit
$\mathrm {U}\rightarrow \infty $
, the hard-core ground state energy (44) is well-defined and is equal to

Proof. When
$0\leq \mathrm {U}\leq \mathrm {V}$
, one obviously has

see Equation (42). In other words,
$A(\mathrm {U},k)$
,
$ \mathrm {U}\in \mathbb {R}_{0}^{+}$
, defines an increasing family of bounded operators and by Proposition 4.2 and Equation (75),

whenever
$0\leq \mathrm {U}\leq \mathrm {V}$
. In particular, by taking the minimum over
$k\in \mathbb {T}^{2}$
, if
$0\leq \mathrm {U}\leq \mathrm {V}$
then

See Proposition 4.11. This shows that E is an increasing function of
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, which is bounded from above by
$0$
. This yields the assertion, thanks to the monotone convergence theorem.
We give in the next theorem a hard-core limit version of Theorems 4.5, 4.8 and 4.9. To this end, recall that
$\mathrm {E}\left ( k\right ) $
,
$k\in \mathbb {T}^{2}$
, are given by Theorem 4.8 as a family of nondegenerate eigenvalues. This family depends upon the parameter
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, and we thus use here the notation
$ \mathrm {E}(\mathrm {U},k)\equiv \mathrm {E}(k)$
. This defines a function
$ \mathrm {E}:\mathbb {R}_{0}^{+}\times \mathbb {T}^{2}\rightarrow \mathbb {R}$
.
Theorem 4.15 (Dispersion relation in the hard-core limit).
Let
$h_{b}\in \lbrack 0,1/2]$
. Recall that
$\mathfrak {T}(\infty ,k,\lambda )\geq 0$
is defined by (106).
-
i.) For every
$k\in \mathbb {T}^{2}$ , the following limit exists:
$$ \begin{align*} \mathrm{E}\left( \infty ,k\right) ={\lim\limits_{\mathrm{U}\rightarrow \infty }}\mathrm{E}\left( \mathrm{U},k\right) ={\sup\limits_{\mathrm{U}\in \mathbb{R}_{0}^{+}}}\mathrm{E}\left( \mathrm{U},k\right) \; \end{align*} $$
-
ii.)
$\mathrm {E}\left ( \infty ,\cdot \right ) :\mathbb {T} ^{2}\rightarrow \mathbb {R}$ is a continuous function;
-
iii.) For
$k\neq 0$ ,
$\mathrm {E}(\infty ,k)$ is the unique solution to the equation
(108)$$ \begin{align} \hat{\upsilon}\left( k\right) ^{2}\mathfrak{T}\left( \infty ,k,z\right) +z- \mathfrak{b}\left( k\right) =0,\qquad z<\mathfrak{z}(k). \end{align} $$
-
iv.) If
$\hat {\upsilon }$ is of class
$C^{d}$ on
$\mathbb {S} ^{2}\subseteq \mathbb {R}^{2}$ with
$d\in \mathbb {N}\cup \{\omega ,a\}$ , then so does
$\mathrm {E}(\infty ,\cdot )$ .
If, in addition,
$r_{\mathfrak {p}}>0$
, then
-
v.) For every
$k\in \mathbb {T}^{2}$ ,
$\mathrm {E}(\infty ,k)\leq \mathfrak {b}(k)$ with equality iff
$\hat {\upsilon }(k)=0$ .
Proof. Fix
$h_{b}\in \lbrack 0,1/2]$
. By (90) and (107) together with Corollary 4.3 and Theorem 4.8,

whenever
$0\leq \mathrm {U}\leq \mathrm {V}$
. This shows that, at any fixed
$ k\in \mathbb {T}^{2}$
, the function
$\mathrm {U}\mapsto \mathrm {E}\left ( \mathrm {U},k\right ) $
from
$\mathbb {R}_{0}^{+}$
to
$\mathbb {R}$
is increasing and bounded. Therefore, for any
$k\in \mathbb {T}^{2}$
,

thanks to the monotone convergence theorem. In particular, Assertion (i) holds true.
By Combining Equation (110) with (94), note that

Meanwhile, for any
$\varepsilon>0$
and
$k_{0}\in \mathbb {T}^{2}$
, there is
$ \delta>0$
such that

As a consequence, similar to what is done after (94), the set

is a family of equicontinuous functions on
$\mathbb {T}^{2}$
. Therefore,
$ \mathrm {E}(\infty ,\cdot )$
is continuous and Assertion (ii) holds true.
Fix
$k\neq 0$
. If
$\hat {\upsilon }(k)\neq 0$
and
$\mathrm {U}\in \mathbb {R} ^{+} $
, then we deduce from Corollary 4.3, Theorem 4.5 and Equations (109)–(110) that

and

Invoking next Corollary 4.13, we obtain the inequality

We can take the limit
$\mathrm {U}\rightarrow \infty $
in this last inequality by using (110) and the continuity of the mapping

at the point
$\mathrm {E}(\infty ,k)$
, to arrive at the equality

This proves that
$\mathrm {E}(\infty ,k)$
is a solution to (108). There is no other solution below
$\mathfrak {z}(k)$
because of the following arguments: Given any
$\mathrm {U}\in \lbrack 0,\infty ]$
, let

By Corollary 4.13,
$(f_{\mathrm {U} })_{\mathrm {U}\in \mathbb {R}_{0}^{+}}$
converges pointwise to
$f_{\infty }$
, as
$\mathrm {U}\rightarrow \infty $
. Since the pointwise limit of monotonically increasing function is again monotonically increasing, it follows that
$f_{\infty }$
is monotonically increasing. Given any
$x<y< \mathfrak {z}(k)$
, take any
$r>0$
with
$r\geq f_{\infty }(y)-f_{\infty }(x)\geq 0$
. Then, for some
$\mathrm {U}_{0}\in \mathbb {R}_{0}^{+}$
sufficiently large, one has

so that

Then, the mean value theorem combined with (89) implies that

for some
$c\in (x,y)$
. This implies that the function
$f_{\infty }$
is strictly increasing on
$(-\infty ,\mathfrak {z}(k))$
, and hence, there is a unique solution,
$\mathrm {E}\left ( \infty ,k\right ) $
, to (108). Meanwhile, if
$\hat {\upsilon }(k)=0$
, then Theorem 4.8 implies that
$\mathrm {E}(\mathrm {U},k)=\mathfrak {b }(k)$
for all
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, and obviously,
$\mathrm {E} (\infty ,k)=\mathfrak {b}(k)$
is the unique solution to (108).
Consider now the open set
$\mathcal {O}$
defined by (92). Let
$ \Phi (\infty ,\cdot ):\mathcal {O}\rightarrow \mathbb {R}$
be defined by

By Corollary 4.13, note that
$ \Phi (\infty ,\cdot ):\mathcal {O}\rightarrow \mathbb {R}$
is nothing else than the pointwise limit of the function
$\Phi (\mathrm {U},\cdot ):\mathcal {O }\rightarrow \mathbb {R}$
defined by (91):

The function
$\Phi (\infty ,\cdot ,\cdot )$
is a continuously differentiable function satisfying

thanks to Inequality (112). Observe also that
$\Phi (\infty ,\cdot )\in C^{d}(\mathcal {O})$
if
$\hat {\upsilon }$
is of class
$C^{d}$
on
$ \mathbb {S}^{2}\subseteq \mathbb {R}^{2}$
with
$d\in \mathbb {N}\cup \{\omega ,a\}$
. Therefore, by repeating essentially the same argument used in the proof of Theorem 4.9, one concludes that
$\mathrm {E} (\infty ,\cdot )\in C^{d}(\mathbb {S}^{2})$
whenever
$\hat {\upsilon }$
is of class
$C^{d}$
on
$\mathbb {S}^{2}\subseteq \mathbb {R}^{2}$
with
$d\in \mathbb { N}\cup \{\omega ,a\}$
.
Finally, assume that
$r_{\mathfrak {p}}>0$
; that is,
$\mathfrak {p}_{1}\notin \mathbb {C}\mathfrak {e}_{0}$
or
$\mathfrak {p}_{2}\notin \mathbb {C}\mathfrak {e} _{0}$
.For
$k\neq 0$
, we deduce from (111) and Lemma 4.12 that

To conclude the proof of Assertion (v), it remains to show that
$\hat { \upsilon }(0)\neq 0$
implies
$\mathrm {E}(\infty ,0)<\mathfrak {b}(0)$
. Assume on the contrary that

keeping in mind that
$\mathfrak {b}\left ( 0\right ) =\mathfrak {z}(0)=0$
. Then, we infer from Theorems 4.5 and 4.8 that the following equality must be true:

By Lemma 4.12, it follows that

where, by a slight abuse of notation,

Therefore, we deduce from Corollary 4.13 that

Now, observe that

where

Let
$\varphi _{\mathrm {U}}=\mathfrak {d}(0)-\alpha (\mathrm {U})\mathfrak {s} \neq 0$
and
$W_{\mathrm {U}}=B_{1,1}(0)-\mathrm {E}(\mathrm {U},0)\mathbf {1}$
. Since
$W_{\mathrm {U}}$
is strictly positive, it has an unique square root, which is also strictly positive. Then

As
$\langle \varphi _{\mathrm {U}},W_{\mathrm {U}}^{-1}\varphi _{\mathrm {U} }\rangle $
tends to
$0$
when
$\mathrm {U}\rightarrow \infty $
, it follows that also
$\varphi _{\mathrm {U}}$
vanishes in this limit; that is,

As
$\mathbb {C}\mathfrak {s}$
is a closed subspace of
$L^{2}(\mathbb {T}^{2})$
, we get that
$\mathfrak {d}(0)\in \mathbb {C}\mathfrak {s}$
, which is a contradiction. Therefore, Assertion (v) holds true.
Remark 4.16. Under the additional conditions that
$r_{\mathfrak {p}}>0$
(i.e.,
$\mathfrak {p }_{1}\notin \mathbb {C}\mathfrak {e}_{0}$
or
$\mathfrak {p}_{2}\notin \mathbb {C} \mathfrak {e}_{0}$
) and
$\hat {\upsilon }(0)\neq 0$
, Assertion (iv) remains valid with
$(-\pi ,\pi )^{2}$
instead of
$\mathbb {S}^{2}$
(i.e., by including the zero quasi-momentum case
$k=0$
). Also, Assertion (iii) holds true for
$k=0$
. In fact, under these further conditions, we know from Assertion (v) that
$\mathrm {E}(\infty ,0)<\mathfrak {z}(0)=0$
.
Corollary 4.17 (Hard-core dispersion relation).
Let
$h_{b}\in \lbrack 0,1/2]$
and
$\hat {\upsilon }$
be of class
$C^{2}$
on
$ \mathbb {S}^{2}$
. Then,
$\mathrm {E}(\infty ,\cdot )\in C(\mathbb {T}^{2})$
and is of class
$C^{2}$
on
$\mathbb {S}^{2}$
. In this case, for any
$k\in \mathbb { S}^{2}$
,

Moreover, if
$\hat {\upsilon }$
is real analytic (of class
$C^{a}$
, in our terminology) on
$\mathbb {S}^{2}$
, then either
$\mathfrak {M}_{\mathrm {E} \left ( \infty ,\cdot \right ) }$
has full measure or is empty.
Proof. Recall that
$\mathcal {O}$
is the open set (92) and
$\Phi (\cdot ,\cdot ):\mathbb {R}_{0}^{+}\times \mathcal {O}\rightarrow \mathbb {R}$
is the real-valued function defined by (91). Note that, for any
$\mathrm {U} \in \mathbb {R}_{0}^{+}$
, the function
$\Phi (\mathrm {U},\cdot )$
is a smooth function on
$\mathcal {O}$
, and we estimate its derivatives with respect to the parameter k, at fixed
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
and x, where
$(k,x)\in \mathcal {O}$
: For any
$(k,x)\in \mathcal {O}$
and
$\mathrm {U} \in \mathbb {R}_{0}^{+}$
, Equations 37–38 together with the second resolvent formula yield the derivative

for any
$j\in \{1,2\}$
, where
$k=(k_{1},k_{2})\in \mathbb {S}^{2}$
. Therefore, by (33) and (91), for any
$(k,x)\in \mathcal {O}$
and
$ \mathrm {U}\in \mathbb {R}_{0}^{+}$
,

for any
$j\in \{1,2\}$
, where
$k=(k_{1},k_{2})\in \mathbb {S}^{2}$
. Taking the second derivative, one can easily check that

for
$j\in \{1,2\}$
, where
$k=(k_{1},k_{2})\in \mathbb {S}^{2}$
. In the same way, we deduce from the identities

the following inequalities:


for any
$(k,x)\in \mathcal {O}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
.
Now, fix
$(k_{0},x_{0})\in \mathcal {O}$
. The function
$\mathfrak {z}$
of Corollary 4.3 is continuous with respect to
$ k_{1},k_{2}$
, where
$k=(k_{1},k_{2})\in \mathbb {S}^{2}$
. Then, there is a closed cube centered at
$(k_{0},x_{0})$
with side length
$\delta \in \mathbb { R}^{+}$
contained in
$\mathcal {O}$
. Suppose that
$\partial _{x}\Phi (\mathrm { U},k_{0},x_{0})$
does not converge to
$\partial _{x}\Phi (\infty ,k_{0},x_{0})$
. Then we can find
$r_{0}\in \mathbb {R}^{+}$
and a sequence
$( \mathrm {U}_{n})_{n\in \mathbb {N}}$
of positive numbers such that
$\mathrm {U} _{n}\rightarrow \infty $
as
$n\rightarrow \infty $
and, for every
$n\in \mathbb {N}$
,

Let

By combining Equation (123) with the mean value theorem, we see that
$\mathfrak {F}$
is Lipschitz equicontinuous. In particular, this family of functions is equicontinuous. Moreover, it follows from (122) that
$\mathfrak {F}$
is bounded in the supremum norm. We can hence apply the (Arzelà-) Ascoli theorem [Reference Rudin44, Theorem A5], according to which
$ \partial _{x}\Phi (\mathrm {U}_{n},k_{0},\cdot )$
, when restricted to the compact interval
$[x_{0}-\delta ,x_{0}+\delta ]$
, converges uniformly along some subsequence. Assume, for simplicity and without loss of generality, that the (full) sequence of functions converges itself. By Equation (114), recall that
$\Phi (\mathrm {U},\cdot ,\cdot )$
converges pointwise to the function
$\Phi (\infty ,\cdot ,\cdot )$
on
$\mathcal {O}$
, which is is the real-valued function defined by (113). Then, by [Reference Rudin93, Theorem 7.17],

For
$x=x_{0}$
, this lead us to a contradiction. Thus,

for every
$(k,x)\in \mathcal {O}$
. In the same way, we invoke Equations (120) and (121) together with the mean value theorem and the (Arzelà-) Ascoli theorem [Reference Rudin44, Theorem A5] to deduce that

for every
$(k,x)\in \mathcal {O}$
. To prove the corollary, we eventually use these observations together with Theorem 4.15 (i), Corollary 4.10 and the equicontinuity of

with
$\mu $
standing for the variables
$k_{1}$
,
$k_{2}$
or x. Note that the last assertion concerning
$\mathfrak {M}_{\mathrm {E}\left ( \infty ,\cdot \right ) }$
is a direct consequence of the fact that the zeros of any nonconstant real analytic function have null Lebesgue measure [Reference Mityagin49]; see the proof of the same assertion for
$\mathrm {U}<\infty $
in Corollary 4.10.
Note that no additional condition is required for
$\mathrm {E}$
to have a well-defined hard-core limit. Compare Corollary 4.10 with Theorem 4.15. Moreover, by Corollary 4.17,
$\mathrm {E}(\infty ,\cdot )$
can be viewed as the (effective) dispersion relation of the dressed bound fermion pairs, with lowest energy, in the hard-core limit.
We close this section by showing the convergence of the low-energy eigenvector of
$A(\mathrm {U},k)$
for large Hubbard couplings. This refers to the (hard-core) limit
$\mathrm {U}\rightarrow \infty $
of the vector

see Equation (130).
Proposition 4.18 (Hard-core limit of eigenvectors).
Let
$h_{b}\in \lbrack 0,1/2]$
. Fix
$k\in \mathbb {T}^{2}\backslash \{0\}$
. The following limit exists:

This statement remains valid for
$k=0$
provided that
$\hat {\upsilon }(0)\neq 0 $
and
$r_{\mathfrak {p}}>0$
(i.e.,
$\mathfrak {p}_{1}\notin \mathbb {C} \mathfrak {e}_{0}$
or
$\mathfrak {p}_{2}\notin \mathbb {C}\mathfrak {e}_{0}$
).
Proof. Fix
$k\in \mathbb {T}^{2}$
with
$k\neq 0$
. By using the first resolvent formula together with Theorem 4.8, Theorem 4.15 (i) and (v), we find that

Note that
$\mathrm {E}(\infty ,k)\leq \mathfrak {b}\left ( k\right ) <\mathfrak {z }\left ( k\right ) $
for any
$k\neq 0$
. (When
$\hat {\upsilon }(0)\neq 0$
and
$ \mathfrak {p}_{1}\notin \mathbb {C}\mathfrak {e}_{0}$
or
$\mathfrak {p} _{2}\notin \mathbb {C}\mathfrak {e}_{0}$
, we also have that
$\mathrm {E}(\infty ,0)<\mathfrak {b}(0)=\mathfrak {z}(0)$
.) However, by Proposition A.14,
$\{(A_{1,1}(\mathrm {U},k)-\mathrm {E} (\infty ,k)\mathbf {1})^{-1}\}_{\mathrm {U}\geq 0}$
is a decreasing family of positive operators, and by Proposition A.15, it converges strongly as
$\mathrm {U}\rightarrow \infty $
. Consequently,
$ (A_{1,1}(\mathrm {U},k)-\mathrm {E}(\mathrm {U},k)\mathbf {1})^{-1}$
also converges strongly.
4.5 Spectral gap and Anderson localization
By Equation (90), the spectral gap of fiber Hamiltonians is equal to

for any Hubbard coupling constant
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. When
$ r_{\mathfrak {p}}>0$
(i.e.,
$\mathfrak {p}_{1}\notin \mathbb {C}\mathfrak {e} _{0} $
or
$\mathfrak {p}_{2}\notin \mathbb {C}\mathfrak {e}_{0}$
) and
$\hat { \upsilon }(0)\neq 0$
, this quantity turns out to be strictly positive, uniformly with respect to the parameter
$\mathrm {U}$
:
Proposition 4.19 (Uniform spectral gap of fiber Hamiltonians).
Fix
$h_{b}\in \lbrack 0,1/2]$
. If
$r_{\mathfrak {p}}>0$
(i.e.,
$\mathfrak {p} _{1}\notin \mathbb {C}\mathfrak {e}_{0}$
or
$\mathfrak {p}_{2}\notin \mathbb {C} \mathfrak {e}_{0}$
) and
$\hat {\upsilon }(0)\neq 0$
, then

Proof. The family
$\{\mathrm {E}(\mathrm {U},\cdot )\}_{\mathrm {U}\in \mathbb {R} _{0}^{+}}$
of real-valued functions on
$\mathbb {T}^{2}$
is equicontinuous, thanks to Theorem 4.9 (i). Since Proposition 4.2 says that, for any
$k\in \mathbb {T}^{2}$
,

we thus deduce from (124) that the family
$\{\mathfrak {g}( \mathrm {U},\cdot )\}_{\mathrm {U}\in \mathbb {R}_{0}^{+}}$
of real-valued functions on
$\mathbb {T}^{2}$
is equicontinuous. It follows that the function

is continuous. Moreover, from the compactness of
$\mathbb {T}^{2}$
,
$ \mathfrak {g}(\mathrm {U},\cdot )$
has a global minimizer, say
$k_{\mathrm {U} }\in \mathbb {T}^{2}$
for all
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. Since
$\hat { \upsilon }(0)\neq 0$
, by Theorem 4.8,

when
$k_{\mathrm {U}}=0$
, while in the case
$k_{\mathrm {U}}\neq 0$
,

In particular,

Using this together with the continuity of the function (126), we arrive at the inequality

for any positive parameter
$c\in \mathbb {R}_{0}^{+}$
. Now, we perform the limit
$\mathrm {U}\rightarrow \infty $
. Since
$\mathbb {T}^{2}$
is compact, the net
$\ (k_{\mathrm {U}})_{\mathrm {U}\in \mathbb {R}_{0}^{+}}$
converges along subnets (in fact, subsequences). Assume without loss of generality that
$(k_{\mathrm {U}})_{\mathrm {U}\in \mathbb {R}_{0}^{+}}$
converges to some
$ k_{\infty }\in \mathbb {T}^{2}$
(otherwise, one uses all the following arguments on subsequences). If
$k_{\infty }\neq 0$
, then

thanks to Theorem 4.8 and the continuity of the functions
$\mathfrak {z}$
and
$\mathfrak {b}$
. Assume now that
$k_{\infty }=0$
. Since, for all
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
,

we infer from the equicontinuity of the family
$\{\mathrm {E}(\mathrm {U} ,\cdot )\}_{\mathrm {U}\in \mathbb {R}_{0}^{+}}$
(Theorem 4.9 (i)) and Theorem 4.15 (i) that

Combined with Theorem 4.15 (v) and
$\hat {\upsilon }(0)\neq 0$
, this last limit in turn implies that

The assertion is therefore a combination of Inequalities (127), (128) and (129).
We study now the space localization of the (dressed) bound pair with total quasi-momentum
$k\in \mathbb {T}^{2}$
and energy
$\mathrm {E}\left ( \mathrm {U} ,k\right ) $
. Assume that
$\hat {\upsilon }(0)\neq 0$
. By Corollary 4.6, for any fixed
$k\in \mathbb {T}^{2}$
, it corresponds to study the fermionic part of the eigenvector

written in the real space
$\mathbb {Z}^{2}$
via the inverse Fourier transform
$\mathcal {F}^{-1}$
(see (31)). This function is denoted by

for any fixed
$k\in \mathbb {T}^{2}$
. One should not be confused here by the parameter k. Recall, for instance, that, given
$k\in \mathbb {T}^{2}$
,
$ \mathfrak {d}(k)\in C\left ( \mathbb {T}^{2}\right ) $
is itself a function on the torus
$\mathbb {T}^{2}$
, defined by

see Equation (34). In particular, observe that

We now show that this function is exponentially localized in the real space:
Theorem 4.20 (Exponentially localized dressed bound fermion pairs).
Fix
$h_{b}\in \lbrack 0,1/2]$
,
$k\in \mathbb {T}^{2}$
and suppose that
$r_{ \mathfrak {p}}>0$
(i.e.,
$\mathfrak {p}\notin \mathbb {C}\mathfrak {e}_{0}$
) and
$\hat {\upsilon }(0)\neq 0$
. There exist positive constants
$C,\alpha>0$
such that, for all
$k\in \mathbb {T}^{2}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
,

Proof. By (36), we compute that

By Equation (133), it suffices to estimate the exponential decay of this particular function. This is done by using the celebrated Combes-Thomas estimates, which correspond here to Theorem A.13. To this end, several quantities, one of them being related to the spectral gap
$\mathfrak {g}(\mathrm {U},k)$
(124), have to be controlled and, as in Section A.6, we use the notation (A.22), that is,

for the distance between a complex number
$\lambda \in \mathbb {C}$
and the spectrum
$\sigma (T)$
of an operator
$T\in \mathcal {B}(\ell ^{2}(\mathbb {Z} ^{2}))$
, as well as (A.20), which, in the present case, refers to the quantity

for any
$T\in \mathcal {B}(\ell ^{2}(\mathbb {Z}^{2}))$
and
$\mu \in \mathbb {R} _{0}^{+}$
. We do it in three steps: The first one controls the spectral gap
$ \mathfrak {g}(\mathrm {U},k)$
(124) and a quantity like (135) for
$\lambda =\mathrm {E}(\mathrm {U},k)$
, while the second step is an analysis of quantities like (136). These two steps allow us to apply, in the last step, Theorem A.13 in order to get the desired result.
Step 1: Observe from Equation (124) and Proposition 4.19 that we can find
$\alpha>0$
such that, for all
$k\in \mathbb {T}^{2}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
,

Using now Proposition 4.2 and the fact that

for all
$k\in \mathbb {T}^{2}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, we deduce from the last inequalities that

see also Equation (135). Since
$\mathcal {F}$
is a unitary transformation,
$A_{1,1}(\mathrm {U},k)$
and
$\mathcal {F}^{\ast }A_{1,1}( \mathrm {U},k)\mathcal {F}$
have the same spectrum, and it follows that

for all
$k\in \mathbb {T}^{2}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
.
Step 2: By Equations (5) and (31), one easily checks that

while, for any fixed
$k\in \mathbb {T}^{2}$
, the real-valued functions
$ \mathfrak {f}(k)$
, defined by (33) and (35) on the torus
$ \mathbb {T}^{2}$
, can be rewritten as

Therefore, since
$M_{\mathfrak {f}\left ( k\right ) }$
stands for the multiplication operator by
$\mathfrak {f}(k)\in C(\mathbb {T}^{2})$
, for every
$p,k\in \mathbb {T}^{2}$
and
$y\in \mathbb {Z}^{2}$
,

which, by (37)–(38), in turn implies that

keeping in mind that
$P_{x}$
is the orthogonal projection onto the one-dimensional subspace
$\mathbb {C}\hat {\mathfrak {e}}_{x}\subseteq L^{2}( \mathbb {T}^{2})$
. Recall that
$\mathcal {F}^{\ast }=\mathcal {F}^{-1}$
, the Fourier transform being unitary. Thus, since
$\alpha>0$
, for each
$x\in \mathbb {Z}^{2}$
, we obtain that

Hence, taking the supremum over all
$x\in \mathbb {Z}^{2}$
in this equation and using the notation given by (136) as well as (137), we arrive at

for any fixed
$k\in \mathbb {T}^{2}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
.
Step 3: Thanks to (138), we are now in a position to apply Theorem A.13 for
$H= \mathcal {F}^{\ast }A_{1,1}\left ( \mathrm {U},k\right ) \mathcal {F}$
and
$\mu =\alpha $
to obtain that, for any
$x,y\in \mathbb {Z}^{2}$
,

Combined with Equations (133)–(134) and the triangle inequality as well as the reverse one
$|x-y|\geq |x|-|y|$
, we then arrive at

for all
$x\in \mathbb {Z}^{2}$
,
$k\in \mathbb {T}^{2}$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. By choosing
$\alpha $
sufficiently small (more precisely
$\alpha \leq \alpha _{0}$
), we can assume, without loss of generality, that the above sum is finite; see (12). This completes the proof, because the Fourier transform
$\hat {\upsilon }$
of
$\upsilon $
is a continuous function on the torus
$\mathbb {T}^{2}$
, which is compact, and is consequently bounded.
4.6 Scattering channels
4.6.1 Unbound pair scattering channel
Recall that
$\mathfrak {H}_{f}$
is defined by (53) and
$ \mathrm {H}_{f}$
is the operator defined by (54) for any
$\mathrm {V} \in \mathbb {R}_{0}^{+}$
and absolutely summable function
$\mathrm {v}:\mathbb { Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
.
For any operator Y acting on a Hilbert space
$\mathcal {Y}$
,
$P_{\mathrm {ac} }(Y)$
denotes the orthogonal projection on the absolutely continuous space of Y, defined by (49). In order to show the existence of a unbound pair scattering channel, we need the following technical lemma:
Lemma 4.21 (Absolute continuous space of fermionic Hamiltonians).
For any
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
and every absolutely summable function
$\mathrm {v}:\mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
, the orthogonal projection
$P_{\mathrm {ac}}(\mathrm {H}_{f})$
on the absolutely continuous space of
$\mathrm {H}_{f}$
, defined by (49), is equal to
$\mathbf {1}$
.
Proof. Take any
$\psi \in \mathfrak {H}_{f}$
and observe from Corollary A.5 that

where
$M_{\mathfrak {f}\left ( k\right ) }$
is the fiber Hamiltonian defined as the multiplication operator associated with the continuous function
$ \mathfrak {f}(k)\in C(\mathbb {T}^{2})$
(see (33)). For any
$\mathrm {V} \in \mathbb {R}_{0}^{+}$
and absolutely summable function
$\mathrm {v}:\mathbb { Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
, the operator defined by (55); that is,

is a trace-class operator, where we recall that
$P_{x}$
is the orthogonal projection on the one-dimensional subspace
$\mathbb {C}\mathfrak {\hat {e}} _{x}\subseteq L^{2}(\mathbb {T}^{2})$
. By [Reference Kato40, Theorem 4.4, Chapter X], it follows in this case that

Let
$B\subseteq \mathbb {R}$
be an arbitrary Borel set with zero Lebesgue measure. By using (54), we deduce that

For the first equality, note that we apply Theorem A.3 (iii).
The following results imply that the dynamic generated by the Hamiltonian H (i.e., included the exchange interaction and extended Hubbard repulsions) asymptotically far in the past or future approaches the purely fermionic dynamics for two unbound fermions. This is, of course, physically expected, since all interaction strengths get weak as the distance between the fermions increases. This is a consequence of the next assertions.
To shorten the notation, for any
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
and every absolutely summable function
$\mathrm {v}:\mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
, we define the Hamiltonian

with

Here,
$M_{\mathfrak {f}\left ( k\right ) }$
,
$R\left ( \mathrm {V},\mathrm {v} \right ) $
and
$A_{2,2}(k)$
are respectively the multiplication operator associated with the continuous function
$\mathfrak {f}(k)\in C(\mathbb {T} ^{2}) $
(see (33)), the trace-class operator (55) and the operator defined on
$\mathbb {C}$
by (41). We start with the unbounded pair scattering channel in each fiber:
Lemma 4.22 (Fiberwise unbound pair (scattering) channel).
For any
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
, every absolutely summable function
$\mathrm {v}:\mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
and all
$ k\in \mathbb {T}^{2}$
, the wave operators

as defined by Equation (51), exist and are complete with

Proof. Note that
$P_{\mathrm {ac}}(\mathrm {H}_{f})=\mathbf {1}$
, thanks to Lemma 4.21. By (51), it justifies the strong limit given in the lemma. As we discuss in the proof of Proposition 4.2, for any
$ k\in \mathbb {T}^{2}$
, the operator
$A(k)$
is the sum of
$M_{\mathfrak {f} \left ( k\right ) }\oplus A_{2,2}(k)$
and a compact operator T. In fact, as the function
$\mathrm {u}:\mathbb {Z}^{2}\rightarrow \mathbb {R}$
(defining the fiber Hamiltonian
$A(k)$
) is absolutely summable (see (7)), the operator difference T is even trace-class. As explained in Lemma 4.21,
$R\left ( \mathrm {V},\mathrm {v} \right ) $
is also a trace-class operator, because
$\mathrm {v}:\mathbb {Z} ^{2}\rightarrow \mathbb {R}_{0}^{+}$
is absolutely summable, again by assumption. By using the Kato-Rosenblum theorem (Reference Reed and Simon41, Theorem XI.8), it thus follows that the wave operators

exist and are complete for every
$k\in \mathbb {T}^{2}$
.
We are now in a position to prove Theorem 3.11. Recall that
$\mathfrak {U}:\mathfrak {H} _{f}\rightarrow L^{2}\left ( \mathbb {T}^{2},\mathcal {H}\right ) $
is the operator defined by (56), while
$A_{2,2}(k)$
and
$H^{(1)}\left ( k\right ) $
are respectively defined by (41) and (140). The definition of wave operators
$W^{\pm }$
are given by Equations 50–51.
Theorem 4.23 (Unbound pair (scattering) channel).
For any
$\mathrm {V}\in \mathbb {R}_{0}^{+}$
and every absolutely summable function
$\mathrm {v}:\mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$
,

with

Proof. For almost every
$k\in \mathbb {T}^{2}$
and every
$\psi \in \mathfrak {H}_{f}$
,

In other words,
$\mathfrak {U}$
is an intertwining operator for
$\mathrm {H} _{f}$
and
$H^{(1)}$
, and hence, for their respective complex exponential:

We also observe that, for any
$z\in \mathbb {C}$
and any Borel set
$ B\subseteq \mathbb {R}$
containing
$\mathfrak {b}(k)\in \mathbb {R}$
(see (32)),

even if the Lebesgue measure of B is zero. This last observation, together with Remark 3.10 and Lemma 4.21, yields

In particular,

We can then apply Proposition A.6 together with Lemmata 4.21, 4.22, Theorem A.3 and Equations (141)–(143) to arrive at

Note that Lemma 4.22 combined with (142) implies that

In particular,

Observe that Lemma A.1 allows one to write

as a Dyson series for all bounded operators
$X,Y$
. This can be applied to
$ X=H^{(1)}$
and
$Y=\mathbb {U}H\mathbb {U}^{\ast }-X$
, or in each fiber
$k\in \mathbb {T}^{2}$
to
$X=H^{(1)}\left ( k\right ) \oplus A_{2,2}\left ( k\right ) $
and
$Y=A\left ( k\right ) -X$
. When
$\mathrm {U}=\mathrm {V}\in \mathbb {R} _{0}^{+}$
and
$\mathrm {v}=\mathrm {u}:\mathbb {Z}^{2}\rightarrow \mathbb {R} _{0}^{+}$
in
$H^{(1)}$
, this result is particularly advantageous because the operator family
$(Y_{t})_{t\in \mathbb {R}}$
appearing in Lemma A.1 can be represented in a relatively simple way in this situation:
Lemma 4.24 (Finite-time scattering and wave operators).
For
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
and all
$s,t\in \mathbb {R}$
,

where
$B_{1,2}\left ( k\right ) $
and
$B_{2,1}\left ( k\right ) $
are the operator families defined by (60) for any
$k\in \mathbb {T}^{2}$
, while
$\cos _{\succ }$
and
$\sin _{\succ }$
are respectively defined by (61) and (62).
Proof. Let
$H^{(1)}\equiv H^{(1)}\left ( \mathrm {U},\mathrm {u}\right ) $
. We infer from Lemma A.1 applied to
$X=H^{(1)}$
and
$Y=\mathbb {U}H\mathbb {U}^{\ast }-X$
that

with
$(B^{(t)})_{t\in \mathbb {R}}\subseteq \mathcal {B}(L^{2}(\mathbb {T}^{2}, \mathcal {H}))$
being the norm-continuous family defined by

the operators
$B_{2,1}^{(t)}\left ( k\right ) $
and
$B_{1,2}^{(t)}\left ( k\right ) $
being defined by (60) for any
$t\in \mathbb {R}$
and
$ k\in \mathbb {T}^{2}$
. Then, one combines explicit computations together with Proposition A.6 and A.7 to arrive at the assertion. Notice that the above integrals are Riemann ones,
$(B^{(t)})_{t\in \mathbb {R}}$
being a continuous family in the Banach space
$\mathcal {B}(L^{2}(\mathbb {T}^{2},\mathcal {H}))$
.
4.6.2 Bound pair scattering channel
We start by studying the wave operator (50) with respect to the operators
$X=\mathbb {U}H\mathbb {U}^{\ast }$
and
$Y=M_{\mathrm { E}\left ( \mathrm {U},\cdot \right ) }$
(66), the identification operator J being
$\mathfrak {P}_{\mathrm {U}}$
(65) for any fixed Hubbard coupling constant
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
.
Proposition 4.25 (Wave operators in the bound pair channel).
Let
$h_{b}\in \lbrack 0,1/2]$
and
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
. Then,
$ \mathbb {U}H\mathbb {U}^{\ast }\mathfrak {P}_{\mathrm {U}}=\mathfrak {P}_{\mathrm { U}}M_{\mathrm {E}\left ( \mathrm {U},\cdot \right ) }$
, and for every bounded continuous function
$f\in C_{b}(\mathbb {R})$
,

Proof. Using Proposition 2.1 and Theorem 4.8, we note that, for any
$ \varphi \in L^{2}(\mathbb {T}^{2})$
and almost every
$k\in \mathbb {T}^{2}$
,

(i.e.,
$\mathbb {U}H\mathbb {U}^{\ast }\mathfrak {P}_{\mathrm {U}}=\mathfrak {P}_{ \mathrm {U}}M_{\mathrm {E}\left ( \mathrm {U},\cdot \right ) }$
), keeping in mind Equations (65) and (66). We then obtain the last assertion by using the Stone-Weierstrass theorem and the spectral theorem.
We now study the (dressed) bound pair channel of lowest energy in the hard-core limit. This is a consequence of the following assertion:
Proposition 4.26 (Bound pair channel in the hard-core limit).
Fix
$h_{b}\in \lbrack 0,1/2]$
. Then,

and for every bounded continuous function
$f\in C_{b}(\mathbb {R})$
,

Proof. Fix
$\varphi \in L^{2}(\mathbb {T}^{2})$
. For any
$\mathrm {U}\in \mathbb {R} _{0}^{+}$
and almost every
$k\in \mathbb {T}^{2}$
, one has that

Thus, by Lebesgue’s dominated convergence theorem, we arrive at

Take now a bounded continuous function
$f\in C_{b}(\mathbb {R})$
. In particular, there exists
$L\in \mathbb {R}^{+}$
such that

By Theorem 4.15 (i) and continuity of the function f, one has that

Moreover,

is a composition of continuous functions and is in particular strongly measurable. Using Proposition A.6 (or Lebesgue’s dominated convergence theorem), we arrive at the limit

By using Equations (144), (145), Proposition 4.25, Theorem A.3 (iii) and the fact that
$\Vert \mathfrak {P}( \mathrm {U})\Vert _{\mathrm {op}}=1$
for all
$\mathrm {U}\in \mathbb {R}_{0}^{+}$
, we find that

A Appendix
A.1 Toward a microscopic theory for cuprate superconductivity
Superconductivity was discovered in 1911 through the study of the resistance of solid mercury at very low temperatures, which was found to disappear below the critical temperature
$T_{c}=4.2$
K. This phenomenon was subsequently observed in several other materials, such as lead (in this case,
$T_{c}=7$
K). The first microscopic explanation of this unexpected, but very interesting physical behavior was given in 1957 by J. Bardeen, L. Cooper and J. R. Schrieffer with what is now known as the BCS theory. They were awarded the Nobel Prize in Physics in 1972. Their theory explained all the superconductors known at the time, named today ‘conventional’ superconductors. For more details, see [Reference Kittel51, Chapter 10].
These superconductors not only have zero resistivity (below some critical current value), but also repel magnetic fields. This is the Meissner effect (or Meissner-Ochsenfeld effect). See, for example, the popular images showing a superconducting piece levitating above a magnet. However, when the magnetic field exceeds a critical value, superconductivity can be broken and the Meissner effect disappears abruptly. This is referred to as type I superconductivity, while type II superconductors manisfests the appearance of vortices beyond a first critical magnetic field and the disappearance of any Meissner effect beyond a second critical field. The BCS theory refers to conventional superconductors but applies for both type I and II superconductors.
Superconductors are characterized not only by the critical temperature but also their superconducting coherence length, which quantifies the characteristic exponent that describes variations in the density of the superconducting component. It is often several hundred nanometers for conventional superconductors. More precisely, from (Reference Kittel51, Chapter 10, Table 5), we have the following coherence lengths
$\xi $
and critical temperatures
$T_{c}$
for the following conventional superconductors:

Note that the superconducting coherence length of a Niobium superconductor is much smaller than others, which is consistent with the type-II property, which requires shorter coherence lengths compared to type I superconductors.
In 1986, there was a major breakthrough in physics with the discovery of a new class of superconductors by G. Bednorz and K. A. Müller [Reference Müller and Bednorz16]. They were awarded the Nobel Prize in Physics in 1987. Physically, these materials are antiferromagnetic and insulating at low temperatures, but as with semiconductors, one dopes them with impurities that provide extra charge carriers and break the perfect Mott insulator phase, which is characterized by an integer number of charge carriers on each lattice site. As doping increases, the antiferromagnetic phase turns into a superconducting phase. This was the discovery of high-
$T_{c}$
superconductors in ceramics materials (i.e., cuprates) for critical temperatures
$T_{c}$
that today range in the interval
$[39,164]$
K (approximately).
These superconductors are nonconventional, and the BCS theory fails to explain their properties. Indeed, while the conventional superconductivity results from an effective attraction between fermions (electrons or holes, depending on the charge carriers in each material) via phonons (i.e., lattice excitations), it soon became apparent that this kind of explanation could not work for cuprates, and the question of the mediator that could produce such an attraction has remained an open problem ever since. In fact, even if a large amount of numerical and experimental data is available, there is no pairing mechanism firmly established (through, for instance, antiferromagnetic spin fluctuations, phonons, etc.). See, for example, [Reference Plakida13, Section 7.6]. The debate is strongly polarized [Reference Rodgers54] between researchers using a purely electronic/magnetic microscopic mechanism and those using electron-phonon mechanisms.
This is undoubtedly one of the most important questions in condensed matter physics, even if current research seems to have shown less interest in this fundamental issue in recent years. Quoting the Nobel Prize winner Müller in 2007 [Reference Müller17]: “
$\ldots$
It is a remarkable fact that in these 20 years since the discovery of high temperature superconductivity no other class of materials has been found which exhibits this property above the boiling point of liquid nitrogen. With a view to finding another class, it would be rewarding to understand why these exceptional properties occur, which per se are regarded as among the important unsolved problems in present day physics.”
Our theoretical approach differs from all others and stems from a microscopic model – first proposed in 1985 by Ranninger-Robaszkiewicz [Reference Ranninger and Robaszkiewicz55] (see also [Reference Ranninger and Robin56, Reference Ranninger57] or [Reference Plakida13, Section 7.4.3]) and independently by Ionov [Reference Ionov58] – which, before our works, was never investigated in the presence of strong Coulomb repulsions.
The cuprates are a class of compounds containing copper (
$\mathrm {Cu}$
) atoms in an anion, and cuprate superconductors are oxide-based cuprates with two-dimensional
$\mathrm {CuO}_{2}$
layers made of
$\mathrm {Cu}^{++}$
(cf. ‘cuprate’) and
$\mathrm {O}^{--}$
(cf. ‘oxide’) ions, which generally possess the symmetries of the square, at least for the important family of tetragonal cuprates such as
$\mathrm {La}_{2-x}\mathrm {Sr}_{x}\mathrm {CuO} _{4} $
(LaSr 214) and
$\mathrm {La}_{2-x}\mathrm {Ba}_{x}\mathrm {CuO}_{4}$
. See, for example, [Reference Tsuei, Kirtley, Bennemann and Ketterson31, Section 9.1.2], [Reference Saxena12, Section 2.3] and [Reference Wesche14, Section 6.3.1].
As stressed in [Reference Köppel, Yarkony and Barentzen15, Part VII], the very strong Jahn-Teller (JT) effect associated with copper ions (
$\mathrm {Cu}^{++}$
) and its consequences for polaron formation are largely neglected in much of the physics literature, even though it was the JT effect that led to the discovery of superconductivity in cuprates [Reference Müller and Bednorz16]. See also [Reference Müller17, Reference Keller, Bussmann–Holder and Müller18, Reference Alexandrov and Zhao19]. For nonexperts, let us explain that the JT effect (or JT distortion) is a spontaneous symmetry breaking of molecules and ions that occurs via a geometrical distortion that suppresses the spatial degeneracy of the electronic ground state and lowers the overall energy of the system. See [Reference Köppel, Yarkony and Barentzen15]. In this context, it can produce JT n-polarons. Polarons, bipolarons or more generally, n-polarons,
$n\in \mathbb {N}$
, are charge carriers that are self-trapped inside a strong and local lattice deformation that surrounds them. They are quasi-particle formed from fermions ‘dressed with phonons’. For example, a bipolaron involves two fermions dressed with phonons. A JT polaron is a polaron for which the local lattice deformation is associated with the (geometrical) JT distortion. The existence of JT (bi)polarons in cuprates is attested in numerous experiments on cuprate superconductors [Reference Müller59, Reference Müller17, Reference Keller, Bussmann–Holder and Müller18, Reference Alexandrov and Zhao19], and we have the following experimental facts:
-
• Superconducting transport in cuprates occurs in two-dimensional
$ \mathrm {CuO}_{2}$ layers and only on oxygen atoms – a fact well established experimentally from 1987 by Bianconi and others [Reference Bianconi, Castellano, De Santis, Rudolf, Lagarde, Flank and Marcelli60, Reference Bianconi, Clozza, Castellano, Longa, De Santis, Di Cicco, Garg, Delogu, Gargano, Giorgi, Lagarde, Flank and Marcelli61, Reference Bianconi, Clozza, Castellano, Longa, De Santis, Di Cicco, Garg, Delogu, Gargano, Giorgi, Lagarde, Flank and Marcelli62, Reference Bianconi, Budnick, Flank, Fontaine, Lagarde, Marcelli, Tolentino, Chamberland, Michel, Raveau and Demazeau63, Reference Fujimori, Takayama-Muromachi and Uchida65, Reference Bianconi, Castellano, De Santis, Delogu, Gargano and Giorgi64, Reference Bianconi66] – while bipolarons are related to the strong JT effect of copper ions.
-
• Because of the presence of strong antiferromagnetic correlations of copper-oxides, experimentally proven even outside the antiferromagnetic phase (see, for example, [Reference Le Tacon, Ghiringhelli, Chaloupka, Sala, Hinkov, Haverkort, Minola, Bakr, Zhou, Blanco-Canosa, Monney, Song, Sun, Lin, De Luca, Salluzzo, Khaliullin, Schmitt, Braicovich and Keimer68, Reference Dean, Dellea, Springell, Yakhou-Harris, Kummer, Brookes, Liu, Sun, Strle, Schmitt, Braicovich, Ghiringhelli, Božović and Hill67] and (Reference Plakida13, Chapter 3)), it can be concluded that JT bipolarons have zero total spin and that other types of polaronic configuration are disadvantaged. This is, for instance, stressed in [Reference Müller17, Sect. 5.2].
-
• There is also an experimental evidence of the short lifetime of polarons in cuprates [Reference Mihailovic69], decaying into fermions (pairs of holes or electrons). Expressed in terms of a length
$\ell $ , one sees a lifetime comparable to the lattice spacing [Reference Mihailovic69]. Remarkably, near the critical temperature,
$\ell $ is actually close to the coherence length in cuprate superconductors.
The most straightforward approach would therefore be to consider the JT bipolarons as the charge carriers of cuprate superconductors. That is exactly what Alexandrov and coauthors have done in their bipolaron theory, based on light bipolarons [Reference Hague, Kornilovitch, Samson and Alexandrov70] as charge carriers. Quoting [Reference Alexandrov71, p. 4]: “cuprate bipolarons are relatively light because they are intersite rather than on-site pairs due to the strong on-site repulsion, and because mainly c-axis polarized optical phonons are responsible for the in-plane mass renormalization.” See, for instance, [Reference Alexandrov71, Reference Alexandrov72, Reference Alexandrov73, Reference Alexandrov74] and references therein. However, this approach does not seem consistent with superconducting transport in cuprates occurring on oxygen ions in
$\mathrm {CuO}_{2}$
layers:
In fact, a priori, (strong and local) lattice deformations (or JT distortions) attached to n-polarons should barely move, and this is not in accordance with the known mobility of superconducting charge carriers. This is confirmed in experiments:
-
• Experimental evidence (still controversial [Reference Chen, Birgeneau, Kastner, Preyer and Thio52]) of a large mass (approx. 700 electronic masses) of polarons in cuprates [Reference Reagor37].
-
• Experimental evidence (apparently not controversial) of the small mass (approx. 3–4 electronic masses) of superconducting carriers in cuprates [Reference Padilla, Lee, Dumm, Blumberg, Ono, Segawa, Komiya, Ando and Basov75, Fig. 2.].
For more recent discussions on the (im)mobility of (bi)polarons in cuprates, we recommend, for instance, [Reference Dzhumanov34, Reference Dzhumanov35, Reference Dzhumanov, Karimboev and Djumanov36].
We bypass this problem by using the exchange interaction (10), while seeing the fermions as the true charge carriers. In other words, we use exchange interactions like (10) to define a simplified model for cuprates, taking into account a large mass of bipolarons but non-polaronic superconducting carriers. Since the lifetime of bipolarons in terms of a length
$\ell $
is comparable to the coherence length in cuprate superconductors near the critical temperature, this suggests a strong exchange interaction between the (Bose-like, zero-spin) bipolaronic state and fermion pairs (electrons or holes).
As shown in Figure A1, bipolarons are formed around an oxygen ion (x) binding an adjacent pair of copper ions, because of the JT effect associated with
$\mathrm {Cu}^{++}$
. It leads to JT ‘intersite bipolarons’. That is why we consider an annihilation (creation) operator
$c_{x}$
(
$c_{x}^{\ast }$
) of a fermion pair of zero total spin at
$x\in \mathbb {Z}^{2}$
as defined by Equation (11). One simple example of such an operator is

with
$a_{z,s}$
(
$a_{z,s}^{\ast }$
) being the annihilation (creation) operator of a single fermion of spin
$s\in \{\uparrow ,\downarrow \}$
at lattice site
$z\in \mathbb {Z}^{2}$
. In this example, we set
$\mathfrak {p} _{2}\left ( 2z\right ) =\mathfrak {p}_{1}\left ( z\right ) =1$
when
$|z|\leq 1$
and
$\mathfrak {p}_{1}\left ( z\right ) =\mathfrak {p}_{2}\left ( z\right ) =0$
otherwise. Of course, one can also assign other weights to each space configuration of fermion pairs in Equation (11), as soon as at least one intersite configuration has a nonzero weight.

Figure A1
$\mathrm {CuO}_{2}$
layer.
There is also an undeniable experimental evidence of strong on-site Coulomb repulsions (cf. the Mott insulator phase at zero doping), which forces us to consider terms like

in Equation (6), where we recall that
$n_{x,s}\doteq a_{x,s}^{\ast }a_{x,s}$
is the number operator of fermions at
$x\in \mathbb {Z }^{2}$
and spin
$s\in \{\uparrow ,\downarrow \}$
. It justifies our strong interest in studying in this paper the hard-core limit
$\mathrm {U} \rightarrow \infty $
. See, for example, Theorem 3.5 and, more generally, related results that hold for
$\mathrm {U}\in \left [ 0,\infty \right ] $
.
The exchange interaction as formally given by (10); that is,

with
$b_{x}$
(
$b_{x}^{\ast }$
) being the annihilation (creation) operator of a JT (intersite) bipolaron, is inspired by an interband interaction proposed by Kondo in 1963 for superconducting transition metals. In [Reference Ranninger and Robaszkiewicz55, Reference Ionov58], only an on-site version (i.e.,
$ c_{y}=a_{y,\uparrow }a_{y,\downarrow }$
) was proposed in 1985. Our version ( A.2) of
$c_{y}$
captures the ‘intersite’ character of the bipolarons present in cuprates, and in [Reference Bru, de Siqueira Pedra and de Pasquale21], we specify the form of the coupling function
$ \upsilon $
in (11) based on the presence of large electron-phonon anomaliesFootnote
21
in cuprates at optimum doping for the following points in the normalized Brillouin zone
$\mathbb {T}^{2}\doteq \lbrack -\pi ,\pi )^{2}$
:

The anomalies at
$(0,\pm \pi /2)$
and
$(\pm \pi /2,0)$
are correctly predicted by the Density Functional Theory (DFT) involving electrons and phonons [Reference Reznik, Sangiovanni, Gunnarsson and Devereaux80, Fig. 1 (a)], in contrast to those of
$(0,-\pi ),(-\pi ,0)$
. Moreover, when no superconducting phase appears, DFT works very well at all quasi-momenta, including
$(0,-\pi ),(-\pi ,0)$
[Reference Reznik79, Fig. 18 (b)].
The above anomalies at quasi-momenta
$(0,-\pi ),(-\pi ,0)$
in the superconducting phase, which cannot be reproduced by the DFT, is expected to be a consequence of the existence of polaronic quasiparticles. Indeed, the DFT used in [Reference Reznik, Sangiovanni, Gunnarsson and Devereaux80, Fig. 1 (a)] does not take into account the formation of compound particles out of phonons and fermions like polaronic modes. In our theory, they are interpreted as being JT (intersite) bipolarons which should then interact strongly with charge carriers only at quasi-momenta
$(-\pi ,0)$
and
$(0,-\pi )$
(at moderate doping). The congruence between the DFT and experimental data for phonon dispersions at
$ (\pm \pi /2,0)$
and
$(0,\pm \pi /2)$
indeed makes the formation of such quasiparticles unlikely in this region of the (normalized) Brillouin zone
$ \mathbb {T}^{2}$
, and more generally in any other region relatively far from
$ (-\pi ,0)$
and
$(0,-\pi )$
. Consequently, the Fourier transform
$\hat { \upsilon }$
of
$\upsilon $
is chosen to take its maximum absolute value at the points
$(-\pi ,0)$
and
$(0,-\pi )$
. This property is fundamental to explaining the superconductivity of cuprates in our microscopic theory.
There is indeed one very important property of superconducting carriers (pairs) in cuprates that differs from conventional superconductors, their d -wave symmetry. The (fiber) space of a fermion pair at constant quasimomentum K is the Hilbert space
$L^{2}\left ( \mathbb {T}^{2},\mathbb {C} ,\nu \right ) $
, see Section 2.3. Define by

the unitary operator
$R_{\perp }$
implementing the
$\pi /2$
-rotation on
$ L^{2}\left ( \mathbb {T}^{2},\mathbb {C},\nu \right ) $
. Then define the mutually orthogonal projectors

Since
$P_{s}+P_{d}+P_{p}=\mathbf {1}$
, any wave function
$\Psi _{f}\in L^{2}\left ( \mathbb {T}^{2},\mathbb {C},\nu \right ) $
of a fermion pair can be uniquely decomposed into orthogonal s-, d- and p-wave components as

In other words, an arbitrary (fermionic pair) function
$\Psi _{f}$
can be uniquely decomposed into ‘s-, d- and p -wave’ components, denoted respectively by
$\Psi _{f}^{(s)},\Psi _{f}^{(d)},\Psi _{f}^{(p)}$
. Observe that

So, each component has a well-defined parity with respect to the group
$ \{0,\pi /2,\pi ,3\pi /2\}$
of rotations: The s-wave component
$\Psi _{f}^{(s)}$
is invariant under these 4 rotations, the d-wave one
$\Psi _{f}^{(d)}$
is antisymmetric with respect to the
$\pi /2$
-rotation and the
$ p $
-wave one
$\Psi _{f}^{(p)}$
is antisymmetric with respect to the
$\pi $
-rotation (reflection over the origin), just like ‘s’ , ‘d’ and ‘p’ atomic orbitals.
In conventional superconductivity, one has s-wave symmetry. For superconducting cuprates, it is more complex. It is firmly established that fermion pairs in cuprate superconductors have zero total spin [Reference Tsuei and Kirtley33], which is believed to lead to s- or d-wave superconductivity only. The
$ s $
-wave symmetry is expected to correspond to fermion pairs on same lattice sites, which should be problematic in the presence of the strong on-site Coulomb repulsion. Therefore, d-wave superconductivity is anticipated in cuprate superconductors. This prediction is experimentally confirmed. See [Reference Tsuei and Kirtley33, Reference Keimer, Kivelson, Norman, Uchida and Zaanen30, Reference Plakida13]. In cuprates, d-wave pairing is therefore predominant, but experiments (involving bulk properties) still indicate the presence of a non-negligible s-wave superconducting part; see [Reference Müller17, Reference Müller59]. This is what our theory demonstrates, using the fact that the Fourier transform
$\hat {\upsilon }$
of
$\upsilon $
is maximal at the points
$ (-\pi ,0)$
and
$(0,-\pi )$
, but first we need to say a few words about the quantitative choice of its parameters. For example, if we take the hard-core limit
$\mathrm {U}\rightarrow \infty $
, we get pure d-wave superconductivity, as shown in the first article [Reference Bru, de Siqueira Pedra and de Pasquale22].
In the second paper [Reference Bru, de Siqueira Pedra and de Pasquale21], we study the ground state
$\Psi ( \mathrm {U},k)\doteq (\hat {\psi }_{k}\left ( \mathrm {U}\right ) ,-1)$
of Theorem 3.1 to give estimates on key features of cuprate superconductors by using real parameters taken from experiments on the prototypical cuprates based on hole-doped cuprates
$\mathrm {La}_{2} \mathrm {CuO}_{4}$
(e.g., LaSr 214) and
$\mathrm {YBa}_{2}\mathrm {Cu}_{3} \mathrm {O}_{7}$
(YBCO), near optimal doping:
-
• The hopping amplitude
$\epsilon $ of charge carriers (here holes) in (6) is accessible using the lattice spacing and the effective mass of charge carriers. Both quantities are known for cuprates: The lattice spacing is
$\mathbf {a}=0.2672\ \mathrm {nm}$ (Reference Wesche14, Section 6.3.1) of the oxygen ions and the effective mass of mobile holes
$ m^{\ast }\simeq 4m_{e}$ [Reference Padilla, Lee, Dumm, Blumberg, Ono, Segawa, Komiya, Ando and Basov75, Fig. 2.], where
$m_{e}$ is the electron mass. This corresponds to
$\epsilon =\hbar ^{2}/\left ( m^{\ast } \mathbf {a}^{2}\right ) \simeq 0.266 $ eV.
-
• In the same way, the hopping amplitude
$\epsilon h_{b}$ of JT bipolarons in (8) is accessible using the lattice spacing and the effective mass of bipolarons. The former is known, and the latter is estimated [Reference Reagor37]. It leads to
$h_{b}\simeq 0.00575\ll 1$ . That is, JT bipolarons can barely move, compared to fermions.
-
• The coefficient
$\mathrm {U}$ in (6) can be fixed by using the first electronic affinity of oxygen – that is, the energy difference between the
$\mathrm {O}^{-}$ anion state (one hole added to the
$\mathrm {O} ^{--}$ anion) and the neutral state (two holes added to
$\mathrm {O}^{--}$ ). These values are known with great precision: By [Reference Chaibi, Peláez, Blondel, Drag and Delsart81],
$ \mathrm {U}\simeq 1.461 $ eV. Note that
$\mathrm {U}\epsilon ^{-1}\simeq 5.5$ , which refers to a strong coupling regime, but it is not the hard-core limit
$\mathrm {U}\rightarrow \infty $ yet, from the perspective of real physical estimates.
-
• The intersite repulsion represented by the function
$\mathrm {u}: \mathbb {Z}^{2}\rightarrow \mathbb {R}_{0}^{+}$ in (6) results from the screening of the Coulomb repulsion, usually estimated via the Thomas-Fermi screening length
$\boldsymbol {\lambda }_{\mathrm {TF}}$ . However, in two dimensions, the decay of the screened Coulomb repulsion is not exponential but rather polynomial [Reference Giuliani and Vignale82, Eq. (5.41)]. In particular, even if
$\boldsymbol {\lambda }_{\mathrm {TF}}\leq \mathbf {a}$ , we consider the Coulomb repulsion for a few neighboring sites, with, of course, decaying strengths (for
$z\neq 0$ ). For example,
$\mathrm {u}\left ( z\right ) =0$ only when
$|z|\geq r$ for some
$r\leq 2$ , with
$\mathrm {u}\left ( 0\right ) =0$ and
$\mathrm {u}\left ( z\right ) <\mathrm {U}$ .
It remains to fix the exchange strength function
$\upsilon :\mathbb {Z} ^{2}\rightarrow \mathbb {R}$
in (10), taking into account the special choice (A.2) for the annihilation and creation operators of a fermion pair of zero total spin. We already know that the absolute value of the Fourier transform
$\hat {\upsilon }$
of
$ \upsilon $
takes its maximum at the points
$(-\pi ,0)$
and
$(0,-\pi )$
, but its precise amplitude has to be still determined. This is performed indirectly through a phenomenological relationship with the density of the superconducting charge carriers (also named superfluid): From recent experimental data [Reference Božović, He, Wu and Bollinger53], for optimum doping, around
$90\%$
of the charge carriers inserted via the doping do not form superfluid. If
$\hat { \psi }_{K}\left ( \mathrm {U}\right ) $
and
$-1$
are respectively the fermionic and bosonic parts of the eigenvector
$\Psi (\mathrm {U},k)\doteq (\hat {\psi } _{K}\left ( \mathrm {U}\right ) ,-1)$
associated with the eigenvalue
$\mathrm {E} \left ( \mathrm {U},K\right ) $
and
$K=(\pi ,0),(0,\pi )$
, then we can interpret

as the proportion of charge carriers forming JT bipolarons. Computing this quantity, we can identify the unique value
$\hat {\upsilon }(K)\simeq 0.11 $
eV making
$\varrho =90\%$
. Similar to [Reference Kabanov and Mihailovic83, Reference Kabanov84], note that we choose
$\hat {v}$
of the form

for quasimomenta
$(k_{x},k_{y})\in \mathbb {T}^{2}$
near
$(\pi ,0)$
or
$ (0,\pi )$
, where
$\alpha>0$
determines the effective mass
$m^{\ast \ast \ast }$
of (dressed) bound fermion pairs. Conversely,
$\alpha $
can be recovered from
$m^{\ast \ast \ast }$
.
Using our mathematical results and rigorous numerical computations, we show in [Reference Bru, de Siqueira Pedra and de Pasquale21] that the model gives the following quantitative estimates in relation with properties of hole-doped cuprates
$\mathrm {La}_{2} \mathrm {CuO}_{4}$
(LaSr 214 or LSCO) and
$\mathrm {YBa}_{2}\mathrm {Cu}_{3} \mathrm {O}_{7}$
(YBCO):
-
• Pairing symmetry.
-
– Prediction:
$16.5\% s$ -wave,
$83.5\% d$ -wave,
$0\% p$ -wave. See [Reference Bru, de Siqueira Pedra and de Pasquale21, p. 10 and Corollary 1.1].
-
– Experimental data:
$\sim 20-25\% s$ -wave,
$\sim 75-80\% d$ -wave,
$ \sim 0\% p$ -wave. Indirect measurement with rough estimates for the s -wave/d-wave ratio, see [Reference Müller17, Reference Müller59].
-
-
• Pseudogap temperature
$T_{\ast }$ (i.e., pair dissociation energy).
-
– Prediction of the binding energy of (d-wave) pairs:
$\mathrm {E} =1250\ \mathrm {K}$ found for the quasi-momenta
$(-\pi ,0)$ and
$(0,-\pi )$ . See Theorem 3.1 (ii)–(iii) and [Reference Bru, de Siqueira Pedra and de Pasquale21, Fig. 6].
-
– Experimental data:
-
* Experiments on cuprates demonstrates a pseudogap appears at
$(-\pi ,0)$ and
$(0,-\pi )$ . See [Reference Keimer, Kivelson, Norman, Uchida and Zaanen30, Fig. 4] and references therein.
-
*
$T_{\ast }\simeq 100-750\ \mathrm {K}$ , depending on the doping [Reference Timusk and Statt85, Fig. 26]. For example,
$T_{\ast }\simeq 400\ \mathrm {K}$ around optimal doping for
$\mathrm {La}_{1.85}\mathrm {Sr}_{0.16}\mathrm {CuO}_{4}$ and
$ T_{\ast }\simeq 200\ \mathrm {K}$ for
$\mathrm {La}_{1.8}\mathrm {Sr}_{0.2} \mathrm {CuO}_{4}$ . The ratio between the theoretical bond energy (in
$ \mathrm {K}$ ) and the dissociation temperatureFootnote 22 of dressed fermion pairs should be between
$ \mathrm {E}/T_{\ast }$ and
$\mathrm {E}/T_{c}$ . For
$\mathrm {La}_{1.8}\mathrm { Sr}_{0.2}\mathrm {CuO}_{4}$ , the coherence length of which perfectly matches our prediction below,
$\mathrm {E}/T_{\ast }\simeq 6.2$ and
$\mathrm {E} /T_{c}=34.247$ , with an average ratio of around
$20$ .
-
* Binding energy of bipolarons [Reference Müller, Zhao, Conder and Keller87, Fig. 2]:
$1500\ \mathrm {K }$ at zero doping and
$500\ \mathrm {K}$ at optimal doping for LaSr 214.
Figure 5 Normalized density
$|\mathcal {F}^{-1}[\hat { \psi }_{1.461,(0,- \pi )}]|^{2}$ of the dressed bound fermion pair as a function of the (relative) position space at total quasimomentum
$(0,- \pi )$ for the prototypical parameters. It is a reproduction of (Reference Bru, de Siqueira Pedra and de Pasquale21, Fig. 5).
-
* To compare with, for standard diatomic molecules, the ratio between the bond energy (in
$\mathrm {K}$ ) and their dissociation temperatures rangesFootnote 23 from
$10$ to
$40$ , with an average of around
$20$ .
-
-
-
• Superconducting coherence length
$\xi $ (i.e., pair radius).
-
– Prediction:
$\xi _{a}=1.6\ \mathrm {nm}$ in one direction,
$\xi _{b}=2.1\ \mathrm {nm}$ in the orthogonal one at quasi-momenta
$(-\pi ,0)$ and
$(0,-\pi )$ . See Figure 5 (in lattice units), reproducing [Reference Bru, de Siqueira Pedra and de Pasquale21, Fig. 5]. It refers approximately to
$6$ lattice sites in one direction and
$8$ lattice sites in the other one. Compare this result with the exponential localization of the fermionic component
$\mathcal {F} ^{-1}[\hat {\psi }_{\mathrm {U},k}]$ of the eigenstate given by Theorem 3.1 (iii).
-
-
•
-
– Experimental data:
-
*
$\xi _{ab}=1.6\ \mathrm {nm}$ is obtained for an optimally doped
$ \mathrm {YBa}_{2}\mathrm {Cu}_{3}\mathrm {O}_{6.9}$ for which
$T_{c}=95\ \mathrm {K}$ . See [Reference Hwang89, above the “Summary and conclusion”].
-
*
$\xi _{ab}=2.1\ \mathrm {nm}$ is obtained for
$\mathrm {La}_{1.8}\mathrm { Sr}_{0.2}\mathrm {CuO}_{4}$ for which
$T_{c}=36.5\ \mathrm {K}$ . See [Reference Panson, Wagner, Braginski, Gavaler, Janocko, Pohl and Talvacchio86, Table II].
-
*
$\xi _{ab}=3.8\ \mathrm {nm}$ is obtained for an optimally doped
$ \mathrm {La}_{1.85}\mathrm {Sr}_{0.16}\mathrm {CuO}_{4}$ for which
$T_{c}=38\ \mathrm {K}$ . See [Reference Hwang89, above the “Summary and conclusion”].
-
* More generally,
$\xi _{ab}$ is in the nanometer range, between
$1\ \mathrm {nm}$ and
$3.8\ \mathrm {nm}$ for various other examples of
$\mathrm {La }_{2}\mathrm {CuO}_{4}$ and
$\mathrm {YBa}_{2}\mathrm {Cu}_{3}\mathrm {O}_{7}$ . See, for example, [Reference Wesche14, Table 9.1] and (Reference Mourachkine88, Table 3.2 on page 60). The coherence length is very small compared to conventional superconductors, for which it is generally several tens or hundreds of nanometers. See, for example, the table in (A.1).
-
-
The pseudo-gap temperature is the temperature below which the Fermi surface of a material exhibits a partial energy gap, in fact a gap in a particular direction, as in the quasi-momenta
$(-\pi ,0)$
and
$(0,-\pi )$
. Compare with Theorem 3.1 (iii). The 2-fermion 1-boson problem studied here and in [Reference Bru, de Siqueira Pedra and de Pasquale21] cannot a priori explain the superconducting phase, which is a collective phenomenon, but only the pseudogap regime which is expected to be related to the formation of fermion pairs (mainly for quasi-momenta
$(-\pi ,0)$
and
$(0,-\pi )$
).
To conclude, this paper together with [Reference Bru, de Siqueira Pedra and de Pasquale22, Reference Bru, de Siqueira Pedra and de Pasquale21] contributes a mathematically rigorous microscopic model for cuprate superconductors that includes Jahn-Teller-type bipolarons with zero spin and local repulsions. This model captures the following phenomenological aspects of these materials:
-
• d-wave pairing not based on anisotropy.
-
• Low density superconducting superfluid.
-
• Pseudogap temperature.
-
• Very accurate coherence lengths.
-
• Solution to the ‘large bipolaron mass vs. small mass of superconducting carrier pairs’.
In addition, as proven in the Ph.D. thesis [Reference de Pasquale24], in a mean-field-like approximation, the many-body version of the model considered here also explains another very special feature of cuprate superconductors – namely, the density waves [Reference Arpaia, Martinelli, Moretti Sala, Caprara, Nag, Brookes, Camisa, Li, Gao, Zhou, Garcia-Fernandez, Zhou, Schierle, Bauch, Peng, Di Castro, Grilli, Lombardi, Braicovich and Ghiringhelli23]. We therefore think that the model we present here deserves to be studied in much more detail, in view of a microscopic theory for cuprate superconductivity.
A.2 The Fock-space formalism
In quantum mechanics, one generally starts with a (one-particle) Hilbert space
$\mathfrak {h}$
, often realized as a space
$L^{2}(\mathcal {M})$
of square-integrable, complex-valued functions on a measure space
$(\mathcal {M}, \mathfrak {a})$
. The states of a quantum system of
$n\in \mathbb {N}$
quantum particles are then represented within the n-fold tensor product
$\mathfrak { h}^{\otimes n}$
of
$\mathfrak {h}$
. However, identical quantum particles are indistinguishable, meaning that they cannot be differentiated from one another, not even in principle. In this situation, the states of these indistinguishable quantum particles are only taken from a subspace of
$ \mathfrak {h}^{\otimes n}$
.
Recall meanwhile that all quantum particles possess an intrinsic form of angular momentum known as spin, characterized by a quantum number
$\mathfrak { s}\in \mathbb {N}/2$
. If
$\mathfrak {s}$
is half-integer, then the corresponding particles are named fermions; otherwise, we have bosons. By the celebrated spin-statistics theorem, fermionic wave functions are antisymmetric with respect to permutations of particles, whereas the bosonic ones are symmetric. The states of a system of
$n\in \mathbb {N}$
fermions correspond then to vectors in the subspace
$\wedge ^{n}\mathfrak {h}$
of totally antisymmetric n-particle wave functions in
$\mathfrak {h}^{\otimes n}$
, while the states of a system of
$n\in \mathbb {N}$
bosons are vectors in the subspace
$\vee ^{n}\mathfrak {h}^{n}$
of totally symmetric n-particle wave functions in
$\mathfrak {h}^{\otimes n}$
.
In most many-body quantum systems, the exact number of particles is not known. In quantum statistical mechanics, physical properties are typically studied in the limit
$n\rightarrow \infty $
of infinite number of particles. Quantum field theory deals with situations where the particle number and species vary with time. The so-called Fock spaces are used to encode both situations. For fermionic systems, the Fock space is, by definition, the Hilbert space

while, for bosonic systems,

The respective scalar products are denoted by
$\left \langle \cdot ,\cdot \right \rangle _{\mathfrak {F}_{\pm }}$
. The two scalar products are the sum over
$n\in \mathbb {N}$
of each canonical scalar product on the sector
$ \wedge ^{n}\mathfrak {h}$
and
$\vee ^{n}\mathfrak {h}$
, respectively. In both cases, we denote the vacuum state by
$\Omega \doteq (1,0,\ldots )$
.
The Fock space proved very useful, not least because it allows so-called creation and annihilation operators:
Fermionic case. The annihilation operator
$a\left ( \varphi \right ) \in \mathcal {B}(\mathfrak {F}_{-})$
of a fermion with wave function
$\varphi \in \mathfrak {h}$
is the (linear) bounded operator uniquely defined by the conditions
$a\left ( \varphi \right ) \Omega =0$
and

for any
$n\in \mathbb {N}$
and
$\psi _{1},\ldots ,\psi _{n}\in \mathfrak {h}$
, where
$\Pi _{n}$
is the set of all permutations
$\pi $
of n elements and
$ \mathrm {sgn}:\Pi _{n}\rightarrow \{-1,1\}$
denotes the sign of these permutations, while
$\Omega =(1,0,0,\ldots )$
is the vacuum state and
$\psi _{1}\wedge \cdots \wedge \psi _{n}$
is the orthogonal projection of
$\psi _{1}\otimes \cdots \otimes \psi _{n}\in \mathfrak {h}^{\otimes n}$
onto the subspace of antisymmetric n–particle wave functions:

The creation operator of a fermion with wave function
$\varphi \in \mathfrak { h}$
is the adjoint
$a^{\ast }(\varphi )\doteq a\left ( \varphi \right ) ^{\ast }$
of
$a\left ( \varphi \right ) $
– namely,
$a^{\ast }(\varphi )\Omega =\varphi $
and

Such operators are known to satisfy the so-called Canonical Anticommutation Relations (CAR): For all
$\varphi _{1},\varphi _{2}\in \mathfrak {h}$
,

See [Reference Bratteli and Robinson90, p. 10]. Here,
$\mathbf {1}$
stands for the identity operator on the Fock space
$\mathfrak {F}_{-}$
.
Bosonic case. The annihilation operator
$b\left ( \varphi \right ) $
of a boson with wave function
$\varphi \in \mathfrak {h}$
is the (linear) unbounded operator acting on
$\mathfrak {F}_{+}$
and uniquely defined by the conditions
$b\left ( \varphi \right ) \Omega =0$
and

for any
$n\in \mathbb {N}$
and
$\psi _{1},\ldots ,\psi _{n}\in \mathfrak {h}$
, where
$\psi _{1}\vee \cdots \vee \psi _{n}$
is the orthogonal projection of
$ \psi _{1}\otimes \cdots \otimes \psi _{n}\in \mathfrak {h}^{\otimes n}$
onto the subspace of symmetric n–particle wave functions:

As in the fermionic case, the creation operator of a boson with wave function
$\varphi \in \mathfrak {h}$
is the adjoint
$b^{\ast }(\varphi )\doteq b\left ( \varphi \right ) ^{\ast }$
of
$b\left ( \varphi \right ) $
, where
$b^{\ast }(\varphi )\Omega =\varphi $
and

Such operators are known to satisfy the so-called Canonical Commutation Relations (CCR): For all
$\varphi _{1},\varphi _{2}\in \mathfrak {h}$
,

See [Reference Bratteli and Robinson90, p. 10]. Here,
$\mathbf {1}$
stands again for the identity operator on the Fock space
$\mathfrak {F}_{+}$
.
The interest of Fock spaces lies in the use of creation and annihilation operators, which not only give a mathematically rigorous definition for precesses of creation or annihilation of physical particles, but also possess essential algebraic properties: the CAR and CCR relations given above. Although Fock spaces and the creation and annihilation operators are not strictly necessary for our proofs, we use them in this paper because they allow us to define the model in a very intuitive way, which makes its physical meaning easy to understand once the Fock-space formulation is familiar.
A.3 Non-autonomous evolution equations and scattering theory
This section collects simple results on wave and scattering operators (50)–(52) for bounded Hamiltonians, related to their approximation by Dyson series. We start with the following elementary lemma, resulting from the theory of non-autonomous evolution equations:
Lemma A.1 (Finite-time scattering and wave operators).
For any self-adjoint
$X,Y\in \mathcal {B}\left ( \mathcal {X}\right ) $
acting on a Hilbert space
$\mathcal {X}$
and all
$s,t\in \mathbb {R}$
,

with
$(Y_{t})_{t\in \mathbb {R}}\subseteq \mathcal {B}(\mathcal {X})$
being the norm-continuous family

Proof. We compute that, for any
$s,t\in \mathbb {R}$
,

as well as

both in
$\mathcal {B}(\mathcal {X})$
. In other words, the family

of (uniformly) bounded operators is a norm-continuous two-parameter family of unitary operators solving the non-autonomous evolution equations

in
$\mathcal {B}(\mathcal {X})$
, where
$(Y_{t})_{t\in \mathbb {R}}\subseteq \mathcal {B}(\mathcal {X})$
is the norm-continuous family defined by (A.11). As is well-known, there is a unique solution to this non-autonomous evolution equation (A.13), which is given by the Dyson series (A.10). This series is absolutely summable in
$ \mathcal {B}(\mathcal {X})$
. Notice that the integrals appearing in it are Riemann integrals, for their arguments are continuous functions taking values in a Banach space.
Corollary A.2 (Approximation of scattering and wave operators).
Let
$X,Y\in \mathcal {B}\left ( \mathcal {X}\right ) $
be two self-adjoint operators acting on a Hilbert space
$\mathcal {X}$
. Assume that the waves operators

exist. Let
$\varepsilon \in \mathbb {R}^{+}$
. Then
-
i.) For any
$\varphi \in \mathrm {ran}\left ( P_{\mathrm {ac}}\left ( X\right ) \right ) $ , there is
$T>0$ such that
$$ \begin{align*} T<t\implies \left\Vert \left( W^{+}\left( X+Y,X\right) -V_{0,t}\right) \varphi \right\Vert _{\mathcal{X}}\leq \varepsilon , \end{align*} $$
$$ \begin{align*} t<-T\implies \left\Vert \left( W^{-}\left( X+Y,X\right) -V_{0,t}\right) \varphi \right\Vert _{\mathcal{X}}\leq \varepsilon . \end{align*} $$
-
ii.) For any
$\varphi ,\psi \in \mathrm {ran}\left ( P_{\mathrm {ac} }\left ( X\right ) \right ) $ , there is
$T>0$ such that
$$ \begin{align*} \left\langle \psi ,S\left( X+Y,X\right) \varphi \right\rangle _{\mathcal{X} }=\left\langle \psi ,W^{+}\left( X+Y,X\right) ^{\ast }W^{-}\left( X+Y,X\right) \varphi \right\rangle _{\mathcal{X}}=\left\langle \psi ,V_{t,s}\varphi \right\rangle _{\mathcal{X}}+\mathcal{O}\left( \varepsilon \right) \ \end{align*} $$
$s<-T<T<t$ .
Here,

the norm-continuous family
$(Y_{t})_{t\in \mathbb {R}}\subseteq \mathcal {B}( \mathcal {X})$
being defined by (A.11).
Proof. Assertion (i) is a direct consequence of Lemma A.1. Concerning the scattering operator, remark in particular that, for any
$r,s,t\in \mathbb {R}$
,
$V_{t,s}V_{s,r}=V_{t,r}$
and
$V_{t,s}^{\ast }=V_{s,t}$
. Given
$\varphi ,\psi \in \mathrm {ran}\left ( P_{ \mathrm {ac}}\left ( Y\right ) \right ) $
, we have that

Assertion (ii) therefore follows from assertion (i).
A.4 Constant fiber direct integrals
For more details, we refer to [Reference Reed and Simon38, Section XIII.16] as well as [Reference Nielsen45] for the general theory.
Let
$(\mathcal {X},\mu )$
be any semifinite measure space and
$\mathcal {Y}$
any separable Hilbert space. The constant fiber direct integral of
$ \mathcal {Y}$
over
$\mathcal {X}$
is, by definition, the Hilbert space

of equivalence classes of square-integrable
$\mathcal {Y}$
–valued functions with scalar productFootnote
24

and the pointwise vector space operations

If
$\mathcal {Y}=\mathbb {C}$
, then we use the shorter notation
$L^{2}\left ( \mathcal {X},\mu \right ) \equiv L^{2}\left ( \mathcal {X},\mathbb {C},\mu \right ) $
.
A mapping
$A:\mathcal {X}\rightarrow \mathcal {B}(\mathcal {Y})$
is strongly measurable whenever the mapping
$x\mapsto \left \langle \varphi ,A(x)\psi \right \rangle _{\mathcal {Y}}$
from
$\mathcal {X}$
to
$\mathbb {C}$
is measurable for all
$\varphi ,\psi \in \mathcal {Y}$
. Let
$L^{\infty }( \mathcal {X},\mathcal {Y},\mu )$
be the
$C^{\ast }$
-algebra of equivalence classes of strongly measurable functions
$A:\mathcal {X}\rightarrow \mathcal {B }(\mathcal {Y})$
with

Here,
$\mathrm {ess-}\sup $
denotes the essential supremum and
$\left \Vert \cdot \right \Vert _{\mathrm {op}}$
stands for the operator norm. If
$\mathcal { Y}=\mathbb {C}$
, then we use the shorter notation
$L^{\infty }\left ( \mathcal {X },\mu \right ) \equiv L^{\infty }\left ( \mathcal {X},\mathbb {C},\mu \right ) $
.
A bounded operator D on
$L^{2}(\mathcal {X},\mathcal {Y},\mu )$
is decomposable if there is
$A\in L^{\infty }(\mathcal {X},\mathcal {Y},\mu )$
such that, for all
$\psi \in L^{2}(\mathcal {X},\mathcal {Y},\mu )$
,

If such an A exists, then it is unique. Moreover, the mapping
$A\mapsto D$
defined by the above equality is a
$\ast $
-homomorphism which is isometric. See [Reference Reed and Simon38, Theorem XIII.83]. The operators
$A(x)\in \mathcal {B}( \mathcal {Y})$
,
$x\in \mathcal {X}$
, are called the fibers of D and we write

For the reader’s convenience, we now give three essential properties of decomposable operators used in the paper, referring to [Reference Reed and Simon38, Theorem XIII.85 (a), (c) and (d)]. Note that
$\sigma (X)$
denotes below the spectrum of any operator X acting on some Hilbert space, as is usual.
Theorem A.3 (Properties of decomposable operators).
Let D be a decomposable operator on
$L^{2}(\mathcal {X},\mathcal {Y},\mu )$
, the fibers
$A(x)\in \mathcal {B}(\mathcal {Y})$
,
$x\in \mathcal {X}$
, of which are all self-adjoint. Then
-
i.) D is self-adjoint.
-
ii.)
$\lambda \in \sigma (D)$ iff, for all
$\varepsilon \in \mathbb {R} ^{+}$ ,
$$ \begin{align*} \mu \left( \left\{ x\in \mathcal{X}:\sigma \left( A\left( x\right) \right) \cap \left( \lambda -\varepsilon ,\lambda +\varepsilon \right) \neq \emptyset \right\} \right)>0. \end{align*} $$
-
iii.) For any bounded Borel function f on
$\mathbb {R}$ ,
$f(D)$ is decomposable and has fibers
$f(A(x))$ ,
$x\in \mathcal {X}$ ; that is,
$$ \begin{align*} f(D)=\int_{\mathcal{X}}^{\oplus }f\left( A\left( x\right) \right) \,\mu \left( \mathrm{d}x\right) {\ .} \end{align*} $$
The above theorem can be used to elegantly prove the following well-known results about multiplication operators
$M_{\varphi }$
by any bounded measurable function
$\varphi \in L^{\infty }(\mathcal {X},\mu )$
, defined for any
${\psi \in L^{2}(\mathcal {X},\mu )}$
, by

Corollary A.4 (Properties of multiplication operators).
The multiplication operators
$M_{\varphi }$
by
${\varphi \in L^{\infty }( \mathcal {X},\mu )}$
have the following properties:
-
i.) For any bounded Borel function f on
$\mathbb {R}$ , one has
$ f(M_{\varphi })=M_{f\circ \varphi }$ .
-
ii.)
$\sigma (M_{\varphi })$ is the essential range
$\mathrm {ess-im} (\varphi )$ of
$\varphi $ .
-
iii.) Its operator norm
$\Vert M_{\varphi }\Vert _{\mathrm {op}}$ is equal to
$\Vert \varphi \Vert _{\infty }$ .
Proof. Noting that
$M_{\varphi }$
is a decomposable operator on

with
$\varphi (x)$
, seen as a linear operator on
$\mathbb {C}$
, being its fibers, we can use Theorem A.3 (iii) to get the equality

This proves Assertion (i). For the second one, we use that
$\sigma (\varphi (x))=\{\varphi (x)\}$
for all
$x\in \mathcal {X}$
and thus infer from Theorem A.3 (ii) that
$\lambda \in \sigma (M_{f})$
iff, for all
$\varepsilon \in \mathbb {R} ^{+}$
,

In other words, one gets Assertion (ii). Assertion (iii) is an elementary application of [Reference Reed and Simon38, Theorem XIII.83].
Below, we study the special case of multiplication operators on
$L^{2}( \mathbb {T}^{2},\nu )$
, where
$\mathbb {T}^{2}\doteq \lbrack -\pi ,\pi )^{2}$
is the torus and
$\nu $
is the normalized Haar measure (23) on
$\mathbb {T}^{2}$
. It is again an elementary result, used in the paper. To this end, we recall that, for any self-adjoint operator Y acting on a Hilbert space
$\mathcal {Y}$
,
$P_{\mathrm {ac}}(Y)$
is the orthogonal projection onto the absolutely continuous space of Y, which is defined by ( 49).
Corollary A.5 (Absolutely continuous space of multiplication operators on
$L^{2}(\mathbb {T}^{2}, \nu )$
).
Let
$\varphi :\mathbb {T}^{2}\rightarrow \mathbb {R}$
be a bounded Borel function with the property that, for every Borel set
$\Omega \subseteq \mathbb {R}$
with zero Lebesgue measure, one has
$\nu (\varphi ^{-1}(\Omega ))=0$
. Then,
$P_{\mathrm {ac}}\left ( M_{\varphi }\right ) =\mathbf {1}$
; that is,

Proof. Given any Borel set
$\Omega \subseteq \mathbb {R}$
, we deduce from Corollary A.4 (i) that

which in turn implies that, for any
$\psi \in L^{2}(\mathbb {T}^{2},\nu )$
,

Hence, if
$\Omega \subseteq \mathbb {R}$
has zero Lebesgue measure, then under the conditions of the corollary,

In other words, for any
$\psi \in L^{2}(\mathbb {T}^{2},\nu )$
,
$\langle \psi ,\chi _{(\cdot )}(M_{\varphi })\psi \rangle $
is absolutely continuous with respect to the Lebesgue measure.
We next provide a result on the strong operator convergence and a version of Fubini’s theorem for (constant fiber) direct integrals, which are also used in our proofs.
Proposition A.6 (Strong operator convergence).
Let
$(A_{n})_{n\in \mathbb {N}}$
be any bounded sequence in
$L^{\infty }( \mathcal {X},\mathcal {Y},\mu )$
. If

then
$A\in L^{\infty }(\mathcal {X},\mathcal {Y},\mu )$
and

Proof. The assertion is well-known, but for the reader’s convenience, we give here its complete proof. For any
$\varphi ,\psi \in \mathcal {Y}$
, it follows from the fact that
$A_{n}(x)\psi \rightarrow A(x)\psi $
everywhere and the continuity of
$\left \langle \varphi ,\cdot \right \rangle \in \mathcal {Y} ^{\ast }$
that

This shows that A is strongly measurable, because the pointwise limit of a sequence of real-valued measurable functions is measurable as well. Now, let

For
$\mu $
-a.e.
$x\in \mathcal {X}$
and any
$\varphi \in \mathcal {Y}$
,

Taking the limit
$n\rightarrow \infty $
, one thus gets that, for
$\mu $
-a.e.
$x\in \mathcal {X}$
and any
$\varphi \in \mathcal {Y}$
,

Hence, M is an essential upper bound for
$\{\Vert A(x)\Vert _{\mathrm {op} }\}_{x\in \mathcal {X}}$
and, therefore,
$A\in L^{\infty }(\mathcal {X}, \mathcal {Y},\mu )$
. Finally, given any element
$\varphi \in L^{2}(\mathcal {X} ,\mathcal {Y},\mu )$
, by (A.15)–(A.16) and the triangle inequality, we have the estimate

Since
$A_{n}\left ( x\right ) \varphi \left ( x\right ) \rightarrow A\left ( x\right ) \varphi \left ( x\right ) $
for all
$x\in \mathcal {X}$
, we can therefore apply Lebesgue’s dominated convergence theorem to conclude that, for any
$\varphi \in L^{2}(\mathcal {X},\mathcal {Y},\mu )$
,

Before proving a version of Fubini’s theorem for constant fiber direct integrals, we fix some terminology concerning the Riemann integral: A partition of the interval
$[a,b]$
is a finite set
$P=\{t_{0}<t_{1}<\cdots <t_{k}\}$
where
$t_{0}=a$
and
$t_{k}=b$
. The norm of the partition P is the number
$|P|=\max _{1\leq i\leq k}(t_{i}-t_{i-1})$
. A tagged partition is a pair
$P^{\ast }=(P,\xi )$
where P is a partition and
$\xi =(\xi _{1},\dotsc ,\xi _{k})$
is such that
$t_{i-1}\leq \xi _{i}<t_{i}$
for every
$ i=1,\dotsc ,k$
. If
$P^{\ast }$
is a tagged partition of
$[a,b]$
, the corresponding Riemann sum for
$f:[a,b]\rightarrow \mathcal {Z}$
, with
$ \mathcal {Z}$
being a vector space, is

Proposition A.7 (Fubini’s Theorem for direct integrals).
Let
$A_{(\cdot )}:[a,b]\rightarrow L^{\infty }(\mathcal {X},\mathcal {Y},\mu )$
be a continuous function. Then
-
i.) The mapping
$$ \begin{align*} \mathcal{X}\ni x\mapsto {\int_{a}^{b}}A_{t}\left( x\right) \,\mathrm{d}t\in \mathcal{B}(\mathcal{Y}) \end{align*} $$
$L^{\infty }(\mathcal {X},\mathcal {Y},\mu )$ ;
-
ii.) The mapping
$$ \begin{align*} \lbrack a,b]\ni t\mapsto {\int_{\mathcal{X}}^{\oplus }}A_{t}\left( x\right) \,\mu \left( \mathrm{d}x\right) \in \mathcal{B}\left( \int_{\mathcal{X} }^{\oplus }\mathcal{Y}\,\mu \left( \mathrm{d}x\right) \right) \end{align*} $$
$$ \begin{align*} {\int_{a}^{b}\int_{\mathcal{X}}^{\oplus }}A_{t}\left( x\right) \,\mu \left( \mathrm{d}x\right) \mathrm{d}t={\int_{\mathcal{X}}^{\oplus }\int_{a}^{b}} A_{t}(x)\,\mathrm{d}t\,\mu \left( \mathrm{d}x\right) . \end{align*} $$
Proof. If
$A_{(\cdot )}:[a,b]\rightarrow L^{\infty }(\mathcal {X},\mathcal {Y},\mu )$
is continuous, then so is
$A_{(\cdot )}(x):[a,b]\rightarrow \mathcal {B}( \mathcal {Y})$
for
$x\in \mathcal {X} \mu $
-a.e. For simplicity, we may assume that
$A_{(\cdot )}(x)$
is even continuous for all
$x\in \mathcal {X}$
. If fact, as this is true for
$x\in \mathcal {X} \mu $
-a.e., for some Borel set
$\mathcal {X}_{0}\subseteq \mathcal {X}$
with
$\mu ( \mathcal {X}_{0})=0$
,
$\mathbf {1}\left [ x\notin \mathcal {X}_{0}\right ] A_{(\cdot )}(x)$
is continuous for all
$x\in \mathcal {X}$
. Note that, for all
$t\in \lbrack a,b]$
, the functions
$A_{t}$
and
$\mathbf {1}\left [ (\cdot )\notin \mathcal {X}_{0}\right ] A_{t}$
are strongly mensurable and represent the same element (i.e., equivalence class of strongly mensurable functions
$ \mathcal {X}\rightarrow \mathcal {B}(\mathcal {Y})$
) of
$L^{\infty }(\mathcal {X} ,\mathcal {Y},\mu )$
. Moreover, as
$[a,b]$
is compact,
$\{A_{t}\}_{t\in \lbrack a,b]}$
is bounded as a subset of the metric space
$\left ( L^{\infty }(\mathcal {X},\mathcal {Y},\mu ),\Vert \cdot \Vert _{\infty }\right ) $
. Thus,

Let
$P_{n}^{\ast }$
be a tagged partition whose norm of the corresponding partition
$P_{n}$
goes to zero as
$n\rightarrow \infty $
. Then for every
$ \varphi ,\psi \in \mathcal {Y}$
and
$x\in \mathcal {X} \mu $
-a.e,

For the first equality, we used the fact that the Riemann integral commutes with bounded linear transformations. Observing that for
$x\in \mathcal {X} \mu $
-a.e., the right-hand side is a pointwise limit of a linear combination of continuous (hence Riemann integrable) functions, this proves assertion (i).
Note that the mapping defined in Assertion (ii) is a composition of two continuous functions – namely,
$A_{(\cdot )}:[a,b]\rightarrow L^{\infty }( \mathcal {X},\mathcal {Y},\mu )$
and

Given any
$\varphi ,\psi \in L^{2}(\mathcal {X},\mathcal {Y},\mu )$
, observe that the function

is measurable when
$A_{(\cdot )}(x)\in C([a,b],\mathcal {B}(\mathcal {Y}))$
for any
$x\in \mathcal {X}$
.
To prove this, for each
$n\in \mathbb {N}$
, define the function
$f_{n}: \mathcal {X}\times \lbrack a,b]\rightarrow \mathbb {C}$
by
$ f_{n}(x,t)=f(x,s_{t})$
, with
$s_{t}=\min \{m_{t,n}/n,b\}$
and
$m_{t,n}\in \mathbb {Z}$
being such that
$(m_{t,n}-1)/n\leq t<m_{t,n}/n$
. In particular,

Characteristic functions are measurable on
$[a,b]$
, and
$f(\cdot ,t)$
is also measurable on
$\mathcal {X}$
for every
$t\in \lbrack a,b]$
. So,
$f_{n}$
is measurable for all
$n\in \mathbb {N}$
. It is easy to check that
$ m_{t,n}/n\rightarrow t$
for any
$t\in \lbrack a,b]$
, which in turn implies that
$f_{n}$
pointwise converges to f, as
$n\rightarrow \infty $
, because of the continuity of
$f(x,\cdot )$
for any fixed
$x\in \mathcal {X}$
. The last continuity property is a direct consequence of the assumption
$ A_{(\cdot )}(x)\in C([a,b],\mathcal {B}(\mathcal {Y}))$
together with elementary estimates using the Cauchy-Schwarz inequality.
As a consequence, f is measurable on
$\mathcal {X}\times \lbrack a,b]$
. Note also that

thanks to the Cauchy-Schwarz inequality for both spaces
$\mathcal {Y}$
and
$ L^{2}(\mathcal {X},\mathcal {Y},\mu )$
. We can then apply (usual) Fubini’s theorem to obtain

As
$\varphi ,\psi $
are arbitrary, we arrive at Assertion (ii).
We conclude this short account on constant fiber direct integrals by providing a representation of them as tensor products:
Proposition A.8 (Direct integrals and tensor products).
There is a unique unitary transformation
$\mathbf {V}:L^{2}(\mathcal {X},\mu )\otimes \mathcal {Y}\rightarrow L^{2}(\mathcal {X},\mathcal {Y},\mu )$
such that

Proof. See [Reference Nielsen45, Proposition 5.2].
A.5 The Birman-Schwinger principle
There are various versions of the Birman-Schwinger principle in the literature, and we give below the precise version that is used in our proofs. To this end, we first define Birman-Schwinger operators: For any operator T acting on some complex vector space, recall that
$\rho (T)\subseteq \mathbb {C}$
denotes its resolvent set. Given two (bounded) operators
$T,V$
acting on some complex vector space and any
$\lambda \in \rho (T)$
, we define the associated Birman-Schwinger operator to be

It turns out that, for all
$\lambda \in \rho (T)$
,
$1$
is an eigenvalue of
$ \mathrm {B}\left ( \lambda \right ) $
iff
$\lambda $
is an eigenvalue of
$ T-V^{2}$
:
Lemma A.9 (The eigenvalues of Birman-Schwinger operators).
Let
$T,V$
be two bounded operators acting on a vector space
$\mathcal {X}$
over
$\mathbb {C}$
. Assume that
$\lambda \in \rho (T)$
is an eigenvalue of
$ T-V^{2}$
and let
$\{\varphi _{i}\}_{i\in I}$
denote any basis of the corresponding eigenspace. Define
$\gamma _{i}\doteq V\varphi _{i}$
,
$i\in I$
. Then,

and
$\{\gamma _{i}\}_{i\in I}$
is a linearly independent set satisfying

Proof. Suppose that
$\lambda $
is an eigenvalue of
$T-V^{2}$
and let
$\{\varphi _{i}\}_{i\in I}$
be a basis of the corresponding eigenspace. Set
$\gamma _{i}=V\varphi _{i}$
,
$i\in I$
. Then

so that (A.18) holds true. By (A.18),
$\gamma _{i}$
is a nonzero vector for any
$i\in I$
since
$\varphi _{i}\neq 0$
for all
$i\in I $
. As linear transformations map a linearly dependent set onto a linearly dependent set, we conclude that
$\{\gamma _{i}\}_{i\in I}$
is a linearly independent set. Equation (A.19) is a direct consequence of (A.17) and (A.18).
This last lemma is explicitly used in the proof of Corollary 4.6 and allows meanwhile to prove the Birman-Schwinger principle for eigenvalues. Below, for any operator T, we use the notation
$\mathcal {E} _{T}(\lambda )$
for the eigenspace associated with the eigenvalue
$\lambda $
of T.
Theorem A.10 (Birman-Schwinger).
Let
$T,V$
be two linear operators acting on a vector space
$\mathcal {X}$
over
$\mathbb {C}$
and
$\lambda \in \rho (T)$
. Then
$\lambda $
is an eigenvalue of
$T-V^{2}$
iff
$1$
is an eigenvalue of
$\mathrm {B}(\lambda )\equiv \mathrm {B}(\lambda ,T,V)$
. In this case,

that is, the corresponding (geometric) multiplicities of eigenvalues are equal to each other.
Proof. If
$\lambda $
is an eigenvalue of
$T-V^{2}$
, then Lemma A.9 implies that
$1$
is an eigenvalue of
$\mathrm {B}(\lambda )$
and the eigenspace of
$\mathrm {B}(\lambda )$
corresponding to the eigenvalue
$1$
has at least dimension
$|I|=\dim \mathcal {E}_{T-V^{2}}(\lambda )$
. Conversely, if
$\{\phi _{j}\}_{j\in J}$
is a basis of the eigenspace of
$\mathrm {B} (\lambda )$
corresponding to the eigenvalue
$1$
, then we set

Then, by (A.17),

which implies that
$\{\psi _{j}\}_{j\in J}$
is a linearly independent set. Thus,

and hence, the eigenspace of
$T-V^{2}$
corresponding to the eigenvalue
$ \lambda $
has at least dimension
${|J|=\dim \mathcal {E}_{\mathrm {B}(\lambda )}(1)}$
.
A.6 Combes-Thomas estimates
We give here a version of the celebrated Combes-Thomas estimates, first proven in 1973 [Reference Combes and Thomas47], which is well-adapted to our framework. For the nonexpert reader, we provide also its proof, which is relatively short and easy to understand in the particular situation we are interested in.
Fix a countable set
$\Lambda $
and a pseudometric
$d:\Lambda \times \Lambda \rightarrow \mathbb {R}_{0}^{+}$
on
$\Lambda $
. Let
$\ell ^{2}(\Lambda )$
be the (separable) Hilbert space of square summable functions
$\Lambda \rightarrow \mathbb {C}$
. Similar to (5), its canonical orthonormal basis is defined by

where
$\delta _{\mathfrak {i},\mathfrak {j}}$
is the Kronecker delta. For simplicity, as before, we use the shorter notation
$\langle \cdot ,\cdot \rangle \equiv \langle \cdot ,\cdot \rangle _{\ell ^{2}(\Lambda )}$
for its scalar product.
For each bounded operator
$T\in \mathcal {B}(\ell ^{2}(\Lambda ))$
and positive parameter
$\mu \in \mathbb {R}_{0}^{+}$
, we define the quantity

Compare with (136). By definition of a pseudometric, the function d is symmetric with respect to the variables x and y. The same occurs with the factor
$|\langle \mathfrak {e}_{x},T\mathfrak {e} _{y}\rangle |$
, provided that T is self-adjoint. Thus, in this particular case,
$\mathbf {S}(T,\mu )$
is equal to

The lemma below provides an estimate of the operator norm of T in terms of quantities that are similar to (A.20) and (A.21):
Lemma A.11. For any bounded operator
$T\in \mathcal {B}(\ell ^{2}(\Lambda ))$
,

Proof. Assume without loss of generality that the above bound is finite for
$T\in \mathcal {B}(\ell ^{2}(\Lambda ))$
. Otherwise the assertion would be trivial. Let
$V:\ell ^{1}(\Lambda )+\ell ^{\infty }(\Lambda )\rightarrow \ell ^{1}(\Lambda )+\ell ^{\infty }(\Lambda )$
be the mapping defined by

If
$f\in \ell ^{\infty }(\Lambda )$
, then
$Vf\in \ell ^{\infty }(\Lambda )$
because

while, for any
$f\in \ell ^{1}(\Lambda )$
, we also have
$Vf\in \ell ^{1}(\Lambda )$
because

using Tonelli’s theorem. It then follows from the Riesz-Thorin theorem [Reference Folland94, Theorem 6.27] that, for any function
$f\in \ell ^{2}(\Lambda )\subseteq \ell ^{1}(\Lambda )+\ell ^{\infty }(\Lambda )$
,

Finally, we observe that

whenever
$x\in \Lambda $
and
$f\in \ell ^{2}(\Lambda )$
.
We now state another, well-known, technical lemma, which is given here for completeness. Recall that, here,
$\rho (T)\subseteq \mathbb {C}$
and
$\sigma (T)\doteq \mathbb {C}\backslash \rho (T)$
respectively denote the resolvent set and the spectrum of any element T in some unital
$C^{\ast }$
-algebra (like the space of bounded operators on some Hilbert space). Similar to (135), we use the notation

for the distance between a complex number
$\lambda \in \mathbb {C}$
and the spectrum
$\sigma (T)$
of any element T in some unital
$C^{\ast }$
-algebra.
Lemma A.12 (Norm estimates of resolvents).
Let
$\mathcal {X}$
be an unital
$C^{\ast }$
-algebra with norm
$\left \Vert \cdot \right \Vert $
. Take
$T,B\in \mathcal {X}$
with T being self-adjoint and let
$\lambda \in \rho (T)$
. If
$\Vert B\Vert <\Delta (\lambda ;T)$
, then
$ \lambda \in \rho (T+B)$
and

Proof. Assume all conditions of the lemma, in particular that
$\Vert B\Vert <\Delta (\lambda ;T)$
. Then,

and using the Neumann series [Reference Bru and Pedra46, Lemma 4.24] for
$ -(T-\lambda \mathbf {1})^{-1}B$
, the element
$\mathbf {1}+(T-\lambda \mathbf {1})^{-1}B$
is invertible with norm bounded by

Finally, one uses the equality

for any
$\lambda \in \rho (T)$
to deduce that
$\lambda \in \rho (T+B)$
and

We can now prove the following version of Combes-Thomas estimates:
Theorem A.13 (Combes-Thomas estimates).
Let
$T\in \mathcal {B}(\ell ^{2}(\Lambda ))$
be a self-adjoint operator. Given
$\mu \in \mathbb {R}_{0}^{+}$
and
$\lambda \in \mathbb {C}$
with
$\Delta (\lambda ;T)>\mathbf {S}(T,\mu )$
, the following inequality holds true:

Proof. Fix
$y\in \Lambda $
and
$R\in \mathbb {R}^{+}$
. Define the function
$\varphi :\Lambda \rightarrow \lbrack 1,\mathrm {e}^{\mu R}]$
by

Clearly,
$\varphi $
and
$1/\varphi $
are bounded, and the inverse of the multiplication operator
$M_{\varphi }\in \mathcal {B}(\ell ^{2}(\Lambda ))$
by
$\varphi $
is nothing else than
$M_{1/\varphi }$
. Because
$\varphi $
is a real-valued function,
$M_{\varphi }^{\ast }=M_{\varphi }$
and, for any
$x\in \Lambda $
,
$\mathfrak {e}_{x}$
is of course an eigenvector of
$M_{\varphi }$
with associated eigenvalue
$\varphi (x)$
. In particular,

Since
$(x,z)\mapsto \min \left \{ d\left ( x,z\right ) ,R\right \} $
is another pseudometric on
$\Lambda $
, for all
$x,z\in \Lambda $
, we have that

In particular, the operator
$B\doteq M_{\varphi }TM_{\varphi }^{-1}-T$
satisfies the bound

By Lemma A.11 together with Equations (A.20) and (A.21), it follows that

Applying now Lemma A.12, we then arrive at the bound

Finally, for any
$x\in \Lambda $
such that
$d(x,y)\leq R$
, we observe from the last upper bound that

Since
$y\in \Lambda $
and
$R\in \mathbb {R}^{+}$
are arbitrary parameters, the above inequality in fact holds true for all
$x,y\in \Lambda $
.
A.7 Elementary observations
For completeness and the reader’s convenience, we conclude the appendix by given a few elementary results related to the space of bounded operators on a Hilbert space.
Proposition A.14 (Monotonicity of the inverse on operators).
Let B and C be two positive bounded operators on a Hilbert space with bounded (positive) inverse. If
$B\leq C$
, then
$C^{-1}\leq B^{-1}$
.
Proof. The proof is standard. Since it is very short, we give it for completeness. If B and C commute, then

because
$B^{-1}$
,
$C^{-1}$
and
$C-B$
are positive commuting operators. If B and C do not commute, then we observe that
$B\leq C$
yields

which in turn implies that

because
$\mathbf {1}$
and
$C^{-\frac {1}{2}}BC^{-\frac {1}{2}}$
commute. From the last inequality it follows that
$B^{-1}\geq C^{-1}$
.
Proposition A.15 (Monotone convergence theorem for operators).
Let
$\mathcal {X}$
be any complex Hilbert space. Any increasing (decreasing) monotone net
$(A_{i})_{i\in I}$
of self-adjoint elements in
$\mathcal {B}( \mathcal {X})$
that is bounded from above (below) has a supremum (infimum) in
$\mathcal {B}(\mathcal {X})$
. The supremum (infimum) is itself also self-adjoint and is the strong operator limit of the net.
Proof. [Reference Bru and Pedra46, Proposition 2.17] already tells us that any increasing (decreasing) monotone net
$(A_{i})_{i\in I}$
of self-adjoint elements in
$\mathcal {B}(\mathcal {X})$
that is bounded from above (below) has a supremum (infimum)
$A_{\infty }$
in
$\mathcal {B}(\mathcal {X})$
. The supremum (infimum)
$A_{\infty }$
is itself also self-adjoint and is the weak operator limit of the net. This is proven by using the polarization identity and the Riesz representation theorem together with elementary estimates. To conclude the proof, it remains to show that
$A_{\infty }\in \mathcal {B}( \mathcal {X})$
is the limit of the increasing net
$(A_{i})_{i\in I}$
also in the strong operator topology. To this end, assume that
$(A_{i})_{i\in I}$
is an increasing net and define the (decreasing) net
$(B_{i})_{i\in I}$
of positive operators by
$B_{i}\doteq A_{\infty }-A_{i}\geq 0$
. By construction, this net converges in the weak operator topology to
$0\in \mathcal {B}(\mathcal {X})$
, which in turn implies that the net
$ (B_{i}^{1/2})_{i\in I}$
converges in the strong operator topology to
$0\in $
$\mathcal {B}(\mathcal {X})$
. As the net
$(B_{i}^{1/2})_{i\in I}$
is norm-bounded and
$B_{i}=B_{i}^{1/2}B_{i}^{1/2}$
, we then conclude that also
$ (B_{i})_{i\in I}$
converges in the strong operator topology to
$0\in \mathcal {B}(\mathcal {X})$
; that is,
$A_{\infty }$
is the strong operator limit of the net
$(A_{i})_{i\in I}$
. If
$(A_{i})_{i\in I}$
is a decreasing net, then we consider the increasing net
$(-A_{i})_{i\in I}$
to conclude that
$(A_{i})_{i\in I}$
has a infimum
$A_{\infty }=A_{\infty }^{\ast }\in \mathcal {B}(\mathcal {X})$
, which is again the strong operator limit of the net
$(A_{i})_{i\in I}$
.
Acknowledgements
We would like to express our gratitude to the reviewers for their constructive suggestions, particularly the discussion of enhanced binding in Quantum Field Theory in the introduction. We are also grateful for their interest in improving the paper and for their help in correcting a calculation error by providing an explicit proof of Proposition 2.1.
Funding statement
This work was supported by the Basque Government through the grant IT1615-22 and BERC 2022-2025 program, by the Ministry of Science and Innovation: PID2020-112948GB-I00 funded by MCIN/AEI/10.13039/501100011033 and by ‘ERDF A way of making Europe’, as well as by CNPq (303682/2025-6).
Competing interest
The authors have no competing interests to declare.
Data availability statement
No data used.
Ethical standards
The research meets all ethical guidelines, including adherence to the legal requirements of the study country.
Author contributions
The article is the result of a collective work by all the three authors.