1 Introduction
One way to understand arithmetic information encoded in Shimura varieties is to study the geometry and cohomology of the special fiber of a suitable integral model. This method has been applied with great success, for example, in the work of Harris and Taylor on the local Langlands correspondence for $GL_n$ . The moduli description which is available at least in the case of Shimura varieties of Hodge type, together with the particular structure obtained (ultimately) from the Frobenius morphism, allows to define certain stratifications of the special fiber whose strata can be studied one by one. In particular, there is the Newton stratification, which is defined, roughly speaking, by grouping those points into one stratum whose corresponding abelian varieties have isogenous p-divisible groups. Another important stratification is the Ekedahl–Kottwitz–Oort–Rapoport (EKOR) stratification defined in [Reference He and Rapoport27], a stratification which simultaneously generalizes the Ekedahl–Oort stratification for hyperspecial level and the Kottwitz–Rapoport stratification for Iwahori level.
It has been observed, over the past few decades, that in certain cases the unique closed Newton stratum, the so-called basic locus, has a simple description. More precisely, it has a stratification as a union of Deligne–Lusztig varieties, where the index set of the union and the combinatorics of the closure relations can be described in terms of a certain Bruhat–Tits building attached to the situation at hand. While for the Siegel moduli space of principally polarized g-dimensional abelian varieties this works only when $g \leqslant 2$ , there are several infinite families where such a description is possible; the cases studied in most detail so far arise from unitary Shimura varieties for unitary groups of signature $(1, n-1)$ . See the paper [Reference Vollaard and Wedhorn56] by Vollaard and Wedhorn for a prototypical example, and Section 6 for a more detailed discussion of individual cases and further references. Results of this type have found applications of different kinds:
-
• Explicit descriptions of the basic locus have been used to compute intersection numbers of special cycles on the special fiber of a Shimura variety, in order to prove results predicted by the Kudla–Rapoport program, which relates such intersection numbers to Fourier coefficients of modular forms (in a general sense). As examples, we mention [Reference Kudla and Rapoport35, Reference Terstiege52].
-
• Similar intersection numbers on Rapoport–Zink spaces play a role in the arithmetic fundamental lemma and in arithmetic transfer conjectures. See, for instance, [Reference Li and Zhu37, Reference Rapoport, Smithling and Zhang43, Reference Zhang63].
-
• In a different direction, a good understanding of the basic locus has been of high importance for some recent results around the Tate conjecture for the special fiber of certain Shimura varieties. See, for example, [Reference Helm, Tian and Xiao29, Reference Tian and Xiao54, Reference Xiao and Zhu61].
In this paper, we give a group-theoretic view on this phenomenon, extending previous work (see [Reference Görtz and He13, Reference Görtz, He and Nie14]) in this direction.
Let us explain the main results of this paper. We fix a connected reductive group ${\mathbf G}$ over a non-archimedean local field F and a conjugacy class of cocharacters $\mu $ of ${\mathbf G}$ over a (fixed) algebraic closure of F. Let $\tau \in B({\mathbf G}, \mu )$ be the unique basic element. Fix a rational level structure K. See Section 2.1 for the notation used here and for more details.
The central object of this paper is the generalized affine Deligne–Lusztig variety $X(\mu , \tau )_K$ , which can be viewed as a group-theoretic model of the basic locus mentioned above, in those cases where ${\mathbf G}$ and $\mu $ come from a Shimura datum. This is a perfect scheme, locally perfectly of finite type over an algebraic closure of the residue class field of F, when F has mixed characteristic. It is a scheme locally of finite type over an algebraic closure of the residue class field of F, if F has equal characteristic.
The following definition (which originates from [Reference Görtz and He13]) singles out a class of particularly well-behaved cases. The idea behind it is to express the condition that $X(\mu , \tau )_K$ is a union of (perfections of) classical Deligne–Lusztig varieties attached to a twisted Coxeter element (in some finite Weyl group).
We define (cf. Definition 2.4 for further details and equivalent formulations) the following result.
Definition 1.1 The datum $({\mathbf G}, \mu , K)$ is said to be of Coxeter type if every EKOR stratum that occurs in $X(\mu , \tau )_K$ is the EKOR stratum of a Weyl group element w that is a twisted Coxeter element in a finite standard parabolic subgroup of the Iwahori–Weyl group $\tilde W$ .
The notion of EKOR strata that we use here is the local version of the EKOR strata introduced in [Reference He and Rapoport27], an interpolation between Ekedahl–Oort and Kottwitz–Rapoport strata. See Sections 2.3 and 2.4 for further details.
The main novelties in this paper are new characterizations of the cases of Coxeter type, on the one hand by a simple dimension condition, on the other hand, equivalently, by an explicit group-theoretic condition which involves neither affine Deligne–Lusztig varieties, nor the $\mu $ -admissible set. As a consequence, we obtain a classification of all Coxeter cases.
Note that the study of affine Deligne–Lusztig varieties can be reduced to the simple groups over F. Moreover, if $\mu $ is central, then $X(\mu , \tau )_K$ is a discrete set and hence is zero-dimensional. The study of $X(\mu , \tau )_K$ can then be reduced to the case where $\mu $ is noncentral in every simple factor of the adjoint group ${\mathbf G}_{\text {ad}}$ over F.
We start by establishing the following general lower bound.
Theorem 1.2 (Theorem 3.5)
Let ${\mathbf J}_{\tau }$ denote the $\sigma $ -centralizer of $\tau $ . Suppose that $\mu $ is noncentral in every simple factor of the adjoint group ${\mathbf G}_{\text {ad}}$ over F. We have that
We can characterize the cases that are of Coxeter type as precisely those cases where equality holds in the previous theorem.
Theorem 1.3 (Theorem 4.6)
Suppose that $\mu $ is noncentral in every simple factor of the adjoint group ${\mathbf G}_{\text {ad}}$ over F. The following conditions are equivalent:
-
(1) The triple $({\mathbf G}, \mu , K)$ is of Coxeter type.
-
(2) We have that $\dim X(\mu , \tau )_K={\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ .
-
(3) For any admissible triple $(\xi , J, K')$ with $K' \supseteq K$ , we have that
$$\begin{align*}\langle \underline{\mu}, 2\rho\rangle \leqslant \sharp \{\sigma\text{-orbits of } K_{\xi}'\} + {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau}). \end{align*}$$
We list condition (3) in this theorem to indicate that we have a simple group-theoretic characterization of these cases which involves neither the dimension of $X(\mu , \tau )_K$ , nor the more subtle combinatorics of the admissible set. For the notation used here, we refer to Section 4. This condition allows us to classify all cases of Coxeter type (without using the classification results of [Reference Görtz and He13, Reference Görtz, He and Nie14]).
Theorem 1.4 (Theorem 4.6, Table 1)
Assume that ${\mathbf G}$ is quasi-simple over F and $\mu $ is noncentral in every $\breve F$ -simple component. Denote by $W_a$ the corresponding affine Weyl group, by $\underline {\mu }$ the image of a dominant representative of $\mu $ in the translation lattice of the Iwahori–Weyl group, and by $\sigma $ the automorphism of $W_a$ induced by Frobenius.
The property whether $({\mathbf G}, \mu , K)$ is of Coxeter type depends only on the tuple $(W_a, \sigma , \underline {\mu }, K)$ .
The quadruples $(W_a, \sigma , \underline {\mu }, K)$ of Coxeter type with K minimal are classified as follows (up to isomorphism; see Section 2.2 for the notation):
-
(i) $(\tilde A_{n-1}, \mathrm {id} , \omega ^{\vee }_1, \emptyset )$ ,
-
(ii) $(\tilde A_{n-1}, \varrho _{n-1}, \omega ^{\vee }_1, \emptyset )$ ,
-
(iii) $(\tilde A_{2 m}, \varsigma _0, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0\})$ ,
$(\tilde A_{2 m+1}, \varsigma _0, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0, s_{m+1}\})$ ,
-
(iv) $(\tilde A_{n-1}, \mathrm {id} , \omega ^{\vee }_1+\omega ^{\vee }_{n-1}, \tilde {\mathbb {S}} -\{s_0\})$ , $n \geqslant 3$ ,
$(\tilde A_{n-1} \times \tilde A_{n-1}, {}^1 \varsigma _0, (\omega _1^{\vee }, \omega _{n-1}^{\vee }), \sqcup _{i=1}^2(\tilde {\mathbb {S}} _i-\{0\}))$ ,
-
(v) $(\tilde B_n, \mathrm {id} , \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0, s_n\})$ ,
$(\tilde B_n, {\mathrm{Ad}}(\tau _1), \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_n\})$ ,
-
(vi) $(\tilde C_n, \mathrm {id} , \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0, s_n\})$ ,
-
(vii) $(\tilde D_n, \mathrm {id} , \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0, s_n\})$ ,
$(\tilde D_n, \varsigma _0, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0\})$ ,
-
(viii) Exceptional cases:
$(\tilde A_1, \mathrm {id} , 2 \omega ^{\vee }_1, \emptyset )$ , $(\tilde A_3, \mathrm {id} , \omega ^{\vee }_2, \{s_1, s_2\})$ , $(\tilde A_3, \varsigma _0, \omega ^{\vee }_2, \tilde {\mathbb {S}} -\{s_0\})$ ,
$(\tilde C_2, \mathrm {id} , \omega ^{\vee }_2, \{s_0\})$ , $(\tilde C_2, {\mathrm{Ad}}(\tau _2), \omega ^{\vee }_2, \{s_0, s_2\})$ .
It is easy to see (Remark 2.5) that whenever $(W_a, \sigma , \underline {\mu }, K)$ is of Coxeter type and $K \subseteq K'$ , then $(W_a, \sigma , \underline {\mu }, K')$ is of Coxeter type. From the classification, we also obtain the following, quite surprising result, which does not seem to follow directly from the characterization above: in all cases, there is a unique (up to isomorphism) minimal set $K\subset \tilde {\mathbb {S}} $ such that $(W_a, \sigma , \underline {\mu }, K)$ is of Coxeter type. Note, however, that the situation is quite subtle: starting with a datum $({\mathbf G}, \mu )$ , the minimal sets K such that the triples $({\mathbf G}, \mu , K)$ are of Coxeter type are not unique (up to isomorphism; see the $C\text {-}BC_2$ cases in Table 3) because isomorphisms of the Dynkin diagram might not be automorphisms of the oriented affine Dynkin diagram.
In Section 5, we discuss consequences of our results for Rapoport–Zink spaces. For many of the pairs $({\mathbf G}, \mu )$ in our list, a stratification of the reduced special fiber by classical Deligne–Lusztig varieties has already been established. In most cases, these results deal with maximal parahoric level structure. The stratification is called the “Bruhat–Tits stratification” because its index set is related to a Bruhat–Tits building.
It is expected—and known in many cases—that one can identify the perfection of the special fiber with a generalized affine Deligne–Lusztig variety of the form $X(\mu , b)_K$ (see Section 5, in particular property ( $\diamondsuit $ )). This makes the connection with the group-theoretic results, and the Bruhat–Tits stratification on that side of the story (see Section 2.4).
Proposition 5.8 allows us to establish a stratification of the special fiber for many nonmaximal level structures in cases of Coxeter type, before passing to the perfection. Note that to prove this result, we need to know a priori that $X(\mu , \tau )_K$ has a Bruhat–Tits stratification.
1.1 Comparison with previous results
This article can be seen as a continuation of [Reference Görtz and He13] where a classification of all cases of Coxeter type was obtained under the following additional (and, a priori, quite restrictive) assumption: $K = \tilde {\mathbb {S}} -\{v\}$ is the complement of a single element of $\tilde {\mathbb {S}} $ (and, as above, is assumed to be preserved by $\sigma $ , i.e., v is a fix point of $\sigma $ ).
In this paper, we remove this restriction on K and obtain different more conceptual characterizations of the cases of Coxeter type.
In [Reference Görtz, He and Nie14], we introduced and studied the notion of fully Hodge–Newton decomposable pairs $({\mathbf G}, \mu )$ (see Section 2.3) and gave a classification of those cases. Note, however, that the classification results in the paper at hand do not make use of the classification of fully Hodge–Newton decomposable cases in [Reference Görtz, He and Nie14]. While in all fully Hodge–Newton decomposable cases, the space $X(\mu , b)_K$ (for basic b) has a stratification into classical Deligne–Lusztig varieties (called the weak Bruhat–Tits stratification in Section 2.4), this stratification has additional nice properties in the cases of Coxeter type.
1.2 Outline of the paper
We recall some preliminary notions and explain the general setting in Section 2. After recalling the method of Deligne–Lusztig reduction in Section 3.1, we prove the dimension formula characterizing Coxeter-type cases in Section 3 and prove the classification in Section 4. In Section 5, we discuss consequences for Rapoport–Zink spaces and relate our results to previous work on the side of Shimura varieties and Rapoport–Zink spaces. See Table 3 in Section 6.1 for a summary. In Section 7, we study the smoothness of the closures of strata.
2 Coxeter type
2.1 Setup
Let F be a non-archimedean local field, fix an algebraic closure $\overline {F}$ , denote by $\breve F$ the completion of its maximal unramified extension $F^{\mathrm{un}}\subset \overline {F}$ , and denote by $\sigma $ the Frobenius automorphism of $\breve F$ over F. We usually think of F being of mixed characteristic, i.e., F is a finite extension of $\mathbb Q_p$ . Everything has an equal-characteristic counterpart, however, where F is of the form $\mathbb F_q((t))$ , the Laurent series field over a finite field $\mathbb F_q$ . In either case, we denote by p the residue characteristic of F.
We fix a connected reductive group ${\mathbf G}$ over F. Write $\breve {G} = {\mathbf G}(\breve F)$ . Let $\breve {\mathcal I} \subseteq \breve {G}$ be a $\sigma $ -invariant Iwahori subgroup, our standard Iwahori subgroup, and let T be a maximal torus of G such that the alcove $\mathfrak a$ corresponding to $\breve {\mathcal I}$ , the standard alcove in the Bruhat–Tits building of ${\mathbf G}$ over $\breve F$ , lies in the apartment attached to $T_{\breve F}$ . Attached to these data, we have the extended affine Weyl group $\tilde W$ and the (relative) finite Weyl group $W_0$ . We fix a special vertex of the base alcove and obtain a splitting $\tilde W = X_* \rtimes W_0$ , where $X_* := X_*(T)_{\Gamma _0}$ denotes the coinvariants of the cocharacter lattice of T with respect to $\Gamma _0 = \mathop {\mathrm{Gal}}(\overline {F}/F^{\mathrm{un}})$ . See [Reference Tits, Borel and Casselman55] and [Reference Görtz, He and Rapoport15, Section 2].
We denote by $\tilde {\mathbb {S}} $ the set of simple affine reflections (defined by our base alcove) inside the affine Weyl group $W_a\subseteq \tilde W$ and denote by $\Omega $ the set of length-zero elements in $\tilde W$ . The Frobenius $\sigma $ acts on $\tilde {\mathbb {S}} $ (since by assumption the base alcove is fixed by $\sigma $ ). Likewise, if $\tau \in \tilde W$ has length $0$ , then it fixes the base alcove and thus acts by conjugation on $\tilde {\mathbb {S}} $ ; we denote this action by ${\mathrm{Ad}}(\tau )$ . For an element $w = w' \tau \in W_a\tau $ , the $\sigma $ -support $\operatorname {\mathrm{supp}}_{\sigma }(w)$ is the smallest subset of $\tilde {\mathbb {S}} $ , which is ${\mathrm{Ad}}(\tau )\circ \sigma $ -stable and contains all simple affine reflections that occur in a reduced expression for $w'$ . The final condition can also be rephrased as $w'\in W_{\operatorname {\mathrm{supp}}_{\sigma }(w)}$ , where for a subset $K\subseteq \tilde {\mathbb {S}} $ we write $W_K$ for the subgroup of $W_a$ generated by the elements of K. See Example 2.10 for explicit examples. We denote by ${}^K \tilde W$ the set of minimal length representatives of the cosets in $W_K\backslash \tilde W$ .
For $b\in \breve {G}$ , we denote by ${\mathbf J}_b$ the $\sigma $ -centralizer of b, i.e.,
If b is understood, we just write ${\mathbf J}$ instead of ${\mathbf J}_b$ .
Below, we always work with the unique reduced root system $\Phi $ underlying the relative root system of ${\mathbf G}$ over $\breve F$ (the échelonnage root system).
2.2 (Enhanced) Tits data and Coxeter data
To specify the classification results below, two types of data typically arise. On the one hand, we will refer to affine Weyl groups together with an automorphism and a coweight; these kinds of data we will call Coxeter data. On the other hand, we will refer to algebraic groups over F together with a conjugacy class of cocharacters; this is what we will call Tits data below (using Tits’s translation between isomorphism classes of such data with affine Dynkin diagrams). In both cases, we often enhance these data by including a “level structure,” i.e., a subset $K\subset \tilde {\mathbb {S}} $ with $W_K$ finite, of the set of simple affine reflections. In the group case, this gives rise to a standard parahoric subgroup. Our level structure will always be assumed to be rational, i.e., K is fixed by the automorphism $\sigma $ (the automorphism induced by the Frobenius over F in the group case).
We hope that no confusion will arise between the notions of Coxeter datum and of being of Coxeter type, the latter being a property that certain Coxeter data have, and others do not—similarly as some elements of a Coxeter group are Coxeter elements.
Definition 2.1 (Cf. [Reference He, Pappas and Rapoport26] and [Reference Görtz, He and Rapoport15, Section 2.6])
-
(1) A Coxeter datum (over F) is a tuple $((W_a, \tilde {\mathbb {S}} ), \sigma , \lambda )$ consisting of an affine Coxeter system, a length-preserving automorphism $\sigma $ , and a $W_0$ -conjugacy class $\lambda $ in $X_*$ , the coweight lattice. Here, $W_0$ denotes the finite Weyl group of the given affine Coxeter system. An enhanced Coxeter datum is a tuple $((W_a, \tilde {\mathbb {S}} ), \sigma , \lambda , K)$ whose first three entries constitute a Coxeter datum and where $K\subsetneq \tilde {\mathbb {S}} $ is a subset with $\sigma (K)=K$ . Below, we often just write $W_a$ instead of $(W_a, \tilde {\mathbb {S}} )$ , or we replace this item by the corresponding affine Dynkin type.
-
(2) A Tits datum (over F) is a tuple $(\tilde {\Delta }, \sigma , \lambda )$ consisting of an absolute affine Dynkin diagram (cf. [Reference Tits, Borel and Casselman55]), a diagram automorphism $\sigma $ , and a $W_0$ -conjugacy class $\lambda $ in the coweight lattice $X_*$ . An enhanced Tits datum is a tuple $(\tilde {\Delta }, \sigma , \lambda , K)$ whose first three entries constitute a Tits datum and where K is a type of rational parahoric subgroups in the corresponding group.
It is worth pointing out that the Coxeter diagram associated with an affine Dynkin diagram is obtained by disregarding the arrows in the affine Dynkin diagram. We refer to [Reference He, Pappas and Rapoport26, Section 5.2] for the discussion on the relationship and difference between the (enhanced) Coxeter data and the (enhanced) Tits data.
2.2.1. Notation for automorphisms of Dynkin diagrams
We use the same labeling of the Coxeter graph as in [Reference Bourbaki5, Plates I–X]. As in [Reference Görtz, He and Rapoport15], we use the following notation for automorphisms of affine Dynkin diagrams. In case the fundamental coweight $\omega ^{\vee }_i$ is minuscule, we denote the corresponding length $0$ element $\tau (t^{\omega ^{\vee }_i})$ by $\tau _i$ ; conjugation by $\tau _i$ is a length preserving automorphism of $\tilde W$ which we denote by ${\mathrm{Ad}}(\tau _i)$ . For type $A_n$ , the automorphism ${\mathrm{Ad}}(\tau _i)$ is the rotation of the affine Dynkin diagram by i steps (i.e., the simple reflection $s_0$ is mapped to $s_i$ , $s_1$ is mapped to $s_{i+1}$ , and so on), and we also denote it by $\varrho _i$ . We write $\varsigma _0$ for the automorphism which fixes the vertex $0$ , and is the unique nontrivial diagram automorphism of the finite Dynkin diagram, if $W_0$ is of type $A_n, D_n$ (with $n \geqslant 5$ ) or $E_6$ . For type $D_4$ , we also denote by $\varsigma _0$ the diagram automorphism which interchanges $\alpha _3$ and $\alpha _4$ .
For the product $\tilde {A}_{n-1}\times \tilde {A}_{n-1}$ , we denote by ${}^1\varsigma _0$ the automorphism which switches the two factors.
2.3 Fully Hodge–Newton decomposable pairs $({\mathbf G}, \mu )$
We now fix a conjugacy class $\mu $ of cocharacters ${\mathbf G}_{m, \overline {F}}\rightarrow {\mathbf G}_{\overline {F}}$ over the algebraic closure $\overline {F}$ of F. We fix the representative $\mu _+\in X_*(T)$ of this conjugacy class whose image $\underline {\mu }$ in the coweight lattice $X_* = X_*(T)_{\Gamma _0}$ , i.e., the translation lattice of the Iwahori–Weyl group, is dominant (i.e., translates the base alcove into the dominant chamber). We use here the Bruhat–Tits convention that for $\lambda \in X_*(T)$ , the element $\lambda (\pi )$ for a uniformizer $\pi $ acts by translation by $-\lambda $ . Cf. [Reference Görtz, He and Rapoport15, Section 2.2].
We also fix a length $0$ element $\tau \in \tilde W$ whose $\sigma $ -conjugacy class is the unique basic element in $B({\mathbf G}, \mu )$ . By abusing the notation, we also use $\tau $ for a chosen representative in $\breve G$ .
Denote by $X_w(b)$ the affine Deligne–Lusztig variety for $w\in \tilde W$ and $b\in \breve {G}$ , a subvariety of the affine flag variety for ${\mathbf G}$ , $X_w(b) = \{g\breve {\mathcal I}\in \breve {G}/\breve {\mathcal I};\ g^{-1}b\sigma (g)\in \breve {\mathcal I} w\breve {\mathcal I}\}$ . Here, we view $\breve {G}/\breve {\mathcal I}$ as the $\mathbf k$ -valued points of the affine flag variety, where $\mathbf k$ is the residue class field of the ring of integers of $\breve F$ . Depending on whether F has characteristic $> 0$ or characteristic $0$ , the affine flag variety is an ind-scheme, or an ind-(perfect scheme), over $\mathbf k$ (see [Reference Pappas and Rapoport40] and [Reference Bhatt and Scholze3, Reference Zhu64], respectively). All affine Deligne–Lusztig varieties are equipped with the reduced scheme structure, so that we can identify them with their $\mathbf k$ -valued points.
Let $\pi = \pi _K\colon \breve {G}/\breve {\mathcal I} \rightarrow \breve {G}/\breve {\mathcal K}$ denote the projection from the affine flag variety to the partial affine flag variety of level K ( $\breve {\mathcal K}$ denotes the standard parahoric subgroup of type K). Recall that
denotes the $\mu $ -admissible set. By definition,
We write ${}^{K}\!{\mathrm{Adm}}(\mu ) = \operatorname {\mathrm{Adm}}(\mu )\cap {}^K\tilde W$ for the subset of $\operatorname {\mathrm{Adm}}(\mu )$ consisting of all elements w which are of minimal length in their right $W_K$ -coset $W_K w$ . Below, we sometimes write $X_{K, w}(\tau ) = \pi (X_w(\tau ))$ .
It is shown in [Reference Görtz and He13, Reference He24] that
The subsets $\pi (X_w(\tau ))$ are called the EKOR strata of $X(\mu , \tau )_K$ .
Note that if $W_{\operatorname {\mathrm{supp}}_{\sigma }(w)}$ is finite, then $X_w(\tau ) \ne \emptyset $ (see [Reference He23, Lemma 3.2]). Let us recall the following characterization of fully Hodge–Newton decomposable pairs $({\mathbf G}, \mu )$ (see [Reference Görtz, He and Nie14, Definition 3.1]).
Theorem 2.2 ([Reference Görtz, He and Nie14, Theorem B])
The pair $({\mathbf G}, \mu )$ is fully Hodge–Newton decomposable, if the following equivalent conditions are satisfied:
-
(1) The coweight $\mu $ is minute [Reference Görtz, He and Nie14, Definition 3.2].
-
(2) For every $w \in {}^{K}\!{\mathrm{Adm}}(\mu )$ with $X_w(\tau ) \ne \emptyset $ , we have that $W_{\operatorname {\mathrm{supp}}_{\sigma }(w)}$ is finite.
Note that this property, as shown by condition (1), is independent of K, and depends only on the Coxeter datum $(W_a, \sigma , \underline {\mu })$ . Set
Then, in the fully Hodge–Newton decomposable cases, we may rewrite (2.1) as
Definition 2.3 We say an element $w \in W_a \tau $ is a $\sigma $ -Coxeter element (or is a twisted Coxeter element) if from each ${\mathrm{Ad}}(\tau ) \circ \sigma $ -orbit on $\tilde {\mathbb {S}} $ at most one simple reflection appears in some (or equivalently, any) reduced expression of $w \tau ^{-1}$ .
Note that we deviate here from the usual definition of (twisted) Coxeter elements where one asks that exactly one simple reflection from each orbit occurs. It would be more precise to say that w is a $\sigma $ -Coxeter element in $W_{\operatorname {\mathrm{supp}}_{\sigma }(w)} \tau $ , but for our purposes, it is useful to shorten this as in the above definition. See Example 2.10 for examples.
We denote by ${}^K\!{\mathrm{Cox}}(\mu ) \subset {}^{K}\!{\mathrm{Adm}}(\mu )_0$ the subset of ${}^{K}\!{\mathrm{Adm}}(\mu )_0$ consisting of $\sigma $ -Coxeter elements.Footnote 1 If $K=\emptyset $ , then we may simply omit the superscript and write $\operatorname {\mathrm{Adm}}(\mu )_0$ and $\mathop {\mathrm{Cox}}(\mu )$ . We define the following notion.
Definition 2.4 We say that the triple $({\mathbf G}, \mu , K)$ or the corresponding quadruple $(W_a, \sigma , \mu , K)$ is of Coxeter type if $(W_a, \sigma , \mu )$ is fully Hodge–Newton decomposable and ${}^K\!{\mathrm{Cox}}(\mu )={}^{K}\!{\mathrm{Adm}}(\mu )_0$ .
Observe that in the definition, $(W_a, \sigma , \mu )$ being fully Hodge–Newton decomposable implies in particular that $W_{\operatorname {\mathrm{supp}}_{\sigma }(w)}$ is finite for all $w\in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ . Therefore, the above is the same definition as the one given in the introduction. Furthermore, this definition is equivalent to the one in [Reference Görtz and He13, Section 5.1]. In fact, the definition in loc. cit. is easily seen to be equivalent to saying that
The equivalence of the two definitions then follows from (2) in Theorem 2.2.
Remark 2.5 Let $K \subset K'$ be proper $\sigma $ -stable subsets of $\tilde {\mathbb {S}} $ . If $(W_a, \sigma , \mu , K)$ is of Coxeter type, then ${}^K\!{\mathrm{Cox}}(\mu )={}^{K}\!{\mathrm{Adm}}(\mu )_0$ and
Hence, $(W_a, \sigma , \mu , K')$ is of Coxeter type.
2.4 The Bruhat–Tits stratification
Let $({\mathbf G}, \mu )$ be fully Hodge–Newton decomposable. As pointed out above, we have
where as before $\pi \colon \breve {G}/\breve {\mathcal I} \rightarrow \breve {G}/\breve {\mathcal K}$ denotes the projection from the full affine flag variety to the partial affine flag variety of type K.
For each $w\in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ , we define the following standard parahoric subgroups of $\breve G$ . Let $\mathcal P^{\flat }_w$ be the standard parahoric subgroup generated by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ and the standard Iwahori subgroup $\breve {\mathcal I}$ . Let $\mathcal P_w$ be the standard parahoric generated by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ and $I(K, w, \sigma )$ and $\breve {\mathcal I}$ . Here, $I(K, w, \sigma )$ is the maximal ${\mathrm{Ad}}(w)\circ \sigma $ -stable subset of K. Then [Reference Görtz, He and Nie14, Proposition 5.7] shows that $W_{\operatorname {\mathrm{supp}}_{\sigma }(w) \cup I(K, w, \sigma )}$ is finite. Finally, let $\breve {\mathcal K}\subset \breve {G}$ be the standard parahoric subgroup of type K, and let $\mathcal Q_w := \mathcal P_w^{\flat } \cap \breve {\mathcal K}$ be the intersection.
We then have
where
Here, $\cdot _{\sigma }$ means that the group on the left acts by $\sigma $ -conjugation, i.e., we have $\mathcal Q_w \cdot _{\sigma } (\breve {\mathcal I} w \breve {\mathcal I})=\{g^{-1} h \sigma (g);\ g\in \mathcal Q_w, h \in \breve {\mathcal I} w \breve {\mathcal I}\}$ . We can identify $\mathcal P_w^{\flat } / \mathcal Q_w$ with a classical partial flag variety for the maximal reductive quotient of the special fiber of the parahoric group scheme attached to $\mathcal P_w^{\flat }$ . The variety $Y(w)$ is a “fine Deligne–Lusztig variety” in that partial flag variety, i.e., the image of a classical Deligne–Lusztig variety in the corresponding full flag variety. See Example 2.10 for examples.
More precisely, in the mixed characteristic case, $Y(w)$ is the perfection of a classical Deligne–Lusztig variety; even though we sometimes drop the adjective perfect for notational convenience, we do not have an actual scheme structure on $X(\mu , \tau )_K$ and therefore cannot talk about the strata as usual schemes, if F has mixed characteristic.
If w is a twisted Coxeter element, then we have the following simple description of the set $I(K, w, \sigma )$ , which appears in the definition of $\mathcal P_w$ .
Lemma 2.6 Suppose that w is a $\sigma $ -Coxeter element in $W_{\operatorname {\mathrm{supp}}_{\sigma }(w)}$ , i.e., from each ${\mathrm{Ad}}(\tau )\circ \sigma $ -orbit on $\operatorname {\mathrm{supp}}_{\sigma }(w)$ , at most (equivalently: exactly) one simple reflection occurs in any reduced expression of w. Then $I(K,w, \sigma )$ is the set of all $s\in \tilde {\mathbb {S}} - \operatorname {\mathrm{supp}}_{\sigma }(w)$ with the following two properties:
-
(a) s commutes with every element of $\operatorname {\mathrm{supp}}_{\sigma }(w)$ .
-
(b) The ${\mathrm{Ad}}(\tau )\circ \sigma $ -orbit of s is contained in K.
Proof It is shown in [Reference Görtz and He13, Lemma 4.6.1] that every element $s\in I(K, w, \sigma )$ commutes with all elements of $\operatorname {\mathrm{supp}}_{\sigma }(w)$ , and that $I(K, w, \sigma )\cap \operatorname {\mathrm{supp}}_{\sigma }(w) = \emptyset $ . It remains to show that $I(K,w, \sigma )$ is ${\mathrm{Ad}}(\tau )\circ \sigma $ -stable, and that every element of $\tilde {\mathbb {S}} - \operatorname {\mathrm{supp}}_{\sigma }(w)$ which satisfies (a) and (b) lies in $I(K, w, \sigma )$ .
By definition, $I(K, w, \sigma )$ is ${\mathrm{Ad}}(w)\circ \sigma $ -stable. It is also ${\mathrm{Ad}}(w\tau ^{-1})$ -stable by property (a). It follows that $I(K,w, \sigma )$ is ${\mathrm{Ad}}(\tau )\circ \sigma $ -stable.
Now, let $s\in \tilde {\mathbb {S}} - \operatorname {\mathrm{supp}}_{\sigma }(w)$ such that (a) and (b) hold. We need to show that $({\mathrm{Ad}}(w)\circ \sigma )^i (s) \in K$ for all $i\geqslant 1$ . However, (a) ensures that ${\mathrm{Ad}}(w\tau ^{-1})^{-1}(s) = s$ , so ${\mathrm{Ad}}(w)\circ \sigma (s) = {\mathrm{Ad}}(\tau )\circ \sigma (s)$ , and this is an element of K by (b). Since ${\mathrm{Ad}}(\tau )\circ \sigma (s)$ again satisfies (a) and (b), we can apply induction, and the lemma follows.
In terms of the Dynkin diagram, we can express this as saying that $I(K, w, \sigma )$ is the union of those ${\mathrm{Ad}}(\tau )\circ \sigma $ -orbits in K in which no element is connected to any vertex in $\operatorname {\mathrm{supp}}_{\sigma }(w)$ .
In particular, this implies that the projection $\pi $ restricts to an isomorphism from $\{ g\breve {\mathcal I};\ g\in {\mathcal P}^{\flat }_w, g^{-1}\tau \sigma (g)\in \breve {\mathcal I} w\breve {\mathcal I}\}$ onto its image $Y(w)$ . So $Y(w)$ is isomorphic to the classical Deligne–Lusztig variety attached to $w\tau ^{-1}$ in the (finite-dimensional) flag variety $\mathcal P^{\flat }_w/\breve {\mathcal I}$ , for the Frobenius given by ${\mathrm{Ad}}(\tau )\circ \sigma $ .
See also [Reference Görtz and He13, Corollary 4.6.2, Section 7.2] (cf. also [Reference Görtz, He and Nie14, Section 5.10] for further details).
Putting together these stratifications for all the different w, we obtain the decomposition of $X(\mu , \tau )_K$ as a union of classical Deligne–Lusztig varieties “in a natural way.” We call this stratification the weak Bruhat–Tits stratification.
The closure of the stratum $Y(w)$ (and likewise of $jY(w)$ ) in $X(\mu , \tau )_K$ is isomorphic to (the perfection of) the closure of $Y(w)$ inside $\mathcal P_w^{\flat } / (\mathcal P_w^{\flat } \cap \breve {\mathcal K})$ , because $\mathcal P_w^{\flat } / (\mathcal P_w^{\flat } \cap \breve {\mathcal K})$ is proper and hence closed in $\breve {G}/\breve {\mathcal K}$ . Even if w is a twisted Coxeter element, then this closure is typically not isomorphic to the closure of $Y(w)$ inside $\mathcal P_w^{\flat } /\breve {\mathcal I}$ (see Example 2.7).
Example 2.7 Let us illustrate by an example that the closure of $Y(w)$ may (and often will) differ depending on whether it is taken in the full or partial flag variety. This phenomenon occurs already on the level of finite-dimensional (partial) flag varieties. For instance, consider the Deligne–Lusztig variety X in the flag variety for $GL_n$ , $n\geqslant 3$ , over a finite field k attached to the Coxeter element $w = s_1s_2\cdots s_{n-1}$ . Cf. [Reference Deligne and Lusztig8, Section 2.2]. Its projection to the projective space $\mathbb P^{n-1}_k$ of lines in $k^n$ is the Drinfeld space (over the finite field k), i.e., the complement in $\mathbb P^{n-1}_k$ of all k-rational hyperplanes. In particular, the closure of the image of X in $\mathbb P^{n-1}_k$ is the whole projective space $\mathbb P^{n-1}_k$ . On the other hand, the closure of X in the full flag variety is the union of all Deligne–Lusztig varieties attached to elements $w'\leqslant w$ . It is easy to check that this closure does not project isomorphically onto $\mathbb P^{n-1}_k$ . For example, for $i\geqslant 2$ , the one-dimensional Deligne–Lusztig variety $X(s_i)$ is mapped onto a finite set (namely the set of k-rational points of $\mathbb P^{n-1}_k$ ).
The closure relations between strata are given as follows (cf. [Reference Görtz and He13, Sections 3.3 and 7]). The closure of a stratum is a union of strata. The closure of a stratum $j Y(w)$ contains a stratum $j' Y(w')$ if and only if:
-
(1) $w' \leqslant _{K, \sigma }w$ , which means by definition that there exists $u\in W_K$ such that $u^{-1} w' \sigma (u)\leqslant w$ , and
-
(2) $j'({\mathbf J}(F)\cap \mathcal P_{w'}) \cap j({\mathbf J}(F)\cap \mathcal P_w) \ne \emptyset $ .
We can express the second condition in an equivalent way in terms of the building, as follows (see [Reference Görtz and He13, Erratum, Proposition 7.2.2]). Let $\breve {\kappa }$ be the Kottwitz homomorphism (see Section 5.1). We identify the set
with the set of simplices of type w, i.e., with stabilizer $\mathcal P_w\cap {\mathbf J}(F)$ , in the rational building of ${\mathbf J}$ , and similarly for $w'$ . Then (2) above is equivalent to requiring that $\breve {\kappa }(j') = \breve {\kappa }(j)$ and that the simplices attached to j and $j'$ via the above identification are contained in the closure of some alcove.
If moreover $({\mathbf G}, \mu , K)$ is of Coxeter type, then this stratification has further nice properties.
Proposition 2.8 Let $({\mathbf G}, \mu , K)$ be of Coxeter type. Let $w, w' \in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ . The following are equivalent:
-
(1) $w' \leqslant w$ (where $\leqslant $ denotes the Bruhat order).
-
(2) $w' \leqslant _{K, \sigma } w$ (where $\leqslant _{K, \sigma }$ is the partial order arising in the above description of the closure relations between strata).
-
(3) The inclusion $\operatorname {\mathrm{supp}}(w'\tau ^{-1}) \subseteq \operatorname {\mathrm{supp}}(w\tau ^{-1})$ holds.
-
(4) The inclusion $\operatorname {\mathrm{supp}}_{\sigma }(w') \subseteq \operatorname {\mathrm{supp}}_{\sigma }(w)$ holds.
In particular, for $w, w' \in {}^K\!{\mathrm{Cox}}(\mu )$ with $w \ne w'$ , we have $\operatorname {\mathrm{supp}}_{\sigma }(w') \neq \operatorname {\mathrm{supp}}_{\sigma }(w)$ .
Note that by definition, we automatically have $(1) \Rightarrow (2) \Rightarrow (4)$ and $(1) \Rightarrow (3) \Rightarrow (4)$ . The nontrivial part is $(4) \Rightarrow (1)$ , which follows by analyzing all Coxeter-type cases in Section 4 and the explicit description of ${}^K\!{\mathrm{Cox}}(\mu )$ . See Section 6.1 for some detailed discussion for the Drinfeld case.
We have the following consequence.
Corollary 2.9 Let $({\mathbf G}, \mu , K)$ be of Coxeter type. The set $\operatorname {\mathrm{supp}}_{\sigma }(w) \cup I(K, w, \sigma )$ determines the element $w\in {}^K\!{\mathrm{Cox}}(\mu )$ . Hence, for $w, w' \in {}^K\!{\mathrm{Cox}}(\mu )$ with $w \ne w'$ , we have ${\mathbf J}(F)\cap \mathcal P_w \ne {\mathbf J}(F)\cap \mathcal P_{w'}$ .
Proof By the proposition above, w is determined by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ . So it is enough to show that we can recover $\operatorname {\mathrm{supp}}_{\sigma }(w)$ from the union $\operatorname {\mathrm{supp}}_{\sigma }(w) \cup I(K, w, \sigma )$ . Since $w\in {}^K\tilde W$ , every connected component of $\operatorname {\mathrm{supp}}_{\sigma }(w)$ must meet $\tilde {\mathbb {S}} -K$ . On the other hand, by definition, we have $I(K, w, \sigma )\subseteq K$ . Therefore, the $\sigma $ -support of w consists exactly of those connected components of the union $\operatorname {\mathrm{supp}}_{\sigma }(w) \cup I(K, w, \sigma )$ , which intersect $\tilde {\mathbb {S}} -K$ .
By Corollary 2.9, if $({\mathbf G}, \mu , K)$ is of Coxeter type, then the index set
of the weak Bruhat–Tits (BT) stratification can be seen as a subset of the set of all simplices in the Bruhat–Tits building of ${\mathbf J}$ over F (up to fixing the connected component, i.e., the image under $\breve \kappa $ ). In view of these particularly favorable properties, we call the resulting stratification the Bruhat–Tits stratification of $X(\mu , \tau )_K$ . See Example 2.10 for a discussion in a specific case.
Example 2.10 Let us illustrate some of the notions we introduced in the specific case of Tits datum $({}^2\tilde {A}^{\prime }_{n-1}, \omega _1^{\vee })$ , i.e., for ${\mathbf G}$ (say over $F=\mathbb Q_p$ to be specific) a unitary group which splits over an unramified quadratic extension and $\underline {\mu } = \omega _1^{\vee }$ . This case arises from Shimura varieties for unitary groups $GU(1, n-1)$ at primes p where the group splits over an unramified quadratic extension of $\mathbb Q_p$ (but does not split over $\mathbb Q_p$ ), and is the case that was studied by Vollaard and Wedhorn in [Reference Vollaard and Wedhorn56] for hyperspecial level structure.
The corresponding Coxeter datum is $(\tilde {A}_{n-1}, \varsigma _0, \omega _1^{\vee })$ , i.e., the affine Dynkin diagram is a circle with vertices $0$ , $1$ , …, $n-1$ , where the automorphism $\sigma = \varsigma _0$ is the “reflection” fixing $0$ and exchanging $1$ and $n-1$ , $2$ and $n-2$ , and so on. The action by ${\mathrm{Ad}}(\tau )$ is the rotation $0\mapsto 1\mapsto 2\mapsto \cdots $ . Hence, ${\mathrm{Ad}}(\tau )\circ \sigma $ has order $2$ and maps
Depending on whether n is even or odd, ${\mathrm{Ad}}(\tau )\circ \sigma $ has no fix point, or the one fix point $\frac {n+1}{2}$ .
An element $w\in W_a\tau $ is a twisted Coxeter element in the sense of Definition 2.3 if and only if at most one simple reflection from each pair $\{0, 1\}$ , $\{n-1, 2\}$ , …, occurs in a reduced expression of $w\tau ^{-1}$ .
Now, let us come back to the case of general n and consider the case of hyperspecial level structure, more specifically we set $K=\tilde {\mathbb {S}} - \{0\}$ . (If n is odd, then $\tilde {\mathbb {S}} - \{ (n+1)/2 \}$ is another choice of K that corresponds to hyperspecial level structure.) We then have
For $w = s_0 s_{n-1}\cdots s_j\tau \in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ , we have
(where we write i instead of $s_i$ ). Together with the restriction of the automorphism ${\mathrm{Ad}}(\tau )\circ \sigma $ , this is the Dynkin diagram of a unitary group over the finite field $\mathbb F_p$ . Denote by $\overline {G}$ its base change to an algebraic closure of $\mathbb F_p$ . Let $\overline {B}\subset \overline {Q}\subset \overline {G}$ denote the Borel group of $\overline {G}$ induced by the Borel of ${\mathbf G}$ and the maximal parabolic subgroup corresponding to the vertex $0$ (the complement of K in $\tilde {\mathbb {S}} $ ), viewed as a vertex of the Dynkin diagram of $\overline {G}$ . The Deligne–Lusztig variety $Y(w)$ is isomorphic to a Deligne–Lusztig variety in the “flag variety” $\overline {G}/\overline {B}$ , and also isomorphic to its image (a “fine” Deligne–Lusztig variety) in the “Grassmannian” $\overline {G}/\overline {Q}$ . The dimension of $Y(w)$ equals the length $\ell (w)$ of w, in particular, in this case all strata of a fixed dimension are translates of the same Deligne–Lusztig variety.
Using the lattice interpretation of the Bruhat–Tits building of ${\mathbf J}$ , we can describe the index set of the Bruhat–Tits stratification in terms of vertex lattices as in [Reference Vollaard and Wedhorn56]. We then recover precisely the result of Theorem B of loc. cit., after perfection.
If n is odd, then we can also consider the parahoric level structure given by $K= \tilde {\mathbb {S}} - \{ 0, \frac {n+1}{2} \}$ . The corresponding parahoric subgroup is not maximal, and a fortiori not hyperspecial. Since K is smaller than in the previous case, the set ${}^{K}\!{\mathrm{Adm}}(\mu )_0$ is larger than before (see Table 1). In particular, there exist different elements in ${}^{K}\!{\mathrm{Adm}}(\mu )_0$ of the same length. See Sections 6.1 and 6.1.
3 Some dimension formulas
We first prove (in Section 3.2) the following inequality on the dimension of affine Deligne–Lusztig varieties. As before, $\tau $ is a fixed representative of a length $0$ element in $\tilde W$ whose $\sigma $ -conjugacy class is the basic element in $B({\mathbf G}, \mu )$ .
Proposition 3.1 Let $w \in W_a \omega $ with $\omega \in \Omega $ such that $X_w(\tau ) \neq \emptyset $ . Then
3.1 Deligne–Lusztig reduction
We first recall the Deligne–Lusztig reduction method.
Let $x, x' \in \tilde W$ and $s \in \tilde {\mathbb {S}} $ . We write $x {\xrightarrow {s}}_{\sigma } x'$ if $x' = s x \sigma (s)$ and $\ell (x') \leqslant \ell (x)$ . We write $x \rightarrow _{\sigma } x'$ if there exists a sequence $x_0, x_1, \dots , x_r$ in $\tilde W$ and a sequence $s_1, s_2, \dots , s_r$ in $\tilde {\mathbb {S}} $ such that $x = x_0 {\xrightarrow {s_1}}_{\sigma } x_1 {\xrightarrow {s_2}}_{\sigma } \cdots {\xrightarrow {s_r}}_{\sigma } x_r = x'$ . We write $x \approx _{\sigma } x'$ if $x \rightarrow _{\sigma } x'$ and $x' \rightarrow _{\sigma } x$ .
Theorem 3.2 ([Reference He and Nie25])
For each $x \in \tilde W$ , there exists an element $y \in \tilde W$ which is of minimal length inside its $\sigma $ -conjugacy class such that $x \rightarrow _{\sigma } y$ .
The following theorem, which is referred to as the reduction à la Deligne and Lusztig, is proved in [Reference Deligne and Lusztig8, Proof of Theorem 1.6] (parts (i) and (ii)) and [Reference He23, Theorem 4.8] (see also [Reference Görtz and He12, Corollary 2.5.3]).
Theorem 3.3 Let $b \in \breve G$ . Let $x, x' \in \tilde W$ such that $x {\xrightarrow {s}}_{\sigma } x'$ for some $s \in \tilde {\mathbb {S}} $ .
-
(i) If $\ell (x) = \ell (x')$ , then $\dim X_x(b) = \dim X_{x'}(b)$ .
-
(ii) If $\ell (x)> \ell (x')$ , then $\dim X_x(b) = 1 + \max \{\dim X_{x'}(b), \dim X_{s x}(b)\}$ .
-
(iii) If x is of minimal length in its $\sigma $ -conjugacy class, then $X_x(b) \neq \emptyset $ if and only if $[x]=[b]$ , in which case, $\dim X_x(b) = \ell (x) - \langle \bar {\nu }_b, 2\rho \rangle $ , where $\bar {\nu }_b$ denotes the Newton vector of b.
3.2 Proof of Proposition 3.1
We argue by induction on the length of w. If w is of minimal length in its $\sigma $ -conjugacy class, by Theorem 3.3, $\dim X_w(\tau ) = \ell (w)$ and the statement follows. Otherwise, by Theorem 3.2, there exist $u \in \tilde W$ and $s \in \tilde {\mathbb {S}} $ such that $w \approx _{\sigma } u$ and $s u \sigma (s) < u$ . Thus, $\operatorname {\mathrm{supp}}_{\sigma }(u) = \operatorname {\mathrm{supp}}_{\sigma }(w)$ and
Moreover, $\dim X_w(\tau ) = \dim X_u(\tau )$ and either $X_{s u \sigma (s)}(\tau ) \neq \emptyset $ or $X_{s u}(\tau ) \neq \emptyset $ . Let us assume that the former case occurs; the proof in the other case is basically the same. By induction hypothesis,
3.3 A general dimension bound
For any reductive group $\mathbf H$ over F, we denote by ${\mathrm{rank}_F^{\mathrm{ss}}}(\mathbf H)$ the semisimple F-rank of $\mathbf H$ . By [Reference Kottwitz34, Section 1.9], if our group ${\mathbf G}$ is quasi-simple over F, then ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau }) = \sharp \{({\mathrm{Ad}}(\tau ) \circ \sigma )\text {-orbits on } \tilde {\mathbb {S}} \} - 1$ .
Corollary 3.4 Let $w \in \tilde W$ with $\operatorname {\mathrm{supp}}_{\sigma }(w) = \tilde {\mathbb {S}} $ and $X_w(\tau ) \neq \emptyset $ . Then $\dim X_w(\tau )> {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ .
Proof Let $\omega \in \Omega $ such that $w \in W_a \omega $ . As $X_w(\tau ) \neq \emptyset $ , there exists $\epsilon \in \Omega $ such that $\epsilon ^{-1} \omega \sigma (\epsilon ) = \tau $ . This implies that ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\omega }) = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ . The result thus follows from Proposition 3.1.
Now, we prove the main result of this section.
Theorem 3.5 Suppose that $\mu $ is noncentral in every simple factor of the adjoint group ${\mathbf G}_{\text {ad}}$ over F. Then
If moreover the equality holds, then $({\mathbf G}, \mu )$ is fully Hodge–Newton decomposable.
Proof We may reduce to the case where ${\mathbf G}$ is quasi-simple over F and that it is semisimple of adjoint type. Under this assumption, we have
where each ${\mathbf G}_i$ is a simple reductive group over $\breve F$ , and $\sigma ({\mathbf G}_j) = {\mathbf G}_{j+1}$ for all j. Here, we set ${\mathbf G}_{r+1} = {\mathbf G}_1$ . Write $\tilde {\mathbb {S}} = \tilde {\mathbb {S}} _1 \sqcup \cdots \sqcup \tilde {\mathbb {S}} _r$ and $\underline {\mu } = (\underline {\mu }_1, \dots , \underline {\mu }_r)$ with respect to the decomposition above. For $J \subseteq \tilde {\mathbb {S}} $ , we set $J_j = J \cap \tilde {\mathbb {S}} _j$ . We may write $\rho $ as $\rho =\rho _1+\cdots +\rho _r$ , where $\rho _i$ is the half sum of positive roots corresponding to the root system associated with $\tilde {\mathbb {S}} _i$ . We also have that $\Omega =\Omega _1 \times \cdots \times \Omega _r$ . We write $\tau = (\tau _1, \ldots , \tau _r)$ , where $\tau _i \in \Omega _i$ .
It is easy to see that $\ell (t^{\underline {\mu }_1}) = \langle \underline {\mu }_1, 2 \rho _1\rangle \geqslant \sharp \mathbb {S} _1 \geqslant {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ . Let $\xi =(\xi _1, \ldots , \xi _r) \in W_0 \cdot \underline {\mu }$ such that $t^{\xi } \in {}^K \tilde W$ . We choose a reduced expression $t^{\xi }=\tau s_{i_1} \cdots s_{i_k}$ of $t^{\xi }$ . Let $w=\tau s_{i_1} \cdots s_{i_m}$ , where $m={\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ . Then $w \leqslant t^{\xi }$ and $w \in {}^K \tilde W$ . Hence, $w \in {}^{K}\!{\mathrm{Adm}}(\mu )$ . Since $\ell (w)=m<\sharp \{({\mathrm{Ad}}(\tau ) \circ \sigma )\text {-orbits on } \tilde {\mathbb {S}} \}$ , the Weyl group $W_{\operatorname {\mathrm{supp}}_{\sigma }(w)}$ is finite and hence $\dim X_{K, w}(\tau )=\dim X_w(\tau )=\ell (w)$ . Thus, $\dim X(\mu , \tau )_K \geqslant \ell (w)=m$ .
Now, we assume that $\dim X(\mu , \tau )_K = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ . Let $w \in {}^{K}\!{\mathrm{Adm}}(\mu )$ such that $\breve {\mathcal K} \cdot _{\sigma } \breve {\mathcal I} w \breve {\mathcal I} \cap [\tau ] \neq \emptyset $ , that is, $\breve {\mathcal I} w \breve {\mathcal I} \cap [\tau ] \neq \emptyset $ or in other words, $X_w(\tau ) \neq \emptyset $ . Then $ \dim X_w(\tau )=\dim X_{K, w}(\tau ) \leqslant \dim X(\mu , \tau )_K = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ . By Corollary 3.4, we have $\operatorname {\mathrm{supp}}_{\sigma }(w) \subsetneq \tilde {\mathbb {S}} $ . By [Reference He23, Lemma 3.2], $\breve {\mathcal K} \cdot _{\sigma } \breve {\mathcal I} w \breve {\mathcal I} \subseteq \breve G \cdot _{\sigma } \breve {\mathcal I} \tau =[\tau ]$ . Noticing that
we deduce that
By Section 2.3, $({\mathbf G}, \mu )$ is fully Hodge–Newton decomposable.
4 Classification
Note that the fully Hodge–Newton decomposable cases are classified in [Reference Görtz, He and Nie14]. By further studying these cases via a case-by-case analysis, one may get a classification of the Coxeter types. However, there is a more direct approach (without using the classification of the Hodge–Newton decomposable cases). This approach classifies the cases where $\dim X(\mu , \tau )_K = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ , and in particular, by analyzing all these cases, we show that this equality implies that $({\mathbf G}, \mu , K)$ is of Coxeter type and thus we obtain a classification of the Coxeter types. This is what we will do in this section.
We may assume that ${\mathbf G}$ is quasi-simple over F, and that it is semisimple of adjoint type (cf. [Reference Görtz, He and Nie14, Section 3.3]). Under this assumption, we have a decomposition
as in the proof of Theorem 3.5. As before, here each ${\mathbf G}_i$ is a simple reductive group over $\breve F$ , and $\sigma ({\mathbf G}_j) = {\mathbf G}_{j+1}$ for all j. We set ${\mathbf G}_{r+1} = {\mathbf G}_1$ . Write $\tilde {\mathbb {S}} = \tilde {\mathbb {S}} _1 \sqcup \cdots \sqcup \tilde {\mathbb {S}} _r$ and $\underline {\mu } = (\underline {\mu }_1, \dots , \underline {\mu }_r)$ with respect to the decomposition above. For $J \subseteq \tilde {\mathbb {S}} $ , we set $J_j = J \cap \tilde {\mathbb {S}} _j$ .
We assume further that each factor $\underline {\mu }_j$ is noncentral.
Recall that $\Phi $ denotes the unique reduced root system underlying the relative root system of ${\mathbf G}$ over $\breve F$ (the échelonnage root system).
4.1 Admissible triples
Let $s \in \tilde {\mathbb {S}} $ . If $s \in W_0$ , set $\alpha _s$ to be the simple root corresponding to s. Otherwise, set $\alpha _s = -\theta $ , where $\theta \in \Phi ^+$ is the highest root of the j-component of the decomposition (4.1), where $s\in \tilde {\mathbb {S}} _j$ ; then $s = t^{\theta ^{\vee }} s_{\theta }$ . Let $J \subseteq \tilde {\mathbb {S}} $ such that $W_J$ is finite. We denote by $\Phi _J$ the root system spanned by $\alpha _s$ for $s \in J$ .
Let $p: \tilde W \rtimes \langle \sigma \rangle \rightarrow \mathrm {GL}({X_*} \otimes \mathbb {R} )$ be the natural projection.
Lemma 4.1 For $\lambda \in X_*$ we have $t^{\lambda } \in {}^K \tilde W$ if and only if $\langle \lambda , \alpha _s\rangle \geqslant 0$ for all $s \in K$ . In particular, there exists $\xi \in W_0 \cdot \underline {\mu }$ such that $t^{\xi } \in {}^K \tilde W$ .
Proof The first statement follows immediately from the definitions. For the second one, notice that $\{\alpha _s; s \in K\}$ is the set of simple roots for $\Phi _K$ whose Weyl group is $p(W_K) \subseteq W_0$ . Thus, each $p(W_K)$ -orbit in $W_0 \cdot \underline {\mu }$ contains a unique cocharacter $\xi $ such that $\langle \xi , \alpha _s\rangle \geqslant 0$ for $s \in K$ , that is, $t^{\xi } \in {}^K \tilde W$ , as desired.
Let $\xi \in W_0 \cdot \underline {\mu }$ , and let $J \subsetneq \tilde {\mathbb {S}} $ be a maximal proper $\sigma $ -stable subset. Let $\xi _J \in \mathbb {R} \Phi _J^{\vee }$ be such that $\langle \xi _J, \alpha \rangle = \langle \xi , \alpha \rangle $ for $\alpha \in \Phi _J$ . We denote by $\xi _J^{\diamond }$ the $p(\sigma )$ -average of $\xi _J$ .
Definition 4.2 We say the triple $(\xi , J, K)$ with $K = \sigma (K) \subseteq J$ is admissible if $t^{\xi } \in {}^K \tilde W$ and $\xi _J^{\diamond } \in \mathbb {R} \Phi _K^{\vee }$ .
In this case, we define
where C ranges over the connected components C of K on which the $p(\sigma )$ -average $\xi ^{\diamond }$ is nonzero. In other words, $K_{\xi }$ is the minimal $\sigma $ -stable subset of K such that $\xi _J^{\diamond } \in \mathbb {R} \Phi _{K_{\xi }}^{\vee }$ .
Lemma 4.3 Let $(\xi , J, K)$ be an admissible triple. Then there exists some $\sigma $ -Coxeter element $c \in W_{K_{\xi }}$ such that
In particular, $t^{\xi } c \in {}^{K}\!{\mathrm{Adm}}(\mu )$ and the Newton point of $t^{\xi } c$ is central.
Proof The existence of c such that $\ell (t^{\xi } c) = \ell (t^{\xi }) - \ell (c)$ and hence $t^{\xi } c \in {}^{K}\!{\mathrm{Adm}}(\mu )$ follows exactly along the same lines as [Reference Görtz, He and Nie14, Lemma 6.4 and Proposition 6.7]. It remains to show that the Newton point of $t^{\xi } c$ is central. By the proof of [Reference Görtz, He and Nie14, Lemma 6.4], it suffices to show that the $p(c\sigma )$ -average $\nu $ of $\xi _J$ is zero. Write $\xi _J = v' + v"$ such that $v" \in \mathbb {R} \Phi _{K_{\xi }}^{\vee }$ and $v'$ is orthogonal to $\mathbb {R} \Phi _{K_{\xi }}^{\vee }$ . As c is a $\sigma $ -Coxeter element of $W_{K_{\xi }}$ , we see that $p(c\sigma ) - \mathrm {id} $ is invertible on $\mathbb {R} \Phi _{K_{\xi }}^{\vee }$ , which means that $\nu $ equals the $p(\sigma )$ -average of $v'$ . In particular, $\nu $ is orthogonal to $\mathbb {R} \Phi _{K_{\xi }}^{\vee }$ . On the other hand, $\nu - \xi _J^{\diamond } \in \mathbb {R} \Phi _{K_{\xi }}^{\vee }$ . By assumption, $\xi _J^{\diamond } \in \mathbb {R} \Phi _{K_{\xi }}^{\vee }$ , which means $\nu \in \mathbb {R} \Phi _{K_{\xi }}^{\vee }$ and $\nu = 0$ , as desired.
Lemma 4.4 Suppose that $\dim X(\mu , \tau )_K={\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ and that $(\xi , J, K')$ is an admissible triple such that $K' \supseteq K$ . Then
Proof Let c be as in Lemma 4.3. Then we need to show that $\ell (t^{\xi } c) \leqslant {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ . However, our assumption implies, by Theorem 3.5, that $({\mathbf G}, \mu )$ is fully Hodge–Newton decomposable, and hence we have $\ell (t^{\xi } c) = \dim X_{t^{\xi } c}(\tau )$ (note that this is an easy consequence of (3.2) and does not require the use of classification results).
Altogether we obtain
as desired.
Given a $\sigma $ -stable subset $K \subseteq \tilde {\mathbb {S}} $ with $W_K$ finite, it follows from Lemma 4.1 that there always exists some admissible triple $(\xi , J, K)$ .
Corollary 4.5 If $\dim X(\mu , \tau )_K={\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ , then
In particular,
We can now state the following equivalent characterizations of being of Coxeter type. Note that there is an obvious notion of product of Coxeter data. We call a Coxeter datum irreducible, if it cannot be decomposed as a product in a nontrivial way.
Theorem 4.6 Consider an enhanced Tits datum $({\mathbf G}, \mu , K)$ with corresponding enhanced Coxeter datum $(W_a, \sigma , \underline {\mu }, K)$ . Assume that all components of $\underline {\mu }$ as in (4.1) are noncentral. The following conditions are equivalent:
-
(1) The enhanced Tits datum $({\mathbf G}, \mu , K)$ is of Coxeter type.
-
(2) We have that $\dim X(\mu , \tau )_K={\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ .
-
(3) For any admissible triple $(\xi , J, K')$ with $K' \supseteq K$ , we have that
$$\begin{align*}\langle \underline{\mu}, 2\rho\rangle \leqslant \sharp \{\sigma\text{-orbits of } K_{\xi}'\} + {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau}). \end{align*}$$ -
(4) The enhanced Coxeter datum $(W_a, \sigma , \underline {\mu }, K)$ is a product of irreducible enhanced Coxeter data, where for each factor the Coxeter datum is one of those listed in Table 1, and the level structure K contains the minimal one listed in that table. See Section 2.2 for the notation.
4.2 Strategy
In Table 1, we list the minimal irreducible enhanced Coxeter data (up to isomorphism) satisfying condition (3) of Theorem 4.6 together with the set ${}^{K}\!{\mathrm{Adm}}(\mu )_0$ . It is easy to see that in all these cases, ${}^{K}\!{\mathrm{Adm}}(\mu )_0={}^K\!{\mathrm{Cox}}(\mu )$ . Therefore, we also have ${}^{K'}\!\operatorname {\mathrm{Adm}}(\mu )_0={}^{K'}\mathop {\mathrm{Cox}}(\mu )$ for all $K' \supset K$ . This shows $(4) \Rightarrow (1)$ . Note that $(1) \Rightarrow (2)$ is obvious and $(2) \Rightarrow (3)$ follows from Lemma 4.4. It remains to show that $(3) \Rightarrow (4)$ .
Note that condition (3), although a bit technical, is the most elementary one among the four conditions and only involves the root system. In the rest of the section, we will analyze condition (3) and give a classification of the irreducible enhanced Coxeter data that satisfy this condition, and finally show that those cases are of Coxeter type. This finishes the direction $(3) \Rightarrow (4)$ .
Our strategy is as follows.
In Step (I), we show that condition (3) implies that the Coxeter datum $(W_a, \sigma , \underline {\mu })$ is one of those listed in Table 2.
This is done by the inequalities (b) and (c) in Corollary 4.5 in most of the cases. The only exception is in Type D, where in some cases we have to use the full strength of condition (3).
Note that the cases in this table are the fully Hodge–Newton decomposable cases. We will then further analyze these cases and give a complete classification.
In Step (II), we show that condition (3) implies that for the Coxeter datum in Table 2, the parahoric subgroup K must be as specified in Theorem 4.6/Table 1.
4.3 Step (I): The Coxeter datum $(W_a, \sigma , {\mu })$
Recall the decomposition (4.1). We will argue on the type of the irreducible affine Dynkin diagram $\tilde {\mathbb {S}} _j$ , which does not depend on j. For $i \in \tilde {\mathbb {S}} _j$ , we denote by $\omega _{i, j}^{\vee }$ the corresponding fundamental coweight in ${\mathbf G}_j$ . If $r = 1$ , we write $\omega _i^{\vee } = \omega _{i, 1}^{\vee }$ for simplicity.
4.3.1. Exceptional types
We use the inequality (c) to exclude all the exceptional types.
Type $\tilde E_6$ : $\langle \underline {\mu }, 2 \rho \rangle \geqslant \langle \omega ^{\vee }_{1, j}, 2 \rho \rangle =16>2 \times 6$ .
Type $\tilde E_7$ : $\langle \underline {\mu }, 2 \rho \rangle \geqslant \langle \omega ^{\vee }_{7, j}, 2 \rho \rangle =27>2 \times 7$ .
Type $\tilde E_8$ : $\langle \underline {\mu }, 2 \rho \rangle \geqslant \langle \omega ^{\vee }_{8, j}, 2 \rho \rangle =58>2 \times 8$ .
Type $\tilde F_4$ : $\langle \underline {\mu }, 2 \rho \rangle \geqslant \langle \omega ^{\vee }_{4, j}, 2 \rho \rangle =16>2 \times 4$ .
Type $\tilde G_2$ : $\langle \underline {\mu }, 2 \rho \rangle \geqslant \langle \omega ^{\vee }_{2, j}, 2 \rho \rangle =6>2 \times 2$ .
Now, we come to the classical groups.
4.3.2. Type $\tilde A_{n-1}$
By applying a suitable automorphism, we may assume that $K_i \subset \tilde {\mathbb {S}} _i - \{0\}$ . By (b), we deduce that (up to isomorphism) one of following cases occurs:
-
(1) $r = 1$ and $\underline {\mu } \in \{\omega _k^{\vee }; 1 \leqslant k \leqslant n-1\}$ .
-
(2) $r = 1$ and $\underline {\mu } \in \{2\omega _1^{\vee }, \omega _1^{\vee } + \omega _{n-1}^{\vee }, 2\omega _{n-1}^{\vee }\}$ .
-
(3) $r = 2$ and $\underline {\mu } = \omega _{1, 1}^{\vee } + \omega _{n-1, 2}^{\vee }$ .
In the last two cases, we have by (b) that ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf G}) = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau }) = n-1$ , which means that $\underline {\mu } = \omega _1^{\vee } + \omega _{n-1}^{\vee }$ (which equals $2\omega ^{\vee }_1$ if $n=2$ ) and $\sigma = \mathrm {id} $ or $\underline {\mu } = \omega _{1, 1}^{\vee } + \omega _{n-1, 2}^{\vee }$ and $\sigma = {}^1\varsigma _0$ .
Now, we assume $r = 1$ . Suppose $\underline {\mu } = \omega _i^{\vee }$ for some $1 \leqslant i \leqslant n-1$ . Notice that ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_1) \leqslant \frac {n}{2} $ (resp. ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau }) \leqslant \frac {n}{2}$ ) if $\sigma \neq \mathrm {id} $ (resp. ${\mathrm{Ad}}(\tau ) \circ \sigma \neq \mathrm {id} $ ). So either $\underline {\mu } \in \{\omega _1^{\vee }, \omega _{n-1}^{\vee }\}$ or $\underline {\mu }=\omega ^{\vee }_2$ with $n=4$ .
Suppose $\underline {\mu } = \omega _1^{\vee }$ and $\sigma ={\mathrm{Ad}}(\tau _k)$ for $0 \leqslant k \neq n-1$ . Then, by (b), we have
which implies $k = n-1$ or $k = 0$ . Otherwise, we deduce that (up to isomorphism) $\sigma = \varsigma _0$ .
Suppose $\underline {\mu } = \omega ^{\vee }_2$ and $n = 4$ . We have (up to isomorphism) $\sigma = \mathrm {id} $ , or $\sigma = \varsigma _0$ , or $\sigma = {\mathrm{Ad}}(\tau _1)$ , or $\sigma = {\mathrm{Ad}}(\tau _1) \circ \sigma _0$ . The last case does not occur since (b) fails.
4.3.3. Type $\tilde B_n$ for $n \geqslant 3$
Here, $\langle \omega ^{\vee }_{i, j}, 2 \rho \rangle =i (2n-i)$ . Therefore, $\langle \underline {\mu }, 2 \rho \rangle \leqslant 2n$ implies that $r = 1$ and $\underline {\mu }=\omega ^{\vee }_1$ . In this case, $\sigma =\mathrm {id} $ or $\sigma ={\mathrm{Ad}}(\tau _1)$ .
4.3.4. Type $\tilde C_n$ for $n \geqslant 2$
Here,
Therefore, $\langle \underline {\mu }, 2 \rho \rangle \leqslant 2n$ implies that $r = 1$ and either $\underline {\mu }=\omega ^{\vee }_1$ or $\underline {\mu }=\omega ^{\vee }_n$ with $n \leqslant 3$ .
If $\underline {\mu }=\omega ^{\vee }_1$ , then $\langle \underline {\mu }, 2 \rho \rangle =2n$ and hence ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })=n$ . Therefore, $\sigma =id$ .
For $n=2$ and $\underline {\mu }=\omega ^{\vee }_2$ , we have $\sigma =\mathrm {id} $ or $\sigma ={\mathrm{Ad}}(\tau _2)$ .
For $n=3$ and $\underline {\mu }=\omega ^{\vee }_3$ , we have $\langle \underline {\mu }, 2 \rho \rangle =6$ and hence ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })=3$ . Therefore, $\sigma ={\mathrm{Ad}}(\tau _3)$ and $\langle \underline {\mu }, 2\rho \rangle = 6> 4 = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf G}) + {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ , contradicting (b).
4.3.5. Type $\tilde D_n$ for $n \geqslant 4$
Here,
Therefore, $\langle \underline {\mu }, 2 \rho \rangle \leqslant 2n$ implies that $r = 1$ and (up to isomorphism) either $\underline {\mu }=\omega ^{\vee }_1$ or $\underline {\mu }=\omega ^{\vee }_n$ with $n=5$ .
If $n=5$ and $\underline {\mu }=\omega ^{\vee }_5$ , we have $\langle \underline {\mu }, 2 \rho \rangle =10$ and hence ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })=5$ . Therefore, $\sigma ={\mathrm{Ad}}(\tau _4)$ . However, $\langle \underline {\mu }, 2 \rho \rangle = 10> 6 = {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf G}) + {\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau })$ , contradicting (b).
If $\underline {\mu }=\omega ^{\vee }_1$ , then $\langle \underline {\mu }, 2 \rho \rangle =2(n-1)$ and ${\mathrm{rank}_F^{\mathrm{ss}}}({\mathbf J}_{\tau }) \geqslant n-2$ . Thus, we have $\sigma =\mathrm {id} $ , $\sigma =\sigma _0$ , or $\sigma ={\mathrm{Ad}}(\tau _1)$ . Suppose $\sigma ={\mathrm{Ad}}(\tau _1)$ and $i \notin K$ for some $1 \leqslant i \leqslant n-1$ . Let $J = \tilde {\mathbb {S}} - \{i, \sigma (i)\}$ . Let $\xi = \underline {\mu }$ if $i = 1$ and $\xi = s_i \cdots s_2 s_1(\underline {\mu })$ if $2 \leqslant i \leqslant n-1$ . Then $J_{\xi } = \emptyset $ if $i \in \{1, n-1\}$ and $J_{\xi } = \{i+1, \dots , n-1, n\}$ otherwise. Hence, (a) fails for the admissible triple $(\xi , J, J)$ .
In Table 2, we list the remaining cases together with the $\sigma $ -orbits and the ${\mathrm{Ad}}(\tau ) \circ \sigma $ -orbits on $\tilde {\mathbb {S}} $ . This information will be used in the case-by-case analysis in the remainder of this section.
4.4 Step (II): Exclude certain K
4.4.1. $(W_a, \sigma , \underline {\mu }) = (\tilde A_{2m}, \varsigma _0, \omega _1^{\vee })$
If $K \neq \tilde {\mathbb {S}} -\{0\}$ , then $K \subset J = \tilde {\mathbb {S}} - \{i, 2m+1-i\}$ for some $1 \leqslant i \leqslant m$ . Let $\xi =s_i s_{i-1} \cdots s_1(\underline {\mu })=(0, \ldots , 0, 1, 0, \ldots , 0)$ with the $(i+1)$ th entry equal to $1$ . Then $J_{\xi }=\{i+1, i+2, \ldots , 2m-i\}$ . Hence, the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.2. $(W_a, \sigma , \underline {\mu }) = (\tilde A_{2m+1}, \varsigma _0, \omega _1^{\vee })$
If $\tilde {\mathbb {S}} -\{0, m+1\} \nsubseteq K$ , then $K \subset J = \tilde {\mathbb {S}} - \{i, 2m+2-i\}$ for some $1 \leqslant i \leqslant m$ . Let $\xi =s_i s_{i-1} \cdots s_1(\underline {\mu })=(0, \ldots , 0, 1, 0, \ldots , 0)$ with the $(i+1)$ th entry equal to $1$ . Then $J_{\xi }=\{i+1, i+2, \ldots , 2m+1-i\}$ . Hence, the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.3. $(W_a, \sigma , \underline {\mu }) = (\tilde A_{2m+1}, \varrho _{n-1} \circ \varsigma _0, \omega _1^{\vee })$
After applying a suitable inner diagram automorphism, we may assume that $K \subseteq J = \tilde {\mathbb {S}} - \{i, 2m+1-i\}$ for some $1 \leqslant i \leqslant m$ . Let $\xi = s_i s_{i-1} \cdots s_1(\underline \mu ) = (0, \dots , 0, 1, 0 \dots , 0)$ otherwise, where the $(i+1)$ th entry equals $1$ . Then $J_{\xi } =\{i+1, i+2, \ldots , 2m-i\}$ otherwise. Hence, the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.4. $(W_a, \sigma , \underline {\mu }) = (\tilde A_{n-1}, \mathrm {id} , \omega _1^{\vee } + \omega _{n-1}^{\vee })$ for $n \geqslant 3$
If $|K|<n-1$ , then after applying the diagram automorphism $\varsigma _0$ , we may assume that $K \subseteq K':= \tilde {\mathbb {S}} - \{0, i\}$ for some $1 \leqslant i \leqslant n-2$ . Let $J=\tilde {\mathbb {S}} -\{0\}$ and $\xi =s_i s_{i-1} \cdots s_1(\underline {\mu })=(0, \ldots , 0, 1, 0, \ldots , 0, -1)$ with the $(i+1)$ th entry equal to $1$ and the nth entry equal to $-1$ . Then $K^{\prime }_{\xi }=\{i+1, i+2, \ldots , n-1\}$ . Hence, the inequality (a) fails for the admissible triple $(\xi , J, K')$ .
4.4.5. $(W_a, \sigma , \underline {\mu }) = (\tilde A_{n-1} \times \tilde A_{n-1}, {}^1\varsigma _0, \omega _{1, 1}^{\vee } + \omega _{n-1, 2}^{\vee })$
If $|K_1| < n-1$ , then after applying a suitable inner diagram automorphism, we may assume that $K_1 \subseteq \tilde {\mathbb {S}} _1 - \{0, i\}$ for some $1 \leqslant i \leqslant n-1$ . Let $J = \sigma (J)$ such that $J_1 = \tilde {\mathbb {S}} _1 -\{0\}$ and $\xi =(\xi _1, \xi _2)$ , where $\xi _2 = (0, \dots , 0, -1)$ and $\xi _2 = (0, \ldots , 0, 1, 0, \ldots , 0)$ with the $(i+1)$ th entry equal to $1$ . Then $J_{\xi } = \{i+1, i+2, \ldots , n-1\}$ . Hence, the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.6. $(W_a, \sigma , \underline {\mu }) = (\tilde A_3, \mathrm {id} , \omega _2^{\vee })$
Suppose K does not contain two consecutive vertices in the affine Dynkin diagram. Then, up to a suitable inner diagram automorphism, we may assume that $K \subset K' := \{1, 3\}$ . Let $J = \tilde {\mathbb {S}} - \{0\}$ and $\xi = s_2(\underline {\mu }) = (1, 0, 1, 0)$ . Then the inequality (a) fails for the admissible triple $(\xi , J, K')$ .
4.4.7. $(W_a, \sigma , \underline {\mu }) = (\tilde A_3, \varsigma _0, \omega _2^{\vee })$
Suppose $K \subseteq K' := \{1, 3\}$ . Let $J = \tilde {\mathbb {S}} - \{0\}$ and $\xi = s_2(\underline {\mu }) = (1, 0, 1, 0)$ . Then the inequality (a) fails for the admissible triple $(\xi , J, K')$ .
Suppose $K \subseteq J := \{0, 2\}$ . Let $\xi = s_1 s_2(\underline {\mu }) = (0, 1, 1, 0)$ . Then $J_{\xi } = \emptyset $ and hence the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.8. $(W_a, \sigma , \underline {\mu }) = (\tilde A_3, {\mathrm{Ad}}(\tau _1), \omega _2^{\vee })$
The semisimple rank of ${\mathbf J}_{\tau }$ is zero, and the inequality (b) fails.
4.4.9. $(W_a, \sigma , \underline {\mu }) = (\tilde B_n, \mathrm {id} , \omega _1^{\vee })$
Suppose $K \subseteq J := \tilde {\mathbb {S}} - \{i\}$ with $2 \leqslant i \leqslant n-1$ . Let $\xi = s_i s_{i-1} \dots s_1(\underline {\mu }) = (0, \dots , 0, 1, 0, \dots , 0)$ with the $(i+1)$ th entry being $1$ . Then $J_{\xi } = \{i+1, i+2, \dots , n\}$ and hence the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.10. $\ \ \,(W_a, \sigma , \underline {\mu }) = (\tilde B_n, {\mathrm{Ad}}(\tau _1), \omega _1^{\vee })$
Suppose $K \subseteq J := \tilde {\mathbb {S}} - \{i\}$ with $2 \leqslant i \leqslant n-1$ . Let $\xi = s_i s_{i-1} \dots s_1(\underline {\mu }) = (0, \dots , 0, 1, 0, \dots , 0)$ with the $(i+1)$ th entry being $1$ . Then $J_{\xi } = \{i+1, i+2, \dots , n\}$ and hence the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
Suppose $K \subseteq J := \tilde {\mathbb {S}} - \{0, 1\}$ . Then $J_{\underline {\mu }} = \emptyset $ and hence the inequality (a) fails for the admissible triple $(\underline {\mu }, J, J)$ .
4.4.11. $\ \ \,(W_a, \sigma , \underline {\mu }) = (\tilde C_n, \mathrm {id} , \omega _1^{\vee })$
If $K \subseteq J := \tilde {\mathbb {S}} - \{i\}$ for $1 \leqslant i \leqslant n-1$ . Let $\xi = s_i s_{i-1} \dots s_1(\underline {\mu }) = (0, \dots , 0, 1, 0, \dots , 0)$ with the $(i+1)$ th entry being $1$ . Then $J_{\xi } = \{i+1, i+2, \dots , n\}$ and hence the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
4.4.12. $\ \ \,(W_a, \sigma , \underline {\mu }) = (\tilde C_2, \mathrm {id} , \omega _2^{\vee })$
Suppose $K \subseteq K' := \{1\}$ . Let $J = \{1, 2\}$ and $\xi = s_2(\underline {\mu })$ . Then the inequality (a) fails for the admissible triple $(\xi , J, K')$ .
4.4.13. $\ \ \,(W_a, \sigma , \underline {\mu }) = (\tilde C_2, {\mathrm{Ad}}(\tau _2), \omega _2^{\vee })$
Suppose $K \subseteq J := \{1\}$ . Then $J_{\underline {\mu }} = \emptyset $ and hence the inequality (a) fails for the admissible triple $(\underline {\mu }, J, J)$ .
4.4.14. $\ \ \,(W_a, \sigma , \underline {\mu }) = (\tilde D_n, \mathrm {id} , \omega _1^{\vee })$
Suppose $K \subseteq J := \tilde {\mathbb {S}} - \{i\}$ for $2 \leqslant i \leqslant n-1$ . Let $\xi = s_i s_{i-1} \dots s_1(\underline {\mu }) = (0, \dots , 0, 1, 0, \dots , 0)$ with the $(i+1)$ th entry being $1$ . Then $J_{\xi } = \{i+1, i+2, \dots , n\}$ and hence the inequality (a) fails for the admissible triple $(\underline {\mu }, J, J)$ .
Suppose $K \subseteq K' := \tilde {\mathbb {S}} - \{0, 1\}$ . Let $J = \tilde {\mathbb {S}} - \{0\}$ and $\xi = s_1(\underline {\mu })$ . Then the inequality (a) fails for the admissible triple $(\underline {\mu }, J, K')$ .
4.4.15. $\ \ \,(W_a, \sigma , \underline {\mu }) = (\tilde D_n, \varsigma _0, \omega _1^{\vee })$
Suppose $K \subseteq J := \tilde {\mathbb {S}} - \{i\}$ for $2 \leqslant i \leqslant n-2$ . Let $\xi = s_i s_{i-1} \dots s_1(\underline {\mu }) = (0, \dots , 0, 1, 0, \dots , 0)$ with the $(i+1)$ th entry being $1$ . Then $J_{\xi } = \{i+1, i+2, \dots , n\}$ and hence the inequality (a) fails for the admissible triple $(\underline {\mu }, J, J)$ .
Suppose $K \subseteq K' := \tilde {\mathbb {S}} - \{0, 1\}$ . Let $J = \tilde {\mathbb {S}} - \{0\}$ and $\xi = s_1(\underline {\mu })$ . Then the inequality (a) fails for the admissible triple $(\underline {\mu }, J, K')$ .
Suppose $K \subseteq J := \tilde {\mathbb {S}} - \{n-1, n\}$ . Let $\xi = (s_{n-1} \cdots s_1(\underline {\mu }) = (0, \dots , 0, 1)$ . Then $J_{\xi } = \emptyset $ and hence the inequality (a) fails for the admissible triple $(\xi , J, J)$ .
5 Consequences for RZ spaces
In this section, we explain consequences of our results for Rapoport–Zink spaces (see [Reference Hamacher and Kim19, Reference Rapoport and Zink46]). Moreover, compare the paper [Reference Wang58] by Wang for applications to Shimura varieties.
5.1 Definitions
We consider the situation where $F = \mathbb Q_p$ , and where the pair $({\mathbf G}, \mu )$ corresponds to a Rapoport–Zink space. As before, we consider the basic case, i.e., we denote by b the basic $\sigma $ -conjugacy class in $B({\mathbf G}, \mu )$ .
Since there are different constructions of Rapoport–Zink (RZ) spaces in the PEL case and the more general case of Hodge type, we will axiomatize the properties that we require, rather than fixing one of the constructions.
In all cases, the RZ space $\mathcal M({\mathbf G}, \mu , b)_K$ is a formal scheme over $\breve O$ , the ring of integers of $\breve F$ . We denote by $\mathbf k$ the residue class field of $\breve F$ , and by a subscript $-_{\mathbf k}$ indicate the base change to $\mathbf k$ .
Note that the results in the previous sections are group-theoretic in nature and hence concern parahoric level structures, but the known constructions of RZ spaces work for stabilizers of facets in the Bruhat–Tits building. See the discussion in Section 5.5. To take this possible difference into account right from the beginning, we change notation as follows: we denote by P the stabilizers of facets of the base alcove, and by $P^{\circ }$ the corresponding parahoric.
The setup of the theory entails that P and $P^{\circ }$ are always defined over F, i.e., fixed by $\sigma $ .
We denote by $\mathrm {Gr}_{P^{\circ }}$ the partial affine flag variety for $P^{\circ }$ (i.e., with $\mathbf k$ -valued points $\breve {G}/P^{\circ }$ , and similarly by $\mathrm {Gr}_{P}$ the “partial affine flag variety” for P with $\mathbf k$ -valued points $\breve {G}/P$ ; cf. [Reference Bhatt and Scholze3], [Reference Zhu64, Theorem 1.4]). Let $\pi \colon \mathrm {Gr}_{P^{\circ }}\rightarrow \mathrm {Gr}_{P}$ denote the projection.
Since $P^{\circ } \subseteq P$ is a normal subgroup, the (finite) quotient group $P/P^{\circ }$ acts on $\mathrm {Gr}_{P^{\circ }}$ on the right by $gP^{\circ } \cdot p = gpP^{\circ }$ , $p\in P/P^{\circ }$ , and $\mathrm {Gr}_P$ is the quotient by this action. The quotient here is of a very simple nature, since the action just identifies certain connected components of $\mathrm {Gr}_{P^{\circ }}$ . In fact, let $\breve {\kappa }\colon \breve {G}\rightarrow \pi _0(\mathrm {Gr}_{P^{\circ }}) = \pi _1({\mathbf G})_{\Gamma _0}$ be the Kottwitz homomorphism (cf. [Reference Zhu64, Proposition 1.21]). Now, $P^{\circ }$ is the kernel of $\breve {\kappa }_{|P}$ (see [Reference Pappas and Rapoport40, Appendix Proposition 3], and note that the Appendix covers the case of mixed characteristic, which we need here). Correspondingly, the parahoric group scheme corresponding to $P^{\circ }$ is the connected component of the “stabilizer group scheme” corresponding to P. So the above action by $P/P^{\circ }$ permutes the connected components of $\mathrm {Gr}_{P^{\circ }}$ and is free. In particular, the restriction of $\pi $ to any connected component of $\mathrm {Gr}_{P^{\circ }}$ is an isomorphism onto a connected component of $\mathrm {Gr}_{P}$ .
In this way, we obtain a perfect ind-scheme $\mathrm {Gr}_{P}$ with $\mathbf k$ -valued points $\breve {G}/P$ and such that the projection $\pi \colon \mathrm {Gr}_{P^{\circ }}\rightarrow \mathrm {Gr}_{P}$ is an isomorphism when restricted to a connected component of $\mathrm {Gr}_{P^{\circ }}$ .
We write
which inherits the structure of a perfect ind-scheme and is even a scheme locally of finite type over $\mathbf k$ (cf. [Reference Hamacher and Viehmann20, Reference Rapoport and Zink47]).
Similarly as in [Reference Görtz, He and Nie14], we consider the following condition:
( $\diamondsuit $ ) For facet stabilizers $P\subset P'$ , we have a projection $\mathcal M({\mathbf G}, \mu , b)_P\rightarrow \mathcal M({\mathbf G}, \mu , b)_{P'}$ , and there are isomorphisms
of perfect schemes, compatible with the projections for inclusions $P\subset P'$ .
The second condition that we need to impose is the following compatibility between the RZ spaces for levels $P_i$ and the RZ space attached to the intersection $P:=\bigcap _i P_i$ . It follows from property ( $\diamondsuit $ ) that the morphism $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{\mathrm{red}}\rightarrow \prod _i \mathcal M({\mathbf G}, \mu , b)_{P_i, \mathbf k}^{\mathrm{red}}$ is a homeomorphism onto a closed subscheme of its target. In fact, by ( $\diamondsuit $ ) it is enough to check the analogous property for the affine Deligne–Lusztig varieties, where it follows from the fact that the natural morphism $\mathrm {Gr}_P \rightarrow \prod _i\mathrm {Gr}_{P_i}$ is a closed immersion. We will need the corresponding property “before perfection” on the RZ-space side and hence impose the following stronger statement as our second axiom:
( $\clubsuit $ ) The natural morphism $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{\mathrm{red}}\rightarrow \prod _i \mathcal M({\mathbf G}, \mu , b)_{P_i, \mathbf k}^{\mathrm{red}}$ is a closed immersion of $\mathbf k$ -schemes.
5.2 The PEL case
For most RZ spaces of PEL type, it is known that assumptions ( $\diamondsuit $ ) and ( $\clubsuit $ ) are satisfied.
Consider an RZ space attached to a PEL datum as in [Reference Rapoport and Zink46, Chapter 3] (see also [Reference Hartwig22, Reference Rapoport and Viehmann45] for summaries and further discussions). Let ${\mathbf G}$ and $\mu $ be the group and cocharacter attached to it. In this context, the level structure is given by a polarized chain of lattices (i.e., we use a lattice model for the relevant Bruhat–Tits building). Denote by $P \subset \breve G$ the stabilizer of this fixed standard chain.
Denote by $\mathcal M^{\mathrm{naive}}_P$ the corresponding RZ space [Reference Rapoport and Zink46, Definition 3.21] over $O_{\breve E}$ , the ring of integers of the completion of the maximal unramified extension of the local reflex field E. It depends on the choice of a “framing object” $\mathbf X$ (a p-divisible group with additional structure corresponding to the group ${\mathbf G}$ ) and parameterizes pairs $((X_{\Lambda }), (\rho _{\Lambda }))$ , where $(X_{\Lambda })$ is a chain of isogenies of p-divisible groups (over a scheme S on which p is locally nilpotent) indexed by the fixed lattice chain, and $\rho _{\Lambda }$ is a quasi-isogeny between $X_{\Lambda }$ and $\mathbf X$ over the closed subscheme $V(p)\subseteq S$ . These data are required to satisfy certain compatibilities (see [Reference Rapoport and Zink46, Definition 3.12]).
It is clear that property ( $\diamondsuit $ ) cannot in general be expected to hold for the “naive” RZ spaces, because their special fiber in general comprises strata not reflected in the set $\operatorname {\mathrm{Adm}}(\mu )$ . Therefore, we pass to the corresponding “flat RZ space.”
As shown in [Reference Rapoport and Zink46], the space $\mathcal M^{\mathrm{naive}}_P$ admits a local model diagram
where $M^{\mathrm{naive}}_P$ denotes the local model of [Reference Rapoport and Zink46, Definition 3.27] and $-^{\wedge }$ denotes the p-adic completion. See [Reference Rapoport and Zink46, Chapter 3], in particular Sections 3.26–3.35. Denote by $M^{\mathrm{flat}}_P$ the flat closure inside $M^{\mathrm{naive}}_P$ of its generic fiber.
Then $\mathcal M_P$ is defined by pulling back and pushing forward the inclusion $M^{\mathrm{flat}}_P \subseteq M^{\mathrm{naive}}_P$ along the local model diagram (after passing to the completion along the special fiber), i.e., we have a diagram
where the vertical arrows are closed immersions and both squares are Cartesian. Then $\mathcal M_P$ is flat over $O_{\breve E}$ .
Forgetting the endomorphism and polarization structure, we obtain a closed embedding into the corresponding RZ space for the general linear group and the same lattice chain, now considered without additional structure. We will denote this space by $\mathcal M^{\prime }_{P'}$ (imitating the notation of [Reference Hamacher and Kim19]), in particular $P'$ is the stabilizer of our lattice chain inside the general linear group $G' = GL(\mathbf N)(\breve {\mathbb Q}_p)$ of automorphisms of the rational Dieudonné module $\mathbf N$ of $\mathbf X$ .
By Dieudonné theory, we obtain a commutative diagram of inclusions:
Here, the right vertical map maps a point $(X_{\bullet }, \rho )\in \mathcal M^{\prime }_{P'}$ , where $X_{\bullet }$ is a chain of isogenies of p-divisible groups indexed by the fixed periodic lattice chain, and $\rho $ is a quasi-isogeny with the framing object $\mathbb X$ , to $gP'$ where g maps the fixed (partial) standard lattice chain to the chain of Dieudonné modules of $X_{\bullet }$ (considered inside the rational Dieudonné module of $\mathbb X$ via $\rho $ ). Since in the $GL_n$ case the vertical map is induced by an isomorphism of perfect schemes onto its image (cf. [Reference Zhu64, Proposition 3.11], which easily generalizes to general parahoric level structure in that case), the same is true for this map.
The map $\mathcal M^{\mathrm{naive}}_P(\mathbf k)\rightarrow \mathcal M^{\prime }_{P'}(\mathbf k)$ , which forgets the additional structure is an inclusion, because the additional structure is uniquely determined (by that structure on $\mathbf X$ and the quasi-isogenies $\rho _{\Lambda }$ ), if it exists. Cf. [Reference Rapoport and Zink46, Proof of Theorem 3.25].
The Frobenius morphism on $\mathbf N$ is given by $b\sigma $ for some $b\in \breve {G}$ .
Consider a point $(\mathcal F_{\Lambda })_{\Lambda }\in M^{\mathrm{naive}}(\mathbf k)$ . By definition, each $\mathcal F_{\Lambda }$ is a subspace of $\Lambda \otimes _{\mathbb Z_p}\mathbf k$ (with further properties which we do not state here; in comparison to [Reference Rapoport and Zink46, Definition 3.27], we switch from quotients to subspaces). Equivalently, we can record these data as a lattice $\tilde {\mathcal F}$ lying between $\Lambda $ and $p\Lambda $ . This defines an inclusion $M^{\mathrm{naive}}(\mathbf k) \rightarrow \breve {G}/P$ , and the action of P on $\breve {G}/P$ on the left preserves the subset $M^{\mathrm{naive}}(\mathbf k)$ .
By the definition of the local model diagram, we obtain a commutative diagram
where the lower horizontal map maps $gP$ to the double coset of $g^{-1}b\sigma (g)$ . Likewise, P acts on $M_P^{\mathrm{flat}}(\mathbf k)\subseteq M^{\mathrm{naive}}(\mathbf k)$ since this action comes from an action of the (smooth) stabilizer group scheme associated with P on the $O_{\breve E}$ -scheme $M^{\mathrm{naive}}$ . We write $\mathcal A(\mu )_P = P\backslash M^{\mathrm{flat}}(\mathbf k) \subset P\backslash \breve {G} / P$ . Similarly, we write $\operatorname {\mathrm{Adm}}(\mu )_P = P\backslash P\operatorname {\mathrm{Adm}}(\mu )P/P$ .
Proposition 5.1 If $\mathcal A(\mu )_P = \operatorname {\mathrm{Adm}}(\mu )_P$ , then assumption ( $\diamondsuit $ ) is satisfied. More precisely, the inclusion $\mathcal M_P(\mathbf k) \subset \breve G/P$ is induced by an isomorphism
of perfect schemes, and these isomorphisms are compatible with the projections for passing to sub-lattice chains.
Proof Since we know (from the embedding into a $GL$ situation) that the map is induced from a morphism of perfect schemes, it is enough to check the claim on $\mathbf k$ -valued points.
The above discussion shows, together with our assumption $\mathcal A(\mu )_P = \operatorname {\mathrm{Adm}}(\mu )_P$ and the definition of $M^{\mathrm{flat}}_P$ , that we have a diagram
in which both squares are Cartesian. Viewing $\mathcal M^{\mathrm{naive}}_P(\mathbf k)\subset \breve {G}/P$ as before, the lower horizontal map is given by $gP\mapsto P g^{-1}b\sigma (g)P$ . So we see that $\mathcal M_P(\mathbf k) \subseteq X(\mu , b)_P$ , and that it is enough to show that $X(\mu , b)_P\subseteq \mathcal M^{\mathrm{naive}}_P(\mathbf k)$ in order to complete the proof.
For this inclusion, note that the square in the diagram (5.1), while not Cartesian in general, is close to being Cartesian. More precisely, $\mathcal M^{\mathrm{naive}}_P(\mathbf k)$ is defined inside the intersection $\mathcal M^{\prime }_{P'}(\mathbf k)\cap \breve {G}/P$ by imposing the Kottwitz determinant condition. Since this condition can be checked on the local model, and since it is satisfied by definition on $M^{\mathrm{naive}}$ , and a fortiori on $M^{\mathrm{flat}}$ , it is enough to show that $X(\mu , b)_P\subseteq \mathcal M^{\prime }_{P'}$ .
Denote by $\mu '$ the composition of $\mu $ with the inclusion ${\mathbf G}\rightarrow GL(\mathbf N)$ . Since the identification of $\mathcal M^{\prime }_{P'}(\mathbf k)$ with the corresponding generalized affine Deligne–Lusztig variety $X^{GL}(\mu ', b)_{P'}$ for the general linear group is easy to check, we see that is enough to show that $X_P$ embeds into $X^{GL}(\mu ', b)_{P'}$ under the embedding $\breve G/P\subseteq G'/P'$ . This follows once we can prove that $\operatorname {\mathrm{Adm}}(\mu )_P$ embeds into the admissible set $\operatorname {\mathrm{Adm}}(\mu ')_{P'}\subset P'\backslash G'/P'$ .
This compatibility of admissible sets follows from the inclusions $M^{\mathrm{flat}}_P(\mathbf k) \subseteq M^{\mathrm{naive}}_P(\mathbf k) \subseteq M^{GL}_{P'}(\mathbf k)$ , where $M^{GL}_{P'}$ denotes the local model for the general linear group, using once more the assumption $\mathcal A(\mu )_P = \operatorname {\mathrm{Adm}}(\mu )_P$ and the fact that for the general linear group the corresponding equality is known, as well.
It is clear that everything above is compatible with the projections arising from forgetting some of the lattices in our chain.
Remark 5.2 The condition $\mathcal A(\mu )_P = \operatorname {\mathrm{Adm}}(\mu )_P$ is known to hold in many cases. Note that almost the same condition is posed as Axiom 3.2 in [Reference He and Rapoport27]; the only difference is that in our setting we can (and need to) be a little bit more precise as to how this identification arises.
-
(1) Assume that ${\mathbf G}/\mathbb Q_p$ is connected and splits over a tamely ramified extension and that the stabilizer P of our lattice chain is a parahoric subgroup. Then, by the work of Pappas and Zhu [Reference Pappas and Zhu42, Theorems 1.1 and 1.2] and Haines and Richarz [Reference Haines and Richarz18, Theorem 6.12] (and in many individual cases, their predecessors), the special fiber of $M^{\mathrm{flat}}_P$ is the union of Schubert varieties (in an equal characteristic affine Grassmannian) indexed by the admissible set $\operatorname {\mathrm{Adm}}(\mu )_P$ . See also [Reference Pappas and Zhu42, Section 8.2], and see [Reference Haines and Richarz17] for further cases in the presence of wild ramification. See [Reference Anschütz, Gleason, Lourenço and Richarz1, Theorem 6.16] for the point of view of v-sheaves.
-
(2) The condition has been checked in many individual cases, including cases where the stabilizer P is not parahoric. Specifically, see Smithling’s papers [Reference Smithling48, Reference Smithling50] for ramified unitary groups, and [Reference Smithling49] for split even orthogonal groups.
Proposition 5.3 For RZ spaces $\mathcal M_P$ of PEL type, the projections in condition ( $\diamondsuit $ ) exist and condition ( $\clubsuit $ ) is satisfied.
Proof It is clear that for $P\subseteq P'$ there is a projection morphism between the corresponding RZ spaces. Now, for the “naive” versions, the definition in terms of chains of p-divisible groups shows immediately that the morphism in question is a monomorphism. Since the irreducible components of the source are proper (cf. [Reference Rapoport and Zink46, Proposition 2.32]) and the source is locally of finite type over $\mathbf k$ [Reference Rapoport and Zink46, Theorem 3.25], it follows that the morphism is a closed immersion. The “flat” RZ spaces are closed formal subschemes of the naive RZ spaces, so the above property continues to hold.
5.3 RZ spaces of Hodge type
In the work of Kim [Reference Kim33] and Hamacher and Kim [Reference Hamacher and Kim19] where RZ spaces for data of Hodge type are constructed, the bijection $\mathcal M({\mathbf G}, \mu , b)_{P}(\mathbf k) \cong X(\mu , b)_P(\mathbf k)$ on $\mathbf k$ -valued points is an essential feature of the construction (see [Reference Hamacher and Kim19, Proposition 4.3.5]). Zhu [Reference Zhu64, Proposition 3.11] proved that this set-theoretical equality implies the above isomorphism of perfect schemes, using results of Gabber and Lau on Dieudonné theory over perfect rings to handle the case ${\mathbf G}=GL_n$ , and then embedding the general situation into a suitable $GL_n$ -situation. While in [Reference Zhu64] it was assumed that K is hyperspecial, the only reason for this assumption is that at the time of writing RZ spaces of Hodge type had been constructed only in this special situation; the paper [Reference Hamacher and Kim19] appeared only later.
Note, however, that the previous paragraph concerns only the situation for a fixed level P. Because of the way RZ spaces are defined in the Hodge-type situation, it is not clear that these bijections are compatible with the projection maps attached to a pair $P\subset P'$ of parahoric subgroups. In fact, the definition relies on embedding the situation into an RZ space of Siegel type (i.e., associated with a group of symplectic similitudes and hyperspecial level structure). However, it is not evident whether the result is independent of the choice of embedding, and it does not seem clear whether such embeddings can be chosen in a compatible way given $P\subset P'$ .
5.4 The weak Bruhat–Tits stratification
Next, we discuss the question of defining a (weak) Bruhat–Tits stratification on Rapoport–Zink spaces. Hence, we now restrict to the fully Hodge–Newton decomposable case. Then we have the weak Bruhat–Tits stratification on $X(\mu , b)_{P^{\circ }}$ (Section 2.4).
Since property ( $\diamondsuit $ ) involves P instead of $P^{\circ }$ , we first need to discuss the question of defining a weak BT stratification on $X(\mu , b)_P$ . To this end, we impose the following additional assumption:
( $\heartsuit $ ) The projection $X_{P^{\circ }} \rightarrow X_P$ is surjective.
See Section 5.5 for two sufficient criteria for this assumption. Of course, it is trivially satisfied if $P=P^{\circ }$ . It is also satisfied in those cases that have been studied in detail on the Shimura variety side (attached to ramified unitary groups). We do not know of an example where this property fails.
Let us denote by $X_{P^{\circ }, c}$ the “component” indexed by $c\in \pi _0(\mathrm {Gr}_{P^{\circ }})$ , i.e., $X_{P^{\circ }, c} = X_{P^{\circ }}\cap {\breve \kappa }^{-1}(c)$ . As explained in Section 5.5, $X_{P^{\circ }, c} = \emptyset $ unless $c\in \pi _0(\mathrm {Gr}_{P^{\circ }})^{\sigma }$ .
For each $c\in \pi _0(\mathrm {Gr}_{P^{\circ }})^{\sigma }$ , the projection $X_{P^{\circ }}\rightarrow X_{P}$ restricts to an isomorphism
Here, we denote by $\pi (c)$ the connected component of $\mathrm {Gr}_{P}$ , which is the image of c, and by $X_{P, \pi (c)}$ its intersection with $X_{P}$ . Then $\pi ^{-1}(X_{P, \pi (c)})$ is the union of the $X_{P^{\circ }, c'}$ where $c'$ ranges over the $P/P^{\circ }$ -orbit of c in $\pi _0(\mathrm {Gr}_{P^{\circ }})$ .
To establish that $\gamma _c$ above is an isomorphism, recall that each connected component of $\mathrm {Gr}_{P^{\circ }}$ maps isomorphically onto a connected component of $\mathrm {Gr}_{P}$ . Therefore, the only question is the surjectivity, which follows from ( $\heartsuit $ ).
Since all BT strata in $X_{P^{\circ }}$ are connected, each stratum lies in one of the components $X_{P^{\circ }, c}$ . Using the isomorphisms $\gamma _c$ , we obtain a Bruhat–Tits stratification on $X_{P}$ which is independent of the choice of c, as the following lemma shows.
Lemma 5.4 Let $c, c'\in \pi _0(\mathrm {Gr}_{P^{\circ }})^{\sigma }$ such that $c'c^{-1}\in P/P^{\circ }$ . Let $j Y(w) \subseteq X_{P^{\circ }, c'}$ be a BT stratum. Then
is a BT stratum.
Proof The map $\gamma _{c}^{-1}\circ \gamma _ {c'}$ is given by $gP^{\circ }\mapsto g(c')^{-1}cP^{\circ }$ . We may represent $(c')^{-1}c$ by an element of $P\cap N_G(T)(\breve F)$ that induces a length $0$ element in $\tilde W$ . To simplify the notation, we change notation and denote this element by c. Then we have $\sigma (c) = c$ inside $\tilde W$ (passing to a different representative, if necessary we could achieve the same property on the level of elements of $\breve G$ ).
If $g\in jY(w)$ , then $g^{-1}\tau \sigma (g)\in P^{\circ } \cdot _{\sigma } (\breve {\mathcal I} w\tau \breve {\mathcal I})$ . But then,
Now, $c^{-1}w\tau \sigma (c) = c^{-1} w\tau c$ is again an EKOR element for $P^{\circ }$ (i.e., it lies in ${}^K\tilde W\cap \operatorname {\mathrm{Adm}}(\mu )$ , where $K\subset \tilde {\mathbb {S}} $ denotes the set of simple affine reflections generating $P^{\circ }$ ).
Since the element $c^{-1} w\tau c$ is independent of g, the lemma follows.
We hence get a well-defined “Bruhat–Tits stratification” on $X_{P}$ and the projection $X_{P^{\circ }}\rightarrow X_{P}$ is compatible with the stratifications.
Corollary 5.5 In the fully Hodge–Newton decomposable case, the isomorphisms $X_c \rightarrow \pi (X_c)$ , for a connected component c of $\mathrm {Gr}_{P^{\circ }}$ , define a weak Bruhat–Tits stratification on $X_P$ (i.e., the stratification is independent of the choices of c).
Consider an inclusion $P\subset P'$ of facet stabilizers and the corresponding inclusion $P^{\circ } \subset {P'}^{\circ }$ . We obtain a commutative diagram
The sets of connected components of $\mathrm {Gr}_{P^{\circ }}$ and $\mathrm {Gr}_{{P'}^{\circ }}$ coincide. Fixing a connected component c of $\mathrm {Gr}_{P^{\circ }}$ and denoting by $c'$ its image in $\mathrm {Gr}_{{P'}^{\circ }}$ , we can restrict the above diagram to c and $c'$ , and obtain a diagram
(with $\pi '\colon \mathrm {Gr}_{{P'}^{\circ }}\rightarrow \mathrm {Gr}_{P'}$ the projection) where the vertical morphisms are isomorphisms. Since the upper horizontal map maps each BT stratum in $X_c$ isomorphically onto a BT stratum in $X_{c'}$ , the same holds true for the lower horizontal map.
We also record the following easy lemma.
Lemma 5.6 Let $\mathbf f$ , $\mathbf f_i$ , $i\in I$ , be facets of the base alcove (viewed as simplices, i.e., as subsets of the set of vertices of the base alcove). Denote by P, $P_i$ the facet stabilizers, and by $P^{\circ }$ , $P^{\circ }_i$ the corresponding parahoric subgroups. The following are equivalent:
-
(1) $\mathbf f = \bigcup _i \mathbf f_i$ .
-
(2) $P = \bigcap _i P_i$ .
-
(3) $P^{\circ } = \bigcap _i P^{\circ }_i$ .
Proof Clearly, (1) and (2) are equivalent. Furthermore, (3) and (1) are equivalent since a parahoric subgroup is determined by the set of affine simple reflections which it contains.
Now, we can invoke property ( $\diamondsuit $ ) and obtain that in the fully Hodge–Newton decomposable case and under the above assumptions, the spaces $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{p^{-\infty }}$ carry a weak BT stratification which is compatible with the weak BT stratification on $X_P$ via ( $\diamondsuit $ ). For an inclusion $P\subset P'$ , the corresponding map of RZ spaces maps each BT stratum in the source isomorphically (in the sense of perfect schemes) onto a BT stratum in the target.
Since the morphism $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{p^{-\infty }} \rightarrow \mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{\mathrm{red}}$ is a homeomorphism, we likewise get the following corollary.
Corollary 5.7 In the fully Hodge–Newton decomposable case and under assumptions ( $\diamondsuit $ ), ( $\clubsuit $ ), and ( $\heartsuit $ ), the spaces $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}$ carry a weak Bruhat–Tits stratification into locally closed reduced subschemes which is compatible with the weak BT stratification on $X_P$ via passing to perfections and the isomorphism ( $\diamondsuit $ ). For an inclusion $P\subset P'$ , the corresponding map of RZ spaces maps each BT stratum in the source homeomorphically onto a BT stratum in the target.
The perfection of each stratum is isomorphic to the perfection of a classical Deligne–Lusztig variety.
The goal of this section is to investigate the following property (under the assumption of full Hodge–Newton decomposability).
Property $BT_P$ : In the (weak) Bruhat–Tits stratification of $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{\mathrm{red}}$ , each stratum is isomorphic to the corresponding classical Deligne–Lusztig variety, without passing to the perfection. The closure of each stratum is isomorphic to the closure of this classical Deligne–Lusztig variety in the corresponding finite-dimensional partial flag variety.
Our result will be that if ( $BT_{P_i}$ ) holds for all i, then ( $BT_P$ ) also holds for $P=\cap _i P_i$ . Note that we need to use in the proof that we already know the result after perfection for level P.
For most maximal parahoric level structures P, property ( $BT_P$ ) has been established by now, and this result will allow us to deduce the same property for many nonmaximal level structures, and in particular for most level structures of Coxeter type. See Section 6 for a discussion of the individual cases.
Proposition 5.8 Let $({\mathbf G}, \mu )$ be fully Hodge–Newton decomposable. Assume that properties ( $\diamondsuit $ ), ( $\clubsuit $ ), and ( $\heartsuit $ ) hold. As above, let $P_i$ be facet stabilizers (of facets of the base alcove which are fixed by $\sigma $ ) such that $BT_{P_i}$ holds for all i. Then $BT_P$ holds for $P := \bigcap _i P_i$ .
Proof To shorten the notation, we write $\mathcal M_P$ for $\mathcal M({\mathbf G}, \mu , b)_{P, \mathbf k}^{\mathrm{red}}$ . The assumption implies that $\mathcal M_P$ has a Bruhat–Tits stratification as explained above. Consider a BT stratum $S\subset \mathcal M_P^{p^{-\infty }}$ (i.e., S is isomorphic to the perfection of a classical Deligne–Lusztig variety X). We denote by Y the reduced locally closed subscheme inside $\mathcal M_P$ corresponding to S.
For each i, S maps isomorphically onto a BT stratum $S_i \subset \mathcal M_{P}^{p^{-\infty }}$ . So $S_i$ is the perfection of the same DL variety X. By the assumption $BT_{P_i}$ , the stratum $Y_i \subset \mathcal M_{P_i}$ corresponding to $S_i$ is isomorphic to X.
Now, consider the closed embedding $\mathcal M_P \rightarrow \prod _i \mathcal M_{P_i}$ given by property ( $\clubsuit $ ). Restricting to Y, we obtain a closed embedding $Y\rightarrow \prod _i Y_i$ . Passing to perfections, we obtain an embedding $S\rightarrow \prod _i S_i$ .
Then the reduced closed subscheme of $\prod _i Y_i$ with underlying space S is isomorphic to the Deligne–Lusztig variety X: both are reduced locally closed subschemes, so it is enough to check that they have the same underlying topological space. However, this can be checked on the perfections, where we know the result from the group-theoretic situation.
This implies that the reduced subscheme Y of $\mathcal M_{P}$ with same topological space as S is isomorphic to X, as desired. Exactly the same reasoning works when we replace each of S, Y, $S_i$ , $Y_i$ with its closure.
5.5 Stabilizers versus parahoric subgroups
It remains to discuss in more detail in which cases assumption ( $\heartsuit $ ) is satisfied, i.e., when the map $X_{P^{\circ }}\rightarrow X_{P}$ is surjective. Let us briefly recall the setup.
We consider $({\mathbf G}, \mu )$ as in Section 5.1 (not necessarily fully Hodge–Newton decomposable) and $b\in B({\mathbf G}, \mu )$ . Let $P\subset \breve G$ be the stabilizer of a (poly-)simplex in the Bruhat–Tits building. We denote by $L\breve {G}$ the loop group attached to $\breve {G}$ , an ind-perfect scheme over $\mathbf k$ . The intersection $P^{\circ }$ of P with the kernel of the Kottwitz homomorphism $\breve {\kappa }\colon \breve G\rightarrow \pi _0(L\breve {G}) = \pi _1({\mathbf G})_{\Gamma _0}$ is a parahoric subgroup (and all parahoric subgroups arise in this way), but in general, $P^{\circ } \subset P$ is a proper normal subgroup.
Remark 5.9 This phenomenon occurs in several cases:
-
(i) The case of ramified unitary groups has been discussed in detail by Pappas and Rapoport in [Reference Pappas and Rapoport40, Section 4] and [Reference Pappas and Rapoport39, Sections 1.2 and 1.3].
-
(ii) For even special orthogonal groups (with hyperspecial level structure), see Yu’s notes [Reference Yu, Cunningham and Nevins62, Example 2.2.3.1].
-
(iii) Applying restriction of scalars along an unramified extension to the above examples, one can construct further examples, and in particular obtains examples where the action of $\sigma $ on $P/P^{\circ }$ is not trivial.
If ${\mathbf G}$ is semisimple and simply connected, or more generally if $\pi _0(L\breve {G})$ is torsion-free, then the finite subgroup $P/P^{\circ } \subseteq \pi _0(L\breve {G})$ must be trivial, so that we have $P^{\circ } = P$ .
In the sequel, we assume that $P^{\circ }$ is a standard parahoric subgroup, or in other words, that P is the stabilizer of a face of the base alcove. As mentioned above, the group-theoretic results obtained in this paper concern a priori the “parahoric setting,” but RZ spaces have been defined, so far, in the “stabilizer setting.” See also Remark 5.4 in the paper [Reference Hamacher and Kim19] by Hamacher and Kim.
Throughout the following discussion, we fix $\mu $ and $b\in B({\mathbf G}, \mu )$ , and we write $X_{P^{\circ }} = X(\mu , b)_{P^{\circ }}$ , and as above write $X_{P} := X(\mu , b)_{P} = \{ g\in \mathrm {Gr}_{P};\ g^{-1}b\sigma (g) \in P \operatorname {\mathrm{Adm}}(\mu ) P\}$ . We have
Lemma 5.10 (Cf. [Reference Hamacher and Kim19, Remark 5.4])
-
(i) If $v \in W_a$ , $\omega \in \Omega $ with $v\omega \in P$ , then $\omega \in P$ . (Here, we do not distinguish between elements of $\tilde W$ and representatives of these elements in $\breve G$ .)
-
(ii) For $\omega \in \Omega $ , we have $\omega \operatorname {\mathrm{Adm}}(\mu ) \omega ^{-1} = \operatorname {\mathrm{Adm}}(\mu )$ .
-
(iii) We have $P \operatorname {\mathrm{Adm}}(\mu ) P = P^{\circ } \operatorname {\mathrm{Adm}}(\mu ) P$ .
Proof To prove part (1), say that P is the stabilizer of the face $\mathbf f$ of the base alcove. Under the assumption that $v\omega $ stabilizes $\mathbf f$ , we need to show that the same is true for $\omega $ . Assume that $\omega \mathbf f \ne \mathbf f$ . Since $\omega $ preserves the base alcove, this means that $\omega \mathbf f$ is a face of a type different from the type of $\mathbf f$ . Because the action of $W_a$ preserves the type of faces, it is then impossible that $v\omega \mathbf f = \mathbf f$ .
For (2), note that conjugation preserves the orbit $W_0(\mu )$ and conjugation by length $0$ elements preserves the Bruhat order.
Now, we prove part (3). By part (1), we can write $P = \bigcup _p P^{\circ } p$ , where p runs through a system of representatives of $P/P^{\circ }$ given by (representatives of) length $0$ elements in $\tilde W$ . Since $P^{\circ }\subset P$ is normal, the claimed statement follows from (2).
Now, for $gP^{\circ }\in \mathrm {Gr}_{P^{\circ }}$ , we have
and
The lemma shows that $P \operatorname {\mathrm{Adm}}(\mu ) P = P^{\circ } \operatorname {\mathrm{Adm}}(\mu ) P$ , and we obtain that
Recall that our assumption $b\in B({\mathbf G}, \mu )$ implies in particular that b and $\mu $ have the same image in $\pi _1({\mathbf G})_{\Gamma }$ , where $\Gamma = \operatorname {\mathrm{Gal}}(\overline {F}/F)$ is the absolute Galois group of F. Since $\breve \kappa $ takes its images in the coinvariants for the (smaller) inertia group $\Gamma _0$ , this does, however, not necessarily entail $\breve \kappa (b)=\breve \kappa (\mu )$ . We choose an element $c_{b,\mu }\in \pi _1({\mathbf G})_{\Gamma _0}$ such that $\sigma (c_{b,\mu })-c_{b,\mu } = \breve {\kappa }(\mu )-\breve {\kappa }(b)$ . Such an element exists because the difference $\breve {\kappa }(\mu )-\breve {\kappa }(b)$ vanishes in $\pi _1({\mathbf G})_{\Gamma }$ , as we just recalled. See also [Reference He and Zhou28, Section 6].
By the previous equation, an element $gP^{\circ } \in \pi ^{-1}(X_{P})$ lies in $X_{P^{\circ }}$ if and only if $\breve {\kappa }(g^{-1}b\sigma (g)) = \breve {\kappa }(\mu )$ , or equivalently $\sigma (\breve {\kappa }(g))-\breve {\kappa }(g)= \breve {\kappa }(\mu )-\breve {\kappa }(b) = \sigma (c_{b,\mu })-c_{b,\mu }$ . We thus have
Choosing a (set-theoretic) section of the projection $\pi _0(\mathrm {Gr}_{P^{\circ }}) \rightarrow \pi _0(\mathrm {Gr}_{P}) = \pi _0(\mathrm {Gr}_{P^{\circ }})/(P/P^{\circ })$ , we obtain a section $\iota \colon \mathrm {Gr}_{P} \rightarrow \mathrm {Gr}_{P^{\circ }}$ of $\pi $ , and then can identify
i.e., $\mathrm {Gr}_{P^{\circ }}$ is isomorphic to a disjoint union of copies of $\mathrm {Gr}_{P}$ . (That the union is in fact disjoint follows since $P/P^{\circ }$ acts freely on $\pi _0(\mathrm {Gr}_{P^{\circ }})$ , which in turn follows from the fact that $P^{\circ } = P \cap \operatorname {\mathrm{Ker}}(\breve {\kappa })$ .)
Proposition 5.11 Assume that we can decompose $\pi _0(\mathrm {Gr}_{P^{\circ }})$ as a product $P/P^{\circ } \times C$ for some subgroup C, such that $\sigma $ preserves this product decomposition, and fix such an identification $\pi _0(\mathrm {Gr}_{P^{\circ }}) = P/P^{\circ } \times C$ . Write $c_{b,\mu } = (c', c")$ according to this decomposition.
This choice defines a section $\iota \colon \mathrm {Gr}_{P}\rightarrow \mathrm {Gr}_{P^{\circ }}$ of $\pi $ , and an identification
With respect to this decomposition, the space $X_{P^{\circ }}$ is a disjoint union of copies of $X_{P}$ . More precisely,
In particular, in this situation, assumption ( $\heartsuit $ ) is satisfied.
Proof The decomposition $\pi _0(\mathrm {Gr}_{P^{\circ }})=P/P^{\circ } \times C$ gives us a section $\pi _0(\mathrm {Gr}_{P}) \cong C \rightarrow \pi _0(\mathrm {Gr}_{P^{\circ }})$ to which we apply the above discussion. We then have an isomorphism $\bigsqcup _{c\in C} (\mathrm {Gr}_{P^{\circ }})_c \rightarrow \mathrm {Gr}_{P}$ . Here, $(\mathrm {Gr}_{P^{\circ }})_c = \breve {\kappa }^{-1}(c)$ denotes the connected component corresponding to c. We obtain a section $\iota \colon \mathrm {Gr}_{P}\rightarrow \mathrm {Gr}_{P^{\circ }}$ of $\pi $ , which in turn gives us the identification
Now, fix $p\in P/P^{\circ }$ and let $gP^{\circ }\in \pi ^{-1}(X_P) \cap \iota (\mathrm {Gr}_{P})p$ . We have $\breve {\kappa }(g) = (p, c) \in P/P^{\circ } \times C$ for some $c\in C$ . Then $gP^{\circ }\in \pi ^{-1}(X_P)$ implies $c \in c" + C^{\sigma }$ . Thus, the condition $\breve {\kappa }(g) \in c_{b,\mu } + \pi _0(\mathrm {Gr}_{P^{\circ }})^{\sigma }$ is equivalent to $p\in c' + (P/P^{\circ })^{\sigma }$ .
Let us also investigate when we have equality $X_{P^{\circ }} = \pi ^{-1}(X_{P})$ .
Lemma 5.12 The following conditions are equivalent:
-
(1) For all $x\in \pi _0(\mathrm {Gr}_{P^{\circ }})$ with $x - \sigma (x) \in P/P^{\circ }$ , we have $x = \sigma (x)$ .
-
(2) We have that $\sigma $ fixes all elements of $P/P^{\circ }$ , and that the sequence
$$\begin{align*}0 \longrightarrow P/P^{\circ} \longrightarrow (\pi_0(\mathrm{Gr}_{P^{\circ}}))^{\sigma} \longrightarrow C^{\sigma} \longrightarrow 0 \end{align*}$$is exact, where C denotes the quotient of $\pi _0(\mathrm {Gr}_{P^{\circ }})$ by $P/P^{\circ }$ .
Proof This is a purely group-theoretic reformulation which uses only that $P/P^{\circ }$ is a $\sigma $ -invariant subgroup of the abelian group $\pi _0(\mathrm {Gr}_{P^{\circ }})$ .
Note that the conditions in the lemma are satisfied, for example, in the following cases:
-
(a) The group $\pi _0(\mathrm {Gr}_{P^{\circ }})$ is a direct product $\pi _0(\mathrm {Gr}_{P^{\circ }}) = C\times P/P^{\circ }$ for some subgroup C, the operation of $\sigma $ preserves this product decomposition, and all elements of $P/P^{\circ }$ are fixed by $\sigma $ .
-
(b) The action of $\sigma $ on $\pi _0(\mathrm {Gr}_{P^{\circ }})$ is trivial. (For instance, this holds if ${\mathbf G}$ splits over a totally ramified extension of F.)
Proposition 5.13 If the equivalent conditions of the previous lemma are satisfied, then the diagram
is Cartesian, i.e., $X_{P^{\circ }} = \pi ^{-1}(X_{P})$ . In particular, in this situation, assumption ( $\heartsuit $ ) is satisfied.
Proof In view of the above discussion, we need to show that $\breve {\kappa }(g) \in c_{b,\mu }+\pi _0(\mathrm {Gr}_{P^{\circ }})^{\sigma }$ for all $gP^{\circ }\in \pi ^{-1}(X_{P})$ . However, those g satisfy
which yields
Thus, part (1) in the above lemma implies that $\breve {\kappa }(g) - c_{b,\mu }\in \pi _0(\mathrm {Gr}_{P^{\circ }})^{\sigma }$ , as desired.
Remark 5.14 Questions related to Propositions 5.11 and 5.13 are also discussed in [Reference Pappas and Rapoport41], in particular Proposition 4.3.7 and Remark 4.3.8.
6 Known results, new results, and open cases
As in the previous section, we assume that $({\mathbf G}, \mu )$ comes from a Rapoport–Zink space and $({\mathbf G}, \mu , K)$ is of Coxeter type.
6.1 Discussion of individual cases
Table 3 lists the cases that come from Shimura varieties and where $\mu $ is noncentral in each $\overline {F}$ -factor of ${\mathbf G}$ . As we stated before, it has been checked in many cases that the strata of the Bruhat–Tits stratification are classical Deligne–Lusztig varieties (also before passing to perfections), and that the closures of strata are isomorphic to the closures of these classical Deligne–Lusztig varieties in the corresponding finite-dimensional partial flag varieties. The strategy in the papers cited below consists of the following steps: define a set-theoretic bijection in terms of Dieudonné theory, extend it to a morphism of schemes using Zink’s display theory, and check that one obtains an identification of schemes using Zariski’s main theorem and the normality of Deligne–Lusztig varieties. In all cases, the stratification is given by viewing the Bruhat–Tits building of the group ${\mathbf J}$ in terms of lattices (“vertex lattices”), and defining the strata by considering relative positions of Dieudonné modules and such vertex lattices. This means that Dieudonné theory gives a suitable way to set up the identifications ( $\diamondsuit $ ), and that then the BT stratification as defined in this paper coincides with the stratifications defined in the papers mentioned below.
The meaning of the symbols used in the table is as follows:
Below, we give further remarks on some of the cases.
6.1.1. $(A_n, \omega _1^{\vee }, \emptyset )$ —Harris–Taylor type
The automorphism $\tau $ acts by rotation
on the affine Dynkin diagram. We have $\dim X(\mu , \tau )_K= 0$ for all $K\subsetneq \tilde {\mathbb {S}} $ , so the set-theoretic bijection defining the Bruhat–Tits stratification is automatically an isomorphism before passing to the perfection. This case arises from unitary Shimura varieties attached to groups that split over $\mathbb Q_p$ . Cf. the book [Reference Harris and Taylor21] by Harris and Taylor.
6.1.2. $({}^d A_{d-1}, \omega ^{\vee }_1, \emptyset )$ —Drinfeld case
The automorphism $\tau $ is the same as in Section 6.1, so the composition ${\mathrm{Ad}}(\tau )\circ \sigma $ is the identity. The only rational level structure is $K=\emptyset $ . We have $\dim X(\mu , \tau )= n-1$ , and in this case, the set $B({\mathbf G}, \mu )$ has only one element: the basic locus equals the whole Shimura variety. The description given by Drinfeld ([Reference Drinfeld9], see also [Reference Rapoport and Zink46, Theorem 3.72] and the subsequent discussion) of the RZ space as a formal scheme (that can be constructed by gluing pieces indexed by simplices in the Bruhat–Tits building of the corresponding group ${\mathbf J}$ [split of type $A_{n-1}$ ]) shows in particular that property $BT_{\emptyset }$ holds.
The maximal elements in $\operatorname {\mathrm{Adm}}(\mu )$ are
The automorphism ${\mathrm{Ad}}(\tau )\circ \sigma $ is the identity map on $\tilde {\mathbb {S}} $ , and in particular the $\sigma $ -support of an element is simply the support of $w\tau ^{-1}$ . From this description, one sees that each element $w\in \operatorname {\mathrm{Adm}}(\mu )$ is determined by its $\sigma $ -support $\operatorname {\mathrm{supp}}_{\sigma }(w)$ , and that for all K the order $\leqslant _{K,\sigma }$ (see Section 2.4) coincides with the Bruhat order on ${}^K\tilde W$ .
The individual strata of the BT stratification are isomorphic to classical Deligne–Lusztig varieties in products of general linear groups. In fact, for each w, the ambient group is the reductive group with Dynkin diagram given by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ (i.e., all other vertices and all edges that involve one of those are discarded) with Frobenius action given by ${\mathrm{Ad}}(\tau )\circ \sigma = \mathrm {id} $ .
For each w, the index set for the strata of type w is given by ${\mathbf J}(F)/({\mathbf J}(F)\cap \mathcal P_w)$ , where $\mathcal P_w$ is the standard parahoric subgroup generated by $\operatorname {\mathrm{supp}}_{\sigma }(w) \sqcup I(K, w, \sigma )$ , and where $I(K, w, \sigma )$ is the subset of K comprising all those vertices which are not connected to $\operatorname {\mathrm{supp}}_{\sigma }(w)$ (cf. Lemma 2.6).
6.1.3. $({}^2 A^{\prime }_{2m}, \omega ^{\vee }_1, \mathbb {S} )$ —Odd unramified unitary group case
The automorphism $\tau $ is the same as in Section 6.1, so the composition ${\mathrm{Ad}}(\tau )\circ \sigma $ acts as the reflection
The only level structure of Coxeter type, $K=\tilde {\mathbb {S}} - \{ s_0\}$ , is hyperspecial, and it was shown in [Reference Vollaard and Wedhorn56] that property $BT_K$ holds.
6.1.4. $({}^2 A^{\prime }_{2m+1}, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{0, {m+1}\})$ —Even unramified unitary group case
The automorphism $\tau $ is the same as in Section 6.1, so the composition ${\mathrm{Ad}}(\tau )\circ \sigma $ acts as the reflection
In this case, $\sigma $ fixes two vertices in the affine Dynkin diagram, namely $0$ and m. Both level structures $\tilde {\mathbb {S}} - \{ s_0\}$ and $\tilde {\mathbb {S}} - \{ s_m\}$ are hyperspecial, and we can again apply the results of [Reference Vollaard and Wedhorn56] to see that property $BT_K$ holds for K hyperspecial. It then follows from Proposition 5.8 that the same is true for $K=\tilde {\mathbb {S}} - \{ s_0, s_m\}$ . Note that Propositions 5.1 and 5.3 ensure that ( $\diamondsuit $ ) and ( $\clubsuit $ ) hold in this case. The sets $\mathcal A(\mu )_P$ and $\operatorname {\mathrm{Adm}}(\mu )_P$ consist only of the double coset of $t^{\mu }$ . Since P is a parahoric subgroup, ( $\heartsuit $ ) is satisfied trivially.
Let us make the case of level structure $\tilde {\mathbb {S}} - \{ s_0, s_m\})$ more explicit: as listed in Table 1, we have
so this index set depends on two parameters $i, j$ and in particular an element of ${}^{K}\!{\mathrm{Adm}}_0$ is not determined by its length in general. (As before, we set $s_{2m+2} := s_0$ and use the interval notation defined in the caption of Table 1.)
For $w = s_{[2m+2, 2m+2-i]}s_{[m+1, m+1-j]}\tau $ , we have
Note that the condition $i+j \leqslant m-2$ ensures that this is a proper subset of $\tilde {\mathbb {S}} $ . We see that each element is determined by its $\sigma $ -support. We write the $\sigma $ -support as a disjoint union of two intervals (possibly empty, depending on the choice of i and j) which are disconnected in the Dynkin diagram. The individual strata of the BT stratification are classical Deligne–Lusztig varieties in the group (over the finite residue class field of F) specified by the Dynkin diagram $\operatorname {\mathrm{supp}}_{\sigma }(w)$ , i.e., a product of two unitary groups (or just one if one of the intervals is empty).
For each w, the index set for the strata of type w is given by ${\mathbf J}(F)/({\mathbf J}(F)\cap \mathcal P_w)$ , where $\mathcal P_w$ is the standard parahoric subgroup generated by $\operatorname {\mathrm{supp}}_{\sigma }(w) \sqcup I(K, w, \sigma )$ , and where $I(K, w, \sigma )$ can be described as in Lemma 2.6.
Remark 6.1 Cho [Reference Cho7] studied the unramified unitary group case for nonhyperspecial maximal parahoric level structure. This case is fully HN decomposable, but not Coxeter type.
6.1.5. $(A_3, \omega ^{\vee }_2, \{1, 2\})$
This case corresponds to a “split $U(2, 2)$ .” Note that for each $i\in \mathbb Z / 4\mathbb Z$ , $(\tilde {A}_{3}, \mathrm {id} , \omega _2^{\vee }, \tilde {\mathbb {S}} - \{ s_i, s_{i+1}\})$ is isomorphic to the above datum. For every i, the level structure $\tilde {\mathbb {S}} - \{ s_i\}$ is hyperspecial and for those cases it was shown by Fox [Reference Fox10] that the Bruhat–Tits strata are isomorphic to Deligne–Lusztig varieties before perfection.
Note that Propositions 5.1 and 5.3 ensure that ( $\diamondsuit $ ) and ( $\clubsuit $ ) hold in this case. It is well known that the sets $\mathcal A(\mu )_P$ and $\operatorname {\mathrm{Adm}}(\mu )_P$ coincide in this case. Since P is a parahoric subgroup, ( $\heartsuit $ ) is satisfied trivially. Hence, Proposition 5.8 can be applied.
6.1.6. $(B$ - $C_n, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0, s_n\})$ —Even ramified unitary group case
To apply Proposition 5.8, we need to check that assumptions ( $\diamondsuit $ ), ( $\clubsuit $ ), and ( $\heartsuit $ ) are satisfied. For the first two, we can use Propositions 5.1 and 5.3. In fact, it follows from Smithling’s paper [Reference Smithling50] that $\mathcal A(\mu )_P = \operatorname {\mathrm{Adm}}(\mu )_P$ . It follows from Proposition 5.11 that ( $\heartsuit $ ) is satisfied.
6.1.7. $(C$ - $BC_n, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{s_0, s_n\})$ —Odd ramified unitary group case
To apply Proposition 5.8, we need to check that assumptions ( $\diamondsuit $ ), ( $\clubsuit $ ), and ( $\heartsuit $ ) are satisfied. For the first two, we can use Propositions 5.1 and 5.3. In fact, it follows from Smithling’s paper [Reference Smithling48] that $\mathcal A(\mu )_P = \operatorname {\mathrm{Adm}}(\mu )_P$ . In this case, P is a parahoric subgroup, and hence ( $\heartsuit $ ) is satisfied.
6.1.8. $(C$ - $BC_2, \omega ^{\vee }_2, \{s_0\})$
In terms of the affine Weyl group, this is the case $(\tilde {C}_2, \mathrm {id} , \omega ^{\vee }_2)$ . Let $K = \{ s_0\}$ , one of the two minimal level structures of Coxeter type (the other one being $\{ s_2\}$ —while the two enhanced Coxeter data are isomorphic, this is not true for the enhanced Tits data, because for them we must take the orientation of the affine Dynkin diagram into account). We have
The automorphism ${\mathrm{Ad}}(\tau ) = {\mathrm{Ad}}(\tau )\circ \sigma $ is given by interchanging $s_0$ and $s_2$ , and fixing $s_1$ . The strata are points (for $w=\tau $ ), and classical Deligne–Lusztig varieties in $SL_2$ (for $w=s_1\tau $ , which has $\operatorname {\mathrm{supp}}_{\sigma }(w)= \{ s_1\}$ ) and in the restriction of scalars of $SL_2$ along a quadratic extension (for $w=s_2\tau $ , which has $\operatorname {\mathrm{supp}}_{\sigma }(w)=\{s_0, s_2\}$ ), respectively.
For each w, the index set for the strata of type w is given by ${\mathbf J}(F)/({\mathbf J}(F)\cap \mathcal P_w)$ , where $\mathcal P_w$ is the standard parahoric subgroup generated by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ , since in this case, $I(K, w, \sigma ) = \emptyset $ for all $w\in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ (cf. Lemma 2.6).
Remark 6.2 In [Reference Wang59], Wang studied the ${}^2C_2$ case with Iwahori level structure.
6.1.9. $(D_n, \omega ^{\vee }_1, \tilde {\mathbb {S}} -\{0, n\})$
Since this case is not of PEL type, it is less clear that the RZ spaces for the different level structures (as defined by Kim [Reference Kim33] and Howard and Pappas [Reference Howard and Pappas31] in the hyperspecial case, and by Hamacher and Kim [Reference Hamacher and Kim19] in general) are compatible in the sense of assumption ( $\clubsuit $ ) above. Once this has been established, the result of Howard and Pappas together with Proposition 5.8 implies the result for level structure $\tilde {\mathbb {S}} -\{s_0, s_n\}$ .
7 Smoothness of closures of strata
In this section, we study the smoothness of the closures of strata in cases of Coxeter type.
Studying the smoothness requires us to consider the actual schemes rather than perfect schemes. The natural deperfection (in the mixed characteristic case) is given by (fine) classical Deligne–Lusztig varieties and their closures in the corresponding partial flag variety, as recalled below. In particular, the discussion applies
-
• in the equal characteristic case, and
-
• in cases of RZ spaces where property $BT_P$ is satisfied (Section 5.4).
Recall the Bruhat–Tits stratification as in Section 2.4. Fix a Coxeter type case $({\mathbf G}, \mu , K)$ and an element $w\in {}^K\!{\mathrm{Cox}}(\mu )$ . Write $w':= w\tau ^{-1}$ .
Recall from Section 2.4 that we denote by $\mathcal P^{\flat }_w$ the standard parahoric subgroup generated by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ . The quotient $\mathcal P^{\flat }_w/\breve {\mathcal I}$ is naturally identified with the full flag variety $\overline {G}/\overline {B}$ for the maximal reductive quotient $\overline {G}$ of the special fiber of the parahoric group scheme attached to $\mathcal P_w^{\flat }$ and its standard Borel $\overline {B}$ . Let $\overline {W}$ be the Weyl group of $\overline {G}$ (for the maximal torus induced from our choice of maximal torus in ${\mathbf G}$ ). The automorphism ${\mathrm{Ad}}(\tau )\circ \sigma $ induces an automorphism on $\overline {G}$ , $\overline {W}$ , and so on, which we will denote by $\bar \sigma $ . The Dynkin diagram of $\overline {G}$ is $\operatorname {\mathrm{supp}}_{\sigma }(w)=\operatorname {\mathrm{supp}}_{\bar \sigma }(w')$ (i.e., we keep from the affine Dynkin diagram of ${\mathbf G}$ the vertices lying in $\operatorname {\mathrm{supp}}_{\sigma }(w)$ and the edges involving only vertices in this subset).
The parahoric subgroup generated by $\operatorname {\mathrm{supp}}_{\sigma }(w)\cap K$ induces a parabolic subgroup $\overline {Q}\subseteq \overline {G}$ . By abuse of notation, we also denote by $\overline {Q}$ the corresponding set of simple reflections inside $\overline {W}$ so that we have $\overline {W}_{\overline {Q}}$ , the set ${}^{\overline {Q}}\overline {W}$ of minimal length representatives, and so on.
The strata of type w are isomorphic to the “fine” Deligne–Lusztig variety
The isomorphism between the BT stratum and $Y(w)$ extends to an isomorphism between the closure of the stratum and the closure of $Y(w)$ inside $\overline {G}/{\overline {Q}}$ .
7.1 Relating Deligne–Lusztig varieties and Schubert varieties
We first show that the singularities in the closure of $Y(w)$ in $\overline {G}/{\overline {Q}}$ are smoothly equivalent to singularities in a certain Schubert variety. To do so, we need to show that the closure of $Y(w)$ in $\overline {G}/{\overline {Q}}$ equals the closure of a certain “coarse” Deligne–Lusztig variety. It was already explained in [Reference Görtz and He13] how to do so, but for the sake of completeness, we repeat the short argument here.
Let
We have $Y(w) \subseteq X_{\overline {Q}}(w')$ , and hence we get the same inclusion for the closures. Because $\overline {W}$ is generated by $\operatorname {\mathrm{supp}}_{\sigma }(w)$ , $X_Q(w')$ is irreducible. To show the equality $\overline {Y(w)} = \overline {X_{\overline {Q}}(w')}$ of the two closures, it is therefore enough to show that $\dim Y(w)$ equals $\dim X_{\overline {Q}}(w')$ , i.e., that
where $w_{0, {\overline {Q}}}$ denotes the longest element of $\overline {W}_{\overline {Q}}$ .
Since $\ell (w) = \ell (w_{{\overline {Q}},0}w') - \ell (w_{{\overline {Q}},0})$ , this is equivalent to saying that $w_{{\overline {Q}}, 0}w'$ is the longest element of $\overline {W}_Qw'\overline {W}_{\bar \sigma (Q)}$ , or equivalently that $w'$ is the longest element of ${}^{\overline {Q}} \overline {W} \cap \overline {W}_{\overline {Q}}w'\overline {W}_{\bar \sigma ({\overline {Q}})}$ .
The truth of this final statement can be established by a case-by-case check for all cases in Table 1, which we omit here.
The closure $\overline {X_{\overline {Q}}(w')}$ is smoothly equivalent to the closure of ${\overline {Q}}w' \bar \sigma ({\overline {Q}}) /\bar \sigma ({\overline {Q}}) \subseteq \overline {G}/\bar \sigma ({\overline {Q}})$ , a Schubert variety in the partial flag variety $\overline {G}/\bar \sigma ({\overline {Q}})$ . In fact, let ${\mathrm{pr}}\colon \overline {G}\rightarrow \overline {G}/\overline {Q}$ denote the projection, and consider the Lang map $L\colon \overline {G} \rightarrow \overline {G}$ , $g\mapsto g^{-1}\bar \sigma (g)$ . Then L is a finite étale morphism, and we obtain a diagram
of surjective smooth morphisms, which restricts to a diagram
that shows the claimed smooth equivalence. See also [Reference Görtz11, Section 2].
7.2 Smoothness of Schubert varieties
It remains to analyze the smoothness of the Schubert varieties in partial flag varieties which arise in our situation. This type of question has been extensively studied. Let us list the few results that we will use below.
-
(1) All closures of Schubert varieties are normal. In particular, if its dimension is $\leqslant 1$ , then it is smooth. The consequence for us is that all closures of strata of dimension $\leqslant 1$ , i.e., whenever $\ell (w)\leqslant 1$ , are smooth.
-
(2) If S is itself a $\overline {Q}$ -orbit (or more generally, a homogeneous space under any parabolic subgroup—i.e., if the boundary in the sense of Section 7.3 is empty), then S is smooth. In the cases relevant to us, if $w'\in \overline {W}_{\bar \sigma (\overline {Q})}$ , then $\overline {Q}w' \bar \sigma (\overline {Q})/\bar \sigma (\overline {Q}) = \overline {Q} \bar \sigma (\overline {Q})/\bar \sigma (\overline {Q})$ is closed in $\overline {G}/\bar \sigma (\overline {Q})$ . This settles almost all cases where $\overline {Q}\ne \bar \sigma (\overline {Q})$ .
-
(3) The case where $\overline {Q}$ is the maximal parabolic subgroup attached to a minuscule or cominuscule weight has been studied in [Reference Brion and Polo6, Reference Lakshmibai and Weyman36]. See Section 7.3 for further details.
-
(4) Every Schubert variety in the full flag variety attached to a Coxeter element is smooth. For instance, this follows because in this case the Schubert variety is isomorphic to its Bott–Samelson resolution.
There is another method to check the smoothness of Schubert varieties in the full flag variety of a classical group, building on the pattern avoidance criteria in terms of the corresponding Weyl group element, viewed as a permutation (see [Reference Billey and Lakshmibai4, Chapter 8]). Furthermore, every $\overline {Q}$ -Schubert variety in $\overline {G}/\bar \sigma (\overline {Q})$ is smoothly equivalent to its inverse image in $\overline {G}/\overline {B}$ , the Schubert variety for the maximal element in $\overline {Q} w' \bar \sigma (\overline {Q})$ . We will not resort to this method here.
7.3 The case of a (co-)minuscule parabolic
Assume that $\overline {G}$ is absolutely simple. Since we will apply the following results in the situation that $\bar \sigma (\overline {Q})=\overline {Q}$ we change notation slightly here and consider Schubert varieties in $\overline {G}/\overline {Q}$ .
We briefly state here the answers to the smoothness question in the case of a parabolic subgroup $\overline {Q}$ attached to a minuscule weight (in other words, the simple reflections in $\overline {Q}$ are precisely those stabilizing a fixed minuscule weight), and in the case of Dynkin type $C_n$ and $\overline {Q}$ the Siegel parabolic, i.e., the parabolic that belongs to the unique minuscule coweight (the “cominuscule case”; cf. [Reference Brion and Polo6, Reference Lakshmibai and Weyman36]).
We can start out slightly more generally than before and consider closures of $\overline {B}$ -orbits in $\overline {G}/\overline {Q}$ , rather than only closures of $\overline {Q}$ -orbits. Given a Schubert variety $S = \overline {{\overline {B}}v\overline {Q}/\overline {Q}} \subseteq \overline {G}/\overline {Q}$ , let $\overline {Q}' \subseteq \overline {G}$ be its stabilizer, and let the boundary $\mathop {\mathrm{Bd}}(S)$ be the complement of the open $\overline {Q}'$ -orbit in S (cf. [Reference Brion and Polo6]). Note that $\overline {Q}'$ can be different from B or $\overline {Q}$ ! Of course, if S is the closure of a $\overline {Q}$ -orbit, then $\overline {Q}\subseteq \overline {Q}'$ , so if in addition $\overline {Q}$ is maximal and $S\ne \overline {G}/\overline {Q}$ , then $\overline {Q}=\overline {Q}'$ .
Note that in [Reference Görtz and He13], the end of the proof of Proposition 7.3.2, we incorrectly claim that [Reference Brion and Polo6, Proposition 3.3] would imply that for all Schubert varieties $\overline {C}_{\overline {Q}, v}$ for $\overline {G}$ of type B and $\overline {Q}$ the minuscule maximal parabolic, the singular locus equals the boundary in the above sense.
7.3.1. The simply laced case
If $\overline {G}$ is simply laced, i.e., the Dynkin diagram has no multiple edges or in other words, the Dynkin type is A, D, or E, then by [Reference Brion and Polo6, Proposition 3.3] the boundary $\mathop {\mathrm{Bd}}(S)$ equals the singular locus of S.
7.3.2. Type B, minuscule case
Let ${\overline {G}}$ be of type $B_n$ (say ${\overline {G}}$ is the special orthogonal group $SO_{2n-1}$ over some algebraically closed field), fix a maximal torus and a Borel subgroup, and let ${\overline {Q}}$ be the maximal parabolic subgroup attached to the unique minuscule weight $\omega _n$ .
The result of [Reference Lakshmibai and Weyman36, Section 5] in this case is the following. We can identify ${}^{\overline {Q}}{\overline {W}}$ with the set
(In other words, precisely one of i, $2n-i$ occurs, and $n+1$ must not occur.)
To an element $(d_i)_i$ , we attach a partition $(a_1, \dots , a_n)$ by the rule
Then $(a_i)_i$ is a self-dual partition fitting into an $n\times n$ square.
Now, [Reference Lakshmibai and Weyman36, Corollary 5.10] states that the Schubert variety corresponding to $(d_i)_i$ is smooth if and only if the partition $(a_i)_i$ is either a square, or a hook.
Among all the Schubert varieties, precisely the $n+1$ cases in the table below correspond to ${\overline {Q}}$ -orbit closures:
There is exactly one line where the Schubert variety is smooth but has nonempty boundary, namely the “hook” in Line 2 of the table.
7.3.3. Type C, minuscule case
Let ${\overline {G}}$ be of type $C_n$ (to pin things down, let us say ${\overline {G}}$ is the symplectic group attached to a symplectic vector space V of dimension $2n$ over some algebraically closed field, $n\geqslant 2$ ), fix a maximal torus and a Borel subgroup, and let ${\overline {Q}}$ be the maximal parabolic subgroup attached to the unique minuscule weight $\omega _1$ ; it is the stabilizer of a line $L\subset V$ . Since ${\overline {G}}$ acts transitively on the set of all lines in V, we can identify ${\overline {G}}/{\overline {Q}}$ with the projective space $\mathbb P(V)$ . There are $2n$ different ${\overline {B}}$ -orbits in ${\overline {G}}/{\overline {Q}}$ , and their closures are projective spaces of dimension $0,\dots , 2n-1 = \dim \mathbb P(V)$ —the same Schubert varieties which arise from $GL(V) \cong GL_{2n}$ and ${\overline {Q}}' \subset GL_{2n}$ the maximal parabolic ${\mathrm{Stab}}_{GL(V)}(L)$ . In particular, in this situation, all Schubert varieties are smooth.
Note though that the Schubert variety of dimension $d=2n-2$ (i.e., codimension $1$ ) is the set of all lines in $L^{\perp }$ and is therefore fixed by Q. In other words, it is equal to a ${\overline {Q}}$ -orbit closure, and its stabilizer is equal to ${\overline {Q}}$ . It follows that the boundary in the above sense is not empty.
7.3.4. Type C, cominuscule case
In this case, [Reference Brion and Polo6, Proposition 4.4] shows that for all Schubert varieties the boundary equals the singular locus.
7.4 Results for the individual cases
Now, we restrict to the situation of Theorem 1.4, Table 1. Note that the answer to our question will depend not only on the Coxeter datum, but also on the actual group over the finite field, i.e., on the orientation of the Dynkin diagram.
Theorem 7.1 Assume that ${\mathbf G}$ is quasi-simple over F and $\mu $ is noncentral in every $\breve F$ -simple component. Suppose that the triple $({\mathbf G}, \mu , K)$ is of Coxeter type. Then all closures of BT strata in are smooth, except for the following cases:
-
(1) the case $(\tilde A_{n-1}, \mathrm {id} , \omega _1^{\vee } + \omega _{n-1}^{\vee }, \tilde {\mathbb {S}} -\{0\})$ for $n \geqslant 4$ ,
-
(2) $\dim X(\mu , \tau )_K \geqslant 2$ and at least one of the long roots of the affine Dynkin diagram is not contained in K.
Remark 7.2 In this theorem, long roots and short roots occur in the affine Dynkin diagrams whose associated affine Weyl groups are of type $\tilde B$ or $\tilde C$ . In these cases, $\dim X(\mu , \tau )_K<2$ only when the enhanced Coxeter datum is of type $(\tilde C_2, \mathrm {id} , \omega _2^{\vee }, \{0\})$ .
Proof In the following cases, all strata have dimension $0$ or $1$ : $(\tilde A_{n-1}, \mathrm {id} , \omega _1^{\vee }, K)$ , $(\tilde A_1, \mathrm {id} , 2 \omega _1^{\vee }, K)$ , $(\tilde A_3, \mathrm {id} , \omega _2^{\vee }, K)$ , $(\tilde C_2, \mathrm {id} , \omega _2^{\vee }, K)$ .
In the following cases, the corresponding Schubert variety has empty boundary (cf. Section 7.2(2)): $(\tilde A_{2m}, \varsigma _0, \omega _1^{\vee }, K)$ , $ (\tilde A_{2m+1}, \varsigma _0, \omega _1^{\vee }, K)$ , $(\tilde A_{n-1} \times \tilde A_{n-1}, {}^1 \varsigma _0, (\omega _1^{\vee }, \omega _{n-1}^{\vee }), K)$ , $(\tilde A_3, \varsigma _0, \omega _2^{\vee }, K)$ , $(\tilde D_n, \mathrm {id} , \omega _1^{\vee }, K)$ , $(\tilde D_n, \varsigma _0, \omega _1^{\vee }, K)$ . In fact, it is enough to check the condition $w'\in \overline {W}_{\bar {\sigma }(\overline {Q})}$ for the minimal possible K, because for larger K, there are fewer elements $w'$ to check, and $\overline {Q}$ becomes larger.
In the case $(\tilde A_{n-1}, \varrho _{n-1}, \omega _1^{\vee }, \emptyset )$ , the closures of strata are smoothly equivalent to Schubert varieties attached to Coxeter elements in a full flag variety, and hence smooth (see Section 7.2(4)).
In the cases $(\tilde B_n, {\mathrm{Ad}}(\tau _1), \omega _1^{\vee }, \tilde {\mathbb {S}} -\{n\})$ and $(\tilde C_2, {\mathrm{Ad}}(\tau _2), \omega _2^{\vee }, \{0, 2\})$ , we have, depending on the orientation of the affine Dynkin diagram, Schubert varieties in a group of type B for a minuscule parabolic (leading to the smooth case in Line 2 of the table in Section 7.3), or in a group of type $C_n$ for the cominuscule maximal parabolic (singular, unless of dimension $\leqslant 1$ ).
In the case $(\tilde B_n, \mathrm {id} , \omega _1^{\vee }, \tilde {\mathbb {S}} -\{0, n\})$ , the situation decomposes as a product, because for all $w\in {}^K\!{\mathrm{Cox}}(\mu )$ which do not lie in ${}^{K'}\mathop {\mathrm{Cox}}(\mu )$ for a larger $K'$ , $\operatorname {\mathrm{supp}}_{\sigma }(w)$ is a disjoint union of two Dynkin diagrams, the group $\overline {G}$ accordingly decomposes as a product, and the situation can be analyzed by handling the two components separately. One factor contributes a smooth Schubert variety with empty boundary, and the other factor gives a Schubert variety of type B or C, as in the previous paragraph. In the case $(\tilde C_n, \mathrm {id} , \omega _1^{\vee }, \tilde {\mathbb {S}} -\{0, n\})$ , the situation decomposes as a product where both factors are of type B or C, depending on the orientation of the affine Dynkin diagram, as in the previous paragraph. If the level K is a maximal proper subset of $\tilde {\mathbb {S}} $ , then only one of the two factors of this product decomposition occurs. The statement on the smoothness then follows from our discussion above.
Regarding the case $(\tilde A_{n-1}, \mathrm {id} , \omega _1^{\vee } + \omega _{n-1}^{\vee }, \tilde {\mathbb {S}} -\{0\})$ , see Example 7.3.
For Deligne–Lusztig varieties in orthogonal groups, cf. also Oki’s paper [Reference Oki38, Section 6]. Note that this theorem corrects a few omissions of smooth cases in [Reference Görtz and He13, Section 7] and the erratum, and the incorrect claim regarding nonsmoothness in type B in [Reference Görtz and He13].
Example 7.3 Type $(\tilde A_{n-1}, \mathrm {id} , \omega _1^{\vee } + \omega _{n-1}^{\vee }, \tilde {\mathbb {S}} -\{0\})$ .
If $n=3$ , then ${}^{K}\!{\mathrm{Adm}}(\mu )_0=\{1, s_0, s_0 s_1, s_0 s_2\}$ . In this case, the closures of strata are smoothly equivalent to Schubert varieties attached to Coxeter elements in a full flag variety, and hence smooth (see Section 7.2(4)).
If $n \geqslant 4$ , then $s_0 s_1 s_{n-1} \in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ . In this case, the Dynkin diagram of $\overline {G}$ is of type $A_3$ and consists of $s_0, s_1$ , and $s_{n-1}$ , and the parabolic subgroup $\overline {Q}$ is the standard parabolic subgroup of $\overline {G}$ associated with $\{s_1, s_{n-1}\}$ . The Schubert variety $\overline {Q} s_0 s_1 s_{n-1} \overline {Q}/\overline {Q}$ has nonempty boundary, and hence is nonsmooth by Section 7.3. In fact, one may show by the same argument that the closure of a BT stratum of type $w \in {}^{K}\!{\mathrm{Adm}}(\mu )_0$ is nonsmooth if and only if $s_0 s_1 s_{n-1} \leqslant w$ .
Example 7.4 Type $(\tilde C_n, \mathrm {id} , \omega _1^{\vee }, \tilde {\mathbb {S}} -\{0, n\})$ .
Let $-1 \leqslant i \leqslant j-2 \leqslant n-1$ and $w=s_{[i, 0]} ^{-1} s_{[n, j]}$ . Then $\operatorname {\mathrm{supp}}(w)=\mathbb {S} _1 \sqcup \mathbb {S} _2$ , where $\mathbb {S} _1=\{0, 1, \ldots , i\}$ and $\mathbb {S} _2=\{j, j+1 ,\ldots , n\}$ . Set $w_1=s_{[i, 0]} ^{-1}$ and $w_2=s_{[n, j]}$ (where we use the interval notation defined in the caption of Table 2). Let $\overline {G}_1$ and $\overline {G}_2$ be the connected semisimple groups associated with $\mathbb {S} _1$ and $\mathbb {S} _2$ , respectively. Here, if $i=-1$ , then $\mathbb {S} _1=\emptyset $ and $\overline {G}_1$ is the trivial group. Similarly, if $j=n+1$ , then $\overline {G}_2$ is the trivial group. Let $\overline {B}_1$ and $\overline {B}_2$ be the standard Borel subgroups of $\overline {G}_1$ and $\overline {G}_2$ , respectively. Let $\overline {Q}_1 \subset \overline {G}_1$ be the standard parabolic subgroup of type $\mathbb {S} _1-\{0\}$ , and let $\overline {Q}_2 \subset \overline {G}_2$ be the standard parabolic subgroup of type $\mathbb {S} _2-\{n\}$ . Then the closure of a stratum of type w is isomorphic to $X_1 \times X_2$ , where $X_i=\{g \in \overline {G}_i/\overline {Q}_i; g ^{-1} \bar \sigma (g) \in \overline {\overline {Q}_i w_i \overline {Q}_i}\}$ .
By Sections 7.3 and 7.3, $X_1$ is smooth if and only if $\{0\}$ is not the long root in the Dynkin diagram of $\mathbb {S} _1$ or $i=0$ or $1$ . Similarly, $X_2$ is smooth if and only if $\{n\}$ is not the long root in the Dynkin diagram of $\mathbb {S} _2$ or $j=n$ or $n+1$ . Hence, the closure of all BT strata are smooth if and only if both $0$ and n are short roots in the affine Dynkin diagram of ${\mathbf G}$ .
Acknowledgment
We would like to thank George Pappas for answering questions on parahoric subgroups, Chao Li for pointing us to a mistake in [Reference Görtz and He13, Section 7] (cf. Section 7), and Michael Rapoport for pointing out an inaccuracy in a previous version. We would also like to thank the referees for their comments.