To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Consider a subcritical branching Markov chain. Let $Z_n$ denote the counting measure of particles of generation n. Under some conditions, we give a probabilistic proof for the existence of the Yaglom limit of $(Z_n)_{n\in\mathbb{N}}$ by the moment method, based on the spinal decomposition and the many-to-few formula. As a result, we give explicit integral representations of all quasi-stationary distributions of $(Z_n)_{n\in\mathbb{N}}$, whose proofs are direct and probabilistic, and do not rely on Martin boundary theory.
This paper investigates the asymptotic properties of parameter estimation for the Ewens–Pitman partition with parameters $0\lt\alpha\lt1$ and $\theta\gt-\alpha$. Specifically, we show that the maximum-likelihood estimator (MLE) of $\alpha$ is $n^{\alpha/2}$-consistent and converges to a variance mixture of normal distributions, where the variance is governed by the Mittag-Leffler distribution. Moreover, we show that a proper normalization involving a random statistic eliminates the randomness in the variance. Building on this result, we construct an approximate confidence interval for $\alpha$. Our proof relies on a stable martingale central limit theorem, which is of independent interest.
We prove the central limit theorem (CLT), the first-order Edgeworth expansion and a mixing local central limit theorem (MLCLT) for Birkhoff sums of a class of unbounded heavily oscillating observables over a family of full-branch piecewise $C^2$ expanding maps of the interval. As a corollary, we obtain the corresponding results for Boolean-type transformations on $\mathbb {R}$. The class of observables in the CLT and the MLCLT on $\mathbb {R}$ include the real part, the imaginary part and the absolute value of the Riemann zeta function. Thus obtained CLT and MLCLT for the Riemann zeta function are in the spirit of the results of Lifschitz & Weber [Sampling the Lindelöf hypothesis with the Cauchy random walk. Proc. Lond. Math. Soc. (3)98 (2009), 241–270] and Steuding [Sampling the Lindelöf hypothesis with an ergodic transformation. RIMS Kôkyûroku BessatsuB34 (2012), 361–381] who have proven the strong law of large numbers for sampling the Lindelöf hypothesis.
QuickSelect (also known as Find), introduced by Hoare ((1961) Commun. ACM4 321–322.), is a randomized algorithm for selecting a specified order statistic from an input sequence of $n$ objects, or rather their identifying labels usually known as keys. The keys can be numeric or symbol strings, or indeed any labels drawn from a given linearly ordered set. We discuss various ways in which the cost of comparing two keys can be measured, and we can measure the efficiency of the algorithm by the total cost of such comparisons.
We define and discuss a closely related algorithm known as QuickVal and a natural probabilistic model for the input to this algorithm; QuickVal searches (almost surely unsuccessfully) for a specified population quantile $\alpha \in [0, 1]$ in an input sample of size $n$. Call the total cost of comparisons for this algorithm $S_n$. We discuss a natural way to define the random variables $S_1, S_2, \ldots$ on a common probability space. For a general class of cost functions, Fill and Nakama ((2013) Adv. Appl. Probab.45 425–450.) proved under mild assumptions that the scaled cost $S_n / n$ of QuickVal converges in $L^p$ and almost surely to a limit random variable $S$. For a general cost function, we consider what we term the QuickVal residual:
\begin{equation*} \rho _n \,{:\!=}\, \frac {S_n}n - S. \end{equation*}
The residual is of natural interest, especially in light of the previous analogous work on the sorting algorithm QuickSort (Bindjeme and Fill (2012) 23rd International Meeting on Probabilistic, Combinatorial, and Asymptotic Methods for the Analysis of Algorithms (AofA'12), Discrete Mathematics, and Theoretical Computer Science Proceedings, AQ, Association: Discrete Mathematics and Theoretical Computer Science, Nancy, pp. 339–348; Neininger (2015) Random Struct. Algorithms46 346–361; Fuchs (2015) Random Struct. Algorithms46 677–687; Grübel and Kabluchko (2016) Ann. Appl. Probab.26 3659–3698; Sulzbach (2017) Random Struct. Algorithms50 493–508). In the case $\alpha = 0$ of QuickMin with unit cost per key-comparison, we are able to calculate–àla Bindjeme and Fill for QuickSort (Bindjeme and Fill (2012) 23rd International Meeting on Probabilistic, Combinatorial, and Asymptotic Methods for the Analysis of Algorithms (AofA'12), Discrete Mathematics and Theoretical Computer Science Proceedings, AQ, Association: Discrete Mathematics and Theoretical Computer Science, Nancy, pp. 339–348.)–the exact (and asymptotic) $L^2$-norm of the residual. We take the result as motivation for the scaling factor $\sqrt {n}$ for the QuickVal residual for general population quantiles and for general cost. We then prove in general (under mild conditions on the cost function) that $\sqrt {n}\,\rho _n$ converges in law to a scale mixture of centered Gaussians, and we also prove convergence of moments.
The Hawkes process is a popular candidate for researchers to model phenomena that exhibit a self-exciting nature. The classical Hawkes process assumes the excitation kernel takes an exponential form, thus suggesting that the peak excitation effect of an event is immediate and the excitation effect decays towards 0 exponentially. While the assumption of an exponential kernel makes it convenient for studying the asymptotic properties of the Hawkes process, it can be restrictive and unrealistic for modelling purposes. A variation on the classical Hawkes process is proposed where the exponential assumption on the kernel is replaced by integrability and smoothness type conditions. However, it is substantially more difficult to conduct asymptotic analysis under this setup since the intensity process is non-Markovian when the excitation kernel is non-exponential, rendering techniques for studying the asymptotics of Markov processes inappropriate. By considering the Hawkes process with a general excitation kernel as a stationary Poisson cluster process, the intensity process is shown to be ergodic. Furthermore, a parametric setup is considered, under which, by utilising the recently established ergodic property of the intensity process, consistency of the maximum likelihood estimator is demonstrated.
Competing and complementary risk (CCR) problems are often modelled using a class of distributions of the maximum, or minimum, of a random number of independent and identically distributed random variables, called the CCR class of distributions. While CCR distributions generally do not have an easy-to-calculate density or probability mass function, two special cases, namely the Poisson–exponential and exponential–geometric distributions, can easily be calculated. Hence, it is of interest to approximate CCR distributions with these simpler distributions. In this paper, we develop Stein’s method for the CCR class of distributions to provide a general comparison method for bounding the distance between two CCR distributions, and we contrast this approach with bounds obtained using a Lindeberg argument. We detail the comparisons for Poisson–exponential, and exponential–geometric distributions.
We investigate geometric properties of invariant spatio-temporal random fields $X\colon\mathbb M^d\times \mathbb R\to \mathbb R$ defined on a compact two-point homogeneous space $\mathbb M^d$ in any dimension $d\ge 2$, and evolving over time. In particular, we focus on chi-squared-distributed random fields, and study the large-time behavior (as $T\to +\infty$) of the average on [0,T] of the volume of the excursion set on the manifold, i.e. of $\lbrace X(\cdot, t)\ge u\rbrace$ (for any $u >0$). The Fourier components of X may have short or long memory in time, i.e. integrable or non-integrable temporal covariance functions. Our argument follows the approach developed in Marinucci et al. (2021) and allows us to extend their results for invariant spatio-temporal Gaussian fields on the two-dimensional unit sphere to the case of chi-squared distributed fields on two-point homogeneous spaces in any dimension. We find that both the asymptotic variance and limiting distribution, as $T\to +\infty$, of the average empirical volume turn out to be non-universal, depending on the memory parameters of the field X.
The Wright–Fisher model, originating in Wright (1931) is one of the canonical probabilistic models used in mathematical population genetics to study how genetic type frequencies evolve in time. In this paper we bound the rate of convergence of the stationary distribution for a finite population Wright–Fisher Markov chain with parent-independent mutation to the Dirichlet distribution. Our result improves the rate of convergence established in Gan et al. (2017) from $\mathrm{O}(1/\sqrt{N})$ to $\mathrm{O}(1/N)$. The results are derived using Stein’s method, in particular, the prelimit generator comparison method.
We show that $\alpha $-stable Lévy motions can be simulated by any ergodic and aperiodic probability-preserving transformation. Namely we show that: for $0<\alpha <1$ and every $\alpha $-stable Lévy motion ${\mathbb {W}}$, there exists a function f whose partial sum process converges in distribution to ${\mathbb {W}}$; for $1\leq \alpha <2$ and every symmetric $\alpha $-stable Lévy motion, there exists a function f whose partial sum process converges in distribution to ${\mathbb {W}}$; for $1< \alpha <2$ and every $-1\leq \beta \leq 1$ there exists a function f whose associated time series is in the classical domain of attraction of an $S_\alpha (\ln (2), \beta ,0)$ random variable.
In this paper, we study the asymptotic behavior of the generalized Zagreb indices of the classical Erdős–Rényi (ER) random graph G(n, p), as $n\to\infty$. For any integer $k\ge1$, we first give an expression for the kth-order generalized Zagreb index in terms of the number of star graphs of various sizes in any simple graph. The explicit formulas for the first two moments of the generalized Zagreb indices of an ER random graph are then obtained from this expression. Based on the asymptotic normality of the numbers of star graphs of various sizes, several joint limit laws are established for a finite number of generalized Zagreb indices with a phase transition for p in different regimes. Finally, we provide a necessary and sufficient condition for any single generalized Zagreb index of G(n, p) to be asymptotic normal.
We introduce the exponentially preferential recursive tree and study some properties related to the degree profile of nodes in the tree. The definition of the tree involves a radix $a\gt 0$. In a tree of size $n$ (nodes), the nodes are labeled with the numbers $1,2, \ldots ,n$. The node labeled $i$ attracts the future entrant $n+1$ with probability proportional to $a^i$.
We dedicate an early section for algorithms to generate and visualize the trees in different regimes. We study the asymptotic distribution of the outdegree of node $i$, as $n\to \infty$, and find three regimes according to whether $0 \lt a \lt 1$ (subcritical regime), $a=1$ (critical regime), or $a\gt 1$ (supercritical regime). Within any regime, there are also phases depending on a delicate interplay between $i$ and $n$, ramifying the asymptotic distribution within the regime into “early,” “intermediate” and “late” phases. In certain phases of certain regimes, we find asymptotic Gaussian laws. In certain phases of some other regimes, small oscillations in the asymototic laws are detected by the Poisson approximation techniques.
Let X be the sum of a diffusion process and a Lévy jump process, and for any integer $n\ge 1$ let $\phi_n$ be a function defined on $\mathbb{R}^2$ and taking values in $\mathbb{R}$, with adequate properties. We study the convergence of functionals of the type
where [x] is the integer part of the real number x and the sequences $(\Delta_n)$ and $(\alpha_n)$ tend to 0 as $n\to +\infty$. We then prove the law of large numbers and establish, in the case where $\frac{\alpha_n}{\sqrt{\Delta_n}}$ converges to a real number in $[0,+\infty)$], a new central limit theorem which generalizes that in the case where X is a continuous Itô’s semimartingale.
We consider the count of subgraphs with an arbitrary configuration of endpoints in the random-connection model based on a Poisson point process on ${\mathord{\mathbb R}}^d$. We present combinatorial expressions for the computation of the cumulants and moments of all orders of such subgraph counts, which allow us to estimate the growth of cumulants as the intensity of the underlying Poisson point process goes to infinity. As a consequence, we obtain a central limit theorem with explicit convergence rates under the Kolmogorov distance and connectivity bounds. Numerical examples are presented using a computer code in SageMath for the closed-form computation of cumulants of any order, for any type of connected subgraph, and for any configuration of endpoints in any dimension $d{\geq} 1$. In particular, graph connectivity estimates, Gram–Charlier expansions for density estimation, and correlation estimates for joint subgraph counting are obtained.
We model voting behaviour in the multi-group setting of a two-tier voting system using sequences of de Finetti measures. Our model is defined by using the de Finetti representation of a probability measure (i.e. as a mixture of conditionally independent probability measures) describing voting behaviour. The de Finetti measure describes the interaction between voters and possible outside influences on them. We assume that for each population size there is a (potentially) different de Finetti measure, and as the population grows, the sequence of de Finetti measures converges weakly to the Dirac measure at the origin, representing a tendency toward weakening social cohesion as the population grows large. The resulting model covers a wide variety of behaviours, ranging from independent voting in the limit under fast convergence, a critical convergence speed with its own pattern of behaviour, to a subcritical convergence speed which yields a model in line with empirical evidence of real-world voting data, contrary to previous probabilistic models used in the study of voting. These models can be used, e.g., to study the problem of optimal voting weights in two-tier voting systems.
We consider uniformly random lozenge tilings of simply connected polygons subject to a technical assumption on their limit shape. We show that the edge statistics around any point on the arctic boundary, that is not a cusp or tangency location, converge to the Airy line ensemble. Our proof proceeds by locally comparing these edge statistics with those for a random tiling of a hexagon, which are well understood. To realize this comparison, we require a nearly optimal concentration estimate for the tiling height function, which we establish by exhibiting a certain Markov chain on the set of all tilings that preserves such concentration estimates under its dynamics.
We consider interacting urns on a finite directed network, where both sampling and reinforcement processes depend on the nodes of the network. This extends previous research by incorporating node-dependent sampling and reinforcement. We classify the sampling and reinforcement schemes, as well as the networks on which the proportion of balls of either colour in each urn converges almost surely to a deterministic limit. We also investigate conditions for achieving synchronisation of the colour proportions across the urns and analyse fluctuations under specific conditions on the reinforcement scheme and network structure.
Let $\{\omega _n\}_{n\geq 1}$ be a sequence of independent and identically distributed random variables on a probability space $(\Omega , \mathcal {F}, \mathbb {P})$, each uniformly distributed on the unit circle $\mathbb {T}$, and let $\ell _n=cn^{-\tau }$ for some $c>0$ and $0<\tau <1$. Let $I_{n}=(\omega _n,\omega _n+\ell _n)$ be the random interval with left endpoint $\omega _n$ and length $\ell _n$. We study the asymptotic property of the covering time $N_n(x)=\sharp \{1\leq k\leq n: x\in I_k\}$ for each $x\in \mathbb {T}$. We prove the quenched central limit theorem for the covering time, that is, $\mathbb {P}$-almost surely,
Gut and Stadmüller (2021, 2022) initiated the study of the elephant random walk with limited memory. Aguech and El Machkouri (2024) published a paper in which they discuss an extension of the results by Gut and Stadtmüller (2022) for an ‘increasing memory’ version of the elephant random walk without stops. Here we present a formal definition of the process that was hinted at by Gut and Stadtmüller. This definition is based on the triangular array setting. We give a positive answer to the open problem in Gut and Stadtmüller (2022) for the elephant random walk, possibly with stops. We also obtain the central limit theorem for the supercritical case of this model.
Early investigation of Pólya urns considered drawing balls one at a time. In the last two decades, several authors have considered multiple drawing in each step, but mostly for schemes involving two colors. In this manuscript, we consider multiple drawing from urns of balls of multiple colors, formulating asymptotic theory for specific urn classes and addressing more applications. The class we consider is affine and tenable, built around a ‘core’ square matrix. We examine cases where the urn is irreducible and demonstrate its relationship to matrix irreducibility for its core matrix, with examples provided. An index for the drawing schema is derived from the eigenvalues of the core. We identify three regimes: small, critical, and large index. In the small-index regime, we find an asymptotic Gaussian law. In the critical-index regime, we also find an asymptotic Gaussian law, albeit with a difference in the scale factor, which involves logarithmic terms. In both of these regimes, we have explicit forms for the structure of the mean and the covariance matrix of the composition vector (both exact and asymptotic). In all three regimes we have strong laws.
We establish a Central Limit Theorem for tensor product random variables $c_k:=a_k \otimes a_k$, where $(a_k)_{k \in \mathbb {N}}$ is a free family of variables. We show that if the variables $a_k$ are centered, the limiting law is the semi-circle. Otherwise, the limiting law depends on the mean and variance of the variables $a_k$ and corresponds to a free interpolation between the semi-circle law and the classical convolution of two semi-circle laws.