We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings.
To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Consider an inhomogeneous Poisson process X on [0, T] whose unknown intensity function ‘switches' from a lower function g∗ to an upper function h∗ at some unknown point θ∗. What is known are continuous bounding functions g and h such that g∗(t) ≤ g(t) ≤ h(t) ≤ h∗(t) for 0 ≤ t ≤ T. It is shown that on the basis of n observations of the process X the maximum likelihood estimate of θ∗ is consistent for n →∞, and also that converges in law and in pth moment to limits described in terms of the unknown functions g∗ and h∗.
We introduce the notion of weakly approaching sequences of distributions, which is a generalization of the well-known concept of weak convergence of distributions. The main difference is that the suggested notion does not demand the existence of a limit distribution. A similar definition for conditional (random) distributions is presented. Several properties of weakly approaching sequences are given. The tightness of some of them is essential. The Cramér-Lévy continuity theorem for weak convergence is generalized to weakly approaching sequences of (random) distributions. It has several applications in statistics and probability. A few examples of applications to resampling are given.
We define a class of anticipative flows on Poisson space and compute its Radon-Nikodym derivative. This result is applied to statistical testing in an anticipative queueing problem.
This paper arose from interest in assessing the quality of random number generators. The problem of testing randomness of a string of binary bits produced by such a generator gained importance with the wide use of public key cryptography and the need for secure encryption algorithms. All such algorithms are based on a generator of (pseudo) random numbers; the testing of such generators for randomness became crucial for the communications industry where digital signatures and key management are vital for information processing.
The concept of approximate entropy has been introduced in a series of papers by S. Pincus and co-authors. The corresponding statistic is designed to measure the degree of randomness of observed sequences. It is based on incremental contrasts of empirical entropies based on the frequencies of different patterns in the sequence. Sequences with large approximate entropy must have substantial fluctuation or irregularity. Alternatively, small values of this characteristic imply strong regularity, or lack of randomness, in a sequence. Pincus and Kalman (1997) evaluated approximate entropies for binary and decimal expansions of e, π, √2 and √3 with the surprising conclusion that the expansion of √3 demonstrated much less irregularity than that of π. Tractable small sample distributions are hardly available, and testing randomness is based, as a rule, on fairly long strings. Therefore, to have rigorous statistical tests of randomness based on this approximate entropy statistic, one needs the limiting distribution of this characteristic under the randomness assumption. Until now this distribution remained unknown and was thought to be difficult to obtain. To derive the limiting distribution of approximate entropy we modify its definition. It is shown that the approximate entropy as well as its modified version converges in distribution to a χ2-random variable. The P-values of approximate entropy test statistics for binary expansions of e, π and √3 are plotted. Although some of these values for √3 digits are small, they do not provide enough statistical significance against the randomness hypothesis.
This paper considers a branching process generated by an offspring distribution F with mean m < ∞ and variance σ2 < ∞ and such that, at each generation n, there is an observed δ-migration, according to a binomial law Bpvn*Nnbef which depends on the total population size Nnbef. The δ-migration is defined as an emigration, an immigration or a null migration, depending on the value of δ, which is assumed constant throughout the different generations. The process with δ-migration is a generation-dependent Galton-Watson process, whereas the observed process is not in general a martingale. Under the assumption that the process with δ-migration is supercritical, we generalize for the observed migrating process the results relative to the Galton-Watson supercritical case that concern the asymptotic behaviour of the process and the estimation of m and σ2, as n → ∞. Moreover, an asymptotic confidence interval of the initial population size is given.
Is the Ewens distribution the only one-parameter family of partition structures where the total number of types sampled is a sufficient statistic? In general, the answer is no. It is shown that all counterexamples can be generated via an urn scheme. The urn scheme need only satisfy two general conditions. In fact, the conditions are both necessary and sufficient. However, in particular, for a large class of partition structures that naturally arise in the infinite alleles theory of population genetics, the Ewens distribution is the only one in this class where the total number of types is sufficient for estimating the mutation rate. Finally, asymptotic sufficiency for parametric families of partition structures is discussed.
In this paper, we consider the question of which convergence properties of Markov chains are preserved under small perturbations. Properties considered include geometric ergodicity and rates of convergence. Perturbations considered include roundoff error from computer simulation. We are motivated primarily by interest in Markov chain Monte Carlo algorithms.
A two-term Edgeworth expansion for the distribution of an M-estimator of a simple linear regression parameter is obtained without assuming any Cramér-type conditions. As an application, it is shown that certain modification of the naive bootstrap procedure is second order correct even when the error variables have a lattice distribution. This is in marked contrast with the results of Singh on the sample mean of independent and identically distributed random variables.
A Bayesian approach for analyzing layered defense systems is presented. This approach incorporates the dependence of penetration probabilities on the size of attackers going into any layer. A general formula is developed for computing the predictive distribution of the number of attackers surviving any layer as well as the posterior distribution of the penetration probabilities under the a priori assumptions that: (i) the probabilities are dependent and their joint distribution is Dirichlet, and (ii) the probabilities are independent. Positive dependence of the penetration probabilities as well as the number of attackers surviving the different layers is also established.
In the Bayesian estimation of higher-order Markov transition functions on finite state spaces, a prior distribution may assign positive probability to arbitrarily high orders. If there are n observations available, we show (for natural priors) that, with probability one, as n → ∞ the Bayesian posterior distribution ‘discriminates accurately' for orders up to β log n, if β is smaller than an explicitly determined β0. This means that the ‘large deviations' of the posterior are controlled by the relative entropies of the true transition function with respect to all others, much as the large deviations of the empirical distributions are governed by their relative entropies with respect to the true transition function. An example shows that the result can fail even for orders β log n if β is large.
Under the assumptions of the neutral infinite alleles model, K (the total number of alleles present in a sample) is sufficient for estimating θ (the mutation rate). This is a direct result of the Ewens sampling formula, which gives a consistent, asymptotically normal estimator for θ based on K. It is shown that the same estimator used to estimate θ under neutrality is consistent and asymptotically normal, even when the assumption of selective neutrality is violated.
Consider a sequence of possibly dependent random variables having the same marginal distribution F, whose tail 1−F is regularly varying at infinity with an unknown index − α < 0 which is to be estimated. For i.i.d. data or for dependent sequences with the same marginal satisfying mixing conditions, it is well known that Hill's estimator is consistent for α−1 and asymptotically normally distributed. The purpose of this paper is to emphasize the central role played by the tail empirical process for the problem of consistency. This approach allows us to easily prove Hill's estimator is consistent for infinite order moving averages of independent random variables. Our method also suffices to prove that, for the case of an AR model, the unknown index can be estimated using the residuals generated by the estimation of the autoregressive parameters.
The stationary distribution for the population frequencies under an infinite alleles model is described as a random sequence (x1, x2, · ··) such that Σxi = 1. Likelihood ratio theory is developed for random samples drawn from such populations. As a result of the theory, it is shown that any parameter distinguishing an infinite alleles model with selection from the neutral infinite alleles model cannot be consistently estimated based on gene frequencies at a single locus. Furthermore, the likelihood ratio (neutral versus selection) converges to a non-trivial random variable under both hypotheses. This shows that if one wishes to test a completely specified infinite alleles model with selection against neutrality, the test will not obtain power 1 in the limit.
We study convergence in total variation of non-stationary Markov chains in continuous time and apply the results to the image analysis problem of object recognition. The input is a grey-scale or binary image and the desired output is a graphical pattern in continuous space, such as a list of geometric objects or a line drawing. The natural prior models are Markov point processes found in stochastic geometry. We construct well-defined spatial birth-and-death processes that converge weakly to the posterior distribution. A simulated annealing algorithm involving a sequence of spatial birth-and-death processes is developed and shown to converge in total variation to a uniform distribution on the set of posterior mode solutions. The method is demonstrated on a tame example.
Consider a spatial point pattern realized from an inhomogeneous Poisson process on a bounded Borel set , with intensity function λ (s; θ), where . In this article, we show that the maximum likelihood estimator and the Bayes estimator are consistent, asymptotically normal, and asymptotically efficient as the sample region . These results extend asymptotic results of Kutoyants (1984), proved for an inhomogeneous Poisson process on [0, T] , where T →∞. They also formalize (and extend to the multiparameter case) results announced by Krickeberg (1982), for the spatial domain . Furthermore, a Cramér–Rao lower bound is found for any estimator of θ. The asymptotic properties of and are considered for modulated (Cox (1972)), and linear Poisson processes.
We describe a computational procedure for evaluating the quasi-stationary distributions of a continuous-time Markov chain. Our method, which is an ‘iterative version' of Arnoldi's algorithm, is appropriate for dealing with cases where the matrix of transition rates is large and sparse, but does not exhibit a banded structure which might otherwise be usefully exploited. We illustrate the method with reference to an epidemic model and we compare the computed quasi-stationary distribution with an appropriate diffusion approximation.
This paper considers the histogram of unit cell size built up from m independent observations on a Poisson (μ) distribution. The following question is addressed: what is the limiting probability of the event that there are no unoccupied cells lying to the left of occupied cells of the histogram? It is shown that the probability of there being no such isolated empty cells (or isolated finite groups of empty cells) tends to unity as the number m of observations tends to infinity, but that the corresponding almost sure convergence fails. Moreover this probability does not tend to unity when the Poisson distribution is replaced by the negative binomial distribution arising when μ is randomized by a gamma distribution. The relevance to empirical Bayes statistical methods is discussed.
We consider the likelihood ratio tests to detect an epidemic alternative in the following two cases of normal observations: (1) the alternative specifies a square wave drift in the mean value of an i.i.d. sequence; (2) the alternative permits a square wave drift in the intercept of a simple linear regression model. To develop the approximations for the significance levels leads us to consider boundary-crossing problems of some two-dimensional discrete-time Gaussian fields. By the method which was proposed originally by Woodroofe (1976) and adapted to study maxima of some random fields by Siegmund (1988), some large deviations for the conditional non-linear boundary-crossing probabilities are developed. Some results of Monte Carlo experiments confirm the accuracy of these approximations.
Let {Xn, n ≧ 1} be an i.i.d. sequence of positive random variables with a continuous distribution function having a regularly varying upper tail. Denote by {X(n), n ≧ 1} the corresponding sequence of record values. We introduce two statistics based on the sequence of successive record values and investigate their asymptotic behaviour. We also give some numerical results.
The paper deals with a problem which arises in the Monte Carlo optimization of steady state or ergodic systems which can be modelled by Markov chains. The transition probability depends on a parameter, and one wishes to find the parameter value at which some performance function is minimum. The only available data are obtained from either simulation or actual operating information. For such a problem one needs good statistical estimates of the derivatives. Conditions are given for the existence of the derivative of the stationary measure with respect to the parameter, in the sense that the derivative is a signed measure, and is the limit of the natural approximating sequence. Some properties and a useful characterization of the derivative are obtained. It is also shown that, under appropriate conditions, the derivative of the n-step transition function converges to the derivative of the stationary measure as n tends to ∞. This latter result is of particular importance whether one is simply estimating or is actually optimizing via some sequential Monte Carlo procedure, since the basic observations are always taken over a finite time interval.