Hostname: page-component-745bb68f8f-hvd4g Total loading time: 0 Render date: 2025-01-27T03:13:01.512Z Has data issue: false hasContentIssue false

Probabilistic models of DNA sequence evolution with context dependent rates of substitution

Published online by Cambridge University Press:  19 February 2016

Jens Ledet Jensen*
Affiliation:
Aarhus University
Anne-Mette Krabbe Pedersen*
Affiliation:
Aarhus University
*
Postal address: Department of Theoretical Statistics, Institute of Mathematics, Ny Munkegade, DK-8000 Aarhus C, Denmark. Email address: jlj@imf.au.dk
∗∗ Postal address: Department of Genetics and Ecology, Institute of Biological Sciences, Ny Munkegade, DK-8000 Aarhus C, Denmark. Email address: annemet@imf.au.dk

Abstract

We consider Markov processes of DNA sequence evolution in which the instantaneous rates of substitution at a site are allowed to depend upon the states at the sites in a neighbourhood of the site at the instant of the substitution. We characterize the class of Markov process models of DNA sequence evolution for which the stationary distribution is a Gibbs measure, and give a procedure for calculating the normalizing constant of the measure. We develop an MCMC method for estimating the transition probability between sequences under models of this type. Finally, we analyse an alignment of two HIV-1 gene sequences using the developed theory and methodology.

Type
General Applied Probability
Copyright
Copyright © Applied Probability Trust 2000 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Felsenstein, J. (1981). Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368376.CrossRefGoogle ScholarPubMed
Felsenstein, J. and Churchill, G. (1996). A hidden Markov model approach to variation among sites in rate of evolution. Mol. Biol. Evol. 13, 93104.CrossRefGoogle ScholarPubMed
Geyer, C. J. (1996). Likelihood inference for spatial point processes. In Proc. Seminaire Europeen de Statistique, Stochastic Geometry, Likelihood and Computation, eds Barndorff-Nielsen, O., Kendall, W. S. and van Lieshout, M. N. M. Chapman and Hall, New York.Google Scholar
Goldman, N. (1993). Statistical tests of models of DNA substitutions. J. Mol. Evol. 36, 182198.CrossRefGoogle Scholar
Goldman, N. and Yang, Z. (1994). A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol. Biol. Evol. 11, 725736.Google ScholarPubMed
Von Haeseler, A. and Schöniger, M. (1998). Evolution of DNA or amino acid sequences with dependent sites. J. Comp. Biol. 5, 149163.CrossRefGoogle ScholarPubMed
Kypr, J., Mrazek, J. and Reich, J. (1989). Nucleotide composition bias and CpG dinucleotide content in the genomes of HIV and HTLV 1/2. Biochim. Biophys. Acta 1009, 280282.CrossRefGoogle ScholarPubMed
Muse, S. V. and Gaut, B. S. (1994). A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome. Mol. Biol. Evol. 11, 715724.Google Scholar
Pedersen, A. K., Wiuf, C. and Christiansen, F. B. (1998). A codon-based model designed to describe lentiviral evolution. Mol. Biol. Evol. 15, 10691081.CrossRefGoogle ScholarPubMed
Thorne, J., Kishino, H. and Felsenstein, J. (1991). An evolutionary model for maximum likelihood alignment of DNA sequences. J. Mol. Evol. 33, 114124.CrossRefGoogle ScholarPubMed
Yang, Z. (1993). Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites. Mol. Biol. Evol. 10, 13961401.Google Scholar