CS Mukhopadhyay and RK Choudhary
School of Animal Biotechnology, GADVASU, Ludhiana
Studies of molecular evolution and construction of phylogenetic trees are based on mathematical models that underpin the process of nucleotide substitution causing the process of speciation. A number of models have been hypothesized, based on certain assumptions of evolution. Some of the important models are described in this chapter.
This is the simplest DNA substitution model, which assumes equal base frequencies and a constant rate of evolution with a base substitution rate (Figure F1) of “α”. The transition (Purine to Purine and Pyrimidine to Pyrimidine) frequency is assumed to be same as the transversion (Purine to Pyrimidine and vice versa) frequency; hence, the model has a single parameter “α”. A substitution matrix (Figure F2) is generated for all possible base substitutions, assuming a fixed rate of changeover.
The substitution matrix is generated from a constant rate of base substitution per unit time, based on the JC69 model.
The substitution matrix generated from a constant rate of base substitution for a specific time period (ζ), based on the JC69 model, is shown in Figure F4.
The amount of base substitution can be calculated, considering a stipulated lapse of time (ζ) from initiation of the evolutionary event in the study.
Mathematically, the amount of base substitution can be reduced to:
Branch length (for substitution of a base by the same base itself):=
Branch length (for substitution of a base by any base other than itself):
Estimate of evolutionary distance (d) = (–3/4 ln(1–4/3p)) where p is the p‐distance that estimates the proportion of sites that differ between two sequences under study.
The JC69 model is applicable under simple conditions, but is not at all suitable for a complex evolutionary model.
The K80 model is an extension of JC69. The model assumes different rates of transitions (α) and transversions (β).
Let:
The substitution matrix at unit time (t = 1) will be as follows:
The rate of substitution can be approximated to the following:
The distance obtained from the Kimura 2 parameter model is:
where:
The JC69 model assumes an equal rate of substitution among all nucleotide residues, which does not hold good for several practical and complex DNA evolution models. The F81 model modifies this assumption with unequal substitution frequencies for different bases. Thus, the substitution rate matrix has four different proportions for the four common bases:
The expected number of substitutions per site .
The other models that have been developed to study DNA evolution are Hasegawa et al. (1985), Tamura (1992), Tamura and Nei (1993), the generalized time reversible model, etc. The scope of this chapter is too limited to discuss the models in length and breadth.
3.131.38.104