Quasispecies
Key Concepts
-
HIV Infection:
- HIV has a short genome (~10,000 bases) and a very high mutation rate (~\(3 \times 10^{-5}\) per base per replication).
- Rapid adaptation due to short generation time, large population sizes, and strong selective pressures (e.g., immune response, antiretroviral therapy).
-
Sequence Space:
- Sequence Space: Describes the set of all possible genetic sequences of a given length \(L\).
- For DNA, the sequence is made from bases \(A, C, G, T\), and for proteins, the sequence is based on 20 amino acids.
- Binary Sequences: Simplified representation using \(0\) and \(1\). For a binary sequence of length \(L\), there are \(2^L\) possible sequences.
- Hamming Distance: Used to measure the difference between two sequences based on the number of mismatched positions.
-
Fitness Landscapes:
- Definition: A fitness landscape is a mapping from genotype space to fitness, \(f: G \rightarrow \mathbb{R}\), representing how well an organism can survive and reproduce.
- Epistasis: Interactions between loci (positions on the genome) where the effect of one gene depends on the presence of others, influencing fitness.
-
Quasispecies Equation:
- Quasispecies: A population of genetically diverse organisms that evolves under mutation and selection pressure.
- Mathematical Representation:
\[
x'(t) = x(t) \cdot Q \cdot f(t) - x(t) \cdot \phi
\]
- \(x(t)\): Genotype frequency vector.
- \(Q\): Mutation matrix.
- \(f\): Fitness landscape vector.
- \(\phi\): Average population fitness.
- The equilibrium solution of this equation predicts the balance between mutation and selection in a population.
-
Properties of the Quasispecies Equation:
- If replication is error-free (\(Q = I\)), the equation simplifies to the selection equation.
- If the mutation matrix \(Q\) is irreducible (strongly connected), there exists a single globally stable equilibrium \(x^*\).
- Mutation reduces overall population fitness compared to what would be expected from selection alone.
-
Adaptation:
- A population adapts by localizing around a peak in the fitness landscape, but mutation can cause individuals to drift away from these peaks.
- Error Threshold: A critical mutation rate above which genetic information cannot be maintained, leading to the "mutational meltdown" phenomenon.
-
Error Threshold and HIV:
- For HIV, with a genome length \(L = 10^4\) and mutation rate \(u = 3 \times 10^{-5}\), the probability of an error-free genome copy is only 0.74.
- With 1 billion new viruses produced each day, a vast number of mutations occur daily.
Key Equations and Models
-
Quasispecies Equation: \[ x'(t) = x(t) \cdot Q \cdot f(t) - x(t) \cdot \phi \]
- Describes the dynamics of a population under mutation and selection.
-
Hamming Distance:
- Measures the difference between two sequences by counting mismatched positions.
-
Error Threshold:
- Describes the mutation rate at which the population cannot maintain genetic information. The condition \(uL > 1\) leads to mutational meltdown.
Summary Points
- HIV's high mutation rate and short generation time lead to extreme evolutionary dynamics, allowing rapid adaptation to selective pressures.
- Sequence space and fitness landscapes provide a framework for understanding how populations evolve under mutation and selection.
- The quasispecies equation models the balance between mutation and selection in populations, particularly for RNA viruses like HIV.
- The concept of an error threshold indicates a mutation rate beyond which genetic information cannot be preserved, leading to population collapse.
- Mutation-selection balance is central to quasispecies theory, explaining viral diversity and resistance to treatment.