6.891 Fall, 2004 Prof. Robert C. Berwick
Outline of Lectures 1-2, 9/8-9/13
1. What does a theory of evolution have to explain about
the world around us – if anything?
2. We might depict any model for ÔevolutionaryÕ change as a sequence of states over time, x1, x2, É where there is some map, T, that carries the state x into a next state xÕ.
In general, we say that a discrete dynamical system consists of a map T from a space x onto itself.
(We havenÕt said anything much yet about constraints that characterize T or x,
e.g., x could be the set of
all the genes in an organism, or their frequencies in the overall population of
organisms; or it could be a list of all the traits in a (set of organisms)
– whatever a ÔtraitÕ is - and T
could be differentiable). As is
usual, we may imagine that this map can be iterated: x, Tx, T(Tx),...Tkx. This
sequence is called the orbit of x.
It will be convenient to write
the map in the form, x«= Tx, or as x«-x= Tx-x. Equations of the second type are called reccurrence or difference equations. We give an explicit example shortly.
3. The general problem is one of constructing a state
space that will be dynamically sufficient, and a set of laws of transformation
in that state space that will transform all the state variables. The laws also have to be empirically
sufficient. This entails an interchange between finding the laws and picking
the right state variables, to predict a future state x« given a current x. The transformational laws cannot be arbitrary
– usually they contain some parameters P that are not themselves a function of time or the
state of the system. Second, the
laws contain the elapsed time except in the case of equilibrium, and may or may
not refer specifically to the absolute time t depending on whether the system
carries in its present state some history of the past. Finally, most importantly, the laws of
transformation must contain the present state of the system and suffice to
produce the next state. Example – you canÕt predict the future position
of a satellite from its current position alone. You need to know 3-D position,
velocity, and acceleration – 9 variables in all, and that is dynamically
sufficient.
4. So what about for evolutionary theory? What is the
state space? What are the laws of
transformation? It is
important to stress that we canÕt go out and describe the world any way we want
and demand that an explanatory and descriptive theory be built on that
description – it might be dynamically insufficient. This has important consequences,
because to model evolution we will necessarily build such dynamical systems
– but this takes some care. What
kind of ÔforceÕ is evolution? What
are the state variables?
5. Much of the remainder of this course will consist in
characterizing and exploring the dynamical system behavior of T and x: rate of evolution; convergence times;
stability properties; measurement and inference problems – how can we
tell from a current state, what has happened in the past; how can we measure
the ÔforcesÕ of evolution; how various characterizations of T and x; how sensitive our these models to our assumptions.
6. We may identify two distinct ÔmodesÕ of accounting for
evolutionary change, transformational
and variational. Transformational
theories have been the most widespread for explaining historical change. The
prototypical transformational example is stellar evolution. In transformational
evolution, the properties of each individual in a group change, and, as a
result, the overall group compositional properties change. This explanatory mode was also common
to perhaps the most popular view of biological evolutionary change in the
1700s, given the name Lamarckism (after Jean-Baptiste Monet, Chevalier de
Lamarck), although he did not originate this approach. Like stellar evolution, on this view
biological change too is transformational: all individual organisms making up a
population change – each giraffeÕs neck gets longer reaching for the
treetops – and as a result of individual change, the population property in question as a
whole changes.
7. In contrast, Darwin seems to have been the first to
introduce a completely novel account for evolution, the variational mode of evolutionary explanation. In a variational
story, no single individual
undergoes evolutionary change within its lifetime. Rather, already existing variation among individuals is selected (sieved) and the group population property as an
ensemble changes because different individuals are selected to make up the
group. Or: In the variational mode, there is variation among the individual
units comprising the whole system. The system changes in time by a change in the
proportions of the different kinds of units, as a consequence of differential
survival and reproduction of the units. In DarwinÕs scheme, the group as a
whole changes through time because objects with different properties leave
different numbers of descendants.
8. It is important to understand that different domains
of phenomena evolve by different modes.
The transformational mode, which is correct for galaxies and embryos, is
not correct for species.
9. The essential nature of the Darwinian revolution was
not the introduction of evolution as a worldview. This is historically not the
case. It is rather the replacement of a metaphysical view of variation among
organisms by a materialistic view. DarwinÕs materialistic view replaced the
Aristotelian, Platonic conception that there is a single ideal ÔtypeÕ –
an ideal squirrel, or whale, or E. coli – and so, and that the ÔfailingsÕ
of organisms in their perfection of adaptation somehow to be related to their
failure attain this ideal – they are imperfect approximations to their
Ôtrue nature.Õ The failure of
individual cases to match the ideal was a measure of the imperfection of
nature. (Compare this to the
Newtonian idealizations about frictionless planes, etc.) Darwin overthrew this
metaphysical picture and replaced it with a materialistic one: what matters in
Darwinian evolution by natural selection is the actual variation in actual
individuals. This individual
variation is of the essence – as weÕll see many times in the course: the rate of evolution by natural selection is directly
proportional to the amount of
standing variation. Variation is
the fuel that evolution burns. (And in addition, unlike a fire, evolution has
to ÔmakeÕ its own fuel.)
10. So: DarwinÕs explanation for the ÔclumpingÕ that we
see in the biological world is the conversion of the standing variation between
individuals into variation between species and groups.
11. All ÔvulgarÕ versions of evolution at the time of
Darwin thus already included variation and inheritance. Darwin added the key notion of selection, dubbed natural selection by analogy with the artificial selection that Darwin observed amongst pigeon, dog,
horse breeders. Selection ÔsievesÕ existing variation by letting Ôpass throughÕ
individuals with certain properties, but not other properties – like
panning for gold. Inheritance then transmits these individuals, with their
selected properties, to the next generation. These comprise the three key
components of DarwinÕs theory:
a. Variation:
Among individual members within any population, there is variation (in genes,
morphology, physiology, behavior, É)
b. Heredity
(information transmission): Offspring resemble their parents more than they
resemble individuals to which they are unrelated
c. Selection:
Some forms are more successful at surviving and reproducing than other forms in
a given environment. (aka, the
ÔmisnomerÕ Òsurvival of the fittestÓ)
It is a valuable exercise to consider whether these
three components are necessary and sufficient for evolution by natural
selection – alternatively, whether it is possible to design some other system that would work to create the living world
around us. At least superficially,
it should be clear that without variation, evolution can do nothing –
there is nothing that selection can do to separate out a group with different
ensemble properties. Similarly,
without some form of inheritance, a differentially selected set of individuals
would not be Ôpassed onÕ to future generations. And of course, some kind of selection must be part of
Ônatural selectionÕ.
12. But what sort of variation, heredity, and selection? Consider heredity. There are interesting constraints on the
kind of Ôinformation transmissionÕ systems that can (easily) lead to evolution by natural selection. One of these constraints was first driven
home in perhaps the most devastating review of Origin of Species – at least the one that troubled Darwin the most
- by the Scottish engineer Fleeming Jenkin. This EECS pioneer almost sunk
DarwinÕs theory. JenkinÕs point had to do with the nature of inheritance. In DarwinÕs time, the most widely
accepted notion of ÔinheritanceÕ was essentially that of blending
inheritance - Ômixing by bloodÕ
– like blending paint (a very old idea). But blending inheritance runs afoul of natural selectionÕs
demand for variation. It is easy to see that if in fact inheritance worked by
blending, then any variation would be quickly swamped within a few generations
– in fact it is halved at ever generation – and so turned into a
kind of uniform brown mud. We wind up with nothing for natural selection to
ÔselectÕ. Darwin himself was
unable to refute this argument, and wound up embracing LamarckÕs inheritance of
acquired characteristics, along with a theory of Ôhyper mutationÕ that would
pump in enough variation to keep selection going (if ½ of all variation
is lost each generation, then ½ the variation we see in the current
generation must be due to mutations introduced by the immediately preceding
generation).
13. The problem was resolved by MendelÕs discovery that
inheritance is quantal, or particulate in nature, and not blending (the
particles of course ultimately turned out to be genes). R.A. Fisher (1911, 1930) demonstrated
exactly how blending inheritance would halve variance at each generation: Let x denote the deviation of one parent from the mean of
any trait (e.g., height, or ÒPaul Newman blue eyesÓ), and y denote the deviation from the mean of that trait for
the other parent. The variance of the trait is the expected (mean) value of the
square of deviations from the mean, or E[x2]. On the
model of perfect blending inheritance, then the deviation of the offspring
would be ½(x+y) – exactly half-way in between. We can now calculate the expected value
of the square of this deviation: E[{½(x+y)}2]=E[¼
(x2+2xy+y2)]. The quantity E(xy)
must be zero, since the expected value of deviations from the mean is always
0. The expected value of x2= the expected value of y2, so we have, E[¼ (x2+2xy+y2)]=
E[¼(2x2]=E[½
(x2)]= ½ E[x2] – that is, exactly half the
original variance. Thus under blending inheritance variance declines
exponentially with each generation.
We also must show how ÔparticulateÕ inheritance removes this problem.
Intuitively, if inheritance works via ÔquantaÕ that we call genes, and these
genes are not blended – neither created nor destroyed – during
reproduction, then the variance remains constant (all other things like
mutation being equal) – we can juggle a handful of different colored
jelly beans around, and mix them, but the individual numbers of different
colored jelly beans will remain the same.
This can be demonstrated more precisely via the so-called
ÒHardy-Weinberg lawsÓ, which we turn to below. Sadly, though MendelÕs work was published during DarwinÕs
lifetime – Darwin even had a copy of MendelÕs publications – it
seems that he never read about them, and the articles were found uncut
(unopened) in DarwinÕs library at the time of his death. MendelÕs algebraic system evidently did
not appeal to Darwin, who claimed in his autobiography to be all in a muddle about
mathematics.
14. We next pursue an elaboration of the notion of a
dynamical system mapping as our model of evolutionary change. We describe any
(array of) organisms via two key descriptive spaces: Genotype space and Phenotype
space. By ÒgenotypeÓ we mean
the full array of genes in an organism or set of organisms (for now, think of
it as a very long vector, say, 20,000 elements in the case of humans.) For now
we donÕt say what the ÔaxesÕ are in this space. Genes occur in variants, called alleles. The variants within a species, over all genes,
specify distinct genotypes. For example, we saw that there is a
particular gene that specifies normal human hemoglobin in red blood cells, and
another, which differs in exactly one DNA ÔletterÕ (the nucleotide adenine, A,
is replaced by a thymine, T), whose corresponding protein has a valine instead
of a glutamic acid, which in turn yields a hemoglobin that forms ÔbentÕ
crystals and causes red blood cells to become crooked and ÔsickleÕ shaped
instead normally elliptical. These
are two alleles of the gene for
hemoglobin (more precisely, the beta-chain of one part of hemoglobin), or two
ÔgenotypesÕ. By ÒphenotypeÓ we
mean the Ôform that showsÕ – i.e., the actual biological form that an
organism presents to the world (and so casually interacts with it in the sense
of affecting the outcome of selection). In our example, the two distinct
hemoglobins (and consequently distinct red blood cell shapes) are two different
ÔphenotypesÕ. Of course,
genotypes can also differ between species as well as between individuals. A
genotype, then, is simple the string of nucleic acids, the Ôgenetic codeÕ that
make up the DNA of an individual organism. Selection acts on phenotypes (note
that we have been a bit vague as to whether a phenotype might include the
notion of a genotype or DNA sequence – doesnÕt this casually interact
with the world? More later.) In the case of normal vs. sickle-shaped red blood
cells, we know that the latter are extremely debilitating and lead to early
death – the sickle-cell phenotype is Ôselected againstÕ. We stress that a full
evolutionary theory must pass back and forth between these spaces by means of
(as yet) unspecified, and current unknown, mappings. Here is a picture:
15.
If we start off in genotype space, in some state G0,
then we must pass back and forth between genotype space and phenotype space
several times in order to reach the next possible genotype state G0Õ:
first, the genotype must be realized as a phenotype P0; (the actual
organisms) – this via a set of Ôdevelopmental lawsÕ T0 that turn genotypes into organisms; then
these organisms must meet and mate, again via some set of laws T1;
then the mated pairs produce the ÔrawÕ DNA, the gametes (e.g., egg and sperm), which brings us back to genotype
space via another mapping T2 specifying how gametes get produced
from mated organisms; the genes then reassort and combine via the rules
commonly known as Mendelism, T3; then the fertilized egg(s) (a zygote) develops into offspring, via developmental transformations
T4; natural selection then acts on the offspring, winnowing them, T5;
then finally these surviving offspring produce an array of genotypes (via their
games, i.e., sperm and eggs) which become the new G0Õ to start the
process all over again. Whew. It almost goes without saying that
except in extremely limited cases we have no knowledge whatsoever about these
TÕs, or how to compute them. We
can say, however, that our theory will have to be both dynamically and
empirically sufficient. As it stands,
to do this would result in a theory concerning a vastly more complex domain
than any yet dealt with by physics or molecular biology.
(Aside 1: We should remark that even under na•ve
assumptions the size of these spaces might be enormous – for example,
suppose we take genotype space to be ÔdiscreteÕ and consist of the set of
possible gene sequences for an organism.
If there are 10,000 genes in an organismÕs gene sequence – not far
off - with 3 gene types or ÔallelesÕ each, that amounts to 310,000
possible genotypes or ÔpointsÕ in this space.) (Aside 2: as correctly pointed
out, we really ought to included another space in this analysis, namely, the external environment or context in
which G and P reside.)
16. If we have no knowledge of these T mappings then how can evolutionary biology
proceed? In fact, there are two
moves that are made, the obvious ones.
We can either Ôassume awayÕ P, and do our modeling only in genotype
space; or we can assume away G, and do our modeling only in phenotype
space. Both strategies are
adopted. We can collapse G=P, and
work entirely in Phenotype space. This is the province of biometrics – we
put this to one side for now. Or, we can collapse G and P, and work entirely in
genotype space. This is the strategy of most of evolutionary population
genetics: to study the origin and dynamics of genetic variation within
populations.
17. While this is a much more modest goal that all of
evolutionary theory, it is an essential ingredient. If we adopt this subgoal,
we are in effect equating evolution as Òchange in gene frequenciesÓ, as is
common in this approach. However,
while population genetics has much to say about changes or the stability of the
frequencies of genes in populations and about the rate of divergences in gene
frequencies in populations, it has contributed little to our understanding of
speciation and nothing to our understanding of extinction.
18. Still, the sufficient set of state variables for
describing an evolutionary process within a population must include some information
about the statistical distribution of gene frequencies. It is for this reason that the
empirical and mathematical study of population genetics has always begun with
and centered on the characterization of the genetic variation in populations.
19.
We begin then by
adopting the Òevolution as change in gene frequenciesÓ view, and describing the
simplest dynamical system modeling this. We want to know what theory says about
the reproduction of genotypes in a population. This results in the derivation
of the so-called Hardy-Weinberg proportions. We imagine a population reproducing without any
natural selection or any interference by any other forces such as mutation or
migration. The Hardy-Weinberg result serves as a kind of ÒNewtonÕs First LawÓ
for the genetics of evolving populations, because it says that under such
conditions gene frequencies (and their variance) will Òremain at restÓ –
that is, gene frequency proportions will remain in equilibrium as long as there is no other force to disturb
them. We use this as a ÔbaselineÕ
model and then introduce selection, migration, etc. as ÔforcesÕ that displace a
population from its equilibrium. H-W is the second half of the demonstration
that Mendelism actually goes hand-in-hand with DarwinÕs theory – it is
virtually a necessary part of Darwinism, since it serves to maintain variation
unless there is some other force to disturb it. It would be an interesting exercise to see whether one could
develop an alternative that could replace Mendelism, and still get the conditions
for the evolution of complex life.
20. The Hardy-Weinberg ÒlawÓ is based on following
assumptions:
á
A single random mating population
á
Infinitely many
individuals (Why do we need this assumption? Follow it out in the analysis
below)
á
No mutation
á
No selection (no
differential fertility, viability)
á
No immigration or
emigration
á
Non-overlapping
generations
21. Suppose we have one gene that comes in two variants,
or alleles, denoted A and a. If we have genotypes with current genotype
frequencies P, Q, and R
of genotypes AA, Aa, and aa, they have a fraction p = P +
1/2 Q of their genes being A rather than a. The value p is the gene
frequency (note the difference between gene frequency and a genotype
frequency, e.g., P). The gene
frequency of the a allele is, for
the same reasons, q = 1/2 Q+R. These
can also be computed by counting the fractions of A and a
among the individuals.
22. Random mating is equivalent to random union of
gametes. Imagine making a pot of female gametes, a pot of male gametes, and
drawing a pair, one from each. The equivalence comes because a random member of
the offspring generation is descended from a random female and a random male,
and Mendelian inheritance ensures that the gametes each contributes contain a
random one of the two copies (at this locus) in that individual. Drawing a
random parent, and then having it choose one of the two copies by Mendelian
segregation, is equivalent to drawing one of the copies from the population at
random. [Indeed, the variance of
this draw, in the case of just two alleles in fractions p and (1-p)
is ½ p(1-p) as per standard sampling theory from a binomial
distribution, a result we shall draw on below – or refer to appendix A in
your Sean Rice textbook.
23. The probability that the offspring gets an A from the female parent is p, and the probability that it gets an A from the male parent is also p. Because these are independent as a result of random
mating, the probability that the individual is AA is then p2.
24. The result is that AA, Aa, and aa have expected genotype
frequencies p2, 2pq and q2.
25. The gene frequency in this offspring population is
again p, since in that generation P is p2
and Q is 2pq, so that p
= P + 1/2 Q = p2
+ 1/2 (2pq) = p.
26. If we again mate these individuals randomly, the gene
frequencies in the second generation are again p and q.
27. Thus the genotype frequencies become these
ÒHardy-WeinbergÓ proportions, and stay that way forever. The gene frequencies
remain forever p. If we are
talking about just one gene
ÔlocationÕ (= ÒlocusÓ) then the frequencies for two alleles are forever p and (1-p).
28. Mendelian
genetic systems thus do not tend to lose genetic variability just because of
random mating. Blending inheritance would lose it. The fundamental reason is
that segregation in a heterozygote yields gametes that are 1/2 A and 1/2 a,
whereas in blending inheritance it is as if they were all medium-sized AÕs. [Q: what would happen in a non-Mendelian
world? Variation lost? No evolution of complex life?]
29. When we relax the assumption of no differential
viability and no differential fertility, we now have natural selection going
on. We can count genes to define the notion of absolute fitness.
30. Let us now consider the simplest case, with just one gene (one ÔlocusÕ) that can take one of two possible
forms (alleles); we generalize this immediately to the case of many alleles
(generalizing to multiple loci is more complex). We retain all
the other assumptions about infinite population size, etc. It is extremely important to think about the biological reality of
these assumptions, and what effect they have. Note that we are, in effect, assuming that the
genotype-phenotype mapping is direct
(i.e., all those TÕs are identity functions). We also assume that a single
geneÕs contribution to the outcome of selection may be calculated, or Ôfactored
outÕ no matter what its interactions with other genes. That is, the effect of
the gene appears directly in the phenotype, and thus selection can directly
affect it.
31. We may imagine a string reaching between a gene and its
phenotypic realization: if we pluck the string at either the gene or the
phenotype end, the corresponding other part wiggles. If there is just a single
string, with gene and trait at either end, this is easy: this is the case
(roughly) in sickle-cell anemia. But the world is more complex. This is a
familiar situation in the AI literature, when we want to construct what are
known as Ôcausal modelsÕ: suppose a number of factors conspire to produce a
particular outcome and we want to draw inferences about Ôwho is to blameÕ (the
Ôcredit assignment problemÕ) – e.g., what caused the great Chicago
fire? The biological reality is
that there are typically many genes that act in concert to produce a given
surface trait, and, importantly, vice-versa. Example: youÕre all familiar with
the ÔtextbookÕ idea that eye color is produced by ÔcrossingÕ B(rown) and b(lue)
eye types – which looks like a locus with two alleles. In reality, there are at least 12
different enzymes that yield eye color; the basal mammalian eye color is
Paul-Newman blue, which in fact is the absence of any color. Of course, itÕs worse (or better) than
that: typically genes and traits are in a many-many relation. What does that say about natural
selection ÔtwiddlingÕ a trait? How does it reach down and pluck on a geneÕs
string? The genotype-phenotype
relation is many-to-many.
32. Please bear in mind all this as we pursue the simplest
possible model: genes are pleiotropic
(one gene can have functions – like a Swiss army knife - e.g., when you
do an expression analysis for a gene, it can play many roles: for example, the
FOXP2 gene mentioned as implicated in neural development also seems to affect
bone growth). Further, genes are epistatic – they interact to yield blue or brown eyes.
33. That said, the absolute fitness of each genotype is the expected contribution a
newborn individual of that genotype makes to the next generation. This is the
product of 1/2 (viability)(fertility). The one-half is because each offspring
it has only gets one-half of its genes from that parent.
34. Many populations are subject to density-dependent
population size regulation. If we can assume that this falls ÒfairlyÓ on all
genotypes, then it simply multiplies all viabilities by the same number, and/or
multiplies all fertilities by the same number. It will do this if the
density-dependent population size regulation acts at a different life stage, in
a way unrelated to whatever causes the other fitness differences.
35. If this is true, then the ratios of the absolute
fitnesses do not change as the population changes density, only the multiplier
that makes them into absolute fitnesses.
36. Then we can define relative fitness of a genotype as the ratio between its absolute
fitness and the absolute fitness of some reference genotype. Thus relative
fitnesses might be 1 : 0.8 : 0.7 for the three genotypes, for example, when AA is the reference genotype. We will use w to denote these relative fitnesses, subscripted as
required to refer to the genotype in question.
37. With this one-locus (=one gene) case, one can compute
the gene frequency after natural selection. The genotype frequencies at the
beginning of the generation are of course in the assumed H-W ratios of p2 : 2pq : q2.
When we count them by their contributions to the next generation (as a result
of differential survival and fertility) they are in the proportions p2wAA : 2pqwAa : q2waa. (Alternatively, we use the
notation w11, w12, and ww22 for the relative fitness values
corresponding to AA, Aa, and aa.
38. These three numbers donÕt add to one, usually. So we
can make them into frequencies by dividing by their sum. The sum is the mean
fitness, denoted = p2wAA + 2pqwAa
+ q2waa, which is also the average value of the
relative fitness of a randomly chosen newborn. [Remark. Note that we slide
between using ÔfrequenciesÕ as a proxy for ÔprobabilitiesÕ – connect this
with our assumption of an Ôarbitrarily largeÕ population.]
39.
Dividing the sum by , we get the three frequencies: (NB – we are using frequencies
as proxies for probabilities again):
40. Taking the frequencies of these three after selection
(i.e. according to their contributions to the next generation) the gene
frequency of A in that next
generation will be the frequency of AA plus half that of Aa:
41. Note the rightmost expression: it says simply that the
new gene frequency is the old one (p)
times the mean fitness of the genotypes that a randomly-chosen A allele happens to find itself in (1), divided by the mean fitness of everybody. In
short, the gene frequency will increase if the mean fitness of AÕs is greater than the mean fitness of random
individuals.
42. We can now compute the rate of change of gene
frequency as a result of natural selection, i.e. dp/dt. We consider
the dynamical system equation (the recurrence relation) described by the
recurrence
43. Calculating pÕ-p we get the fundamental recurrence formula:
44. LetÕs take a look in more detail at this last basic
equation, first formulated by Sewall Wright (1930). Note that Æp, the change in allele frequency, hence the evolutionary
change, depends on two components
that get multiplied together: (1) the factor p(1-p),
which is the genetic variance, or heterozygosity; and (2) , which is the gradient of the mean fitness with respect to allele frequency p. Thus the first factor gives the amount of change, and the second factor says which direction it
is applied in. The direction of change – whether an allele increases or
decreases in frequency - is defined by the equivalent term
- the slope of
the plot of ln
as a function of
p. Since
is the instantaneous per capital rate of growth of the
population, it is as if the population is climbing a slope defined by the
population growth rate. We may
imagine, as Wright did, that this describes a Ôfitness surfaceÕ or Ôadaptive
landscapeÕ and that we move in a direction so as to increase mean fitness
– drive it uphill. The
picture is a useful one because it underscores all the strengths and weakness
of this Ôevolutionary search for maximum fitness:
45. How do we get this interpretation? The p(1-p)
factor comes from binomial sampling theory: we are drawing one allele from a
very large population N, so the variance in the probability of getting frequency
p is p(1-p).
This variance is multiplied by the ÒforceÓ of selection – the
gradient in fitness. In other
words, this variance is changed by the derivative of a potential function, .
Alternatively: it is multiplied by the slope of the fitness function and divided by the mean
population fitness.
46. Without saying anything else, we can already tell that
the change in allele frequency will be large at intermediate values where the
variance term is large (the maximum rate of change where p=q=0.5),
and ever smaller near allele
fixation (i.e., as p approaches 0
or 1).
47. We can thus (approximately!) write down a formula for
the time to move from one allele frequency to another:
48. We can glean a lot just from this form. This tells us
that in general it will be very difficult to catch natural selection operating
in flagrante delecto, as it
were. As the frequency of p approaches unity, the time required for even small
changes will be small, owing to the term 1-p in the denominator. We also note that the equation
has the form of a logistic (we can integrate by partial fractions). (Aside:
note importantly that we are also
assuming that the relative fitnesses
remain constant! We examine
deviations from this assumption below.)
49. Further, we know that mean fitness will always
increase: i.e., - just subtract two wÕs, we get:
which is non-negative. So, on this
model, natural selection acts so as to increase or at least maintain mean
fitness, at least in accord with our intuitions.
50. LetÕs see if we can further understand the dynamics of
this recurrence equation – where are the equillibria points, given
various values for w and p; how much time (number of generations) does it take to
reach an equilibrium, etc.? First,
it is clear that there are fixed points at p=0 or
1. We can understand matters by
comparing the fitnesses of AA, Aa, and aa.
51. If there is just one locus (gene), here it is helpful
to plot the Òadaptive landscape,Ó against p, recalling
the equation:
52. This is the equation of a parabola, and its slope, evolutionÕs
Ôguiding handÕ. We can examine the
possible fixed points and interior equillibria conditions via cases. There turn
out to be seven distinct possibilities (ways of drawing this curve) –
three lines and two parabolas.
Cases (1)–(3): If w11=w12 and w22=w12, then the p2 factor is 0. In this (degenerate) case, , and
is linear. (See
Populus try it out). If all three fitnesses are equal, selection does not
operate,
is constant, and all points are equillibria. Otherwise, the slope
of
is nonzero, and p
converges to 0 or 1. Thus, the homozygote with the highest fitness gets
established. [this makes 3 different linear graphs in all].
Cases
(4)-(7): In the generic case, ,
is a parabola
whose extremum is at the point:
Here there are four subcases.
If the fitness of the heterozygote w12 is between the two homozygotes, w11 and w22, then
Subcase 1: Suppose AA is dominant, w12 =w11.
Then = (w22-w12)/(w22-w12)
= 1. (A increases until it fixes at 1.0)
Subcase 2: Conversely, when w12 =w22, the numerator is 0, and =0. (See Populus
graphs 4 and 5). ItÕs clear
from the way the gradient points – which way the wind blows.
Subcases 3 &4: If the heterozygote fitness is not
between the two homozygotes, the
equilibrium is in (0, 1). There
are two possibilities: (i) the
heterozygote fitness is greater than both homozygotes, or heterosis, AKA overdominance, AKA heterozygote superiority: w12>
w11 and w12 > w22.
Then we have figure 7 – no matter where we start, we get to a
maximum somewhere in the interior. This corresponds to a (simple version of)
the sickle-cell case in malarial regions. (But not quite, as we shall
seeÉ) (ii) w12< w11
and w12 < w22 i.e., underdominance – the heterozygote fitness is less than both
homozygous fitnesses. Then the
fixed point depends on which side of the minima we start at (we can show that
the mapping T is monotonic –
take the derivative and itÕs positive).
Note that case (i) is the only one where the heterozygote is
maintained. Is this why we see so
much variation? It was thought so
at one timeÉbut this turns out not to be true.
53. What of the relation between p and t
(time t measured in generations)?
We can carry out the recurrence calculation directly, and for the cases of
dominance (w for AA and Aa a factor s greater than that for aa) and no dominance (all wÕs equal), to obtain the following table).
54. Table for fixation: # generations spent in various
frequency ranges of p: [Question:
why does it take approximately 200 times longer to eliminate a in the case where A is
dominant?]
Time spent in each
frequency range in terms of # of generations |
||||||
p range |
0.001-0.01 |
0.01-0.1 |
0.1-0.5 |
0.5-0.9 |
0.9-0.99 |
0.99-0.999 |
w11>w12>w22 |
462 |
480 |
439 |
439 |
480 |
462 |
dominance, w11=w12>w22 |
232 |
250 |
309 |
1.020 |
9.240 |
90.231 |
55. Another view of the same sort of data. Here we use a conventional notation s, for Ôselection coefficientÕ. The parameter s is simply the ratio 1-w2/w1 – e.g., if 100 AÕs survive compared to 90 aÕs, then
the ratio is 1-0.0 = 0.1. Values
of s of 0.1 or greater are relatively
unsual biologically.
Selection coefficient, s |
time (generations) |
1.0 |
13.26 |
0.5 |
22.67 |
0.2 |
50.41 |
0.1 |
96.42 |
0.05 |
188.36 |
0.02 |
464.09 |
0.01 |
923.61 |
0.001 |
9194.83 |
56. The times show that while selection ultimately
destroys variation, the times are much much longer than those under blending
inheritance. We might expect to
observe considerable genetic polymorphism in populations even though they are
subject to directional selection.
57. Suppose that the fitnesses of AA, Aa, and aa
stand in the ratio (1 +s)2
: 1 +s : 1. s is called a selection coefficient. [NB this is
geometric mean – multiplicative fitnesses] Note: 1+s: 1 is not the same as 1: 1-s. (Close, but no cigar:) The curve of gene frequency change wrt generation time
is a logistic curve (see Populus simulation). The time taken to change between
any two gene frequencies is (approximately) inversely proportional to s.
58. Overdominance. When the fitness of the heterozygote is
higher than that of either homozygote, natural selection will bring the gene
frequency toward an interior equilibrium, retaining both alleles. This is a
polymorphism. The exact equilibrium gene frequency depends on the fitnesses (in
fact, if fitnesses are written as 1 − s : 1 : 1 − t the equilibrium frequency of A is t/(s + t).[Show
this]) If we plot fitness against gene frequency we get a quadratic curve, with
a peak precisely at this equilibrium gene frequency.
59. This movement of gene frequencies can be rationalized
in terms of the mean fitness of A
compared to the mean fitness of everybody. In an overdominant case, when A is rare it is present mostly in heterozygotes. In
that case A copies have a higher
mean fitness than a (which, being common, are mostly located in aa individuals.
So A increases when rare. When A is common and a is rare, the argument is reversed, with a being mostly in heterozygotes and having the
advantage.
60. Underdominance. When s and t are
both negative, so that the heterozygote is the worst genotype, the gene
frequency will move continually away from the interior equilibrium, which is
now an unstable equilibrium. The gene frequency tends toward 1 or 0. Which one
it goes to depends on which side of the unstable equilibrium it started from.
Note that the outcome depends on the exact starting point. The plot of fitness
against gene frequency is again a quadratic curve, but now it has a minimum at
the unstable equilibrium. The stable equilibria are now 0 and 1.
61. Selection and fitness. In all of these cases the gene
frequency changes so that the mean fitness either improves or remains the same,
it never declines. In each case the population ÒclimbsÓ the adaptive surface or
fitness surface until it comes to rest at the top. Incidentally, this is true
for constant relative fitnesses, and for any number of alleles. It is not
perfectly true when fitness is controlled by multiple loci. But in a lot of
cases it is true that there is a net gain of mean relative fitness from the
beginning of the evolution of the gene frequencies to the end.
62. Is Òall for the best in this best of all possible
worlds?Ó (At least in terms of evolution resulting in optimal organisms). The
underdominance case shows that while evolution at a single locus (with constant
relative fitnesses) results in improvement of the mean fitness, the population
can sometimes come to rest on an equilibrium which is not the highest possible
one. It depends on the starting point. A gene is evaluated by natural selection
against the backgrounds in which it occurs, and that decides whether it will
increase. If the fitnesses of AA, Aa,
and aa are 1.2 : 0.7 : 1, then
when A is rare it is mostly
occurring with aÕs in
heterozygotes, which have fitness 0.7. By comparison, the aÕs are occurring in homozygotes which have fitness 1.
So a seems to be better and copies
of it survive and reproduce better. But in fact, AA would be the best genotype. However natural selection
is not making a global assessment of the effects of combining alleles, so it
misses this and we end up with aa.
Thus the opportunistic nature of natural selection causes us to climb the
nearest peak on the adaptive surface, not the highest one.
63. Thus the opportunistic nature of natural selection
causes us to climb the nearest peak on the adaptive surface, not the highest
one. If one could always do the latter, would we be able to fly (unaided) at
500 miles per hour, swim to the depths of the ocean, while composing brilliant
sonatas all the time? There does not seem to be any way to know, without a
comprehensive understanding of organisms.
64. Returning to our ÔvarianceÕ equationÉThe basic
principle of evolution by natural selection can then be stated as follows: allele
frequencies change in such a way as to maximize the mean fitness of the
population. It is as if ln
plays the role of a potential in physics. And it at first looks like weÕre done
– or close to finding a kind of ÒHamiltonianÓ for biology. Or are we??
65. So here comes the big BUT. The equation above turns out to be inadequate as a
model of selection in all but the simplest case of independent loci,
frequency independent selection, arbitrarily large populations. In reality, neither the maximization of
fitness, nor the simple relation between variance (heterozygosity), p(1-p), is
realized in fact. Only in the case
of a single locus with constant genotypic fitness is this an adequate
representation of selection. In
all other cases there is neither a necessary maximization of fitness – it
may even be minimized! – nor does selection operate most rapidly at intermediate
frequencies. So there is no Fermat
principle of Ôshortest distanceÕ – evolution by natural selection wonÕt
look like physics of the sort weÕre used to. We will use the case of frequency-dependent selection (next
lecture) to illustrate; but even the case of 3 alleles for one gene (below, in
the sickle cell anemia example), and constant fitnesses, we get the same
result.
66. We next immediately generalize to the multi-allele
case. This simply involves converting the 2 x 2 matrix of wÕs we had earlier to general n x n form
w. We now
derive the multidimensional analog of the Ôadaptive landscapeÕ equation, as
well as the correct model for the sickle-cell anemia case, in which there are 3
common alleles: HbA, HbC, and HbS. S
homozygotse have sickle-cell anemia, which occurs when the hemoglobin forms
long crystals under low oxygen tension.
The table below is from Hartl and Clark (1989, p. 171). It gives the
observed genotypic counts and Hardy-Weinberg expectations for all six genotypes
from a sample of 32,898 individuals from 72 West-African populations:
xxxx |
AA |
SS |
CC |
AS |
AC |
SC |
Observed |
25374 |
67 |
108 |
5482 |
1737 |
130 |
Expected |
25616 |
307 |
75 |
4967 |
1769 |
165 |
Obs/Exp |
0.99 |
0.22 |
1.45 |
1.10 |
0.98 |
0.79 |
Relative fitness |
0.89 |
0.20 |
1.31 |
1 |
0.89 |
0.70 |
|
|
|
|
|
|
|
67. Calculating fitness estimate here from H-W (assuming
that H-W ratios would be a population with relative fitnesses all equalÉ is
this correct?)
68. Note first that if a population composed entirely of AA genotypes, with mean fitness 0.89 was invaded by a
single S allele (in a
heterozygote), S would increase in
frequency – a single S has a
fitness of 1.0. With only these
two alleles, we would evolve to the (see Populus simulation) equillbrium value
for S as follows (actual average
value over Africa of 0.09)
69. The mean fitness at equilibrium is 0.9033. [Compute
this!]
70. If a second mutation introduces the C alleles into a population at equilibrium between A and S, its
spread will be determined by its marginal fitness, which is:
71. When C is
rare, pC is
approximately 0 so the third term
can be ignored. So, pA=0.8791,
and the marginal fitness of C when
rare is: 0.8670, less than the mean fitness of 0.90. So, C cannot
invade when rare, even though a population fixed for C would have a global maximum mean fitness. If C were introduced in sufficient numbers to include a
contribution from the third term, then C would fix.
72. It is
even more useful to figure out this Ôadaptive topologyÕ by looking at each pair of alleles at a time: A-S (this we did), C-S, and
A-C. This reveals the existence of
a Ôsaddle pointÕ where movement in one direction lowers fitness, and in the
other, raises fitness.
73. Does Darwinism mean survival of the fittest?
Biology is not physics. Despite the superficial lure of a
Fermat-type maximization/minimization principle, this is not realized.