Abstract In its initial formulation by Motoo Kimura, the neutral theory was concerned solely with the level of variability maintained by random genetic drift of selectively neutral mutations, and the rate of molecular evolution caused by the fixation of such mutations. The original theory considered events at a single genetic locus in isolation from the rest of the genome. It did not take long, however, for theoreticians to wonder whether selection at one or more loci might influence neutral variability at linked sites. Once DNA sequence variability could be studied, and especially when resequencing of whole genomes became possible, it became clear that patterns of neutral variability in genomes are affected by selection at linked sites, and that these patterns could advance our understanding of natural selection, and can be used to detect the action of selection in genomic regions, including selection much weaker than could be detected by direct measurements of the relative fitnesses of different genotypes. We outline the different types of processes that have been studied, in approximate order of their historical development. linkage, associative overdominance, hitchhiking, selective sweep, background selection Introduction The first process to be studied in which selection on sites in genomes affects neutral variants was associative overdominance (AOD), first analyzed in a paper in the same year as Kimura's formulation of the neutral theory (Sved 1968), and later developed by Ohta and Kimura (1970). In a randomly mating population, genetic drift causes linkage disequilibrium (LD) between a neutral locus and one or more loci under selection, even in randomly mating populations. These early studies showed that, if the selected loci experience heterozygote advantage, or segregate for partially recessive deleterious alleles that are maintained by mutation pressure, homozygotes at the neutral locus also appear to have lower fitnesses. The effects on “apparent fitnesses” can be large if the neutral and selected loci are closely linked. In a partially inbreeding population, such as a plant population that reproduces by a mixture of self-fertilization and outcrossing, another form of AOD arises, because inbreeding produces between-individual correlations in homozygosity among loci. This identity disequilibrium (ID) occurs even between unlinked loci. The effects can be strong enough to be detected empirically, and were indeed observed in plants and molluscs, and even in mammals, in the pre-DNA-sequencing era, using genotypes of allozyme and microsatellite markers (Strauss 1986; Leberg et al. 1990; Bierne et al. 2000). For nearly 50 years, it was thought that the apparent heterozygote advantage generated by AOD would delay loss of variability at the neutral site, compared with the rate predicted by standard neutral theory. However, AOD induced by ID does not change allele frequencies at neutral loci (Charlesworth 1991), and the same has recently been shown for LD-induced AOD (Zhao and Charlesworth 2016); indeed, this is implicit in the equations in the original papers. At first sight, this suggests that AOD should not retard the loss of neutral variability. However, a re-analysis of LD-induced AOD has shown that this is incorrect (Zhao and Charlesworth 2016). Heterozygote advantage retards loss of variability under quite light conditions; with partially recessive deleterious mutations, retardation can occur if the product of the selection coefficient s against homozygotes and the effective population size, Ne, is of the order of 0.5 or less. Situations also exist when loss of variability is actually accelerated (see the discussion of background selection [BGS]). Unexpectedly, therefore, the pioneering theoretical work on AOD has taken on a new lease of life. The second type of effect of selection on variability at linked neutral sites, hitchhiking by positively selected mutations (selective sweeps), has attracted greater interest than AOD. The basic theory was developed by John Maynard Smith and John Haigh (1974), stimulated by Lewontin’s (1974) observation that populations' allozyme locus variability levels are only weakly related to their size. They proposed that the spread of a selectively favorable allele would reduce variability at a linked neutral site, if the mutation to the favored allele had originated on a single haplotype. In the extreme case of no recombination, the spread to fixation of such a mutation will “sweep” to fixation all variants present on the haplotype in question. With recombination, they showed that the reduction of variability at a neutral site declines rapidly with r/sa, where r is the frequency of recombination with the selected locus, and sa is the selection coefficient for the advantageous allele. They suggested that the overall variability in a species would depend more on the frequency of selective sweeps than the rate of genetic drift determined by its effective population size, resolving Lewontin’s paradox. This proposal was later elaborated by Gillespie (2001) in his theory of “genetic draft,” but the basic idea remains the same. Despite empirical evidence for the hitchhiking of restriction site variants associated with the spread of the human hemoglobin S mutation (Kan and Dozy 1978), relatively little attention was paid to selective sweeps until Drosophila population geneticists, led by Chuck Langley, noticed that levels of sequence variability (mostly detected using restriction site variants) tended to be unusually low in genome regions that recombine infrequently, notably telomere and centromere regions (Aguadé et al. 1989). This stimulated the development of more elaborate models of selective sweeps (Kaplan et al. 1989), especially after Begun and Aquadro (1992) demonstrated a significant correlation between the local recombination rate in Drosophilamelanogaster and the level of restriction site variability in genes. Their finding that no such correlation is seen between recombination rate and divergence between D. melanogaster and its close relative Drosophilasimulans ruled out potential confounding factors, such as a direct influence of recombination on the mutation rate, strongly indicating that variability at neutral or nearly neutral sites is influenced by selection at linked sites. Similar patterns have now been detected in many other taxa, including humans, and in further Drosophila population studies (Cutter and Payseur 2013). The literature on the theory of selective sweeps, and its application to statistical tools for detecting sweeps from patterns of variability at putatively neutral sites, has now developed to a large volume, and we cannot do it justice in this brief overview. We simply note the development of theory for recurrent selective sweeps by Wiehe and Stephan (1993), and improved theory concerning the effects of a single sweep (Barton 1998). Several methods to detect selective sweeps from their effects on patterns of variability, including their effects on the departure of the frequency distribution of segregating sites and on LD, have been developed and successfully applied (Nielsen et al. 2005). In addition, much interest has been aroused by the possibility that there may be “soft sweeps” that involve favorable mutations present initially as more than one copy, rather than single new mutations (Hermisson and Pennings 2005). Similarly, it is possible to detect introgression of a selectively favorable allele from another species or population using patterns of variability at linked loci (Bradburd et al. 2016). Selective sweeps are not the only possible mode of hitchhiking. It also occurs when deleterious mutations are eliminated from a population, taking linked variants with them. This process is commonly known as BGS. It was described verbally by R.A. Fisher in connection with the fate of a beneficial mutation arising on a background of deleterious mutations (Fisher 1930). The first model of this process (Birky and Walsh 1988) did not consider the effect of deleterious mutations on neutral variability at linked sites. A formal treatment of this aspect of BGS (Charlesworth et al. 1993) was quickly followed by more sophisticated theoretical work (Hudson and Kaplan 1995; Nordborg et al. 1996). BGS provides an additional possible explanation for the low neutral variability in genome regions with low local recombination rates mentioned above. With Nes ≫ 1, so that the deleterious alleles are held close to their equilibrium frequencies under mutation–selection balance, neutral variability at linked sites is reduced below the value in the absence of selection. However, as mentioned above, partially recessive mutations in a diploid organism with a sufficiently small Nes value can retard loss of neutral variability. The same basic equation for the effect of allele frequency change at a site under directional selection on the allele frequency at a linked neutral site underlies all three of the processes discussed so far (Zhao and Charlesworth 2016). They can all be regarded as different versions of hitchhiking. When sites subject to deleterious mutation are closely linked, the efficacy of selection on each site is reduced (McVean and Charlesworth 2000), through Hill–Robertson interference (Hill and Robertson 1966). This effect reduces the effects of BGS on neutral variability in the region, explaining why the observed variability in nonrecombining portions of the genome is higher than is predicted by the standard BGS model (Kaiser and Charlesworth 2009). In this interference selection limit, predictions of the effects of selection on linked variability require more complex models (Good et al. 2014). If variants are maintained by long term balancing selection within a single population, the equilibrium variability at linked sites can be higher than expected in the absence of selection (Hudson 1990). This situation is similar to a subdivided population: the alternative alleles at the selected site correspond to different subpopulations, and recombination between the neutral and selected sites act like migration. Given enough time, sites that are closely linked to a target of selection will accumulate sequence differences associated with the alternative alleles at the selected site; the population as a whole will show a peak in variability at neutral sites around the target of selection, which declines with the recombination distance from the selected site. In the region, neutral variants will have higher frequencies than expected in the absence of the selected site, which can allow such balancing selection to be detected (e.g., DeGiorgio et al. 2014). A similar pattern of enhanced variability can be produced by alleles maintained for a long time by differences in local selective pressures, because the selective elimination of the “wrong” type of allele reduces the effective migration rate of neutral variants closely linked to the target of selection (Petry 1983; Charlesworth et al. 1997). In this case, however, the size of the region affected is related to the ratio of the recombination rate to the selection coefficient against the locally nonadaptive allele, and will often be much larger than expected with balancing selection. Genome scans can potentially detect signatures of enhanced population subdivision at neutral markers, but it is difficult to distinguish between reduced variability within local populations caused by local selective sweeps from between-population differences in the direction of selection (Cruickshank and Hahn 2014). Overall, levels of neutral variability in genomes cannot be understood without taking the effects of selection at linked sites into account. In organisms with compact genomes, such as Drosophila, the combined actions of selective sweeps and BGS have probably more than halved the average nucleotide diversity at nearly neutral sites, relative to the value expected in the absence of selection (Elyashiv et al. 2016). The low diversity in highly inbreeding species and in asexual lineages (Cutter and Payseur 2013) must also reflect, at least in part, such effects. In addition to the positive correlations observed between variability and the local recombination rate, selection at linked sites can account for other genomic patterns, such as an increase in nucleotide diversity with increasing distance from coding sequences in mammals (e.g., Halligan et al. 2013). Much work remains to be done to distinguish between the effects of selective sweeps and BGS in causing these patterns, as well as between hard and soft sweeps. In addition, the extent to which variability is affected by loci under balancing selection remains to be determined. References Aguadé M, Miyashita N, Langley CH. 1989. Reduced variation in the yellow-achaete-scute region in natural populations of Drosophila melanogaster. Genetics 122( 3): 607– 615. Google Scholar PubMed Barton NH. 1998. The effect of hitch-hiking on neutral genealogies. Genet Res . 72( 2): 123– 134. Google Scholar CrossRef Search ADS Begun DJ, Aquadro CF. 1992. Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster. Nature 356( 6369): 519– 520. Google Scholar CrossRef Search ADS PubMed Bierne N, Launey S, Naciri-Graven Y, Bonhomme F. 2000. Early effect of inbreeding as revealed by microsatellite analyses on Ostrea edulis larvae. Genetics 148: 1893– 1906. Birky CW, Walsh JB. 1988. Effects of linkage on rates of molecular evolution. Proc Natl Acad Sci U S A. 85( 17): 6414– 6418. Google Scholar CrossRef Search ADS PubMed Bradburd GS, Ralph PL, Coop G. 2016. A spatial framework for understanding population structure and admixture. PLoS Genet. 12( 1): e1005703. Google Scholar CrossRef Search ADS PubMed Charlesworth B, Morgan MT, Charlesworth D. 1993. The effect of deleterious mutations on neutral molecular variation. Genetics 134( 4): 1289– 1303. Google Scholar PubMed Charlesworth B, Nordborg M, Charlesworth D. 1997. The effects of local selection, balanced polymorphism and background selection on equilibrium patterns of genetic diversity in subdivided inbreeding and outcrossing populations. Genet Res . 70( 2): 155– 174. Google Scholar CrossRef Search ADS PubMed Charlesworth D. 1991. The apparent selection on neutral marker loci in partially inbreeding populations. Genet Res Camb . 57( 02): 159– 195. Google Scholar CrossRef Search ADS Cruickshank T, Hahn MW. 2014. Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow. Mol Ecol . 23( 13): 3133– 3157. Google Scholar CrossRef Search ADS PubMed Cutter A, Payseur B. 2013. Genomic signatures of selection at linked sites: unifying the disparity among species. Nat Rev Genet . 14( 4): 262– 274. Google Scholar CrossRef Search ADS PubMed DeGiorgio M, Lohmueller KE, Nielsen R. 2014. A model-based approach for identifying signatures of ancient balancing selection in genetic data. PLoS Genet. 10( 8): e1004561. Google Scholar CrossRef Search ADS PubMed Elyashiv E, Sattath S, Hu TT, Strutsovsky A, McVicker G, Andolfatto P, Coop G, Sella G. 2016. A genomic map of the effects of linked selection in Drosophila. PLoS Genet. 12( 8): e1006130. Google Scholar CrossRef Search ADS PubMed Fisher RA. 1930. The genetical theory of natural selection . Oxford: Clarendon Press. Google Scholar CrossRef Search ADS Gillespie JH. 2001. Is the population size of a species relevant to its evolution? Evolution 55( 11): 2161– 2169. Google Scholar CrossRef Search ADS PubMed Good B, Walczak A, Neher R, Desai M. 2014. Genetic diversity in the interference selection limit. PLos Genet. 10( 3): e1004222. Google Scholar CrossRef Search ADS PubMed Halligan D, Kousathanas A, Ness R, Harr B, Eory L, Keane T, Adams D, Keightley P. 2013. Contributions of protein-coding and regulatory change to adaptive molecular evolution in murid rodents. PLoS Genet. 9( 12): e1003995. Google Scholar CrossRef Search ADS PubMed Hermisson J, Pennings PS. 2005. Soft sweeps—molecular population genetics of adaptation from standing genetic variation. Genetics 169( 4): 2335– 2352. Google Scholar CrossRef Search ADS PubMed Hill WG, Robertson A. 1966. The effect of linkage on limits to artificial selection. Genet Res . 8( 3): 269– 294. Google Scholar CrossRef Search ADS PubMed Hudson RR. 1990. Gene genealogies and the coalescent process. Oxf Surv Evol Biol. 7: 1– 45. Hudson RR, Kaplan NL. 1995. Deleterious background selection with recombination. Genetics 141( 4): 1605– 1617. Google Scholar PubMed Kaiser VB, Charlesworth B. 2009. The effects of deleterious mutations on evolution in non-recombining genomes. Trends Genet 25( 1): 9– 12. Google Scholar CrossRef Search ADS PubMed Kan W, Dozy AM. 1978. Polymorphism of DNA sequence adjacent to human beta-globin structural gene: relationship to sickle mutation. Proc Natl Acad Sci U S A . 75( 11): 5631– 5635. Google Scholar CrossRef Search ADS PubMed Kaplan NL, Hudson RR, Langley CH. 1989. The “hitchhiking effect” revisited. Genetics 123( 4): 887– 899. Google Scholar PubMed Leberg PL, Smith MH, Rhodes OE. 1990. The association between heterozygosity and growth of deer fetuses is not explained by the effects of the loci examined. Evolution 44( 2): 454– 458. Google Scholar CrossRef Search ADS PubMed Lewontin RC. 1974. The genetic basis of evolutionary change . New York: Columbia Univ. Press. Maynard Smith J, Haigh J. 1974. The hitch-hiking effect of a favourable gene. Genet Res . 22: 29– 35. McVean GAT, Charlesworth B. 2000. The effects of Hill–Robertson interference between weakly selected mutations on patterns of molecular evolution and variation. Genetics 155: 929– 944. Google Scholar PubMed Nielsen R, Bustamante C, Clark AG, Glanowski S, Sackton TB, Hubisz MJ, Fledel-Alon A, Tanenbaum DM, Civello D, White TJ, et al. 2005. A scan for positively selected genes in the genomes of humans and chimpanzees. PLoS Biol . 3( 6): e170. Google Scholar CrossRef Search ADS PubMed Nordborg M, Charlesworth B, Charlesworth D. 1996. The effect of recombination on background selection. Genet Res . 67( 2): 159– 174. Google Scholar CrossRef Search ADS PubMed Ohta T, Kimura M. 1970. Development of associative overdominance through linkage disequilibrium in finite populations. Genet Res . 18: 277– 286. Google Scholar CrossRef Search ADS Petry D. 1983. The effect on neutral gene flow of selection at a linked locus. Theor Pop Biol . 23( 3): 300– 313. Google Scholar CrossRef Search ADS Strauss SH. 1986. Heterosis at allozyme loci under inbreeding and crossbreeding in Pinus attenuata. Genetics 113( 1): 115– 134. Google Scholar PubMed Sved JA. 1968. The stability of linked systems of loci with a small population size. Genetics 59( 4): 543– 563. Google Scholar PubMed Wiehe THE, Stephan W. 1993. Analysis of a genetic hitchhiking model, and its application to DNA polymorphism data from Drosophila melanogaster. Mol Biol Evol . 10( 4): 842– 854. Google Scholar PubMed Zhao L, Charlesworth B. 2016. Resolving the conflict between associative overdominance and background selection. Genetics 203( 3): 1315– 1334. Google Scholar CrossRef Search ADS PubMed © The Author(s) 2018. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: email@example.com This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices)
Molecular Biology and Evolution – Oxford University Press
Published: Apr 4, 2018
It’s your single place to instantly
discover and read the research
that matters to you.
Enjoy affordable access to
over 18 million articles from more than
15,000 peer-reviewed journals.
All for just $49/month
Query the DeepDyve database, plus search all of PubMed and Google Scholar seamlessly
Save any article or search result from DeepDyve, PubMed, and Google Scholar... all in one place.
Get unlimited, online access to over 18 million full-text articles from more than 15,000 scientific journals.
Read from thousands of the leading scholarly journals from SpringerNature, Elsevier, Wiley-Blackwell, Oxford University Press and more.
All the latest content is available, no embargo periods.
“Hi guys, I cannot tell you how much I love this resource. Incredible. I really believe you've hit the nail on the head with this site in regards to solving the research-purchase issue.”Daniel C.
“Whoa! It’s like Spotify but for academic articles.”@Phil_Robichaud
“I must say, @deepdyve is a fabulous solution to the independent researcher's problem of #access to #information.”@deepthiw
“My last article couldn't be possible without the platform @deepdyve that makes journal papers cheaper.”@JoseServera