Post by Admin on Dec 29, 2021 2:31:30 GMT
Functional Impact
A small proportion of SNVs have a predicted functional impact (Supplementary Figs. 20–23, Supplementary Tables 11–14, Supplementary Note, and Supplementary Data File). Among 60,555 SNVs, we observed two singleton premature stop-codons, one each in AMELY and USP9Y, and one splice-site SNV that affects all known transcripts of TBL1Y. Among 94 missense SNVs with SIFT19 scores, all 30 deleterious variants are singletons or doubletons, while 17/64 tolerated variants are present at higher frequency (p = 0.001), underscoring the impact of purifying selection on variation at protein-coding genes. No STRs overlapped protein-coding regions, but, in contrast to the SNVs, a high proportion of CNVs have a predicted functional impact.
Twenty of 100 CNVs in our final callset overlap with 27 protein-coding genes from 17 of the 33 Y-chromosome gene families. In our analysis of 1000 Genomes Project autosomal data, we observed that the ratio of the proportion of deletions overlapping protein-coding genes to the proportion of duplications overlapping protein-coding genes is 0.84. Whereas on the autosomes deletions are less likely to overlap protein-coding genes than duplications are, as others have also reported20, we found the reverse to be true for the Y chromosome. Despite its haploidy, we calculated its ratio of proportions to be 1.5, indicating a surprising increased tolerance of gene loss, as compared with the diploid genes on autosomes.
Diversity
Given observed diversity levels of the autosomes, the X chromosome, and the mitochondrial genome (mtDNA) (Supplementary Table 15, Supplementary Note, and Supplementary Data File), Y-chromosome diversity was reported to be lower than expected from simple population-genetic models that assume a Poisson-distributed number of offspring4, and the role of selection in this disparity is debated. We confirmed that Y-chromosome diversity in our sample is low (Supplementary Fig. 24) and found that positing extreme male-specific bottlenecks in the last few millennia can lead to a good fit between modeled and observed relative diversity levels of the autosomes, the X chromosome, the Y chromosome, and the mtDNA (Supplementary Figs. 25–28, Supplementary Table 16, and Supplementary Note). Therefore, we conclude that Y diversity may be shaped primarily by neutral demographic processes.
Haplogroup Expansions
To investigate punctuated bursts within the phylogeny and estimate growth rates, we modeled haplogroup growth as a rapid phase followed by a moderate phase and applied this model to lineages showing rapid expansions (Supplementary Figs. 29–31, Supplementary Tables 17–19, Supplementary Note, and Supplementary Data File), noting that such extreme expansions are seldom seen in the mtDNA phylogeny here or in other studies5. We examined 20 nodes of the tree whose branching patterns were well-fit by this model. These nodes were drawn from eight haplogroups and included at least one lineage from each of the five continental regions surveyed (Fig. 4). As the haplogroup expansions we report are among the most extreme yet observed in humans, we think it more likely than not that such events correspond to historical processes that have also left archaeological footprints. Therefore, in what follows, we propose links between genetic and historical or archaeological data. We caution that, especially in light of as yet imperfect calibration, these connections remain unproven. But they are testable, for example using aDNA.
Figure 4
Explosive male-lineage expansions of the last 15 thousand years. Each circle represents a phylogenetic node whose branching pattern suggests rapid expansion. The x-axis indicates the timings of the expansions, and circle radii reflect growth rates—the minimum number of sons per generation, as estimated by our two-phase growth model. Nodes are grouped by continental superpopulation (AFR, African; AMR, Admixed American; EAS, East Asian; EUR, European; SAS, South Asian) and colored by haplogroup. Line segments connect phylogenetically nested lineages.
First, in the Americas, we observed expansion of Q1a-M3 (Supplementary Figs. 14e and 17) at ~15 kya, the time of the initial colonization of the hemisphere21. This correspondence, based on one of the most thoroughly examined dates in human prehistory, attests to the suitability of the calibration we have chosen. Second, in sub-Saharan Africa, two independent E1b-M180 lineages expanded ~5 kya (Supplementary Figs. 14a), a period before the numerical and geographical expansions of Bantu speakers in whom E1b-M180 now predominates22. The presence of these lineages in non-Bantu speakers (e.g., Yoruba, Esan) indicates an expansion pre-dating the Bantu migrations, perhaps triggered by the development of ironworking23. Third, in Western Europe, related lineages within R1b-L11 expanded ~4.8–5.9 kya (Supplementary Figs. 14e), most markedly around 4.8 and 5.5 kya. The earlier of these times, 5.5 kya, is associated with the origin of the Bronze Age Yamnaya culture. The Yamnaya have been linked by aDNA evidence to a massive migration from the Steppe, which may have replaced much of the previous European population24,25, but the six Yamnaya with informative genotypes did not bear lineages descending from or ancestral to R1b-L11, so a Y-chromosome connection has not been established. The later time, 4.8 kya, coincides with the origins of the Corded Ware (Battle Axe) culture in Eastern Europe and the Bell-Beaker culture in Western Europe26.
Potential correspondences between genetics and archaeology in South and East Asia have received less investigation. In South Asia, we detect eight lineage expansions dating to ~4.0–7.3 kya and involving haplogroups H1-M52, L-M11, and R1a-Z93 (Supplementary Figs. 14b, 14d, and 14e). The most striking are expansions within R1a-Z93, ~4.0–4.5 kya. This time predates by a few centuries the collapse of the Indus Valley Civilization, associated by some with the historical migration of Indo-European speakers from the western steppes into the Indian sub-continent27. There is a notable parallel with events in Europe, and future aDNA evidence may prove to be as informative as it has been in Europe. Finally, East Asia stands out from the rest of the Old World for its paucity of sudden expansions, perhaps reflecting a larger starting population or the coexistence of multiple prehistoric cultures wherein one lineage could rarely dominate. We observed just one notable expansion within each of the O2b-M176 and O3-M122 clades (Supplementary Figs. 14d).
A small proportion of SNVs have a predicted functional impact (Supplementary Figs. 20–23, Supplementary Tables 11–14, Supplementary Note, and Supplementary Data File). Among 60,555 SNVs, we observed two singleton premature stop-codons, one each in AMELY and USP9Y, and one splice-site SNV that affects all known transcripts of TBL1Y. Among 94 missense SNVs with SIFT19 scores, all 30 deleterious variants are singletons or doubletons, while 17/64 tolerated variants are present at higher frequency (p = 0.001), underscoring the impact of purifying selection on variation at protein-coding genes. No STRs overlapped protein-coding regions, but, in contrast to the SNVs, a high proportion of CNVs have a predicted functional impact.
Twenty of 100 CNVs in our final callset overlap with 27 protein-coding genes from 17 of the 33 Y-chromosome gene families. In our analysis of 1000 Genomes Project autosomal data, we observed that the ratio of the proportion of deletions overlapping protein-coding genes to the proportion of duplications overlapping protein-coding genes is 0.84. Whereas on the autosomes deletions are less likely to overlap protein-coding genes than duplications are, as others have also reported20, we found the reverse to be true for the Y chromosome. Despite its haploidy, we calculated its ratio of proportions to be 1.5, indicating a surprising increased tolerance of gene loss, as compared with the diploid genes on autosomes.
Diversity
Given observed diversity levels of the autosomes, the X chromosome, and the mitochondrial genome (mtDNA) (Supplementary Table 15, Supplementary Note, and Supplementary Data File), Y-chromosome diversity was reported to be lower than expected from simple population-genetic models that assume a Poisson-distributed number of offspring4, and the role of selection in this disparity is debated. We confirmed that Y-chromosome diversity in our sample is low (Supplementary Fig. 24) and found that positing extreme male-specific bottlenecks in the last few millennia can lead to a good fit between modeled and observed relative diversity levels of the autosomes, the X chromosome, the Y chromosome, and the mtDNA (Supplementary Figs. 25–28, Supplementary Table 16, and Supplementary Note). Therefore, we conclude that Y diversity may be shaped primarily by neutral demographic processes.
Haplogroup Expansions
To investigate punctuated bursts within the phylogeny and estimate growth rates, we modeled haplogroup growth as a rapid phase followed by a moderate phase and applied this model to lineages showing rapid expansions (Supplementary Figs. 29–31, Supplementary Tables 17–19, Supplementary Note, and Supplementary Data File), noting that such extreme expansions are seldom seen in the mtDNA phylogeny here or in other studies5. We examined 20 nodes of the tree whose branching patterns were well-fit by this model. These nodes were drawn from eight haplogroups and included at least one lineage from each of the five continental regions surveyed (Fig. 4). As the haplogroup expansions we report are among the most extreme yet observed in humans, we think it more likely than not that such events correspond to historical processes that have also left archaeological footprints. Therefore, in what follows, we propose links between genetic and historical or archaeological data. We caution that, especially in light of as yet imperfect calibration, these connections remain unproven. But they are testable, for example using aDNA.
Figure 4
Explosive male-lineage expansions of the last 15 thousand years. Each circle represents a phylogenetic node whose branching pattern suggests rapid expansion. The x-axis indicates the timings of the expansions, and circle radii reflect growth rates—the minimum number of sons per generation, as estimated by our two-phase growth model. Nodes are grouped by continental superpopulation (AFR, African; AMR, Admixed American; EAS, East Asian; EUR, European; SAS, South Asian) and colored by haplogroup. Line segments connect phylogenetically nested lineages.
First, in the Americas, we observed expansion of Q1a-M3 (Supplementary Figs. 14e and 17) at ~15 kya, the time of the initial colonization of the hemisphere21. This correspondence, based on one of the most thoroughly examined dates in human prehistory, attests to the suitability of the calibration we have chosen. Second, in sub-Saharan Africa, two independent E1b-M180 lineages expanded ~5 kya (Supplementary Figs. 14a), a period before the numerical and geographical expansions of Bantu speakers in whom E1b-M180 now predominates22. The presence of these lineages in non-Bantu speakers (e.g., Yoruba, Esan) indicates an expansion pre-dating the Bantu migrations, perhaps triggered by the development of ironworking23. Third, in Western Europe, related lineages within R1b-L11 expanded ~4.8–5.9 kya (Supplementary Figs. 14e), most markedly around 4.8 and 5.5 kya. The earlier of these times, 5.5 kya, is associated with the origin of the Bronze Age Yamnaya culture. The Yamnaya have been linked by aDNA evidence to a massive migration from the Steppe, which may have replaced much of the previous European population24,25, but the six Yamnaya with informative genotypes did not bear lineages descending from or ancestral to R1b-L11, so a Y-chromosome connection has not been established. The later time, 4.8 kya, coincides with the origins of the Corded Ware (Battle Axe) culture in Eastern Europe and the Bell-Beaker culture in Western Europe26.
Potential correspondences between genetics and archaeology in South and East Asia have received less investigation. In South Asia, we detect eight lineage expansions dating to ~4.0–7.3 kya and involving haplogroups H1-M52, L-M11, and R1a-Z93 (Supplementary Figs. 14b, 14d, and 14e). The most striking are expansions within R1a-Z93, ~4.0–4.5 kya. This time predates by a few centuries the collapse of the Indus Valley Civilization, associated by some with the historical migration of Indo-European speakers from the western steppes into the Indian sub-continent27. There is a notable parallel with events in Europe, and future aDNA evidence may prove to be as informative as it has been in Europe. Finally, East Asia stands out from the rest of the Old World for its paucity of sudden expansions, perhaps reflecting a larger starting population or the coexistence of multiple prehistoric cultures wherein one lineage could rarely dominate. We observed just one notable expansion within each of the O2b-M176 and O3-M122 clades (Supplementary Figs. 14d).