|
Post by Admin on Aug 16, 2021 19:24:28 GMT
Until recently, scientists thought the modern humans with the highest proportion of Denisovan ancestry lived in Papua New Guinea and Australia. According to a new study published yesterday (August 12) in Current Biology, however, an Indigenous group in the Philippines called the Ayta Magbukon have 30 to 40 percent more Denisovan DNA than these other frontrunners, for a total of nearly 5 percent of their genomes. Denisovans were a group of archaic humans first identified from a single pinkie bone in a Siberian cave. They coexisted with modern humans and other archaic human species, such as Neanderthals, for hundreds of thousands of years, until they went extinct an estimated 30,000 to 50,000 years ago. According to Gizmodo, only Pacific Islanders and Southeast Asians have substantial Denisovan ancestry. By comparison, most people in other parts of mainland Asia have less than 0.05 percent Denisovan ancestry, and people of African and European descent don’t have any. “[The Ayta Magbukon] possess more Denisovan ancestry than anybody else on the planet today,” Uppsala University biologist and study coauthor Mattias Jakobsson tells Inverse. “So that was a surprise to us.” According to Gizmodo, the researchers were originally interested in studying the human history of the Philippines as part of a massive collaborative effort with Indigenous communities, local governments, the National Commission for Culture and the Arts of the Philippines, and researchers at Uppsala University. As a follow-up study to an earlier one studying human migrations to the Philippines, “we intended to look at the distant past by assessing the levels of archaic ancestry among the populations, especially that some populations in these regions were previously shown to have elevated levels of Denisovan ancestry and that Island Southeast Asia is known to be inhabited by various archaic species of Homo,” population geneticist and study coauthor Maximilian Larena tells Gizmodo. To do this, the researchers analyzed the genomes of 1,107 individuals belonging to 118 distinct ethnic groups in the Philippines—including 25 groups self-identifying as “Negritos,” who are regarded as the earliest modern human inhabitants of the Philippines, according to the study’s authors. By comparing these genomes to Denisovan and Neanderthal genomes, they found that while the degree of Neanderthal ancestry was fairly uniform in their study population (and comparable to modern humans in other parts of the world), the degree of Denisovan ancestry was highly variable, and substantially higher among Negritos than in other groups. These findings “are consistent with a model of an independent interbreeding event between Negritos and Denisovans within the Philippines, suggesting that Denisovans may have been in the islands long before the presence of any modern human ethnic group,” Larena tells Gizmodo. University of Tübingen paleogeneticist Cosimo Posth, who was not involved in the study, tells Science News the new report suggests that “still today there are populations that have not been fully genetically described and that Denisovans were geographically widespread.” Currently, the Denisovan fossil record is sparse, and according to Science News, Denisovan fossils can’t be identified by morphology alone. They have to be genetically sequenced, which can be difficult when extracting fossils from tropical climates where the ancient DNA degrades more quickly. See “Denisovan Fossil Identified in Tibetan Cave” The findings “further increase my suspicions that Denisovan fossils are hiding in plain sight,” among previously excavated discoveries on Southeast Asian islands, University of Adelaide population geneticist João Teixeira tells Science News. Teixeira was not involved with the current study. “When it comes to Southeast Asia and the Southeast Asian Islands, we have more questions than answers as we don’t have a good archaeological record,” University of Colorado Boulder population geneticist Fernando Villanea tells Inverse. Villanea, who was not involved with the study, adds, “Now we have these incredible genetic findings and we’re having a hard time putting together a cohesive story.” “By sequencing more genomes in the future, we will have better resolution in addressing multiple questions, including how the inherited archaic tracts influenced our biology and how it contributed to our adaptation as a species,” Larena says in the press release.
|
|
|
Post by Admin on Aug 16, 2021 22:08:43 GMT
Divergence Time between Negritos and Papuans To estimate the divergence time between Philippine Negritos versus Papuans, we utilized the ‘TT’ method, which is based on computing sample configurations in a population divergence model25,92. This approach estimates the number of generations since a population divergence for a pair of individuals (or populations), and the method produces direct estimates in generations that are unaffected by the effective population size of the population of each of the individuals in the comparison. We utilized the publicly available genome sequence data on Papuans23 and our newly generated sequence data on Ayta Magbukon. The estimated divergence time between Ayta Magbukon Negrito and Papuans is ∼85 kya (95% CI: 76 – 95 kya). This unusually old divergence time may be attributed to the deeply diverging Denisovan ancestries found in both Papuans and Ayta Magbukon. To correct the effect of archaic introgression, we filtered out all the archaic sequences (that were identified using the S’ method) in both Ayta Magbukon and Papuans. The estimated divergence time between Ayta Magbukon Negritos and Papuans, using the filtered data, is 53 kya (95% CI: 41-64 kya). This estimated date falls shortly after the divergence between Australasians and East Asians/West Eurasians, which was previously estimated to ∼58 kya (95% CI: 51 – 72 kya)24. Figure S3. Levels of Neandertal, Denisovan, East Asian and Australasian-related ancestries among Philippine ethnic groups, Related to Figures 2 and 3. Map of the Philippines showing the location of the populations with circular markers indicating their levels of East Asian (A) and Australasian (B), Neandertal (C), and Denisovan (D) ancestry. Neandertal ancestry and Denisovan ancestries are estimated by f4-ratio statistics, while East Asian and Australasian-related ancestries are estimated by qpAdm (see STAR Methods). An overlap between the distribution pattern of Australasian-related (B) and Denisovan (D) ancestry is observed, indicating correlation. Inset scatter plots (C,D) display stronger positive correlation between Australasian-related and Denisovan ancestry than between Australasian-related and Neandertal ancestry (R2: 0.86 vs 0.42, respectively). D and f4-ratio statistics on genotyped data All Philippine ethnic groups carry Neanderthal ancestry that is similar to the levels found in all worldwide populations outside of Africa (Figure S3C). This was determined using the test f4-ratio (Chimp;AltaiNeanderthal,Mbuti,X) / (Chimp;AltaiNeanderthal,Mbuti, VindijaNeanderthal). The analysis was restricted to the sites that are non-polymorphic or homozygous ancestral in the Denisovan genome (to correct for inflation of the estimate due to the presence of Denisovan ancestry in some populations). The comparable levels of Neanderthal ancestry in all populations are in line with previous findings indicating that Neanderthal introgression is likely to have been a single event that occurred when anatomically modern humans migrated out of Africa2,3. However, an alternative model of multiple episodes of Neanderthal gene flow into European and East Asians has been suggested93, which may explain the asymmetric pattern of Neanderthal allele frequencies among these populations. We performed the test D(Mbuti;Denisovan,Han,X) or D(Mbuti;Denisovan,Dai,X) (Figures S5A and S5B) to investigate whether any X population received any gene flow from Denisovans, using East Asians as the reference population for comparison (East Asians are known to have some minimal amount of Denisovan ancestry, but which is not significant by standard D tests)12,15. Almost all ethnic groups of the Philippines exhibited significant levels of Denisovan ancestry with the highest levels found among Negritos. The results are consistent when we use the tests D(Mbuti;Denisovan,French/Norwegian,X), where we use Europeans, who are known to lack Denisovan ancestry, as the alternative reference populations for comparison (Figures S5C and S5D). Among Negritos, Ayta Magbukon displayed the highest estimated levels of Denisovan ancestry in a worldwide panel of populations, with Ayta Magbukon exhibiting higher estimates than Papuans, Australians, or any Oceanian populations (Figures 2A, 2B, and S5A–S5D; Data S1A). These findings are consistent when we ran the D tests with different outgroups (Chimp, Juhoansi, or YRI), when we ran with Phil_HO or Phil_AsiaPacific_315K datasets, or with our Philippine dataset merged with the Estonian Genome Diversity Project dataset, or when we correct for Neanderthal ancestry in target populations with f4-ratio Mbuti;Neanderthal,EastAsian,X:Mbuti; Neanderthal,EastAsian,Denisova) (Figures S5F–S5L; Data S1B and S1C). The findings are also consistent when we restrict the Phil_1KGP_SGDP_1.92M dataset to a panel of SNPs that are polymorphic only or ascertained for Europeans or Africans or to the H3frica SNP panel (Figures S5M–S5O). Moreover, when we test individuals instead of populations, we find that most Ayta Magbukon Negritos possess the highest level of Denisovan ancestry in the world (Figure S5L). Some Ayta Ambala, Agta Lopez and Agta Manide individuals also exhibit higher Denisovan ancestry than any other Papuan individual (Figure S5L). RFMix-based masking of East Asian ancestry Considering that Philippine Negritos are admixed with Cordilleran-related ancestry (Figure 1D), we implemented RFMix to produce a new dataset where we removed the Cordilleran-related alleles (and hence retain only the Negrito-related genome-regions) among Philippine Negritos. RFMix64 was implemented on a shapeit-phased94,95 subset of the Phil_1KGP_SGDP_1.92M dataset using the flag -fb 1 -e 1 -n 10 -u 1, with 15 Cordillerans as a reference for least admixed East Asian and 15 Papuans as a reference for least admixed Australasian. We find that Northern Negritos of the Philippines consistently exhibit higher Denisovan ancestry, with non-overlapping confidence intervals, relative to Australians or Papuans or to a southern Negrito, Mamanwa (Figure S5E). There is no change in the levels of Denisovan ancestry in Papuans and Australians before and after masking (the data points are overlapping), which indicates that the increase in Denisovan ancestry among Philippine Negritos is solely related to removal of confounding East Asian admixture. We also performed the test D(Mbuti;Denisovan,Negrito,Papuan/Australian), both in the unmasked and masked versions of the dataset, with matched sample size between populations and inclusion of least admixed individuals (Figures 3B and 3C). We consistently find that Ayta Magbukon exhibit higher Denisovan ancestry relative to Papuans or Australians, and is more pronounced when we mask away East Asian-related ancestry. Correlation analysis A significant negative correlation is observed when we plot the proportion of East Asian ancestry versus Denisovan ancestry (R2: 0.8796, p value: < 0.0001) (Figure S4A). Likewise, a positive correlation between Negrito ancestry and Denisovan ancestry is observed when we plot the proportion of Australasian ancestry using D(Mbuti;Papuan,EastAsian,X) versus D(Mbuti;Denisovan,EastAsian,X) or D(Mbuti; Australian,EastAsian,X) versus D(Mbuti;Denisovan,EastAsian,X) (Figures S4B–S4E). Separate from the slope of Papuan-related ancestry, two additional slopes can be observed for Northern Negritos and Southern Negritos (Figures S4B and S4E). The linear clusters of Northern Negritos and Southern Negritos (together with Manobo-related populations with high Southern Negrito ancestry) form their distinct respective slopes that are also positively correlated with Denisovan ancestry. All populations with Papuan-related ancestry (that is distinctively post the Australopapuan divergence), such as Indonesians and Oceanians, form a distinct and separate slope, separate from the slopes formed by Negritos (Figures S4B–S4E). As expected, among the Philippine ethnic groups, Sangil is a distinct outlier from the slope formed by Northern Negritos or Southern Negritos (Figures S4B–S4E). Sangil aligns more with the slope for eastern Indonesians, Bougainville islander, and Papuans. This suggests that populations with more Papuan-like ancestry exhibit a Denisovan ancestry that is distinct from the Denisovan ancestry found in Northern or Southern Negritos. Altogether, these findings, that Denisovan ancestry in Negritos is not associated with Papuan-related ancestry, indicates that the Denisovan introgression event in Negritos is independent from that of Papuans.
|
|
|
Post by Admin on Aug 17, 2021 4:39:39 GMT
Sequencing Analyses of Ayta Magbukon Negritos We have sequenced and analyzed the genomes of five Ayta Magbukon individuals and present the results in Figure 4. In order to omit potential genome-sequence-processing biases, we started from the raw sequence data (FASTQs) for all individuals (including the comparative human sequence data), and processed all the sequence data in one batch, using standard mapping and filtering criteria (see Method details section for a full description). D and f4-ratio statistics on sequenced data Denisovan ancestry was estimated using the test D(Outgroup;Denisovan,EastAsian,X) or the f4-ratio statistic f4(Outgroup; Neanderthal,EastAsian,X) / f4(Outgroup;Neanderthal, EastAsian,Denisovan), with Altai and Vindija individuals representing the Neanderthals, and Kankanaey Cordilleran representing East Asians. Ayta Magbukon consistently demonstrate to have the highest level of Denisovan ancestry (Figure 4A; Data S2A and S2B). To assess whether Ayta Magbukon possess higher Denisovan ancestry than highland Papuans or Australians, we implemented the test D(Outgroup;Denisovan,Papuan/Australian,AytaMagbukon), with Chimp, Juhoansi, Biaka, Khomani San, or Yoruba used as outgroups. We confirm that the Ayta Magbukon possess higher levels of Denisovan ancestry relative to that of Australians or Papuans, with D values of 0.0168 - 0.0256 and all Z score of > 3 ((Figures S6A and S6B; Data S2C). To assess the percentage difference of Denisovan ancestry in Ayta Magbukon versus Australians or Papuans, we utilized the statistic D(Mbuti,Denisovan,EastAsian,AytaMagbukon) / D(Mbuti,Denisovan,East Asian,Papuan/Australian). We find that the Ayta Magbukon possess 29%–39% more Denisovan ancestry than do Australians and Papuans (Figure 4B; Data S2A). S’ method For identifying archaic introgressed sequences without the use of reference archaic genomes, we utilized the recently developed S-prime (S’) method15. The method is designed to detect divergent haplotypes that are in strong LD and that are of very low frequency or absent in outgroup African population (who are known not to harbor archaic ancestry). The S’ method was demonstrated to have better power relative to other archaic reference-free methods, and is appropriate for large-scale genomic data15. For our analysis, we utilized 15 SGDP Sub-Saharan African individuals as the outgroup population (which includes the Juhoansi, Mbuti, Khomani, Yoruba, and Biaka individuals), and 5 newly sequenced Ayta Magbukon and 15 SGDP Papuan individuals as the test populations. Following retrieval of archaic sequences using S’, we leveraged the sequences to the reference Altai Denisovan and Vindija Neanderthal genomes. First, using the masks downloaded from cdna.eva.mpg.de/neandertal/, we filtered out the following sites from the archaic genomes: indels, sites within tandem repeats, sites with poor mapability, depth of coverage of < 10x, or mapping quality of < 25. For each filtered site, we compared the genomes of the test individuals to the reference Denisovan and Neanderthal genomes, and report if they match or mismatch. The match rate per archaic segment was then estimated by dividing the total number of matches over the total number of sites compared. Contour densities of the Denisovan versus Neanderthal match rates were plotted using the kde2d function of the R v3.3.1 MASS package. We find that both Ayta Magbukon and Papuans display similar patterns of affinity to the reference Denisovan and Neanderthal genomes, moderate affinity or match rate of ∼50% to the Altai Denisovan and high affinity or match rate of ∼80% to the Vindija Neanderthal (Figures 5A and 5B). Following a previous approach15, we identify the segments as putatively Denisovan with the following criteria: presence of at least 30 introgressed alleles that can be compared with the reference archaic genomes; less than 30% match rate to the Vindija/Altai Neanderthal genome, and greater than 30% match rate to the Denisovan genome. Based on this criteria, we find that the Ayta Magbukon exhibit significantly higher magnitude of Denisovan ancestry relative to that of Papuans. The average amount of Denisovan sequence detected in Ayta Magbukon and Papuans are 100.38 Mb (95% CI: 89.66 – 111.11 Mb) and 79.35 Mb (95% CI: 76.89 – 81.79 Mb), respectively (p = 3.0x10-4); a difference of ∼27%. To classify the introgressed haplotypes into Denisovan versus Neanderthal source, we applied the criteria used in a previous work15. We classified segments as putatively Neanderthal with the following conditions: presence of at least at least 30 introgressed alleles that can be compared with the reference archaic genomes; greater than 60% match rate to the Vindija Neanderthal genome; and less than 40% match rate to the Denisovan genome. We then identified segments as putatively Denisovan with the following conditions: presence of at least at least 30 introgressed alleles that can be compared with the reference archaic genomes; greater than 40% match rate to the Denisovan genome; and less than 30% match rate to the Vindija Neanderthal genome. Any segment that indistinguishable from Neanderthal or Denisovan are categorized as ambiguous. The summary of archaic sequences per individual is presented in Figure S6C. Using the aforementioned criteria that distinguishes between different archaic sources, Ayta Magbukon and Papuans display similar amounts of Neanderthal sequences [(45.35 Mb (95% CI: 41.71 – 48.99 Mb) versus 46.47 Mb (95%CI: 44.96 – 47.97 Mb)] and ambiguous sequences [(7.52 Mb (95% CI: 5.47 – 9.57 Mb) versus 6.92 Mb (95%CI: 6.25 – 7.60 Mb)]. Consistently, we find Ayta Magbukon to present with significantly higher levels of Denisovan ancestry relative to Papuans (Figure 4D). The average amount of Denisovan sequences detected in Ayta Magbukon and Papuans are 51.94 Mb (95% CI: 44.62.66 – 59.25 Mb) and 41.96 Mb (95% CI: 39.54 – 44.37 Mb), respectively (p = 3.7x10-3); a difference of ∼24%. Correlation Between Denisovan and Australasian ancestry We plotted the correlation between Denisovan and Australasian-related ancestry among Negrito and Papuan-related groups. Denisovan ancestry was estimated using f4-ratio statistics, while Australasian-related ancestry was estimated using qpAdm. Consistent with the correlation analysis presented earlier, Negritos fall outside the slope formed by the Papuan-related populations. Furthermore, an important aspect of this study is the availability of Negritos with varying levels of admixture with East Asian-related populations. This allowed us for the first time to plot and extrapolate the level of Denisovan ancestry in an ‘unadmixed’ ancestral Negrito population (D = 4.47), which demonstrably have at least 46% more Denisovan ancestry than the highlander Papuans (Figure 3D). In addition, our analysis also demonstrated for the first time that Negritos have significantly higher levels of ‘Denisovan richness’ in their Australasian ancestry than Oceanians, which we define as the ratio of Denisovan ancestry over Australasian ancestry (Figure 3E). Altogether, the evidence presented here, that Denisovan ancestry in Negritos is not associated with Papuan-related ancestry and that Negritos have higher levels of Denisovan richness, provide support to a demographic model where Negritos receive an additional and/or an independent Denisovan introgression event. Evaluating Recent Admixture in Philippine Negritos versus Papuans The lower levels of Denisovan ancestry in Papuans could be explained by dilution via recent admixture with non-Papuan populations with no detectable or low levels of Denisovan ancestry. To determine the presence of gene flow from populations with little or no Denisovan ancestry into Papuans or Philippine Negritos, we implemented the test D(Outgroup;X,Papuan, Philippine Negrito); with Balangao, Han, or Dai as a surrogate for East Asian; Even and Yakut as a surrogate for Siberian or North Asian; Karitiana or Mixe as a surrogate for Native American; Irula as a surrogate for South Asian; French or Sardinian as a surrogate for European; and Luhya or Mbuti as a surrogate for African (Figures S7A–S7L). Based on the various combination of D tests listed above, we do not find any population that shares more alleles with Papuans relative to Negritos (Figures S7A–S7L). In fact, it is the Negritos that consistently displayed evidence for recent admixture, given the high levels of allele sharing between Negritos and East Asians, relative to Papuans. Admixture Graph Analysis We constructed admixture graphs using the qpGraph tool of the AdmixTools v5.0 software package to determine which population-history model best fits the data that accounts for the number of Denisovan introgression events into Philippine Negritos and Papuans. Using the Phil_1KGP_SGDP_Ancient_Transv_317K dataset, we incorporarted in the model the chimpanzee as an outgroup, French to represent West Eurasians, Balangao Cordilleran and Liangdao 2 to represent East Asians, and Australian, Papuan, and Ayta Magbukon to represent Australasians (Figures 6A–6C). The qpGraph tool was ran with the following parameters: outpop (NULL), blgsize (0.05), diag (0.0001), hires (YES), and lsqmode (YES). A model is rejected when the worst fitting f statistical test has a significant Z score of > | 3 |. A model with a single Denisovan introgression event shared between Philippine Negritos and Australopapuans is rejected (worst-fitting f score Z = −3.6). Models that allow independent Denisovan introgression events into Negritos and Papuans are not rejected (worst-fitting f score of Z = −1.18 for both models), providing support that Philippine Negritos likely admixed with Denisovans independently from the Denisovan introgression event that occurred in Australopapuans. Simulation of Demographic Scenarios The coalescent simulator msprime29 was utilized to evaluate whether a null model (Figure 7F) of single Denisovan introgression event into the common ancestor of Ayta Negritos and Papuans or alternative models (Figures 7K, 7P, 7U, and 7Z) of multiple Denisovan admixture events are consistent with our empirical data: higher levels of Denisovan ancestry in Ayta Negritos than Papuans (Figures 7A and 7B), similar Denisovan tract lengths between Ayta Negritos and Papuans (Figure 7C), higher ratio of Denisovan over Australasian ancestry in Ayta Negritos relative to Papuans (Figure 7D), and Ayta Negritos is an outlier to the Papuan-Denisovan slope formed after correlating Denisovan and Australasian ancestry (Figure 7E). For each of the models shown in Figures 7F, 7K, 7P, 7U, and 7Z, 500 replicate simulations were performed, with 10 samples of 100M sequence taken from each of 8 populations. The parameters fixed across all models are shown in Table S1, and parameters that differed between models are shown in Table S2. All scripts used to generate data are available at github.com/jammc313/Philippine_Denisovan_Ancestry. The null model (Figure 7F) includes an admixture event coming from an unsampled Denisovan population that diverged from the ‘Altai’ Denisovan 300 kya, into the shared ancestral population of Papuan and Ayta (4%, 48 kya). An alternative model (Alt1) follows an identical setup (Figure 7K), plus an additional admixture event (2%) coming from a distinct and unsampled Denisovan population, (also diverged from the Altai Denisovan 300 kya), into the (ancestors of) an Ayta population (40 kya), resulting in a total of 4% + 2% = 6% Denisovan admixture in the (ancestral) Ayta population. A second alternative model (Alt2) includes Denisovan admixture events private to both Papuans (4%, 45kya) and Ayta (6%, 35 kya) (Figure 7P). Again, the sources of Denisovan admixture are distinct Denisovan populations both diverged from the ‘Altai’ Denisovan 300 kya. A third alternative model (alt3) is similar to alt2, but with the Denisovan introgression event into Papuans occurring more recently at 25 kya (Figure 7U). The last alternative model (alt4) is similar to alt2, but with the independent Denisovan introgression events into Ayta and Papuans occurring around the same at 25 kya (Figure 7Z). The parameters of all models were kept as general as possible, with population sizes, admixture dates, and admixture proportions all chosen in accordance with the general consensus from previous studies24,27,28. For all the simulations, we plotted the levels of Denisovan ancestry based on f4-ratio (Africa;Neanderthal,EastAsian,X)/(Africa;Neanderthal,EastAsian,Denisovan), mean tract lengths of Denisovan ancestry, and the ratio of Denisovan ancestry over Australasian ancestry. All pairwise comparisons were tested for significance using the Mann-Whitney U test with Holm-Bonferroni correction. The correlation between Denisovan and Australasian ancestry were also plotted following additional simulations with incremental 10% East Asian admixture into Ayta and Papuans. The simulated models were leveraged against the empirical data using the following four parameters: level of Denisovan ancestry, mean Denisovan tract length, ratio of Denisovan ancestry over Australasian ancestry, and correlation between Denisovan ancestry and Australasian ancestry. Except for mean Denisovan tract length, the null model produced patterns of Denisovan ancestry that are not consistent with the observed data (Figures S7F–S7J). On the other hand, all the alternative models consistently replicate three pieces of empirical evidence: i) higher level of Denisovan ancestry in Ayta Negritos than Papuans, ii) higher ratio of Denisovan over Australasian ancestry in Ayta Negritos than Papuans, and iii) Ayta Negritos falls outside the slope formed by Papuan-related groups when plotting the correlation between Denisovan ancestry and Australasian ancestry (Figures S7K–S7Ad). Among the alternative models, only Alt4 model produced patterns of mean Denisovan tract length that is approximating similar levels in Ayta Negritos and Papuans, supporting a demographic scenario where the timing of the separate Denisovan introgression event into Ayta Negritos and Papuans happened around the same time, (Figures 7C and 7Ab).
|
|
|
Post by Admin on Aug 17, 2021 21:29:26 GMT
Discerning the Origins of the Negritos, First Sundaland People: Deep Divergence and Archaic Admixture Timothy A. Jinam, Maude E. Phipps, Farhang Aghakhanian, Partha P. Majumder, Francisco Datar, Mark Stoneking, Hiromi Sawai, Nao Nishida, Katsushi Tokunaga, Shoji Kawamura
Abstract Human presence in Southeast Asia dates back to at least 40,000 years ago, when the current islands formed a continental shelf called Sundaland. In the Philippine Islands, Peninsular Malaysia, and Andaman Islands, there exist indigenous groups collectively called Negritos whose ancestry can be traced to the “First Sundaland People.” To understand the relationship between these Negrito groups and their demographic histories, we generated genome-wide single nucleotide polymorphism data in the Philippine Negritos and compared them with existing data from other populations. Phylogenetic tree analyses show that Negritos are basal to other East and Southeast Asians, and that they diverged from West Eurasians at least 38,000 years ago. We also found relatively high traces of Denisovan admixture in the Philippine Negritos, but not in the Malaysian and Andamanese groups, suggesting independent introgression and/or parallel losses involving Denisovan introgressed regions. Shared genetic loci between all three Negrito groups could be related to skin pigmentation, height, facial morphology and malarial resistance. These results show the unique status of Negrito groups as descended from the First Sundaland People.
Introduction The question of how and when anatomically modern humans made the journey out of Africa and into all corners of the world has been of great interest. It is widely acknowledged that humans have been in Southeast Asia (SEA) at least 40–50 thousand years ago (ka). Indeed, human remains found in Niah Cave in Borneo (Barker et al. 2007), Callao Cave in the Philippines (Mijares et al. 2010) and Tam Pa Ling in Laos (Demeter et al. 2012) were dated to approximately that time period. Then, the current islands of Sumatra, Java and Borneo were connected with the Asian mainland, forming the landmass known as Sundaland. A cluster of islands separate Sundaland from another landmass called Sahul, made up of what is now New Guinea and Australia.
There are several human populations scattered throughout SEA that are thought to be descendants of the “First Sundaland People.” They are collectively known as Negritos and are currently found in the Andaman Islands, Malay Peninsula and several islands in the Philippines. They have been traditionally associated with a hunter-gathering lifestyle, and also exhibit physical features that are distinct from their non-Negrito neighbors, namely short stature, frizzy hair, and dark skin (Barrows 1910; Radcliffe-Brown 1922; Evans 1937). These observations led to the idea that the Negritos might be closely related to African Pygmies who also exhibit similar phenotypes (Howells 1973). Alternatively, the similar phenotypes may have arisen due to adaptation to relatively similar environmental conditions in Africa and Southeast Asia (Coon 1965), namely convergent evolution.
Early genetic studies that utilized various red blood cell enzymes and serum proteins, led by one of us (K.O.), found that Philippine Negritos have closer affinities to Asia-Pacific populations than to African Pygmies (Omoto et al. 1978, 1981; Matsumoto et al. 1979; Horai et al. 1981). Later studies of mitochondrial DNA (mtDNA) reported basal lineages in Andamanese and Malaysian Negritos that date back to the earliest migrations to SEA (Thangaraj et al. 2005; Hill et al. 2006; Jinam et al. 2012). Genome-wide Single Nucleotide Polymorphisms (SNPs) studies added further insight to the genetic diversity of Negritos. Using ∼50k SNPs, the HUGO Pan-Asian SNP Consortium proposed a single wave of migration into SEA, with the Malaysian and Philippine Negritos as the forerunners (The HUGO Pan-Asian SNP Consortium 2009). Subsequent studies reported various demographic and evolutionary factors that affected their genetic diversity. These included admixture (Jinam et al. 2013), long term isolation and bottlenecks (Deng et al. 2014; Aghakhanian et al. 2015), and adaptation to malaria (Liu et al. 2015). However, compared with Malaysian Negritos, genome-wide studies involving the Philippine Negritos are relatively limited.
Here we generated approximately 1 million genome-wide SNPs in four Negrito (Aeta, Agta, Batak, and Mamanwa) and three non-Negrito groups (Tagalog, Visayan, and Manobo) from the Philippines. By combining this newly generated SNP data with previously published data of Andamanese and Malaysian Negritos, as well as other neighboring populations in SEA, we sought to obtain a deeper view of the demographic events that shaped the genetic diversity of the Negritos. Specifically, we would like to find out when these populations diverged; are there any genetic similarities among the Negrito groups; and did admixture with archaic humans have any impact on their genetic diversity?
|
|
|
Post by Admin on Aug 18, 2021 4:13:55 GMT
Materials and Methods Sample Information Peripheral blood samples from the Philippines were collected by the Japanese–Philippine joint study team headed by one of us (K. O.), from 1975 to 1985. The DNA were extracted and purified using the phenol–chloroform method, and were preserved in freezers; now stored at Department of Integrated Biosciences, Graduate School of Frontier Sciences, the University of Tokyo. A total of 150 individual samples from four Negrito groups and three non-Negrito groups were selected for genome-wide SNP typing using the Affymetrix 6.0 array. The sample information is listed in supplementary table S1 in the Supplementary Material online. The geographical locations of these populations are depicted in supplementary figure S1 in the Supplementary Material online. This study has been approved by the ethics committee of the University of Tokyo (15-223) and the National Institute of Genetics, Japan. Data Quality Checks The SNP genotypes, typed at Department of Human Genetics, Graduate School of Medicine, the University of Tokyo, were called using Affymetrix Birdseed v2 algorithm, with a more stringent confidence score cutoff of 0.1 instead of the default 0.5. In total, genotypes for 868,261 autosomal SNPs were obtained. Further SNP quality filtering was done using PLINK software (Purcell et al. 2007) to omit SNPs with <95% genotyping call rate. SNPs that deviated from the Hardy–Weinberg equilibrium (P < 1×10−10) were also removed from each population separately, before merging the data again. Closely related pairs of individuals were identified in each population using KING software (Manichaikul et al. 2010). To keep as many individuals as possible, only one individual from each pair that had first degree relations (estimated kinship coefficient >0.25) were removed; in total 25 individuals were omitted. Merging with Other Samples The resulting 782,966 autosomal SNP from the Philippines were merged with five populations from Malaysia and the Philippines which were also genotyped using Affymetrix 6.0 arrays (Reich et al. 2011), including two Negrito groups (Jehai and Mamanwa) and three non-Negrito Austronesian populations (Manobo, Bidayuh, and Temuan). In addition, three HapMap (International HapMap Consortium 2005) populations (Han Chinese (CHB), Caucasians (CEU) and Yorubans (YRI)) that were genotyped using Affymetrix 6.0 arrays were merged with the Philippine and Malaysian data sets, resulting in a total of 519,832 SNPs. The above data set was further merged with two Andamanese groups (Jarawa and Onge) (Basu et al. 2015) and two Malaysian Negrito groups (Bateq and Kintaq) (Aghakhanian et al. 2015). The number of overlapping SNP loci was 112,845. We further merged the data with the Human Genome Diversity Panel (HGDP) SNP data generated with the Affymetrix Human Origins array (Patterson et al. 2012). The number of available SNPs after merging was 44,960. The various data sets used for subsequent analyses are listed in supplementary table S2 in the Supplementary Material online. Data Analysis Principal Component Analysis (PCA) was performed using the smartpca program from the EIGENSOFT package (Patterson et al. 2006) to infer relationships at the individual level. In addition, ADMIXTURE analysis (Alexander et al. 2009) was done to assess population substructure and admixture within individuals. We also performed population level phylogenetic analysis by constructing Neighbor Joining trees (Saitou and Nei 1987) from Nei’s standard genetic distances (Nei 1972) computed using PHYLIP (http://evolution.genetics.washington.edu/phylip.html). One thousand bootstrap replicates of the input data were generated to assess the robustness of the tree branching patterns. Networks were constructed using the Neighbor-Net algorithm (Bryant and Moulton 2004) implemented in Splitstree software (Huson and Bryant 2006). To assess possible geneflow events between populations, we used the treemix software (Pickrell and Pritchard 2012). The robustness of treemix estimates was tested using 1,000 bootstrap replicates of the input data, generated using perl scripts developed and used for Jomon ancient genome analysis (Kanzawa-Kiriyama et al. 2017). We also performed a formal test for admixture using the D-statistics method from the Admixtools software package (Patterson et al. 2012). The f4 ratio test from the same software package was used to estimate the proportion of Denisovan admixture in the various Negrito groups, assuming the tree topology in supplementary figure S2 in the Supplementary Material online. We further applied the RD(x) statistic (Qin and Stoneking 2015) to verify Denisovan introgression, taking into account Neanderthal ancestry in various Southeast Asian populations. The RD(x) statistics is defined as the ratio of two f4 (or D) statistics: f4(Yoruban, Denisovan; French, x)/f4(Yoruban, Neanderthal; French, x), where x is the test population. Values >1 imply Denisovan ancestry in population x. Pairwise allele sharing distances (ASD) (Gao and Martin 2009) were calculated from the genome sequences of Denisovan (Meyer et al. 2012), Altai Neanderthal (Prüfer et al. 2014), a Papuan individual (Green et al. 2010), an Australian Aborigine (Rasmussen et al. 2011), three Aeta (Philippine Negrito) individuals (Pagani et al. 2016) and five Han Chinese (The 1000 Genomes Project Consortium 2015), using a perl script. The position of the three Aeta individuals from Pagani et al. (2016) in the PCA plot (supplementary fig. S3, Supplementary Material online) suggests that they may be more admixed than some Aeta individuals from our data set. SNP loci with missing genotypes were omitted and the total number of SNPs used for this analysis was 753,276. We defined allele sharing (AS) as 1−ASD and calculated this statistic for nonoverlapping blocks of 1,000 SNPs. For each block, we took the ratio of ASDenisovan/ASNeanderthal in Papuan, Australian Aborigine, and Aeta separately. Blocks with ratios >1 are putatively enriched with Denisovan ancestry while the rest are set to zero AS with Denisovan. We further subtracted ASDenisovan-CHB values from each putative Denisovan shared block to get a clearer signal of Denisovan ancestry. To estimate divergence times between pairs of populations, we used the R-package NeON (Mezzavilla and Ghirotto 2015) which implements a method based on patterns of LD and allele frequencies in the genome (McEvoy et al. 2011). From the resulting pairwise estimates of divergence times, we constructed phylogenetic trees using the UPGMA method (Sokal and Michener 1958; see Saitou 2013 for the algorithm), because by definition divergence time estimates should follow a constant rate of evolution. In order to identify genetic loci that may be shared among Negrito groups, we calculated pairwise Fst (Weir and Cockerham 1984) between Andamanese, Malaysian and Philippine Negritos and Han Chinese. We then identified SNP sites that have low Fst among the three Negrito groups (Fst from 0 to 0.05) but high Fst between Negritos and Han Chinese (>10-fold difference in Fst). The possible functions of these SNPs were examined using the Panther Gene Ontology (Thomas et al. 2003).
|
|