Post by Admin on Jul 21, 2023 18:30:52 GMT
Human pigmentation varies considerably within and among populations and is a function of both variation in exposure to ultraviolet radiation (UVR) and the type and quantity of melanin produced in melanocytes and keratinocytes (1). While genome wide association (GWA) studies in European populations have yielded numerous insights into the genetic basis of pigmentation variation within Europeans (2), the relevance of these associations outside of Europe remains largely unexplored (but see (3)). Here, we focus on understanding the genetic basis of blond hair in the Solomon Islands, a population that breaks from the general trend of darker skin and hair pigmentation near the equator where there is higher UVR (1, 4). Strikingly, while individuals from the Solomon Islands and other locations in Oceania near the equator have both the darkest skin pigmentation outside of Africa, they also have the highest prevalence of blond hair (5–10%) outside of Europe (5). The earliest inhabitants of Near Oceania settled at least 40,000 years before present (YPB) and are thought to have remained in relative isolation on the archipelago for at least 25,000 years (6–9). There is clear evidence of at least one subsequent migration from Southeast Asia about 3,000–4,000 YBP, and waves of more recent gene flow from other islands in the Pacific and from Europe (10, 11). To understand whether sharing of this rare human phenotype is due to globally common genetic variants or arose independently, we investigated the genetic mechanism underlying hair color in Solomon Islanders.
Figure 1
PCA (A) and ADMIXTURE (B) plots demonstrating the genetic relationships between Solomon Islanders (blond and dark hair) genotyped in the present study and several other populations. In PC space, Solomon Islanders are found between New Guineans and Asians, which is consistent with their population history (7). Blond and dark-haired Solomon Islanders show no systematic differences in their ancestry suggesting that blond hair is unlikely to be due to gene flow from other populations (e.g. Europeans). (CEU = Europeans from HapMap; CHB = Han Chinese from HapMap; PNG = Papua New Guinea; MIC = Micronesians; POL = Polynesians; SI = Solomon Islanders).
We measured skin and hair pigmentation in 1,209 Solomon Islanders and selected 85 individuals from the extreme 10% tails of the hair pigmentation distribution (43 blond and 42 dark-haired individuals) for genotyping with the Affymetrix 6.0 array (fig. S1–2 and table S1) (12). Estimates of global ancestry proportion showed that individuals from the Solomon Islands are genetically distinct from nearby populations (Fig. 1A and 1B), as has been previously observed (5). Further, no systematic differences in ancestry between blond and dark-haired Solomon Islanders were found suggesting that the presence of blond hair in the Solomon Islands is not due to recent gene flow from other populations (Fig. 1A, 1B and S3).
A case-control GWA study for hair color, comparing blond to dark-haired individuals, revealed a single strong association signal on chromosome 9 (Fig. 2A). The GWA signal was driven by 151 single nucleotide polymorphisms (SNPs) at 9p23 flanked at each side by a recombination hotspot (Fig. 2B). The SNP with the strongest association (rs13289810; odds ratio = 29.5±0.485 SE; P=1.11×10−19) had a frequency of 0.93 and 0.31 in blond -and dark-haired individuals, respectively. The mapping interval contained a single known candidate gene, tyrosinase-related protein 1 (TYRP1), which encodes a melanosomal enzyme involved in mammalian pigmentation (13).
Figure 2
Mapping the blond hair locus in Solomon Islanders. (A) A Manhattan plot and QQ-plot (inset) of the association scores comparing blond and dark haired individuals. The grey line indicates the genome-wide threshold for statistical significance. The QQ-plot shows the distribution of p-values before (black points; λ=1.02) and after (grey points; λ=1.00) removal of the 151 SNPs in the associated 9p23 region. (B) A regional plot of the GWA including the R93C allele, where the signal for the top SNP on the GWA array (blue diamond) and R93C (large red diamond) are indicated. The degree of redness of all SNP’s in the region indicates linkage disequilibrium with R93C. Also shown is recombination rate (navy line) at the signal peak in a window of ± 500 kb around the top GWA SNP. Positions, recombination rates and gene annotations are according to NCBI build 37 (hg 19).
Figure 3
Hair and skin pigmentation variants in TYRP1 (A) Top; a schema of the protein domains of TYRP1 (signal peptide (SI); epidermal growth factor (EGF), copper-binding and transmembrane (TM) domains) indicating variants in humans and other vertebrates that result in lightened skin and/or hair/coat (in brown) or albinism (in grey). Bottom; homology at the first 100 amino acids of TYRP1 in humans and other vertebrates. The location of the mouse brownlight mutation (R38C) and the Solomon Islands blond hair mutation (R93C) are given. (B) Boxplot showing the effect of the R93C genotypes on hair pigmentation in 921 Solomon Islanders.
To verify that 93C is the single causal allele driving the 9p23 GWA peak, the variant was genotyped directly in the discovery GWA panel and analyses repeated including R93C. R93C was more strongly associated with blond hair (P=9.60×10−23) than the top GWA SNP (Fig. 2B). Furthermore, analysis conditioning on the top GWA SNP (rs13289810) failed to remove all association from this region, as the residual signal for R93C still attained borderline genome-wide significance (P=3.11×10−7) (fig. S4A). Conversely, conditioning on R93C abolished the signal for association across this locus (fig. S4B), confirming a primary role for the coding R93C change.
To further evaluate its effects on hair pigmentation in Solomon Islanders, 93C was genotyped in 921 Solomon Islanders for whom we had measured hair pigmentation with spectrometry (fig. S5). Adjusting for age and geography, females had lighter hair than males (Student’s T test, P = 4.09×10−35), consistent with a previous study of hair color in Melanesians (5). Further, adjusting for sex and geography, hair color for 93C homozygotes (i.e. blonds) was not seen to darken significantly with age (R2 = 0.013, P = 0.187), whereas darkening was observed in heterozygotes (R2 = 0.026, P = 0.0017) and 93R homozygotes (R2 = 0.031, P = 0.0004). A recessive model provides the best fit for the association between 93C and hair color and a model including genotype, age and sex accounts for 46.4% of the variance in hair color on the Solomon Islands (P = 2.19×10−90; fig. S6 and table S3). There was evidence, however, that hair color differs between 93R homozygotes and heterozygotes (One-sided T test, P = 1.27×10−9), which suggests that 93C may have some co-dominant effect (Fig. 3B). Finally, we also assessed the effect of 93C on skin pigmentation in the Solomon Islands and found a modest additive effect accounting for 5.3% of variance in skin color (P = 4.56×10−8; fig. S7 and table S4).
The location and molecular alteration of the 93C mutation in TYRP1 suggests some intriguing possibilities for its function. TYRP1 encodes a key enzymatic step in melanin biosynthesis, giving rise to black/brown pigmentation (eumelanin), which occurs at high levels in individuals of African and Oceanic descent. Mutations in TYRP1, which is highly conserved in vertebrates, have been shown to lighten skin and/or hair pigmentation in several species including pigs, dogs, mice and zebra fish (Fig. 3A; 15, 16–19). In humans, previously discovered rare mutations leading to complete loss of function for TYRP1 are known to cause OCA3 or rufous albinism (20, 21). The 93C blond hair mutation in human TYRP1 resides in an epidermal growth factor (EGF)-like repeat near the N-terminus and shows similarity to the TYRP1 allele in the brownlight mouse (Fig. 3A). The brownlight mouse exhibits reduced TYRP1 stability and catalytic function resulting in decreased melanin content in mouse hair (19). Further, hair in these mice progressively lightens with age due to premature melanocyte death (19). Strikingly, like blond hair in Melanesians, the brownlight phenotype in the mouse is caused by an ARG→CYS substitution, located 55 amino acids upstream of R93C (R38C). R38C likely acts to interfere with normal disulfide bridges formed by the 15-Cys EGF repeat (Fig. 3A). It seems likely that the human 93C mutation confers a similar phenotype, namely premature melanocyte death that is exacerbated with age and melanogenic activity, resulting in blond hair that does not darken with age as observed in the Solomon Islands (Fig. 3B). However, it is also possible that the mouse R38C and human R93C mutations confer different effects on TYRP1 function, or that the R93C mutation is not disruptive to TYRP1 function, but rather is in linkage disequilibrium with non-coding functional genetic variant(s) within the ~500kb mapping interval.
The frequency of the 93C allele in the Solomon Islands is 0.26. To determine the worldwide distribution of the 93C allele, we genotyped the R93C SNP in 941 individuals from the 52 populations in the CEPH-HGDP (22). All 941 individuals were homozygous for 93R (table S4) and furthermore, the 93C allele was found to be absent from standard variant repositories (i.e. dbSNP, 1000 Genomes). Three 93C homozygous individuals were found in our cohort who reported that one of their parents came from outside the Solomon Islands: two individuals each with a parent from Fiji and one individual with a parent from New Guinea. Together, these observations suggest that the 93C allele is restricted to the Pacific and that the unique genetic mechanism causing blond hair in Solomon Islanders is shared with neighboring populations with the blond hair phenotype (5).
The frequency of the 93C allele in Solomon Islanders relative to other populations and its robust association with a visible human phenotype suggest that the 93C allele may have been driven to intermediate frequency by positive selection. The TYRP1 locus was evaluated for evidence of past positive selection using statistics based on allele frequency differentiation and long-range linkage disequilibrium (12). No strong evidence of recent selection at the TYRP1 locus was found in our sample from the Solomon Islands (fig. S8–S10). This result should be interpreted with caution, however, since our sample has relatively low power to detect selection on variants that follow a recessive mode of inheritance (23, 24). Moreover, even if the 93C allele was driven to its present frequency by selection, it remains a possibility that a different phenotype was targeted by selection and that blond hair is the result of phenotypic hitchhiking (25). For example, skin pigmentation may have been targeted by selection as we find that the 93C allele is associated with a minor reduction in skin pigmentation (table S3). This locus does show evidence of selection in Europeans, but it remains unclear what functional variant(s) may have been the target of selection (26, 27).
Regardless of the mechanisms responsible for the intermediate frequency of the 93C allele in Melanesia, this study realizes some of the postulated benefits of mapping complex human traits in population isolates (28); namely, the strong influence of genetic drift (and/or local positive selection) in such populations means that, while many recessive and/or rare alleles will be lost, some have a chance to rise to an appreciable frequency. Thus, it is expected (and demonstrated in recent studies) that population isolates may harbor alleles at appreciable frequency that influence complex traits and that are otherwise rare or absent in other populations (29–31). In the present case, the 93C TYRP1 variant accounts for ~46% of the variation in hair color in Solomon Islanders, which to our knowledge is the strongest reported effect on a visible human phenotype attributable to a common polymorphism. More generally, this work strongly supports the growing notion that population-specific variation is important in accounting for the heritable variance in clinically relevant phenotypes, and underscores the importance of enabling medical genomics in diverse worldwide populations.
Figure 1
PCA (A) and ADMIXTURE (B) plots demonstrating the genetic relationships between Solomon Islanders (blond and dark hair) genotyped in the present study and several other populations. In PC space, Solomon Islanders are found between New Guineans and Asians, which is consistent with their population history (7). Blond and dark-haired Solomon Islanders show no systematic differences in their ancestry suggesting that blond hair is unlikely to be due to gene flow from other populations (e.g. Europeans). (CEU = Europeans from HapMap; CHB = Han Chinese from HapMap; PNG = Papua New Guinea; MIC = Micronesians; POL = Polynesians; SI = Solomon Islanders).
We measured skin and hair pigmentation in 1,209 Solomon Islanders and selected 85 individuals from the extreme 10% tails of the hair pigmentation distribution (43 blond and 42 dark-haired individuals) for genotyping with the Affymetrix 6.0 array (fig. S1–2 and table S1) (12). Estimates of global ancestry proportion showed that individuals from the Solomon Islands are genetically distinct from nearby populations (Fig. 1A and 1B), as has been previously observed (5). Further, no systematic differences in ancestry between blond and dark-haired Solomon Islanders were found suggesting that the presence of blond hair in the Solomon Islands is not due to recent gene flow from other populations (Fig. 1A, 1B and S3).
A case-control GWA study for hair color, comparing blond to dark-haired individuals, revealed a single strong association signal on chromosome 9 (Fig. 2A). The GWA signal was driven by 151 single nucleotide polymorphisms (SNPs) at 9p23 flanked at each side by a recombination hotspot (Fig. 2B). The SNP with the strongest association (rs13289810; odds ratio = 29.5±0.485 SE; P=1.11×10−19) had a frequency of 0.93 and 0.31 in blond -and dark-haired individuals, respectively. The mapping interval contained a single known candidate gene, tyrosinase-related protein 1 (TYRP1), which encodes a melanosomal enzyme involved in mammalian pigmentation (13).
Figure 2
Mapping the blond hair locus in Solomon Islanders. (A) A Manhattan plot and QQ-plot (inset) of the association scores comparing blond and dark haired individuals. The grey line indicates the genome-wide threshold for statistical significance. The QQ-plot shows the distribution of p-values before (black points; λ=1.02) and after (grey points; λ=1.00) removal of the 151 SNPs in the associated 9p23 region. (B) A regional plot of the GWA including the R93C allele, where the signal for the top SNP on the GWA array (blue diamond) and R93C (large red diamond) are indicated. The degree of redness of all SNP’s in the region indicates linkage disequilibrium with R93C. Also shown is recombination rate (navy line) at the signal peak in a window of ± 500 kb around the top GWA SNP. Positions, recombination rates and gene annotations are according to NCBI build 37 (hg 19).
Figure 3
Hair and skin pigmentation variants in TYRP1 (A) Top; a schema of the protein domains of TYRP1 (signal peptide (SI); epidermal growth factor (EGF), copper-binding and transmembrane (TM) domains) indicating variants in humans and other vertebrates that result in lightened skin and/or hair/coat (in brown) or albinism (in grey). Bottom; homology at the first 100 amino acids of TYRP1 in humans and other vertebrates. The location of the mouse brownlight mutation (R38C) and the Solomon Islands blond hair mutation (R93C) are given. (B) Boxplot showing the effect of the R93C genotypes on hair pigmentation in 921 Solomon Islanders.
To verify that 93C is the single causal allele driving the 9p23 GWA peak, the variant was genotyped directly in the discovery GWA panel and analyses repeated including R93C. R93C was more strongly associated with blond hair (P=9.60×10−23) than the top GWA SNP (Fig. 2B). Furthermore, analysis conditioning on the top GWA SNP (rs13289810) failed to remove all association from this region, as the residual signal for R93C still attained borderline genome-wide significance (P=3.11×10−7) (fig. S4A). Conversely, conditioning on R93C abolished the signal for association across this locus (fig. S4B), confirming a primary role for the coding R93C change.
To further evaluate its effects on hair pigmentation in Solomon Islanders, 93C was genotyped in 921 Solomon Islanders for whom we had measured hair pigmentation with spectrometry (fig. S5). Adjusting for age and geography, females had lighter hair than males (Student’s T test, P = 4.09×10−35), consistent with a previous study of hair color in Melanesians (5). Further, adjusting for sex and geography, hair color for 93C homozygotes (i.e. blonds) was not seen to darken significantly with age (R2 = 0.013, P = 0.187), whereas darkening was observed in heterozygotes (R2 = 0.026, P = 0.0017) and 93R homozygotes (R2 = 0.031, P = 0.0004). A recessive model provides the best fit for the association between 93C and hair color and a model including genotype, age and sex accounts for 46.4% of the variance in hair color on the Solomon Islands (P = 2.19×10−90; fig. S6 and table S3). There was evidence, however, that hair color differs between 93R homozygotes and heterozygotes (One-sided T test, P = 1.27×10−9), which suggests that 93C may have some co-dominant effect (Fig. 3B). Finally, we also assessed the effect of 93C on skin pigmentation in the Solomon Islands and found a modest additive effect accounting for 5.3% of variance in skin color (P = 4.56×10−8; fig. S7 and table S4).
The location and molecular alteration of the 93C mutation in TYRP1 suggests some intriguing possibilities for its function. TYRP1 encodes a key enzymatic step in melanin biosynthesis, giving rise to black/brown pigmentation (eumelanin), which occurs at high levels in individuals of African and Oceanic descent. Mutations in TYRP1, which is highly conserved in vertebrates, have been shown to lighten skin and/or hair pigmentation in several species including pigs, dogs, mice and zebra fish (Fig. 3A; 15, 16–19). In humans, previously discovered rare mutations leading to complete loss of function for TYRP1 are known to cause OCA3 or rufous albinism (20, 21). The 93C blond hair mutation in human TYRP1 resides in an epidermal growth factor (EGF)-like repeat near the N-terminus and shows similarity to the TYRP1 allele in the brownlight mouse (Fig. 3A). The brownlight mouse exhibits reduced TYRP1 stability and catalytic function resulting in decreased melanin content in mouse hair (19). Further, hair in these mice progressively lightens with age due to premature melanocyte death (19). Strikingly, like blond hair in Melanesians, the brownlight phenotype in the mouse is caused by an ARG→CYS substitution, located 55 amino acids upstream of R93C (R38C). R38C likely acts to interfere with normal disulfide bridges formed by the 15-Cys EGF repeat (Fig. 3A). It seems likely that the human 93C mutation confers a similar phenotype, namely premature melanocyte death that is exacerbated with age and melanogenic activity, resulting in blond hair that does not darken with age as observed in the Solomon Islands (Fig. 3B). However, it is also possible that the mouse R38C and human R93C mutations confer different effects on TYRP1 function, or that the R93C mutation is not disruptive to TYRP1 function, but rather is in linkage disequilibrium with non-coding functional genetic variant(s) within the ~500kb mapping interval.
The frequency of the 93C allele in the Solomon Islands is 0.26. To determine the worldwide distribution of the 93C allele, we genotyped the R93C SNP in 941 individuals from the 52 populations in the CEPH-HGDP (22). All 941 individuals were homozygous for 93R (table S4) and furthermore, the 93C allele was found to be absent from standard variant repositories (i.e. dbSNP, 1000 Genomes). Three 93C homozygous individuals were found in our cohort who reported that one of their parents came from outside the Solomon Islands: two individuals each with a parent from Fiji and one individual with a parent from New Guinea. Together, these observations suggest that the 93C allele is restricted to the Pacific and that the unique genetic mechanism causing blond hair in Solomon Islanders is shared with neighboring populations with the blond hair phenotype (5).
The frequency of the 93C allele in Solomon Islanders relative to other populations and its robust association with a visible human phenotype suggest that the 93C allele may have been driven to intermediate frequency by positive selection. The TYRP1 locus was evaluated for evidence of past positive selection using statistics based on allele frequency differentiation and long-range linkage disequilibrium (12). No strong evidence of recent selection at the TYRP1 locus was found in our sample from the Solomon Islands (fig. S8–S10). This result should be interpreted with caution, however, since our sample has relatively low power to detect selection on variants that follow a recessive mode of inheritance (23, 24). Moreover, even if the 93C allele was driven to its present frequency by selection, it remains a possibility that a different phenotype was targeted by selection and that blond hair is the result of phenotypic hitchhiking (25). For example, skin pigmentation may have been targeted by selection as we find that the 93C allele is associated with a minor reduction in skin pigmentation (table S3). This locus does show evidence of selection in Europeans, but it remains unclear what functional variant(s) may have been the target of selection (26, 27).
Regardless of the mechanisms responsible for the intermediate frequency of the 93C allele in Melanesia, this study realizes some of the postulated benefits of mapping complex human traits in population isolates (28); namely, the strong influence of genetic drift (and/or local positive selection) in such populations means that, while many recessive and/or rare alleles will be lost, some have a chance to rise to an appreciable frequency. Thus, it is expected (and demonstrated in recent studies) that population isolates may harbor alleles at appreciable frequency that influence complex traits and that are otherwise rare or absent in other populations (29–31). In the present case, the 93C TYRP1 variant accounts for ~46% of the variation in hair color in Solomon Islanders, which to our knowledge is the strongest reported effect on a visible human phenotype attributable to a common polymorphism. More generally, this work strongly supports the growing notion that population-specific variation is important in accounting for the heritable variance in clinically relevant phenotypes, and underscores the importance of enabling medical genomics in diverse worldwide populations.