Y-SNP miniplexes for East Asian Y-chromosomal haplogroup determination in degraded DNA

https://doi.org/10.1016/j.fsigen.2012.06.014Get rights and content

Abstract

Four multiplex PCR systems followed by single base extension reactions were developed to score 22 single nucleotide polymorphisms (SNPs) and identify the most frequent East Asian Y chromosome haplogroups. Select Y chromosome SNPs allowed hierarchical testing for almost all of the major East Asian haplogroups along the revised Y chromosome tree. The first multiplex consists of six SNPs defining world-wide major haplogroups (M145, RPS4Y711, M89, M9, M214, and M175). The second multiplex includes six SNPs of subhaplogroup O (M119, P31, M95, SRY465, 47z, and M122). The third multiplex contains six SNPs that subdivide the subhaplogroup O3 (M324, P201, M159, M7, M134, and M133). The fourth multiplex comprises four SNPs of subhaplogroup C (M217, M48, M407, and P53.1). The sizes of the PCR amplicons ranged from 70 to 100 bp to facilitate their application to degraded forensic and ancient samples. Validation experiments demonstrated that the multiplexes were optimized for analysis of low template DNA and highly degraded DNA. In a test using DNA samples from 300 Korean males, 16 different Y chromosome haplogroups were identified; haplogroup O2b* was the most frequently observed (29.3%), followed by haplogroups C3 (xC3c, C3d, C3e) (16.0%) and O3a3c1 (11.0%). These multiplex sets will be useful tools for Y-chromosomal haplogroup determination in anthropological and forensic studies of East Asian populations.

Introduction

Genetic variations in the non-recombining portion of the Y chromosome (NRY) are analyzed in diverse disciplines including anthropological, forensic and medical genetics [1], [2], [3], [4], [5]. Because of a lack of recombination and low mutation rates, Y chromosome single nucleotide polymorphisms (Y-SNPs) are the most useful genetic markers for reconstructing male lineages. Therefore, Y chromosome haplogroups, which are defined by combinations of allelic states at hierarchically arranged Y-SNPs and small indels, have been extensively studied to infer the origins, evolution, and histories of migrations of modern human populations [6], [7].

A number of changes have been made to the Y-chromosomal haplogroup tree, and a total of 311 distinct haplogroups have been defined with increased resolution [8]. The haplogroup O was considerably rearranged during this revision; the L1 retroposon insertion (LINE 1) polymorphism, which had conflicted with the N7 polymorphism, was excluded from the list of markers used to define subhaplogroup O3. Persistent commercial or in-house development efforts for Y-SNP typing protocols have been made, but most developed methods involve typing of European Y haplogroups or world-wide major haplogroups with low resolution [9], [10], [11], [12] and are not suitable to subdivide subhaplogroup O3, a major haplogroup in East Asians [13], [14], into internal derivatives following the revised Y haplogroup tree. In addition, as the demand for inferring geographic origin is increasing in forensic DNA analysis as well as in ancient DNA analysis (e.g., identification of Korean War and Vietnam War victims, and genetic characterization of ancient remains), the development of sensitive and efficient methods for the Y-chromosomal haplogroup determination in degraded DNA is required.

Therefore, in the current study, a multiplex single base extension (SBE) method was developed to score Y-SNPs of East Asian haplogroups following a small size amplicon strategy that is suitable for application to degraded DNA. Y-SNPs were selected from SNPs that are hierarchically located along the revised topology of Y chromosome haplogroups, focusing on East Asian-specific haplogroups. To assess the utility of our method for the analysis of highly degraded samples, the sensitivity and efficiency of the multiplex set were validated in samples of serially diluted DNA, artificially degraded DNA, and DNA extracted from 55-year-old skeletal remains. Finally, a Korean sample was analyzed using the newly developed multiplex systems, and the distribution of Y haplogroups was studied, since the geographic origins and history of migration of a population can be inferred from the distributions and ages of its haplogroups [13], [14], [15].

Section snippets

DNA samples

Our study protocol was approved by the Institutional Review Board of Severance Hospital, Yonsei University in Seoul, Korea. DNA samples from 300 unrelated Korean males were obtained from the National Biobank of Korea. DNA concentrations were measured using a NanoDrop® ND-1000 Spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA) and the final sample concentrations were adjusted to 1.0 ng/μl. For the sensitivity test, 9948 male DNA (Promega, Madison, WI, USA) was serially diluted to

Y-SNP selection and primer design

The selected 22 Y-SNP markers were amplified with four multiplex PCR systems to explore East Asian Y chromosome haplogroups. Multiplex I was composed of the six Y-SNPs, M145, RPS4Y711, M89, M9, M214, and M175, which distinguish all world-wide major clades except for the African lineages A and B. Among the six major haplogroups in Multiplex I, clade O is the most common haplogroup in East Asians, and accordingly, a more detailed analysis of the haplogroup O lineage is necessary to differentiate

Conclusion

Four multiplex systems for scoring 22 Y-SNPs were developed to identify the most frequent East Asian Y-chromosomal haplogroups, and subsequent validation tests showed that the multiplexes are sensitive and efficient for analyzing low template and highly degraded DNA. The newly developed multiplexes allowed not only subdivision of haplogroup C, but also subdivision of haplogroup O and its derived subhaplogroup O3 according to the revised Y chromosome tree, thereby demonstrating their suitability

Acknowledgements

The biospecimens for this study were provided by National Biobank of Korea, Korea Centers for Disease Control and Prevention, supported by the Korean Ministry of Health and Welfare. This study was supported by a faculty research grant from Yonsei University College of Medicine in Seoul, Korea for 2008 (6-2008-0266).

References (30)

  • M.A. Jobling et al.

    The Y chromosome in forensic analysis and paternity testing

    Int. J. Legal Med.

    (1997)
  • P.A. Underhill et al.

    Y chromosome sequence variation and the history of human populations

    Nat. Genet.

    (2000)
  • M.F. Hammer et al.

    Hierarchical patterns of global human Y-chromosome diversity

    Mol. Biol. Evol.

    (2001)
  • M.A. Jobling et al.

    The human Y chromosome: an evolutionary marker comes of age

    Nat. Rev. Genet.

    (2003)
  • P.A. Underhill et al.

    The phylogeography of Y chromosome binary haplotypes and the origins of modern human populations

    Ann. Hum. Genet.

    (2001)
  • Cited by (13)

    • A custom hybridisation enrichment forensic intelligence panel to infer biogeographic ancestry, hair and eye colour, and Y chromosome lineage

      2023, Forensic Science International: Genetics
      Citation Excerpt :

      Subsequently, we selected 22 biogeographic ancestry SNPs, including two X-chromosome SNPs, from the PacifiPlex ancestry panel [53] (seven SNPs overlap with the GNano set), and 15 biogeographic ancestry SNPs based on their inclusion in other forensic ancestry panels, and with the highest remaining divergence values across continental groups from Phillips et al. [47], Kosoy et al. [29], Daya et al. [11], and Daca-Roszak et al. [10]. Thirty-five Y-chromosome SNPs were selected from Karafet et al. [26,27,25], Lao et al. [30], van Oven et al. [39,40], Valverde et al. [62], Park et al. [42], and Hudjashov et al. [21]. The Y-chromosome SNPs were chosen to infer major worldwide macrohaplogroups, and additional SNPs for dissecting Y-haplogroups with broad distributions (such as C, E, O and R) into sub-haplogroups with more restricted geographical affiliations (Supplementary Figure 1).

    • Title: Developmental validation of Y-SNP pedigree tagging system: A panel via quick ARMS PCR

      2020, Forensic Science International: Genetics
      Citation Excerpt :

      The advent of next-generation sequencing technology has made it possible to type hundreds of SNPs in a single batch. Apparently, many studies have attempted to sequence Y-SNPs using massively parallel sequencing (MPS) for forensic applications [11–13], while others developed new Y-SNP panels, which give more strength to SNaPshot technology for detecting degraded DNA [14] or shedding light on the genetic substructures [15]. According to the phylogenetic Y tree constructed by the International Society of Genetic Genealogy (ISOGG), all modern male individuals could be classified into 20 monophyletic or paraphyletic groups (aka Y Haplogroup A to T alphabetically), and their sub-haplogroups (https://isogg.org/tree/HaplogroupTr2019.html).

    • Y-chromosome haplogrouping for Asians using Y-SNP target sequencing

      2017, Forensic Science International: Genetics Supplement Series
      Citation Excerpt :

      In addition to Koreans and Pakistanis, polymorphic information of the Y-SNPs for several other Asian populations (Han and Dai Chinese, Japanese, and Vietnamese) could be obtained from the 1000 Genomes Browser (http://www.ncbi.nlm.nih.gov/variation/tools/1000genomes). Haplogroup distribution was similar with previous studies for Koreans and Japanese [4,5]. A total of 48 sub-haplogroups were observed when Y-haplogroup tree was constructed for 6 Asian populations (Fig. 1).

    • Confirmation of y haplogroup tree topologies with newly suggested Y-SNPs for the C2, O2b and O3a subhaplogroups

      2015, Forensic Science International: Genetics
      Citation Excerpt :

      The newly added Y-SNPs for the haplogroup designation are shown in Table 1. Polymerase chain reaction (PCR) primers for amplification of new SNPs were designed using the Primer3 program (http://frodo.wi.mit.edu/primer3/input.htm) and some primers were the same as those used previously [17–18]. Multiplex PCR reactions were performed in a final volume of 20.0 μl containing 1.0 ng of template DNA, 2.0 μl of Gold ST*R 10 × buffer (Promega, Madison, WI, USA), 1.5 U of AmpliTaq Gold® DNA polymerase (Applied Biosystems, Foster City, CA, USA) and appropriate concentrations of primers, as shown in Supplementary Material Table S1.

    • Analysis of 22 y chromosomal STR haplotypes and y haplogroup distribution in Pathans of Pakistan

      2014, Forensic Science International: Genetics
      Citation Excerpt :

      All but M172 marker that was analyzed by direct sequencing were analyzed using in-house multiplex PCR and SBE reaction. The detailed methods for determination of Y chromosomal haplogroup are described in the Supplementary Material. In addition, genealogical relationships among haplotypes of the Pathan population and Jewish populations [21,22] within haplogroups G and Q were reconstructed using the program Network 4.6.1.1 [23].

    View all citing articles on Scopus
    View full text