Targeted Genotyping of a Whole-Gene Repertoire by an Ultrahigh-Multiplex and Flexible HD-Marker Approach
Received date: 26 Jan 2021
Published date: 24 Jan 2022
Targeted genotyping is an extremely powerful approach for the detection of known genetic variations that are biologically or clinically important. However, for non-model organisms, large-scale target genotyping in a cost-effective manner remains a major challenge. To address this issue, we present an ultrahigh-multiplex, in-solution probe array-based high-throughput diverse marker genotyping (HDMarker) approach that is capable of targeted genotyping of up to 86 000 loci, with coverage of the whole gene repertoire, in what is a 27-fold and six-fold multiplex increase in comparison with the conventional Illumina GoldenGate and original HD-Marker assays, respectively. We perform extensive analyses of various ultrahigh-multiplex levels of HD-Marker (30 k-plex, 56 k-plex, and 86 k-plex) and show the power and excellent performance of the proposed method with an extremely high capture rate (about 96%) and genotyping accuracy (about 96%). With great advantages in terms of cost (as low as 0.0006 USD per genotype) and high technical flexibility, HD-Marker is a highly efficient and powerful tool with broad application potential for genetic, ecological, and evolutionary studies of non-model organisms.
Key words: HD-Marker; Targeted genotyping; Whole gene repertoire; Non-model organism
Pingping Liu , Jia Lv , Cen Ma , Tianqi Zhang , Xiaowen Huang , Zhihui Yang , Lingling Zhang , Jingjie Hu , Shi Wang , Zhenmin Bao . Targeted Genotyping of a Whole-Gene Repertoire by an Ultrahigh-Multiplex and Flexible HD-Marker Approach[J]. Engineering, 2022 , 13(6) : 186 -196 . DOI: 10.1016/j.eng.2021.07.027
[1] |
Stapley J, Reger J, Feulner PGD, Smadja C, Galindo J, Ekblom R, et al. Adaptation genomics: the next generation. Trends Ecol Evol 2010;25(12):705–12.
|
[2] |
Shafer ABA, Wolf JBW, Alves PC, Bergström L, Bruford MW, Brännström I, et al. Genomics and the challenging translation into conservation practice. Trends Ecol Evol 2015;30(2):78–87.
|
[3] |
Helyar SJ, Hemmer-Hansen J, Bekkevold D, Taylor MI, Ogden R, Limborg MT, et al. Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges. Mol Ecol Resour 2011;11(Suppl 1):123–36.
|
[4] |
Davey JW, Hohenlohe PA, Etter PD, Boone JQ, Catchen JM, Blaxter ML. Genomewide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet 2011;12(7):499–510.
|
[5] |
Jiang Z, Wang H, Michal JJ, Zhou X, Liu B, Woods LCS, et al. Genome wide sampling sequencing for SNP genotyping, methods: challenges and future development. Int J Biol Sci 2016;12(1):100–8.
|
[6] |
Andrews KR, Good JM, Miller MR, Luikart G, Hohenlohe PA. Harnessing the power of RADseq for ecological and evolutionary genomics. Nat Rev Genet 2016;17(2):81–92.
|
[7] |
Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, et al. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 2008;3(10):e3376.
|
[8] |
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 2011;6(5):e19379.
|
[9] |
Wang S, Meyer E, McKay JK, Matz MV. 2b-RAD: a simple and flexible method for genome-wide genotyping. Nat Methods 2012;9(8):808–10.
|
[10] |
Wang S, Liu P, Lv J, Li Y, Cheng T, Zhang L, et al. Serial sequencing of isolength RAD tags for cost-efficient genome-wide profiling of genetic and epigenetic variations. Nat Protoc 2016;11(11):2189–200.
|
[11] |
De Wit P, Pespeni MH, Palumbi SR. SNP genotyping and population genomics from expressed sequences–current advances and future possibilities. Mol Ecol 2015;24(10):2310–23.
|
[12] |
Jiao W, Fu X, Li J, Li L, Feng L, Lv J, et al. Large-scale development of geneassociated single-nucleotide polymorphism markers for molluscan population genomic, comparative genomic, and genome-wide association studies. DNA Res 2014;21(2):183–93.
|
[13] |
Jones MR, Good JM. Targeted capture in evolutionary and ecological genomics. Mol Ecol 2016;25(1):185–202.
|
[14] |
Zenger KR, Khatkar MS, Jones DB, Khalilisamani N, Jerry DR, Raadsma HW. Genomic selection in aquaculture: application, limitations and opportunities with special reference to marine shrimp and pearl oysters. Front Genet 2019;9:693.
|
[15] |
Asan Y, Xu Y, Jiang H, Tyler-Smith C, Xue Y, Jiang T, et al. Comprehensive comparison of three commercial human whole-exome capture platforms. Genome Biol 2011;12(9):R95.
|
[16] |
Fan B, Du Z, Gorbach DM, Rothschild MF. Development and application of high-density SNP arrays in genomic studies of domestic animals. Asian Austral J Anim 2010;23(7):833–47.
|
[17] |
Rasheed A, Hao Y, Xia X, Khan A, Xu Y, Varshney RK, et al. Crop breeding chips and genotyping platforms: progress, challenges, and perspectives. Mol Plant 2017;10(8):1047–64.
|
[18] |
Mangal M, Bansal S, Sharma SK, Gupta RK. Molecular detection of foodborne pathogens: a rapid and accurate answer to food safety. Crit Rev Food Sci Nutr 2016;56(9):1568–84.
|
[19] |
Guppy JL, Jones DB, Jerry DR, Wade NM, Raadsma HW, Huerlimann R, et al. The state of ‘‘Omics” research for farmed penaeids: advances in research and impediments to industry utilization. Front Genet 2018;9:282.
|
[20] |
Albrechtsen A, Nielsen FC, Nielsen R. Ascertainment biases in SNP chips affect measures of population divergence. Mol Biol Evol 2010;27(11):2534–47.
|
[21] |
Mertes F, Elsharawy A, Sauer S, van Helvoort JMLM, van der Zaag PJ, Franke A, et al. Targeted enrichment of genomic DNA regions for next-generation sequencing. Brief Funct Genomics 2011;10(6):374–86.
|
[22] |
Mamanova L, Coffey AJ, Scott CE, Kozarewa I, Turner EH, Kumar A, et al. Targetenrichment strategies for next-generation sequencing. Nat Methods 2010;7 (2):111–8.
|
[23] |
Tewhey R, Warner JB, Nakano M, Libby B, Medkova M, David PH, et al. Microdroplet-based PCR enrichment for large-scale targeted sequencing. Nat Biotechnol 2009;27(11):1025–31.
|
[24] |
Damiati E, Borsani G, Giacopuzzi E. Amplicon-based semiconductor sequencing of human exomes: performance evaluation and optimization strategies. Hum Genet 2016;135(5):499–511.
|
[25] |
Kozarewa I, Armisen J, Gardner AF, Slatko BE, Hendrickson CL. Overview of target enrichment strategies. Curr Protoc Mol Biol 2015;112:7.21.1–23.
|
[26] |
Teer JK, Bonnycastle LL, Chines PS, Hansen NF, Aoyama N, Swift AJ, et al; the NISC Comparative Sequencing Program. Systematic comparison of three genomic enrichment methods for massively parallel DNA sequencing. Genome Res 2010;20(10):1420–31.
|
[27] |
Clark MJ, Chen R, Lam HYK, Karczewski KJ, Chen R, Euskirchen G, et al. Performance comparison of exome DNA sequencing technologies. Nat Biotechnol 2011;29(10):908–14.
|
[28] |
Schott RK, Panesar B, Card DC, Preston M, Castoe TA, Chang BSW. Targeted capture of complete coding regions across divergent species. Genome Biol Evol 2017;9(2):398–414.
|
[29] |
Sulonen AM, Ellonen P, Almusa H, Lepistö M, Eldfors S, Hannula S, et al. Comparison of solution-based exome capture methods for next generation sequencing. Genome Biol 2011;12(9):R94.
|
[30] |
Gasc C, Peyretaillade E, Peyret P. Sequence capture by hybridization to explore modern and ancient genomic diversity in model and nonmodel organisms. Nucleic Acids Res 2016;44(10):4504–18.
|
[31] |
Chung J, Son DS, Jeon HJ, Kim KM, Park G, Ryu GH, et al. The minimal amount of starting DNA for Agilent’s hybrid capture-based targeted massively parallel sequencing. Sci Rep 2016;6:26732.
|
[32] |
Zhang Y, Li B, Li C, Cai Q, Zheng W, Long J. Improved variant calling accuracy by merging replicates in whole-exome sequencing studies. BioMed Res Int 2014;2014:319534.
|
[33] |
Yi X, Liang Y, Huerta-Sanchez E, Jin X, Cuo ZXP, Pool JE, et al. Sequencing of 50 human exomes reveals adaptation to high altitude. Science 2010;329 (5987):75–8.
|
[34] |
Yigit E, Zhang Q, Xi L, Grilley D, Widom J, Wang J, et al. High-resolution nucleosome mapping of targeted regions using BAC-based enrichment. Nucleic Acids Res 2013;41(7):e87.
|
[35] |
Cao H, Wu J, Wang Y, Jiang H, Zhang T, Liu X, et al. An integrated tool to study MHC region: accurate SNV detection and HLA genes typing in human MHC region using targeted high-throughput sequencing. PLoS ONE 2013;8(7): e69388.
|
[36] |
Lv J, Jiao W, Guo H, Liu P, Wang R, Zhang L, et al. HD-Marker: a highly multiplexed and flexible approach for targeted genotyping of more than 10,000 genes in a single-tube assay. Genome Res 2018;28(12):1919–30.
|
[37] |
Sambrook J, Fritsch EF, Maniatis T. Molecular cloning, a laboratory manual. 2nd ed. Now York City: Cold Spring Harbor Laboratory Press; 1989.
|
[38] |
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009;25(14):1754–60.
|
[39] |
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al; the 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 2009;25(16):2078–9.
|
[40] |
Koboldt DC, Chen K, Wylie T, Larson DE, McLellan MD, Mardis ER, et al. VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics 2009;25(17):2283–5.
|
[41] |
Liu F, Li Y, Yu H, Zhang L, Hu J, Bao Z, et al. MolluscDB: an integrated functional and evolutionary genomics database for the hyper-diverse animal phylum Mollusca. Nucleic Acids Res 2021;49(D1):D1556.
|
[42] |
Yang Z, Zhang L, Hu J, Wang J, Bao Z, Wang S. The evo-devo of molluscs: insights from a genomic perspective. Evol Dev 2020;22(6):409–24.
|
[43] |
Hou R, Bao Z, Wang S, Su H, Li Y, Du H, et al. Transcriptome sequencing and de novo analysis for Yesso scallop (Patinopecten yessoensis) using 454 GS FLX. PLoS ONE 2011;6(6):e21560.
|
[44] |
Wang S, Hou R, Bao Z, Du H, He Y, Su H, et al. Transcriptome sequencing of Zhikong scallop (Chlamys farreri) and comparative transcriptomic analysis with Yesso scallop (Patinopecten yessoensis). PLoS ONE 2013;8(5): e63927.
|
[45] |
Wang S, Zhang J Jiao W, Li J, Xun X, Sun Y, et al. Scallop genome provides insights into evolution of bilaterian karyotype and development. Nat Ecol Evol 2017;1(5):0120.
|
[46] |
Thomson MJ. High-throughput SNP genotyping to accelerate crop improvement. Plant Breed Biotechnol 2014;2(3):195–212.
|
[47] |
Syvänen AC. Toward genome-wide SNP genotyping. Nat Genet 2005;37(S6 Suppl):S5–10.
|
[48] |
Fan JB, Chee MS, Gunderson KL. Highly parallel genomic assays. Nat Rev Genet 2006;7(8):632–44.
|
[49] |
Perkel J. SNP genotyping: six technologies that keyed a revolution. Nat Methods 2008;5(5):447–54.
|
[50] |
Paux E, Sourdille P, Mackay I, Feuillet C. Sequence-based marker development in wheat: advances and applications to breeding. Biotechnol Adv 2012;30 (5):1071–88.
|
[51] |
Hayes B, Goddard M. Genome-wide association and genomic selection in animal breeding. Genome 2010;53(11):876–83.
|
[52] |
Goddard ME, Hayes BJ, Meuwissen THE. Using the genomic relationship matrix to predict the accuracy of genomic selection. J Anim Breed Genet 2011;128 (6):409–21.
|
[53] |
Ballester LY, Luthra R, Kanagal-Shamanna R, Singh RR. Advances in clinical next-generation sequencing: target enrichment and sequencing technologies. Expert Rev Mol Diagn 2016;16(3):357–72.
|
[54] |
Robledo D, Palaiokostas C, Bargelloni L, Martínez P, Houston R. Applications of genotyping by sequencing in aquaculture breeding and genetics. Rev Aquacult 2018;10(3):670–82.
|
[55] |
de Oliveira AA, Guimaraes LJM, Guimaraes CT, de Oliveira Guimaraes PE, de Oliveira PM, Pastina MM, et al. Single nucleotide polymorphism calling and imputation strategies for cost-effective genotyping in a tropical maize breeding program. Crop Sci 2020;60(6):3066–82.
|
[56] |
Tsairidou S, Hamilton A, Robledo D, Bron JE, Houston RD. Optimizing low-cost genotyping and imputation strategies for genomic selection in Atlantic Salmon. G3-Genes Genom Genet 2020;10(2):581–90.
|
[57] |
Luo Z, Yu Y, Xiang J, Li F. Genomic selection using a subset of SNPs identified by genome-wide association analysis for disease resistance traits in aquaculture species. Aquaculture 2021;539:736620.
|
/
〈 | 〉 |