The Haplotype-Resolved Pentaploid Gynostemma pentaphyllum Genome Provides Insights into Chromosomal Evolution and the Convergent Evolution of Protopanaxadiol Synthases

Chuyi Zhang , Lingling Yun , Ziqin Li , Sijie Sun , Yini Niu , Li Qiu , Feng Cao , Xiaofeng Shen , Li Xiang , Ying Li , Baolin Guo , Vincent Courdavault , Chao Sun

Engineering ›› : 202511022

PDF (3743KB)
Engineering ›› :202511022 DOI: 10.1016/j.eng.2025.11.022
Research
research-article
The Haplotype-Resolved Pentaploid Gynostemma pentaphyllum Genome Provides Insights into Chromosomal Evolution and the Convergent Evolution of Protopanaxadiol Synthases
Author information +
History +
PDF (3743KB)

Abstract

As an important natural source of dammarane-type triterpenoid saponins, Gynostemma pentaphyllum (G. pentaphyllum ) holds significant potential for applications in the healthcare and pharmaceutical industries. In this study, we successfully assembled a high-quality, haplotype-resolved pentaploid genome of G. pentaphyllum, which is rich in protopanaxadiol (PPD)-type saponins. By incorporating genomic data from G. pentaphyllum and other species positioned near the evolutionary base of Cucurbitaceae, we reconstructed and updated the ancestral karyotype of Cucurbitaceae to include 14 chromosomes. Comparative genomic analyses among G. pentaphyllum accessions of different ploidy levels revealed extensive chromosomal inversions and notable sequence variation in centromeric regions. Transposable elements (TEs) are hypothesized to have played a key role in shaping chromosomal structure and centromere evolution, potentially contributing to ploidy diversification in G. pentaphyllum. Notably, two PPD synthases (PPDSs) from the CYP88 family of cytochrome P450s (CYPs) were characterized in G. pentaphyllum. Molecular docking analysis revealed that, compared with the PPDS of Panax ginseng (P. ginseng ), the isozyme in G. pentaphyllum orients its substrate in the opposite direction during catalysis because of distinct amino acid interactions. Phylogenetic analysis further indicated that the PPDSs in G. pentaphyllum and P. ginseng independently recruited different key residues, highlighting a case of convergent evolution. Overall, the high-quality genome assembled in this study provides new insights into chromosome evolution and the mechanisms underlying ploidy diversification while also establishing a foundation for advancing our understanding of triterpene saponin biosynthesis in this species.

Keywords

Gynostemma pentaphyllum / Dammarane-type saponins / Convergent evolution / Centromeres / Ancestral karyotype

Cite this article

Download citation ▾
Chuyi Zhang, Lingling Yun, Ziqin Li, Sijie Sun, Yini Niu, Li Qiu, Feng Cao, Xiaofeng Shen, Li Xiang, Ying Li, Baolin Guo, Vincent Courdavault, Chao Sun. The Haplotype-Resolved Pentaploid Gynostemma pentaphyllum Genome Provides Insights into Chromosomal Evolution and the Convergent Evolution of Protopanaxadiol Synthases. Engineering 202511022 DOI:10.1016/j.eng.2025.11.022

登录浏览全文

4963

注册一个新账户 忘记密码

References

[1]

Goddard ZR, Searcey M, Osbourn A. Advances in triterpene drug discovery. Trends Pharmacol Sci 2024; 45(11):964-8.

[2]

Rahimi S, Kim J, Mijakovic I, Jung KH, Choi G, Kim SC, et al. Triterpenoid-biosynthetic UDP-glycosyltransferases from plants. Biotechnol Adv 2019; 37(7):107394.

[3]

Wang M, Li H, Liu W, Cao H, Hu X, Gao X, et al. Dammarane-type leads panaxadiol and protopanaxadiol for drug discovery: biological activity and structural modification. Eur J Med Chem 2020; 189:112087.

[4]

Nguyen NH, Ha TKQ, Yang JL, Pham HTT, Oh WK. Triterpenoids from the genus Gynostemma: chemistry and pharmacological activities. J Ethnopharmacol 2021; 268:113574.

[5]

Chen XB, Yao CL, Hou JR, Nie M, Li Y, Wei WL, et al. Systematical characterization of gypenosides in Gynostemma pentaphyllum and the chemical composition variation of different origins. J Pharm Biomed Anal 2023; 232:115328.

[6]

Liang HZ, Wang MY, Yang G, Li G, Zhang J, Yao L, et al. Untargeted qualitative and targeted quantitative analysis of saponins reveal differential chemotypes of Gynostemma pentaphyllum and G. longipes from different geographical origins. Food Chem 2025; 468:142412.

[7]

Zhang X, Zhao Y, Kou Y, Chen X, Yang J, Zhang H, et al. Diploid chromosome-level reference genome and population genomic analyses provide insights into gypenoside biosynthesis and demographic evolution of Gynostemma pentaphyllum (Cucurbitaceae). Hortic Res 2023; 10(1):uhac231.

[8]

Ahmed A, Saleem MA, Saeed F, Afzaal M, Imran A, Nadeem M, et al. Gynostemma pentaphyllum an immortal herb with promising therapeutic potential: a comprehensive review on its phytochemistry and pharmacological perspective. Int J Food Prop 2023; 26(1):808-32.

[9]

Chen DJ, Liu HM, Xing SF, Piao XL. Cytotoxic activity of gypenosides and gynogenin against non-small cell lung carcinoma A 549 cells. Bioorg Med Chem Lett 2014; 24(1):186-91.

[10]

Dinday S, Ghosh S. Recent advances in triterpenoid pathway elucidation and engineering. Biotechnol Adv 2023; 68:108214.

[11]

Tansakul P, Shibuya M, Kushiro T, Ebizuka Y. Dammarenediol-II synthase, the first dedicated enzyme for ginsenoside biosynthesis, in Panax ginseng. FEBS Lett 2006; 580(22):5143-9.

[12]

Yun L, Zhang C, Liang T, Tian Y, Ma G, Courdavault V, et al. Insights into dammarane-type triterpenoid saponin biosynthesis from the telomere-to-telomere genome of Gynostemma pentaphyllum. Plant Commun 2024; 5(8):100932.

[13]

Han JY, Kim HJ, Kwon YS, Choi YE. The Cyt P450 enzyme CYP716A47 catalyzes the formation of protopanaxadiol from dammarenediol-II during ginsenoside biosynthesis in Panax ginseng. Plant Cell Physiol 2011; 52(12):2062-73.

[14]

Zhang X, Su H, Yang J, Feng L, Li Z, Zhao G. Population genetic structure, migration, and polyploidy origin of a medicinal species Gynostemma pentaphyllum (Cucurbitaceae). Ecol Evol 2019; 9(19):11145-70.

[15]

Huang D, Ming R, Xu S, Wang J, Yao S, Li L, et al. Chromosome-level genome assembly of Gynostemma pentaphyllum provides insights into gypenoside biosynthesis. DNA Res 2021; 28(5):dsab018.

[16]

Jin X, Du H, Zhu C, Wan H, Liu F, Ruan J, et al. Haplotype-resolved genomes of wild octoploid progenitors illuminate genomic diversifications from wild relatives to cultivated strawberry. Nat Plants 2023; 9(8):1252-66.

[17]

Wang YJ, Guo C, Zhao L, Mao L, Hu XZ, Yang YZ, et al. Haplotype-resolved nonaploid genome provides insights into in vitro flowering in bamboos. Hortic Res 2024; 11(12):uhae250.

[18]

Huang P, Li Z, Wang H, Huang J, Tan G, Fu Y, et al. A genome assembly of decaploid Houttuynia cordata provides insights into the evolution of Houttuynia and the biosynthesis of alkaloids. Hortic Res 2024; 11(9):uhae203.

[19]

Henikoff S, Ahmad K, Malik HS. The centromere paradox: stable inheritance with rapidly evolving DNA. Science 2001; 293(5532):1098-102.

[20]

Naish M, Alonge M, Wlodzimierz P, Tock AJ, Abramson BW, Schmücker A, et al. The genetic and epigenetic landscape of the Arabidopsis centromeres. Science 2021; 374(6569):eabi7489.

[21]

Wlodzimierz P, Rabanal FA, Burns R, Naish M, Primetis E, Scott A, et al. Cycles of satellite and transposon evolution in Arabidopsis centromeres. Nature 2023; 618(7965):1-9.

[22]

Wang T, Wang B, Hua X, Tang H, Zhang Z, Gao R, et al. A complete gap-free diploid genome in Saccharum complex and the genomic footprints of evolution in the highly polyploid Saccharum genus. Nat Plants 2023; 9(4):554-71.

[23]

Talbert PB, Henikoff S. What makes a centromere? Exp Cell Res 2020; 389(2):111895.

[24]

Marçais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 2011; 27(6):764-70.

[25]

Ranallo-Benavidez TR, Jaron KS, Schatz MC. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat Commun 2020; 11(1):1432.

[26]

Cheng H, Concepcion GT, Feng X, Zhang H, Li H. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat Methods 2021; 18(2):170-5.

[27]

Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:13033997.

[28]

Zeng X, Yi Z, Zhang X, Du Y, Li Y, Zhou Z, et al. Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes. Nat Plants 2024; 10(8):1184-200.

[29]

Alonge M, Soyk S, Ramakrishnan S, Wang X, Goodwin S, Sedlazeck FJ, et al. RaGOO: fast and accurate reference-guided scaffolding of draft genomes. Genome Biol 2019; 20(1):1-17.

[30]

Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci USA 2020; 117(17):9451-7.

[31]

Abrusán G, Grundmann N, DeMester L, Makalowski W. TEclass—a tool for automated classification of unknown eukaryotic transposable elements. Bioinformatics 2009; 25(10):1329-30.

[32]

Ou S, Su W, Liao Y, Chougule K, Agda JR, Hellinga AJ, et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019; 20(1):1-18.

[33]

Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 2019; 37(8):907-15.

[34]

Pertea M, Pertea GM, Antonescu CM, Chang TC, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol 2015; 33(3):290-5.

[35]

Gabriel L, Bru˚na T, Hoff KJ, Ebel M, Lomsadze A, Borodovsky M, et al. BRAKER3: Fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA. Genome Res 2024; 34(5):769-77.

[36]

Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B. AUGUSTUS: ab initio prediction of alternative transcripts. Nucl Acids Res 2006; 34(Web Server suppl_2):W435-9.

[37]

Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods 2015; 12(1):59-60.

[38]

Chan PP, Lin BY, Mak AJ, Lowe TM. tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucl Acids Res 2021; 49(16):9077-96.

[39]

Ou S, Jiang N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol 2018; 176(2):1410-22.

[40]

Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness. Methods Mol Biol 2019; 1962:227-45.

[41]

Rhie A, Walenz BP, Koren S, Phillippy AM. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol 2020; 21(1):1-27.

[42]

Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience 2021; 10(2):giab008.

[43]

Jia KH, Wang ZX, Wang L, Li GY, Zhang W, Wang XL, et al. SubPhaser: a robust allopolyploid subgenome phasing method based on subgenome-specific k-mers. New Phytol 2022; 235(2):801-9.

[44]

Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 2018; 34(18):3094-100.

[45]

Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010; 26(6):841-2.

[46]

Sun P, Jiao B, Yang Y, Shan L, Li T, Li X, et al. WGDI: a user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes. Mol Plant 2022; 15(12):1841-51.

[47]

Xie D, Xu Y, Wang J, Liu W, Zhou Q, Luo S, et al. The wax gourd genomes offer insights into the genetic diversity and ancestral cucurbit karyotype. Nat Commun 2019; 10(1):5158.

[48]

Emms DM, Kelly S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 2019; 20(1):1-14.

[49]

Chen H, Zwaenepoel A, Van de Peer Y. wgd v2: a suite of tools to uncover and date ancient polyploidy and whole-genome duplication. Bioinformatics 2024; 40(5):btae272.

[50]

Wlodzimierz P, Hong M, Henderson IR. TRASH: tandem repeat annotation and structural hierarchy. Bioinformatics 2023; 39(5):btad308.

[51]

Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 2013; 30(4):772-80.

[52]

Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, Von Haeseler A, et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol 2020; 37(5):1530-4.

[53]

Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 2024; 630(8016):493-500.

[54]

Kozlov AM, Darriba D, Flouri T, Morel B, Stamatakis A. RAxML-NG: a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics 2019; 35(21):4453-5.

[55]

Fu A, Zheng Y, Guo J, Grierson D, Zhao X, Wen C, et al. Telomere-to-telomere genome assembly of bitter melon (Momordica charantia L. var. abbreviata Ser.) reveals fruit development, composition and ripening genetic characteristics. Hortic Res 2023; 10(1):uhac228.

[56]

Wu H, Zhao G, Gong H, Li J, Luo C, He X, et al. A high-quality sponge gourd (Luffa cylindrica) genome. Hortic Res 2020; 7(1):7.

[57]

Zeng Q, Wei M, Li S, Wang H, Mo C, Yang L, et al. Complete genome assembly provides insights into the centromere architecture of pumpkin (Cucurbita maxima). Plant Commun 2024; 5(9):100935.

[58]

Mo C, Wang H, Wei M, Zeng Q, Zhang X, Fei Z, et al. Complete genome assembly provides a high-quality skeleton for pan-NLRome construction in melon. Plant J 2024; 118(6):2249-68.

[59]

Deng Y, Liu S, Zhang Y, Tan J, Li X, Chu X, et al. A telomere-to-telomere gap-free reference genome of watermelon and its mutation library provide important resources for gene discovery and breeding. Mol Plant 2022; 15(8):1268-84.

[60]

Guan J, Miao H, Zhang Z, Dong S, Zhou Q, Liu X, et al. A near-complete cucumber reference genome assembly and Cucumber-DB, a multi-omics database. Mol Plant 2024; 17(8):1178-82.

[61]

Wu S, Shamimuzzaman M, Sun H, Salse J, Sui X, Wilder A, et al. The bottle gourd genome provides insights into Cucurbitaceae evolution and facilitates mapping of a Papaya ring-spot virus resistance locus. Plant J 2017; 92(5):963-75.

[62]

Liao X, Xie D, Bao T, Hou M, Li C, Nie B, et al. Inversions encounter relaxed genetic constraints and balance birth and death of TPS genes in Curcuma. Nat Commun 2024; 15(1):9349.

[63]

Presting GG. Centromeric retrotransposons and centromere function. Curr Opin Genet Dev 2018; 49:79-84.

[64]

Shao Y, Zhou L, Li F, Zhao L, Zhang BL, Shao F, et al. Phylogenomic analyses provide insights into primate evolution. Science 2023; 380(6648):913-24.

[65]

Zhang H, He Q, Xing L, Wang R, Wang Y, Liu Y, et al. The haplotype-resolved genome assembly of autotetraploid rhubarb Rheum officinale provides insights into its genome evolution and massive accumulation of anthraquinones. Plant Commun 2024; 5(1):100677.

[66]

Melters DP, Bradnam KR, Young HA, Telis N, May MR, Ruby JG, et al. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol 2013; 14(1):1-20.

[67]

Chen J, Wang Z, Tan K, Huang W, Shi J, Li T, et al. A complete telomere-to-telomere assembly of the maize genome. Nat Genet 2023; 55(7):1-11.

[68]

Hansen CC, Nelson DR, Møller BL, Werck-Reichhart D. Plant cytochrome P450 plasticity and evolution. Mol Plant 2021; 14(8):1244-65.

[69]

Ghosh S. Triterpene structural diversification by plant cytochrome P450 enzymes. Front Plant Sci 2017; 8:1886.

[70]

Guo H, Wang H, Chen T, Guo L, Blank LM, Ebert BE, et al. Engineering critical amino acid residues of lanosterol synthase to improve the production of triterpenoids in Saccharomyces cerevisiae. ACS Synth Biol 2022; 11(8):2685-96.

[71]

Li M, Ma M, Wu Z, Liang X, Zheng Q, Li D, et al. Advances in the biosynthesis and metabolic engineering of rare ginsenosides. Appl Microbiol Biotechnol 2023; 107(11):3391-404.

[72]

Chu LL, Montecillo JAV, Bae H. Recent advances in the metabolic engineering of yeasts for ginsenoside biosynthesis. Front Bioeng Biotechnol 2020; 8:139.

[73]

Gao R, Lou Q, Hao L, Qi G, Tian Y, Pu X, et al. Comparative genomics reveal the convergent evolution of CYP82D and CYP706X members related to flavone biosynthesis in Lamiaceae and Asteraceae. Plant J 2022; 109(5):1305-18.

[74]

Denoeud F, Carretero-Paulet L, Dereeper A, Droc G, Guyot R, Pietrella M, et al. The coffee genome provides insight into the convergent evolution of caffeine biosynthesis. Science 2014; 345(6201):1181-4.

[75]

Huang R, O’Donnell AJ, Barboline JJ, Barkman TJ. Convergent evolution of caffeine in plants by co-option of exapted ancestral enzymes. Proc Natl Acad Sci USA 2016; 113(38):10613-8.

[76]

Tian T, Wang YJ, Huang JP, Li J, Xu B, Chen Y, et al. Catalytic innovation underlies independent recruitment of polyketide synthases in cocaine and hyoscyamine biosynthesis. Nat Commun 2022; 13(1):4994.

PDF (3743KB)

0

Accesses

0

Citation

Detail

Sections
Recommended

/