NYAMGEREL, BAASANMUNKH, MUNKHTULGA, TUGSBILGUUN, OYUNTSETSEG, XIANG, and CHOI: Characterization of the complete chloroplast genome of Dracocephalum ruyschiana (Lamiaceae) and its phylogenetic analysis
Abstract
The northern dragonhead, Dracocephalum ruyschiana L., is a perennial plant distributed from Europe to Mongolia. This study presents the first complete chloroplast genome sequence of D. ruyschiana from Mongolia based on high-throughput sequencing. The chloroplast genome displays a typical quadripartite structure and is 150,896bp long, with a GC content of 37.7%. It consists of a large single-copy region of 82,420bp, a small single-copy region of 17,730 bp, and two inverted repeat regions of 25,373bp each. The genome encodes 114 unique genes, which include 4 rRNA genes, 30 tRNA genes, and 80 protein-coding genes. A total of 30 simple sequence repeats were identified, primarily located in intergenic spacer regions. A phylogenetic analysis indicated that Dracocephalum species form a monophyletic group, with D. ruyschiana being closely related to D. argunense.
Keywords: Dracocephalum, genome annotation, medicinal plant, Nepetoideae, phylogenetic analysis, plastid genome
INTRODUCTION
Dracocephalum L. (Lamiaceae) is the second-largest genus within the subtribe Nepetinae, comprising more than 80 species worldwide ( Chen et al., 2022; Rose et al., 2023). Morphologically, Dracocephalum is most similar to Nepeta L. but can be readily distinguished by its calyces with a thickened sinus-like fold between the bases of the calyx lobes ( Harley et al., 2004). Many species of Dracocephalum are widely used in traditional medicine and herbal remedies around the world ( Fathiazad and Hamedeyazdan, 2011).
The northern dragonhead, Dracocephalum ruyschiana L., is a perennial species distributed from Europe to Mongolia. In Europe, it is listed on Red Lists in several countries due to its declining population ( Kleven et al., 2019). In Mongolia, this species is distributed in the northern forest areas ( Baasanmunkh et al., 2022). As a member of the polyphenol-rich genus Dracocephalum, the aerial parts of D. ruyschiana, collected in Mongolia, are known to contain flavone tetra-glycosides and benzyl alcohol glycosides, which have antioxidant, antimicrobial activities, and anti-inflammatory effects ( Selenge et al., 2013).
The complete chloroplast genomes (plastomes) of land plants have increasingly been sequenced in recent years. Plastids are essential organelles in plant cells, playing critical roles in growth and development ( Howe et al., 2003). The plastomes in plants have a typical tetrad structure, consisting of a large single-copy (LSC), a small single-copy (SSC), and two copies of inverted repeats (IRa and IRb) ( Wicke et al., 2011). Additionally, plastome sequences have been widely used in plant phylogenetic studies (e.g., Liu et al., 2023; Nyamgerel et al., 2024; Oyuntsetseg et al., 2024; Wang et al., 2024; Yuan et al., 2024). To date, plastome data from 11 Dracocephalum species have been sequenced and are available in the National Center for Biotechnology Information (NCBI) database ( Yao et al., 2020; Zhao et al., 2021; Zhang et al., 2024).
This study presents the complete plastome sequence of D. ruyschiana from Mongolia and investigates its phylogenetic relationships within Dracocephalum, providing valuable resources for future research on this genus.
MATERIALS AND METHODS
Fresh leaves of D. ruyschiana were collected from Bugant, Selenge province, Mongolia (49°24′47.5″N, 107°15′59.0″E). The specimens were deposited in the herbarium of the National University of Mongolia (UBU0038422). Detailed illustration photos of species were taken in the field surveys by S. Baasanmunkh and D. Munkhutlga.
Genomic DNA was extracted from silica gel-dried leaf material using the CTAB method ( Doyle and Doyle, 1987). The sequencing library was constructed from the extracted DNA using the TruSeq DNA Nano Kit and the NextSeq 500 platform (Illumina, San Diego, CA, USA), following the manufacturer’s protocol. Trimmomatic v.0.36 ( Bolger et al., 2014) was used to remove adapter sequences and low-quality reads to reduce bias. A base quality plot generated using FastQC v.0.11.5 ( Antil et al., 2023) was used to check the overall quality of the data and show the range of quality values for each cycle. NOVOplasty v.4.1.0 was used to perform de novo assembly using various k-mers ( Dierckxsens et al., 2016).
Genome annotation of D. ruyschiana was performed using the GeSeq web server ( https://chlorobox.mpimp-golm.mpg.de/geseq.html) ( Tillich et al., 2017) to predict gene locations. Protein-coding sequences were acurated using BLAST and tRNA was identified using tRNAscan-SE ( Chan and Lowe, 2019). A circular map was visualized using the CPGAVAS2 web server ( Liu et al., 2012). Long tandem repeats were identified with the TRF online tool ( Benson, 1999), with a minimum alignment score of 50 and a maximum period size of 500; the identity of repeats was set to ≥ 90%. Simple sequence repeats (SSRs) were identified using the Microsatellite Identification Tool (MISA) web server ( Beier et al., 2017), with minimum repeat thresholds set to 10, 5, 4, 3, 3, and 3 for mono-, di-, tri-, tetra-, penta-, and hexanucleotides, respectively.
The chloroplast genomes of 11 Dracocephalum species and two outgroups were retrieved from the NCBI database for phylogenetic reconstruction. The genomes were aligned using MAFFT v.7.490 ( Katoh et al., 2002) as implemented in Geneious Prime 2024.0.5 ( http://www.geneious.com). Phylogenetic relationships were determined through a maximum likelihood analysis performed in RaxML v.8.2.11 ( Stamatakis, 2014) with 1,000 bootstrap replicates. The best-fitting model for nucleotide substitutions was determined using the Akaike information criterion in jModelTest v.2.1.1073 ( Darriba et al., 2012), where the GTR+G+I model was selected. The resulting phylogenetic trees were visualized using FigTree v.1.4.2 ( Rambaut, 2012).
RESULTS AND DISCUSSION
The complete chloroplast genome of D. ruyschiana from Mongolia ( Fig. 1) was sequenced for the first time. A total of 9.6 Gb of paired-end (150 bp) bases, comprising 63,838,854 reads, was obtained. After trimming, 8.5 Gb of high-quality bases and 56,528,706 reads were retained for assembly. The assembled chloroplast genome was 158,469 bp long and displayed a typical quadripartite structure ( Fig. 2). It comprises an LSC region of 82,420bp, an SSC region of 17,830 bp, and two IRs of 25,373bp each ( Fig. 2). The overall GC content is 37.7%, distributed as 43.1% in the IRs, 35.8% in the LSC, and 31.6% in the SSC regions. The genome sequence data was submitted to GenBank (NCBI) under the accession number PQ963003.
The chloroplast genome of D. ruyschiana encodes a total of 131 genes, including 8 rRNA, 37 tRNA, and 86 protein-coding genes. Among these, 4 rRNA, 7 tRNA, and 6 protein-coding genes are duplicated in the IR regions ( Table 1). The predicted gene count in Dracocephalum species ranges from 125 ( D. palmatum) to 133 ( D. heterophyllum) ( Zhao et al., 2021; Fu et al., 2022; Zhang et al., 2024). The plastome of D. ruyschiana contains 10 cis-splicing genes with introns, two of which have two introns ( Fig. 3) and one trans-splicing gene with three exons ( Fig. 4). The junction analysis revealed that the rps19 and ycf1 genes span LSC/IRa and IRa/SSC junctions, respectively.
Chloroplast microsatellites can serve as valuable markers for ecological and evolutionary studies due to the non-recombinant, uniparentally inherited nature of organelle genomes ( Provan et al., 2001). A total of 34 SSRs were identified, primarily consisting of mononucleotide motifs (18), followed by di-nucleotide (6), tetra-nucleotide (5), tri-nucleotide (2), penta-nucleotide (2), and hexa-nucleotide (1) motif repeats, mostly found in the intergenic spacer region ( Fig. 2). Additionally, we identified 24 tandem repeats that generally ranged from 9 to 31 bp in length. The number of SSRs in D. ruyschiana is lower than that in other Dracocephalum species, with the highest count (91 SSRs) found in D. moldavica ( Fu et al., 2022). These SSR markers can be utilized to assess genetic variation in population genetic studies, which are essential for the conservation of endangered plants.
The genome alignment included available Dracocephalum species from the NCBI database, encompassing 158,397 bp sites, of which 6,653 were variable. Phylogenetic analysis revealed that Dracocephalum species form a monophyletic group, with D. ruyschiana closely related to D. arguments, supported by strong bootstrap values ( Fig. 5). A previous phylogenetic study using nuclear ITS and external transcribed spacer regions, plastid rpl32-trnL, trnL-trnF, ycf1, and ycf1- rps15, and two low-copy nuclear markers ( Chen et al., 2022) also clustered Dracocephalum ruyschiana and D. arguments together, suggesting a recent common ancestry for these two species. Additionally, the morphological diagnosis of D. ruyschiana is quite similar to that of D. argunense but differs in its stem being sparsely minute hairy toward vs. stem subglabrous, calyx minutely hairy toward base vs. calyx minutely hairy throughout, and the size of the corolla ( Li and Hedge, 1994). Further studies will compare the morphological, physiological, and genetic characteristics of these two species to better understand their evolutionary history.
This study provides valuable genomic resources to enhance research on the Dracocephalum genus, particularly in understanding phylogenetic relationships and chloroplast genome evolution. To gain deeper insights into the evolutionary history of this genus, future studies should include comparative genomic analyses with a broader sampling of Dracocephalum species and closely related genera.
Fig. 1.
Photographs of Dracocephalum ruyschiana in Mongolia. A. General habitats. B. Upper parts of plant. C. Habit. D. Inflorescence. E. Flower in lateral view. F. Calyx. G. Leaves. H. Seeds (photo credit: S. Baasanmunkh and D. Munkhutlga).
Fig. 2.
Schematic representation of the complete chloroplast genome of Dracocephalum ruyschiana. The map contains four rings. From the center going outward, the first circle shows the forward and reverse repeats connected with red and green arcs, respectively. The next circle shows the tandem repeats marked with short bars. The third circle shows the microsatellite sequences identified using Microsatellite Identification Tool (MISA). The fourth circle is drawn using drawgenemap and shows the gene structure on the plastome. The genes were colored based on their functional categories.
Fig. 3.
Schematic map of the cis-spliced genes in the complete chloroplast genome of Dracocephalum ruyschiana.
Fig. 4.
Schematic map of the trans-spliced genes in the complete chloroplast genome of Dracocephalum ruyschiana.
Fig. 5.
Phylogenetic tree of Dracocephalum species based on the whole chloroplast genome sequence using the maximum likelihood tree. Bootstrap values are indicated at branch level. Newly sequenced Dracocephalum ruyschiana is represented by red color.
Table 1.
Genes of the chloroplast genome of Dracocephalum ruyschiana.
|
Group of genes |
Name of genes |
|
RNA genes |
Ribosomal RNA |
rrn4.5a, rrn5a, rrn16a, rrn23a
|
|
Transfer RNA |
trnA-UGCa, trnC-GCA, trnD-GUC, trnE-UUCa, trnF-GAA, trnfM-CAU, trnG-GCC, trnG-UCC, trnH-GUG, trnI-CAUa, trnI-GAUa, trnK-UUU, trnL-CAAa, trnL-UAA, trnL-UAG, trnM-CAUa, trnN-GUUa, trnP-UGG, trnQ-UUG, trnR-ACGa, trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GACa, trnV-UAC, trnW-CCA, trnY-GUA
|
|
Ribosomal proteins |
Small subunit |
rps2, rps3, rps4, rps7a, rps8, rps11, rps12a, rps14, rps15, rps16, rps18, rps19
|
|
Large subunit |
rpl2a, rpl14, rpl16, rpl20, rpl22, rpl23a, rpl32, rpl33, rpl36
|
|
Transcription |
RNA polymerase |
rpoA, rpoB, rpoC1, rpoC2
|
|
Protein genes |
Photosystem I |
psaA, psaB, psaC, psaI, psaJ, ycf3, ycf4
|
|
Photosystem II |
psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
|
|
Cytochrome b6/f |
petA, petB, petD, petG, petL, petN
|
|
ATP synthase |
atpA, atpB, atpE, atpF, atpH, atpI
|
|
Rubisco |
rbcL
|
|
NADH dehydrogenase |
ndhA, ndhB, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
|
|
ATP-dependent protease subunit P |
clpP
|
|
Chloroplast envelope membrane protein |
cemA
|
|
Transitional initiation factor |
infA
|
|
Maturase |
|
matK
|
|
Subunit acetyl-coA carboxylase |
|
accD
|
|
C-type cytochrome synthesis |
|
ccsA
|
|
Hypothetical proteins |
|
ycf1, ycf2a, ycf15a
|
|
Component of TIC complex |
|
ycf3
|
LITERATURE CITED
Antil, S., Abraham, J. S. Sripoorna, S. Maurya, S. Dagar, J. Makhija, S. Bhagat, P. Gupta, R. Sood, U. Lal, R. and Toteja, R. 2023. DNA barcoding, an effective tool for species identification: A review. Molecular Biology Reports 50: 761-775.
Baasanmunkh, S., Urgamal, M. Oyuntsetseg, B. Sukhorukov, A. P. Tsegmed, Z. Son, D. C. Erst, A. Oyundelger, K. Kechaykin, A. A. Norris, J. Kosachev, P. Ma, J.-S. Chang, K. S. and Choi, H. J. 2022. Flora of Mongolia: Annotated checklist of native vascular plants. PhytoKeys 192: 63-169.
Chan, P. P. and Lowe, T. M. 2019. tRNAscan-SE: Searching for tRNA genes in genomic sequences. Methods in Molecular Biology 1962. 1-14.
Chen, Y.-P, Turdimatovich, T. O. Nuraliev, M. S. Lazarević, P. Drew, B. T. and Xiang, C.-L. 2022. Phylogeny and biogeography of the northern temperate genus Dracocephalum s.l. (Lamiaceae). Cladistics 38: 429-451.
Dierckxsens, N., Mardulyn, P. and Smits, G. 2016. NOVOPlasty: D e novo assembly of organelle genomes from whole genome data. Nucleic Acids Research 45: e18.
Doyle, J. J. and Doyle, J. L. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin 19: 11-15.
Fathiazad, F. and Hamedeyazdan, S. 2011. A reviewon Hyssopus officinalis L.: Composition and biological activities. African Journal of Pharmacy and Pharmacology 5: 1959-1966.
Fu, G., Liu, Y. Caraballo-Ortiz, M. A. Zheng, C. Liu, T. Xu, Y. and Su, X. 2022. Characterization of the complete chloroplast genome of the dragonhead herb, Dracocephalum heterophyllum (Lamiaceae), and comparative analyses with related species. Diversity 14: 110.
Harley, R. M., Atkins, S. Budantsev, A. L. Cantino, P. D. Conn, B. J. Grayer, R. Harley, M. M. Kok, R. Krestovskaja, T. Morales, R. Paton, A. J. Ryding, O. and Upson, T. 2004. Labiatae. In The Families and Genera of Vascular Plants. 7: Flowering Plants. Dicotyledons. Kadereit, J. W. (ed.), Springer, Berlin. Pp. 167-275.
Kleven, O., Endrestøl, A. Evju, M. Stabbetorp, O. E. and Westergaard, K. B. 2019. SNP discovery in the northern dragonhead Dracocephalum ruyschiana. Conservation Genetics Resources 11: 431-435.
Li, H. W. and Hedge, I. C. 1994. Lamiaceae. In Flora of China. 17: Verbenaceae through Solanaceae. Wu, C. Y. and Raven, P. H. (eds.), Science Press, Beijing and Missouri Botanical Garden Press, St Louis, MO. Pp. 269-291.
Liu, C., Shi, L. Zhu, Y. Chen, H. Zhang, J. Lin, X. and Guan, X. 2012. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences. BMC Genomics 13: 715.
Nyamgerel, N., Baasanmunkh, S. Oyuntsetseg, B. Tsegmed, Z. Bayarmaa, G.- A Lazkov, G. Pyak, E. Gil, H.-Y Park, I. and Choi, H. J. 2024. Comparative plastome analysis and taxonomic classification of snow lotus species ( Saussurea, Asteraceae) in Central Asia and Southern Siberia. Functional and Integrative Genomics 24: 42.
Oyuntsetseg, D., Nyamgerel, N. Baasanmunkh, S. Oyuntsetseg, B. Urgamal, M. Yoon, J. W. Bayarmaa, G.-A and Choi, H. J. 2024. The complete chloroplast genome and phylogenetic results support the species position of Swertia banzragczii and Swertia marginata (Gentianaceae) in Mongolia. Botanical Studies 65: 11.
Provan, J., Powell, W. and Hollingsworth, P. M. 2001. Chloroplast microsatellites: New tools for studies in plant ecology and evolution. Trends in Ecology and Evolution 16: 142-147.
Rambaut, A. 2012. FigTree v1. 4. Molecular evolution, phylogenetics and epidemiology. University of Edinburgh, Institute of Evolutionary Biology, Edinburgh.
Rose, J. P., Wiese, J. Pauley, N. Dirmenci, T. Celep, F. Xiang, C.-L and Drew, B. T. 2023. East Asian-North American disjunctions and phylogenetic relationships within subtribe Nepetinae (Lamiaceae). Molecular Phylogenetics and Evolution 187: 107873.
Selenge, E., Murata, T. Kobayashi, K. Batkhuu, J. and Yoshizaki, F. 2013. Flavone tetraglycosides and benzyl alcohol glycosides from the Mongolian medicinal plant Dracocephalum ruyschiana. Journal of Natural Products 76: 186-193.
Wang, X., Guo, L. Ding, L. Medina, L. Wang, R. and Li, P. 2024. Comparative plastome analyses and evolutionary relationships of 25 East Asian species within the medicinal plant genus Scrophularia (Scrophulariaceae). Frontiers in Plant Science 15: 1439206.
Yao, J., Zhao, F. Xu, Y. Zhao, K. Quan, H. Su, Y. Hao, P. Liu, J. Yu, B. Yao, M. Ma, X. Liao, Z. and Lan, X. 2020. Complete chloroplast genome sequencing and phylogenetic analysis of two Dracocephalum plants. BioMed Research International 2020: 4374801.
Yuan, J.-C., Liu, A. Takano, A. Maki, M. Hodel, R. G. J. Chen, Y.-P. and Xiang, C.-L. 2024. Plastid phylogenomics with broad taxon sampling provides insights into the generic delimitation of Paraphlomideae (Lamiaceae). Taxon 73: 1016-1029.
Zhang, R.-Q., Ma, X.-L. Chen, Y.-P. and Xiang, C.-L. 2024. Complete chloroplast genome sequences of Dracocephalum argunense and D. integrifolium (Lamiaceae: Nepetinae). Journal of Asia-Pacific Biodiversity 17: 586-589.
Zhao, F., Chen, Y.-P. Salmaki, Y. Drew, B. T. Wilson, T. C. Scheen, A.-C. Celep, F. Bräuchler, C. Bendiksby, M. Wang, Q. Min, D.-Z. Peng, H. Olmstead, R. G. Li, B. and Xiang, C.-L. 2021. An updated tribal classification of Lamiaceae based on plastome phylogenomics. BMC Biology 19: 2.
|
|