Characterization of the complete chloroplast genome of Dracocephalum ruyschiana (Lamiaceae) and its phylogenetic analysis

Article information

Korean J. Pl. Taxon. 2025;55(1):44-51
Publication date (electronic) : 2025 March 31
doi : https://doi.org/10.11110/kjpt.2025.55.1.44
Department of Biology and Microbiology, Changwon National University, Changwon 51140, Korea
1Department of Biology, School of Arts and Sciences, National University of Mongolia, Ulaanbaatar 14201, Mongolia
2CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
Corresponding author: Hyeok Jae CHOI, E-mail: hjchoi1975@changwon.ac.kr
Editor: Sang-Tae KIM
Received 2025 January 17; Revised 2025 February 19; Accepted 2025 February 26.

Abstract

The northern dragonhead, Dracocephalum ruyschiana L., is a perennial plant distributed from Europe to Mongolia. This study presents the first complete chloroplast genome sequence of D. ruyschiana from Mongolia based on high-throughput sequencing. The chloroplast genome displays a typical quadripartite structure and is 150,896bp long, with a GC content of 37.7%. It consists of a large single-copy region of 82,420bp, a small single-copy region of 17,730 bp, and two inverted repeat regions of 25,373bp each. The genome encodes 114 unique genes, which include 4 rRNA genes, 30 tRNA genes, and 80 protein-coding genes. A total of 30 simple sequence repeats were identified, primarily located in intergenic spacer regions. A phylogenetic analysis indicated that Dracocephalum species form a monophyletic group, with D. ruyschiana being closely related to D. argunense.

INTRODUCTION

Dracocephalum L. (Lamiaceae) is the second-largest genus within the subtribe Nepetinae, comprising more than 80 species worldwide (Chen et al., 2022; Rose et al., 2023). Morphologically, Dracocephalum is most similar to Nepeta L. but can be readily distinguished by its calyces with a thickened sinus-like fold between the bases of the calyx lobes (Harley et al., 2004). Many species of Dracocephalum are widely used in traditional medicine and herbal remedies around the world (Fathiazad and Hamedeyazdan, 2011).

The northern dragonhead, Dracocephalum ruyschiana L., is a perennial species distributed from Europe to Mongolia. In Europe, it is listed on Red Lists in several countries due to its declining population (Kleven et al., 2019). In Mongolia, this species is distributed in the northern forest areas (Baasanmunkh et al., 2022). As a member of the polyphenol-rich genus Dracocephalum, the aerial parts of D. ruyschiana, collected in Mongolia, are known to contain flavone tetra-glycosides and benzyl alcohol glycosides, which have antioxidant, antimicrobial activities, and anti-inflammatory effects (Selenge et al., 2013).

The complete chloroplast genomes (plastomes) of land plants have increasingly been sequenced in recent years. Plastids are essential organelles in plant cells, playing critical roles in growth and development (Howe et al., 2003). The plastomes in plants have a typical tetrad structure, consisting of a large single-copy (LSC), a small single-copy (SSC), and two copies of inverted repeats (IRa and IRb) (Wicke et al., 2011). Additionally, plastome sequences have been widely used in plant phylogenetic studies (e.g., Liu et al., 2023; Nyamgerel et al., 2024; Oyuntsetseg et al., 2024; Wang et al., 2024; Yuan et al., 2024). To date, plastome data from 11 Dracocephalum species have been sequenced and are available in the National Center for Biotechnology Information (NCBI) database (Yao et al., 2020; Zhao et al., 2021; Zhang et al., 2024).

This study presents the complete plastome sequence of D. ruyschiana from Mongolia and investigates its phylogenetic relationships within Dracocephalum, providing valuable resources for future research on this genus.

MATERIALS AND METHODS

Fresh leaves of D. ruyschiana were collected from Bugant, Selenge province, Mongolia (49°24′47.5″N, 107°15′59.0″E). The specimens were deposited in the herbarium of the National University of Mongolia (UBU0038422). Detailed illustration photos of species were taken in the field surveys by S. Baasanmunkh and D. Munkhutlga.

Genomic DNA was extracted from silica gel-dried leaf material using the CTAB method (Doyle and Doyle, 1987). The sequencing library was constructed from the extracted DNA using the TruSeq DNA Nano Kit and the NextSeq 500 platform (Illumina, San Diego, CA, USA), following the manufacturer’s protocol. Trimmomatic v.0.36 (Bolger et al., 2014) was used to remove adapter sequences and low-quality reads to reduce bias. A base quality plot generated using FastQC v.0.11.5 (Antil et al., 2023) was used to check the overall quality of the data and show the range of quality values for each cycle. NOVOplasty v.4.1.0 was used to perform de novo assembly using various k-mers (Dierckxsens et al., 2016).

Genome annotation of D. ruyschiana was performed using the GeSeq web server (https://chlorobox.mpimp-golm.mpg.de/geseq.html) (Tillich et al., 2017) to predict gene locations. Protein-coding sequences were acurated using BLAST and tRNA was identified using tRNAscan-SE (Chan and Lowe, 2019). A circular map was visualized using the CPGAVAS2 web server (Liu et al., 2012). Long tandem repeats were identified with the TRF online tool (Benson, 1999), with a minimum alignment score of 50 and a maximum period size of 500; the identity of repeats was set to ≥ 90%. Simple sequence repeats (SSRs) were identified using the Microsatellite Identification Tool (MISA) web server (Beier et al., 2017), with minimum repeat thresholds set to 10, 5, 4, 3, 3, and 3 for mono-, di-, tri-, tetra-, penta-, and hexanucleotides, respectively.

The chloroplast genomes of 11 Dracocephalum species and two outgroups were retrieved from the NCBI database for phylogenetic reconstruction. The genomes were aligned using MAFFT v.7.490 (Katoh et al., 2002) as implemented in Geneious Prime 2024.0.5 (http://www.geneious.com). Phylogenetic relationships were determined through a maximum likelihood analysis performed in RaxML v.8.2.11 (Stamatakis, 2014) with 1,000 bootstrap replicates. The best-fitting model for nucleotide substitutions was determined using the Akaike information criterion in jModelTest v.2.1.1073 (Darriba et al., 2012), where the GTR+G+I model was selected. The resulting phylogenetic trees were visualized using FigTree v.1.4.2 (Rambaut, 2012).

RESULTS AND DISCUSSION

The complete chloroplast genome of D. ruyschiana from Mongolia (Fig. 1) was sequenced for the first time. A total of 9.6 Gb of paired-end (150 bp) bases, comprising 63,838,854 reads, was obtained. After trimming, 8.5 Gb of high-quality bases and 56,528,706 reads were retained for assembly. The assembled chloroplast genome was 158,469 bp long and displayed a typical quadripartite structure (Fig. 2). It comprises an LSC region of 82,420bp, an SSC region of 17,830 bp, and two IRs of 25,373bp each (Fig. 2). The overall GC content is 37.7%, distributed as 43.1% in the IRs, 35.8% in the LSC, and 31.6% in the SSC regions. The genome sequence data was submitted to GenBank (NCBI) under the accession number PQ963003.

Fig. 1.

Photographs of Dracocephalum ruyschiana in Mongolia. A. General habitats. B. Upper parts of plant. C. Habit. D. Inflorescence. E. Flower in lateral view. F. Calyx. G. Leaves. H. Seeds (photo credit: S. Baasanmunkh and D. Munkhutlga).

Fig. 2.

Schematic representation of the complete chloroplast genome of Dracocephalum ruyschiana. The map contains four rings. From the center going outward, the first circle shows the forward and reverse repeats connected with red and green arcs, respectively. The next circle shows the tandem repeats marked with short bars. The third circle shows the microsatellite sequences identified using Microsatellite Identification Tool (MISA). The fourth circle is drawn using drawgenemap and shows the gene structure on the plastome. The genes were colored based on their functional categories.

The chloroplast genome of D. ruyschiana encodes a total of 131 genes, including 8 rRNA, 37 tRNA, and 86 protein-coding genes. Among these, 4 rRNA, 7 tRNA, and 6 protein-coding genes are duplicated in the IR regions (Table 1). The predicted gene count in Dracocephalum species ranges from 125 (D. palmatum) to 133 (D. heterophyllum) (Zhao et al., 2021; Fu et al., 2022; Zhang et al., 2024). The plastome of D. ruyschiana contains 10 cis-splicing genes with introns, two of which have two introns (Fig. 3) and one trans-splicing gene with three exons (Fig. 4). The junction analysis revealed that the rps19 and ycf1 genes span LSC/IRa and IRa/SSC junctions, respectively.

Genes of the chloroplast genome of Dracocephalum ruyschiana.

Fig. 3.

Schematic map of the cis-spliced genes in the complete chloroplast genome of Dracocephalum ruyschiana.

Fig. 4.

Schematic map of the trans-spliced genes in the complete chloroplast genome of Dracocephalum ruyschiana.

Chloroplast microsatellites can serve as valuable markers for ecological and evolutionary studies due to the non-recombinant, uniparentally inherited nature of organelle genomes (Provan et al., 2001). A total of 34 SSRs were identified, primarily consisting of mononucleotide motifs (18), followed by di-nucleotide (6), tetra-nucleotide (5), tri-nucleotide (2), penta-nucleotide (2), and hexa-nucleotide (1) motif repeats, mostly found in the intergenic spacer region (Fig. 2). Additionally, we identified 24 tandem repeats that generally ranged from 9 to 31 bp in length. The number of SSRs in D. ruyschiana is lower than that in other Dracocephalum species, with the highest count (91 SSRs) found in D. moldavica (Fu et al., 2022). These SSR markers can be utilized to assess genetic variation in population genetic studies, which are essential for the conservation of endangered plants.

The genome alignment included available Dracocephalum species from the NCBI database, encompassing 158,397 bp sites, of which 6,653 were variable. Phylogenetic analysis revealed that Dracocephalum species form a monophyletic group, with D. ruyschiana closely related to D. arguments, supported by strong bootstrap values (Fig. 5). A previous phylogenetic study using nuclear ITS and external transcribed spacer regions, plastid rpl32-trnL, trnL-trnF, ycf1, and ycf1-rps15, and two low-copy nuclear markers (Chen et al., 2022) also clustered Dracocephalum ruyschiana and D. arguments together, suggesting a recent common ancestry for these two species. Additionally, the morphological diagnosis of D. ruyschiana is quite similar to that of D. argunense but differs in its stem being sparsely minute hairy toward vs. stem subglabrous, calyx minutely hairy toward base vs. calyx minutely hairy throughout, and the size of the corolla (Li and Hedge, 1994). Further studies will compare the morphological, physiological, and genetic characteristics of these two species to better understand their evolutionary history.

Fig. 5.

Phylogenetic tree of Dracocephalum species based on the whole chloroplast genome sequence using the maximum likelihood tree. Bootstrap values are indicated at branch level. Newly sequenced Dracocephalum ruyschiana is represented by red color.

This study provides valuable genomic resources to enhance research on the Dracocephalum genus, particularly in understanding phylogenetic relationships and chloroplast genome evolution. To gain deeper insights into the evolutionary history of this genus, future studies should include comparative genomic analyses with a broader sampling of Dracocephalum species and closely related genera.

Notes

ACKNOWLEDGMENTS

This study was supported by Changwon National University in 2025–2026.

CONFLICT OF INTEREST

The authors declare that there are no conflicts of interest.

References

Antil S., Abraham J. S., Sripoorna S., Maurya S., Dagar J., Makhija S., Bhagat P., Gupta R., Sood U., Lal R., Toteja R.. 2023;DNA barcoding, an effective tool for species identification: A review. Molecular Biology Reports 50:761–775.
Baasanmunkh S., Urgamal M., Oyuntsetseg B., Sukhorukov A. P., Tsegmed Z., Son D. C., Erst A., Oyundelger K., Kechaykin A. A., Norris J., Kosachev P., Ma J.-S., Chang K. S., Choi H. J.. 2022;Flora of Mongolia: Annotated checklist of native vascular plants. PhytoKeys 192:63–169.
Beier S., Thiel T., Münch T., Scholz U., Mascher M.. 2017;MISA-web: A web server for microsatellite prediction. Bioinformatics 33:2583–2585.
Benson G.. 1999;Tandem repeats finder: A program to analyze DNA sequences. Nucleic Acids Research 27:573–580.
Bolger A. M., Lohse M., Usadel B.. 2014;Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120.
Chan P. P., Lowe T. M.. 2019;tRNAscan-SE: Searching for tRNA genes in genomic sequences. Methods in Molecular Biology 1962;:1–14.
Chen Y.-P, Turdimatovich T. O., Nuraliev M. S., Lazarević P., Drew B. T., Xiang C.-L.. 2022;Phylogeny and biogeography of the northern temperate genus Dracocephalum s.l. (Lamiaceae). Cladistics 38:429–451.
Darriba D., Taboada G. L., Doallo R., Posada D.. 2012;jModelTest 2: More models, new heuristics and parallel computing. Nature Methods 9:772.
Dierckxsens N., Mardulyn P., Smits G.. 2016;NOVOPlasty: De novo assembly of organelle genomes from whole genome data. Nucleic Acids Research 45:e18.
Doyle J. J., Doyle J. L.. 1987;A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin 19:11–15.
Fathiazad F., Hamedeyazdan S.. 2011;A reviewon Hyssopus officinalis L.: Composition and biological activities. African Journal of Pharmacy and Pharmacology 5:1959–1966.
Fu G., Liu Y., Caraballo-Ortiz M. A., Zheng C., Liu T., Xu Y., Su X.. 2022;Characterization of the complete chloroplast genome of the dragonhead herb, Dracocephalum heterophyllum (Lamiaceae), and comparative analyses with related species. Diversity 14:110.
Harley R. M., Atkins S., Budantsev A. L., Cantino P. D., Conn B. J., Grayer R., Harley M. M., Kok R., Krestovskaja T., Morales R., Paton A. J., Ryding O., Upson T.. 2004. Labiatae. In The Families and Genera of Vascular Plants. 7Flowering Plants. Dicotyledons In : Kadereit J. W., ed. Springer. Berlin: p. 167–275.
Howe C. J., Barbrook A. C., Koumandou V. L., Nisbet R. E. R., Symingtoon H. A., Wightman T. F.. 2003;Evolution of the chloroplast genome. Philosophical Transactions of the Royal Society of London, Series B: Biological Sciences 358:99–107.
Katoh K., Misawa K., Kuma K.-I., Miyata T.. 2002;MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research 30:3059–3066.
Kleven O., Endrestøl A., Evju M., Stabbetorp O. E., Westergaard K. B.. 2019;SNP discovery in the northern dragonhead Dracocephalum ruyschiana. Conservation Genetics Resources 11:431–435.
Li H. W., Hedge I. C.. 1994. Lamiaceae. In Flora of China. 17Verbenaceae through Solanaceae In : Wu C. Y., Raven P. H., eds. Science Press, Beijing and Missouri Botanical Garden Press. St Louis, MO: p. 269–291.
Liu C., Shi L., Zhu Y., Chen H., Zhang J., Lin X., Guan X.. 2012;CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences. BMC Genomics 13:715.
Liu T.-J., Zhang S.-Y, Wei L., Lin W., Yan H.-F., Hao G., Ge X.-J.. 2023;Plastome evolution and phylogenomic insights into the evolution of Lysimachia (Primulaceae: Myrsinoideae). BMC Plant Biology 23:359.
Nyamgerel N., Baasanmunkh S., Oyuntsetseg B., Tsegmed Z., Bayarmaa G.- A, Lazkov G., Pyak E., Gil H.-Y, Park I., Choi H. J.. 2024;Comparative plastome analysis and taxonomic classification of snow lotus species (Saussurea, Asteraceae) in Central Asia and Southern Siberia. Functional and Integrative Genomics 24:42.
Oyuntsetseg D., Nyamgerel N., Baasanmunkh S., Oyuntsetseg B., Urgamal M., Yoon J. W., Bayarmaa G.-A, Choi H. J.. 2024;The complete chloroplast genome and phylogenetic results support the species position of Swertia banzragczii and Swertia marginata (Gentianaceae) in Mongolia. Botanical Studies 65:11.
Provan J., Powell W., Hollingsworth P. M.. 2001;Chloroplast microsatellites: New tools for studies in plant ecology and evolution. Trends in Ecology and Evolution 16:142–147.
Rambaut A.. 2012. FigTree v1. 4. Molecular evolution, phylogenetics and epidemiology University of Edinburgh, Institute of Evolutionary Biology. Edinburgh:
Rose J. P., Wiese J., Pauley N., Dirmenci T., Celep F., Xiang C.-L, Drew B. T.. 2023;East Asian-North American disjunctions and phylogenetic relationships within subtribe Nepetinae (Lamiaceae). Molecular Phylogenetics and Evolution 187:107873.
Selenge E., Murata T., Kobayashi K., Batkhuu J., Yoshizaki F.. 2013;Flavone tetraglycosides and benzyl alcohol glycosides from the Mongolian medicinal plant Dracocephalum ruyschiana. Journal of Natural Products 76:186–193.
Stamatakis A.. 2014;RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313.
Tillich M., Lehwark P., Pellizzer T., Ulbricht-Jones E. S., Fischer A., Bock R., Greiner S.. 2017;GeSeq: Versatile and accurate annotation of organelle genomes. Nucleic Acids Research 45:W6–W11.
Wang X., Guo L., Ding L., Medina L., Wang R., Li P.. 2024;Comparative plastome analyses and evolutionary relationships of 25 East Asian species within the medicinal plant genus Scrophularia (Scrophulariaceae). Frontiers in Plant Science 15:1439206.
Wicke S., Schneeweiss G. M., dePamphilis C. W., Müller K. F., Quandt D.. 2011;The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Molecular Biology 76:273–297.
Yao J., Zhao F., Xu Y., Zhao K., Quan H., Su Y., Hao P., Liu J., Yu B., Yao M., Ma X., Liao Z., Lan X.. 2020;Complete chloroplast genome sequencing and phylogenetic analysis of two Dracocephalum plants. BioMed Research International 2020:4374801.
Yuan J.-C., Liu A., Takano A., Maki M., Hodel R. G. J., Chen Y.-P., Xiang C.-L.. 2024;Plastid phylogenomics with broad taxon sampling provides insights into the generic delimitation of Paraphlomideae (Lamiaceae). Taxon 73:1016–1029.
Zhang R.-Q., Ma X.-L., Chen Y.-P., Xiang C.-L.. 2024;Complete chloroplast genome sequences of Dracocephalum argunense and D. integrifolium (Lamiaceae: Nepetinae). Journal of Asia-Pacific Biodiversity 17:586–589.
Zhao F., Chen Y.-P., Salmaki Y., Drew B. T., Wilson T. C., Scheen A.-C., Celep F., Bräuchler C., Bendiksby M., Wang Q., Min D.-Z., Peng H., Olmstead R. G., Li B., Xiang C.-L.. 2021;An updated tribal classification of Lamiaceae based on plastome phylogenomics. BMC Biology 19:2.

Article information Continued

Fig. 1.

Photographs of Dracocephalum ruyschiana in Mongolia. A. General habitats. B. Upper parts of plant. C. Habit. D. Inflorescence. E. Flower in lateral view. F. Calyx. G. Leaves. H. Seeds (photo credit: S. Baasanmunkh and D. Munkhutlga).

Fig. 2.

Schematic representation of the complete chloroplast genome of Dracocephalum ruyschiana. The map contains four rings. From the center going outward, the first circle shows the forward and reverse repeats connected with red and green arcs, respectively. The next circle shows the tandem repeats marked with short bars. The third circle shows the microsatellite sequences identified using Microsatellite Identification Tool (MISA). The fourth circle is drawn using drawgenemap and shows the gene structure on the plastome. The genes were colored based on their functional categories.

Fig. 3.

Schematic map of the cis-spliced genes in the complete chloroplast genome of Dracocephalum ruyschiana.

Fig. 4.

Schematic map of the trans-spliced genes in the complete chloroplast genome of Dracocephalum ruyschiana.

Fig. 5.

Phylogenetic tree of Dracocephalum species based on the whole chloroplast genome sequence using the maximum likelihood tree. Bootstrap values are indicated at branch level. Newly sequenced Dracocephalum ruyschiana is represented by red color.

Table 1.

Genes of the chloroplast genome of Dracocephalum ruyschiana.

Group of genes Name of genes
RNA genes Ribosomal RNA rrn4.5a, rrn5a, rrn16a, rrn23a
Transfer RNA trnA-UGCa, trnC-GCA, trnD-GUC, trnE-UUCa, trnF-GAA, trnfM-CAU, trnG-GCC, trnG-UCC, trnH-GUG, trnI-CAUa, trnI-GAUa, trnK-UUU, trnL-CAAa, trnL-UAA, trnL-UAG, trnM-CAUa, trnN-GUUa, trnP-UGG, trnQ-UUG, trnR-ACGa, trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GACa, trnV-UAC, trnW-CCA, trnY-GUA
Ribosomal proteins Small subunit rps2, rps3, rps4, rps7a, rps8, rps11, rps12a, rps14, rps15, rps16, rps18, rps19
Large subunit rpl2a, rpl14, rpl16, rpl20, rpl22, rpl23a, rpl32, rpl33, rpl36
Transcription RNA polymerase rpoA, rpoB, rpoC1, rpoC2
Protein genes Photosystem I psaA, psaB, psaC, psaI, psaJ, ycf3, ycf4
Photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
Cytochrome b6/f petA, petB, petD, petG, petL, petN
ATP synthase atpA, atpB, atpE, atpF, atpH, atpI
Rubisco rbcL
NADH dehydrogenase ndhA, ndhB, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
ATP-dependent protease subunit P clpP
Chloroplast envelope membrane protein cemA
Transitional initiation factor infA
Maturase matK
Subunit acetyl-coA carboxylase accD
C-type cytochrome synthesis ccsA
Hypothetical proteins ycf1, ycf2a, ycf15a
Component of TIC complex ycf3
a

Gene with copies.