The complete chloroplast genome sequence of Viola labradorica (Violaceae)

Article information

Korean J. Pl. Taxon. 2024;54(2):121-125
Publication date (electronic) : 2024 June 30
doi : https://doi.org/10.11110/kjpt.2024.54.2.121
Department of Biological Sciences, Kangwon National University, Chuncheon 24341, South Korea
Corresponding author Ki-Oug YOO E-mail: yooko@kangwon.ac.kr
Received 2024 February 2; Revised 2024 March 11; Accepted 2024 March 15.

Abstract

Viola labradorica is commonly known as Alpine violet, American dog violet, Dog violet, and Labrador violet. This species is distributed throughout eastern Canada and North America. In the present study, we sequenced the chloroplast genome of V. labradorica for the first time and performed a phylogenetic analysis of the genus Viola. The length of the chloroplast genome of V. labradorica was 158,751 bp. A large single-copy region (87,211 bp), a small single-copy region (17,344 bp), and two inverted repeat regions (27,098 bp each) were identified. A phylogenetic analysis was conducted using 77 protein-coding genes from the chloroplast genomes of 34 Viola. Salix koriyanagi (Salicaceae) was used as an outgroup. The genus Viola forms a monophyletic clade. Among the clades for the genus Viola, the sect. Viola formed a clade and was divided into two subclades: subsect. Viola and subsect. Rostratae. Within the subsect. Rostratae, V. labradorica was placed in a basal position. These results contribute to a clear identification of the phylogenetic position of V. labradorica in the subsect. Rostratae.

INTRODUCTION

Viola L. is the largest genus of Violaceae Batsch, and the genus Viola contains 658 species (Marcussen et al., 2022). Viola labradorica Schrank has several common names, such as Alpine violet, American dog violet, Dog violet, and Labrador violet. V. labradorica is distributed throughout eastern Canada and North America (GBIF Secretariat, 2022). According to Platt (1950), V. labradorica and V. walteri are morphologically similar in the Appalachians; they present stipular teeth, pubescent adaxial leaves, petioles, and peduncles. However, V. labradorica has small spar stipular teeth and scattered hairs on its upper leaves. Therefore, this species differs from V. walteri (Platt, 1950). The colors of the petals are lavender violet to violet, and the two lateral petals are sparsely bearded (Little and McKinney, 2015). Previous studies of this species included morphology, scanning electron microscope observations of the stigmas, and a flavonoid analysis (Platt, 1950; Ballard, 1992). Systematic studies of chloroplast DNA sequences were conducted by Shooner et al. (2015). However, V. labradorica has never been compared within the section or genus categories. Here, we report the chloroplast genome of V. labradorica for the first time and reconstruct the phylogeny of the genus Viola. These results will serve as a valuable resource for comprehensive taxonomic studies of the subsection Rostratae.

MATERIALS AND METHODS

Fresh leaves of V. labradorica were obtained from the greenhouse of Kangwon National University in Korea, and DNA was extracted. The seeds were purchased from Rareplants.eu (http://www.rareplants.de), and seed-germinating individuals were identified by referring to the original description (Schrank, 1818), lectotype specimen (K000327814 [digital image]), and a description by Little and McKinney (2015). A voucher specimen (KWNU 100770) was deposited into the Kangwon National University Herbarium. DNA was extracted using the DNA Plant Mini Kit (Qiagen Inc., Valencia, CA, USA), with sequencing done on the Illumina MiSeq platform (Illumina Inc., San Diego, CA, USA). We obtained paired-end reads with an average read length of 301 bp from 5,493,560 raw reads; de novo assembly was conducted using Geneious 7.1.9 (Biomatters Ltd., Auckland, New Zealand), and 628,897 contigs were aligned. The complete chloroplast sequence was annotated based on the online program GeSeq (Tillich et al., 2017) and was manually edited through a comparison with the caulescent plants Viola: V. acuminata, V. collina, and V. grypoceras (GenBank accession Nos. MW802528, OM177181, and OM055663, respectively). A circular genome map of V. labradorica was drawn using CPGView (Liu et al., 2023). The cis-splicing genes and the trans-splicing gene of rps12 were drawn using CPGView (Liu et al., 2023). For a phylogenetic analysis of V. labradorica, 77 protein-coding genes from 34 Viola species (Go and Yoo, 2023) and one outgroup (Salix koriyanagi; GenBank accession No. MK120982) were aligned using MAFFT v7.017 (Katoh et al., 2002). Phylogenetic analyses were conducted by means of the maximum likelihood method using RAxML v8.2.12 (Stamatakis, 2014) under the GTR + I model with 1,000 bootstrap replicates and with Bayesian inference using MrBayes 3.2.2 (Ronquist et al., 2012) on the CIPRES Science Gateway (http://www.phylo.org) (Miller et al., 2010). We used the intraspecific classification system of Marcussen et al. (2022).

RESULTS AND DISCUSSION

The complete chloroplast genome of V. labradorica (GenBank accession No. OR003906) has a total length of 158,751 bp (GC content: 36.1%). This genome exhibits a quadripartite structure, consisting of a large single-copy region with a length of 87,211 bp, a small single-copy region with a length of 17,344 bp, and two inverted repeats (IRs) of 27,098 bp each (Fig. 1). The chloroplast genome of V. labradorica contains 111 genes, with 77 proteincoding genes, 29 tRNA genes, four rRNA genes, and one pseudogene. In total, the IR regions contained the following 19 duplicated genes: ndhB, rpl2, rpl23, rps7, rps12, ycf1, ycf2, ycf15, rrn16, rrn23, rrn4.5, rrn5, trnA-UGC, trnICAU, trnI-GAU, trnL-CAA, trnN-GUU, trnR-ACG, and trnV-GAC (Table 1). In the chloroplast genome of V. labradorica, eight protein-coding genes (atpF, ndhA, ndhB, petB, petD, rpl2, rpl16, and rpoC1) and six tRNA genes (trnA-UGC, trnG-UCC, trnI-GAU, trnK-UUU, trnL-UAA, and trnV-UAC) contained one intron, and three genes (clpP, rps12, and ycf3) contained two introns. The protein-coding genes with introns had the structures of 13 genes, including 12 cis-splicing genes (atpF, rpoC1, ycf3, clpP, petB, petD, rpl16, and ndhA, with reverse duplications for rpl2 and ndhB), and one trans-splicing gene (rps12).

Fig. 1.

Chloroplast genomic map of the V. labradorica chloroplast genome. This circular map was drawn using CPGView (http://www.1kmpg.cn/cpgview/) and contains seven tracks. From the center, the first track shows dispersed repeats, with direct repeats in red and palindromic repeats in green. The second track presents long tandem repeats as short blue bars. Short tandem repeats or microsatellite sequences are depicted in the third track as short bars in various colors. The fourth track delineates regions in different colors, showing small single-copy (light blue), inverted repeats (grey), and large single-copy (light green) regions. The GC content of the genome is depicted in the fifth track. Genes are shown in the sixth track, with their corresponding functional classifications indicated. The transcription directions for the inner and outer genes are clockwise and counterclockwise, respectively. The outermost track shows quadripartite structures in different line patterns, with small single-copy (thick dotted line), inverted repeats (solid lines), and large single-copy (dotted line) cases shown. Functional gene classifications are displayed in the bottom left corner.

Genes Contained in the Chloroplast Genome of Viola labradorica (111 gene species)

The gene order and gene direction of V. labradorica were very similar to another Viola, allowing us to confirm that the chloroplast genome of Viola is highly conserved, as found in previous reports (Cheon et al., 2019; Cao et al., 2022). We conducted a phylogenetic analysis, with the results showing that Viola strongly indicated monophyly (BS = 100). A phylogenetic tree was distinguished for each section according to the classification system (Marcussen et al., 2022), except for the section Chamaemelanium, for which conflicting opinions exist in the literature (Becker, 1925; Clausen, 1929, 1964). Specifically, the section Viola formed a monophyletic clade and then was divided into two well-supported subclades, denoted as subsection Viola and subsection Rostratae (Fig. 2). In these two subsections, V. labradorica belongs to the subsection Rostratae and forms a sister group with five other species. The subsection Rostratae subclade supports the classification of Viola (Marcussen et al., 2022). Here, we report the first complete chloroplast genome sequence of V. labradorica and a phylogenetic analysis of Viola. Our results will assist in the effort to construct the comprehensive classification system of the Viola subsection Rostratae.

Fig. 2.

Maximum likelihood tree based on 77 protein-coding genes from the complete chloroplast genomes of 34 Viola and one Salix as an outgroup (MK120982). The position of V. labradorica is highlighted in bold. Bootstrap (BP) values and Bayesian Posterior probabilities (PP) values are displayed above the branch nodes, with asterisks indicating node BS = 100% and PP = 1.00.

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MIST) (NRF-2022R1F1A1067342). The authors are grateful to anonymous reviewers who provided invaluable comments on the early version of the manuscript.

Notes

CONFLICTS OF INTEREST

The authors declare that there are no conflicts of interest.

References

Ballard H. E.. 1992. Systematics of Viola, section Viola in North America North of Mexico. MS thesis,. Graduate School Central Michigan University, Mount Pleasant, MI, USA; 255.
Becker W.. 1925. Viola L. Die natürlichen Pflanzenfamilien 21. Parietales und Opuntiales In : Englar A., ed. Wilhelm Engelmann. Leipzig: p. 363–376.
Cao D.-L., Zhang X.-J., Fan S.-Q., Qu X.-J.. 2022;Application of chloroplast genome in the identification of traditional Chinese medicine Viola philippica . BMC Genomics 23:540.
Cheon K.-S., Kim K.-A., Kwak M., Lee B., Yoo K.-O. 2019;The complete chloroplast genome sequences of four Viola species (Violaceae) and comparative analyses with its congeneric species. PLoS ONE 14:e0214162.
Clausen J.. 1929;Chromosome number and relationship of some North American species of Viola . Annals of Botany 43:741–764.
Clausen J. 1964;Cytotaxonomy and distributional ecology of western North American violets. Madroño 17:173–197.
Secretariat GBIF. 2022;GBIF Backbone Taxonomy. Retrieved Jun, 6 2023, available from https://doi.org/10.15468/39omei.
Go A., Yoo K.-O.. 2023;Comparative genomics of Viola selkirkii and V. ulleungdoensis (Violaceae). Korean Journal of Plant Taxonomy 53:38–46.
Katoh K., Misawa K., Kuma K., Miyata T.. 2002;MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Research 30:3059–3066.
Little R. J., McKinney L. E.. 2015. Violaceae. Flora of North America North of Mexico In : Flora of North America Editorial Committee, ed. 6Oxford University Press. New York, Oxford: p. 106–164.
Liu S., Ni Y., Li J., Zhang X., Yang H., Chen H., Liu C.. 2023;CPGView: A package for visualizing detailed chloroplast genome structures. Molecular Ecology Resources 23:694–704.
Marcussen T., Ballard H. E., Danihelka J., Flores A. R., Nicola M. V., Watson J. M.. 2022;A revised phylogenetic classification for Viola (Violaceae). Plants 11:2224.
Miller M. A., Pfeiffer W., Schwartz T.. 2010. Creating the CIPRES Science Gateway for inference of large phylogenetic trees. Proceedings of the Gateway Computational Environment Workshop (GCE) Institute of Electrical and Electronics Engineers. New York: p. 1–8.
Platt R. B.. 1950;Two Mid-Appalachian Violets. Castanea 15:126–129.
Ronquist F, Teslenko M., van der Mark P., Ayres D. L., Darling A., Höhna S., Larget B., Liu L., Suchard M. A., Huelsenbeck J. P.. 2012;MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic Biology 61:539–542.
Schrank F. P.. 1818;Aufzählung einiger Pflanzen aus Labrador, mit Anmerkungen. Denkschriften der Königlich-Baierischen Botanischen Gesellschaft in Regensburg 1:1–30.
Shooner S., Chisholm C., Davies T. J.. 2015;The phylogenetics of succession can guide restoration: An example from abandoned mine sites in the subarctic. Journal of Applied Ecology 52:1509–1517.
Stamatakis A. 2014;RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313.
Tillich M., Lehwark P., Pellizzer T., Ulbricht-Jones E. S., Fisher A., Bock R., Greiner S.. 2017;GeSeq: Versatile and accurate annotation of organelle genomes. Nucleic Acids Research 45:W6–W11.

Article information Continued

Fig. 1.

Chloroplast genomic map of the V. labradorica chloroplast genome. This circular map was drawn using CPGView (http://www.1kmpg.cn/cpgview/) and contains seven tracks. From the center, the first track shows dispersed repeats, with direct repeats in red and palindromic repeats in green. The second track presents long tandem repeats as short blue bars. Short tandem repeats or microsatellite sequences are depicted in the third track as short bars in various colors. The fourth track delineates regions in different colors, showing small single-copy (light blue), inverted repeats (grey), and large single-copy (light green) regions. The GC content of the genome is depicted in the fifth track. Genes are shown in the sixth track, with their corresponding functional classifications indicated. The transcription directions for the inner and outer genes are clockwise and counterclockwise, respectively. The outermost track shows quadripartite structures in different line patterns, with small single-copy (thick dotted line), inverted repeats (solid lines), and large single-copy (dotted line) cases shown. Functional gene classifications are displayed in the bottom left corner.

Fig. 2.

Maximum likelihood tree based on 77 protein-coding genes from the complete chloroplast genomes of 34 Viola and one Salix as an outgroup (MK120982). The position of V. labradorica is highlighted in bold. Bootstrap (BP) values and Bayesian Posterior probabilities (PP) values are displayed above the branch nodes, with asterisks indicating node BS = 100% and PP = 1.00.

Table 1.

Genes Contained in the Chloroplast Genome of Viola labradorica (111 gene species)

Category Gene group Gene names
Self-replication Large subunit ribosomal proteins rpl2 (×2)*, rpl14, rpl16*, rpl20, rpl22, rpl23 (×2), rpl33, rpl36
Small subunit ribosomal proteins rps2, rps3, rps4, rps7 (×2), rps8, rps11, rps12 (×2)**T, rps14, rps15, rps18, rps19
DNA-dependent RNA polymerase rpoA, rpoB, rpoC1*, rpoC2
Ribosomal RNAs rrn4.5 (×2), rrn5 (×2), rrn16 (×2), rrn23 (×2)
Transfer RNAs trnA-UGC (×2)*, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnG-GCC, trnG-UCC*, trnH-GUG, trnI-CAU (×2), trnI-GAU (×2)*, trnK-UUU*, trnL-CAA (×2), trnL-UAA*, trnL-UAG, trnM-CAU (×2), trnN-GUU (×2), trnP-UGG, trnQ-UUG, trnR-ACG (×2), trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC (x2), trnV-UAC*, trnW-CCA, trnY-GUA
Photosynthesis Subunits of ATP synthase atpA, atpB, atpE, atpF*, atpH, atpI
Subunits of NADH-dehydrogenase ndhA*, ndhB (×2)*, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
Subunits of cytochrome b/f complex petA, petB*, petD*, petG, petL, petN
Subunits of photosystem I psaA, psaB, psaC, psaI, psaJ
Subunits of photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ
Subunit of Rubisco rbcL
Other genes Subunit of acetyl-CoA-carboxylase accD
C-type cytochrome synthesis gene ccsA
Envelop membrane protein cemA
ATP-dependent protease subunit P clpP**
Maturase matK
Unknown function Conserved open reading frames ycf1, ycf2 (×2), ycf3**, ycf4, ycf15(x2)
Pseudogenes Pseudogene Ψycf1
*

denotes a gene containing one intron,

**

denotes a gene containing two introns, ψ denotes pseudogene, ×2 denotes a duplicate gene, and T denotes a trans-spliced gene