The complete chloroplast genome of Aruncus aethusifolius (Rosaceae), a species endemic to Korea

Article information

Korean J. Pl. Taxon. 2022;52(2):118-122
Publication date (electronic) : 2022 June 30
doi : https://doi.org/10.11110/kjpt.2022.52.2.118
1InfoBoss Inc., Seoul 07766, Korea
2InfoBoss Research Center, Seoul 07766, Korea
3Department of Biology, Daejeon University, Daejeon 34520, Korea
Corresponding author Jongsun PARK and Sang-Hun OH, E-mail: starflr@infoboss.co.kr soh42@dju.kr
Editor Tae-Soo JANG
Received 2022 February 25; Revised 2022 March 13; Accepted 2022 March 14.

Abstract

Aruncus aethusifolius (H. Lév.) Nakai is an endemic species in Korea and is economically important as an ornamental herb. The complete chloroplast genome of A. aethusifolius is 157,217 bp long with four subregions consisting of 85,207 bp of large single-copy and 19,222 bp of small single-copy regions separated by 26,394 bp of inverted repeat regions. The genome includes 131 genes (86 protein-coding genes, eight rRNAs, and 37 tRNAs). Phylogenetic analyses demonstrates that the chloroplast genome of A. aethusifolius was sister to A. dioicus var. kamtschaticus, forming the strongly supported clade of Aruncus. This is the first report of the chloroplast genome of A. aethusifolius.

INTRODUCTION

Aruncus aethusifolius (H. Lév.) Nakai (Rosaceae; Spiraeoideae; Spiraeeae) is an endemic species of Korea, distributed only on Jejudo Island. Plants of A. aethusifolius are found at the outcrops of volcanic rocks at elevations ranging from 1,500 to 1,950 m, and they also grow in moist parts of lowlands on the island (Yun et al., 2017). Aruncus aethusifolius is a small perennial herb up to 30 cm tall with estipulate, pinnately compound leaves; compact panicles with many small, unisexual or rarely bisexual flowers; and erect pedicels and follicles. It is morphologically similar to A. dioicus var. kamtschaticus (Maxim.) H. Hara and A. dioicus var. astilboides (Maxim.) H. Hara, but easily distinguished from these by having highly dissected leaflets (Nakai, 1912; Lee, 1980; Lee, 2006; Yun et al., 2017). Aruncus aethusifolius is widely cultivated as an ornamental, known as dwarf goat’s beard, for its attractive inflorescence arising above the foliage mound and because it requires little care to grow. As a part of biogeographic and population genetic studies of the species, the complete chloroplast genome was determined to understand the origin of this economically important plant.

MATERIALS AND METHODS

The sample was collected on Georinjokeun Oreum on Jejudo Island, Korea (33º21′8.20″N, 126º39′48.50″E). A specimen was deposited at the Daejeon University Herbarium (TUT) under the voucher number Suh 8681. It is locally common. No permission is required for the collection. Total DNA was extracted from 100 mg of fresh leaves using a DNeasy Plant Mini Kit (QIAGEN, Hilden, Germany), and visualized in 1% agarose gel. An amount of 1.2 micrograms of DNA, measured with an Invitrogen Qubit fluorometer, was used to construct the library. The sequencing library was constructed using an Illumina TruSeq Nano DNA Library Preparation Kit (Illumina, San Diego, CA, USA) following the manufacturer’s recommendations with approximately 350-bp DNA fragments. 1.99-Gbp raw sequences were obtained using HiSeqX at Macrogen Inc. (Seoul, Korea), and were filtered by Trimmomatic v0.33 (Bolger et al., 2014). The chloroplast genome was de novo assembled with Velvet v1.2.10 (Zerbino and Birney, 2008), and gaps were closed using GapCloser v1.12 (Zhao et al., 2011). The genome sequence was confirmed by aligning all raw reads against the assembled genome using BWA v0.7.17 and SAMtools v1.9 (Li, 2013; Li et al., 2009).

All processes were conducted in the Genome Information System utilized in previous studies (Park et al., 2019c; Baek et al., 2021; Park et al., 2021). Geneious Prime v2020.2.4 (Biomatters Ltd., Auckland, New Zealand) was used for annotation based on the A. dioicus var. kamtschaticus chloroplast (MW115132) (Suh et al., 2021). A circular map of A. aethusifolius chloroplast genome was drawn using OGDRAW v1.31 (Greiner et al., 2019). Large single-copy (LSC), small single-copy (SSC), and inverted repeat (IR) regions were determined by bl2seq (Tatusova and Madden, 1999), which provides a dot plot to show exact IR regions.

Sixteen whole chloroplast genomes to represent the major lineages of the tribe Spiraeeae (Potter et al., 2007) were included in phylogenetic analyses using the maximum likelihood (ML), neighbor-joining, and Bayesian inference (BI) methods. Heuristic searches were employed to reconstruct the ML phylogenetic tree using MEGA X (Kumar et al., 2018) with the following options: nearest-neighbor interchange branch swapping, the Tamura-Nei model, uniform rates among sites, and default values for other options. Bootstrap analyses with 1,000 pseudoreplicates were also conducted. The BI tree was constructed using MrBayes v3.2.6 (Ronquist et al., 2012). The GTR model with gamma rates was used, determined in jModelTest v2.1.5 (Darriba et al., 2012). A Markov-chain Monte Carlo algorithm was employed for 1,000,000 generations, sampling trees every 200 generations, with four chains running simultaneously.

RESULTS AND DISCUSSION

The chloroplast genome of A. aethusifolius (GenBank accession: MZ882398) (Fig. 1) is 157,217 bp long (GC ratio is 36.5%) and has four subregions consisting of 85,207 bp of LSC (34.3%) and 19,222 bp of SSC (30.0%) regions separated by 26,394 bp of IR regions (42.4%). It contains 131 genes (86 protein-coding genes, eight rRNAs, and 37 tRNAs); 19 genes (eight protein-coding genes, four rRNAs, and seven tRNAs) are duplicated in IR regions (Fig. 1). The chloroplast genome sequence can be accessed via accession number MZ882398 and SRA numbers are PRJNA722681, SAMN18789871, and SRR14267045.

Fig. 1.

Circular map of Aruncus aethusifolius complete chloroplast genome. The genes located outside of the circle are transcribed clockwise, while those located inside are transcribed counterclockwise. The dark gray plot in the inner circle corresponds to GC content. Large single-copy, small single-copy, and inverted repeat are indicated with LSC, SSC, and IR (IRA and IRB), respectively.

Interspecific variations between A. aethusifolius and A. dioicus var. kamtschaticus were investigated with the same method used in the previous studies (Kim et al., 2021; Lee and Park, 2021; Yoo et al., 2021). In total, there were 206 single nucleotide polymorphisms and 62 regions of insertion/deletion (INDEL regions, 907 bp in total). Remarkably, a 687-bp deletion was found in an intergenic region between trnT-GGU and psbD between the chloroplast genomes of A. dioicus var. kamtschaticus and A. aethusifolius, which contributes to the major difference in the genome size between the two (642 bp). These kinds of long INDELs were also found in the chloroplast genomes of Arabidopsis thaliana L. (378 bp) (Park et al., 2020a) and Coffea arabica L. (84 bp) (Park et al., 2019b) and mitochondrial genomes of Liriodendron tulipifera L. (1,980 bp) (Park et al., 2019a) and Rosa rugosa Thunb. ex Murray (607 bp) (Park et al., 2020b). Moreover, two 26-bp insertions found in the SSC region and 17 INDEL regions, of which length is above 5-bp were also identified. These interspecific variations are similar to the chloroplast genomes of four Viburnum species (Park et al., 2020c).

The phylogenetic tree showed that A. aethusifolius formed a strongly supported clade of Aruncus with A. dioicus and A. dioicus var. kamtschaticus, sister to Holodiscus (K. Koch) Maxim. (Fig. 2). The results of phylogenetic analyses are consistent with previous data, in which the clade of Aruncus and monotypic Leutkea Bong. is sister to Holodiscus (Potter et al., 2007; Zhang et al., 2017). The precise phylogenetic position of A. aethusifolius within Aruncus is important to gain insights into the origin of the species. But, chloroplast genome data of other varieties of A. dioicus (Hara, 1955) could not be included in this analysis because they are unavailable. Aruncus dioicus var. astilboides, a rare species in northern Honshu, Japan, is of particular interest because erect follicles found in A. aethusifolius are also characteristics of the Japanese variety. The information of organisms for the two genome data of A. dioicus from GenBank (GenBank accessions: KY419926 and KY419932) were not provided to the varietal level. Further studies with more taxa are needed to infer the origin of A. aethusifolius and to understand the biogeographic history of Aruncus. This is the first complete chloroplast genome sequence of A. aethusifolius, which will be helpful to develop species-specific markers.

Fig. 2.

Maximum likelihood phylogenetic tree of 14 chloroplast genomes of 12 species of the tribe Spiraeeae with two Gillenia species as outgroups. The numbers above the branches correspond to the bootstrap support values from the maximum likelihood and neighbor-joining methods, as well as posterior probabilities from the Bayesian inference.

Acknowledgements

This work was supported, in part, by research grants from the National Institute of Biological Resources of Korea (NIBR-202104201).

Notes

CONFLICT OF INTEREST

Sang-Hun OH, the Editor-in-Chief of the Korean Journal of Plant Taxonomy, was not involved in the editorial evaluation or decision to publish this article. The authors have declared no conflicts of interest.

References

Baek J, Park S, Lee J, Min J, Park J, Lee GW. 2021;The complete chloroplast genome of Chrysanthemum zawadskii Herbich (Asteraceae) isolated in Korea. Mitochondrial DNA Part B Resources 6:1956–1958.
Bolger AM, Lohse M, Usadel B. 2014;Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120.
Darriba D, Taboada G. L., Doallo R, Posada D. 2012;jModelTest 2: More models, new heuristics and parallel computing. Nature Methods 9:772.
Greiner S., Lehwark P, Bock R. 2019;OrganellarGenome-DRAW (OGDRAW) version 1.3.1: Expanded toolkit for thegraphical visualization of organellar genomes. Nucleic Acids Research 47:W59–W64.
Kim M.-H, Park S, Lee J, Baek J, Park J, Lee GW. 2021;The complete chloroplast genome of Glycyrrhiza uralensis Fisch. isolated in Korea (Fabaceae). Korean Journal of Plant Taxonomy 51:353–362.
Kumar S., Stecher G, Li M, Knyaz C, Tamura K. 2018;MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Molecular Biology and Evolution 35:1547–1549.
Hara H. 1955;Critical notes on some type specimens of East-Asiatic plants in foreign herbaria (2). Journal of Japanese Botany 30:65–72.
Lee B, Park J. 2021;The complete chloroplast genome of Zoysia japonica Steud. isolated in Korea (Poaceae): Investigation of potential molecular markers on Z. japonica chloroplast genomes. Plant Biotechnology Reports 15:707–715.
Lee TB. 1980. Illustrated Flora of Korea Hyang-mun Pub. Co.. Seoul: p. 990.
Lee YN. 2006. New Flora of Korea Kyo-Hak Publishing Co. Seoul: 1p. 975. 2p. 885. (in Korean).
Li H. 2013. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint at: https://arxiv.org/abs/1303.3997.
Li H., Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. 2009;The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079.
Nakai T. 1912;Notulae ad plantas Japoniae et Coreae VIII. Botanical Magazine (Tokyo) 26:321–328.
Park J., Kim Y, Kwon M. 2019a;The complete mitochondrial genome of tulip tree, Liriodendron tulipifera L. (Magnoliaceae): Intra-species variations on mitochondrial genome. Mitochondrial DNA Part B Resources 4:1308–1309.
Park J., Kim Y, Kwon W, Xi H, Park C-H. 2021;The complete chloroplast genome sequence of new species candidate of Plantago depressa Willd. in Korea (Plantaginaceae). Mitochondrial DNA Part B Resources 6:1961–1963.
Park J, Kim Y, Xi H, Heo K-I. 2019b;The complete chloroplast genome of ornamental coffee tree, Coffea arabica L. (Rubiaceae). Mitochondrial DNA Part B Resources 4:1059–1060.
Park J, Xi H, Kim Y. 2020a;The complete chloroplast genome of Arabidopsis thaliana isolated in Korea (Brassicaceae): An investigation of intraspecific variations of the chloroplast genome of Korean A. thaliana . International Journal of Genomics 3236461
Park J., Xi H, Kim Y, Nam S, Heo K-I. 2020b;The complete mitochondrial genome of new species candidate of Rosa rugosa (Rosaceae). Mitochondrial DNA Part B Resources 5:3435–3437.
Park J., Xi H, Oh S-H. 2020c;Comparative chloroplast genomics and phylogenetic analysis of the Viburnum dilatatum complex (Adoxaceae) in Korea. Korean Journal of Plant Taxonomy 50:8–16.
Park J., Yun N, Oh S-H. 2019c;The complete chloroplast genome of an endangered species in Korea, Halenia corniculata (L.) Cornaz (Gentianaceae). Mitochondrial DNA Part B Resources 4:1539–1540.
Potter D, Eriksson T, Evans RC, Oh S, Smedmark J. E. F., Morgan DR, Kerr M, Robertson KM, Arsenault M, Dickinson TA, Campbell C. 2007;Phylogeny and classification of Rosaceae. Plant Systematics and Evolution 266:5–43.
Ronquist F, Teslenko M, Van Der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP. 2012;MrBayes 3.2: Efficient Bayesian phylogenetic inference and model choice across a large model space. Systematic Biology 61:539–542.
Suh H.-J., Min J, Park J, Oh S-H. 2021;The complete chloroplast genome of Aruncus dioicus var. kamtschaticus (Rosaceae). Mitochondrial DNA Part B Resources 6:1256–1258.
Tatusova TA, Madden TL. 1999;BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. FEMS Microbiology Letters 174:247–250.
Yoo S.-C., Oh S-H, Park J. 2021;Phylogenetic position of Daphne genkwa (Thymelaeaceae) inferred from complete chloroplast data. Korean Journal of Plant Taxonomy 51:171–175.
Yun N., Suh H-J, Oh S-H. 2017;Sexuality of Aruncus aethusifolius (Rosaceae). Korean Journal of Plant Taxonomy 47:189–195.
Zerbino DR, Birney E. 2008;Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Research 18:821–829.
Zhang S.-D, Jin J.-J., Chen S.-Y, Chase MW, Soltis DE, Li H.-T, Yang J.-B, Li D.-Z, Yi T-S. 2017;Diversification of Rosaceae since the Late Cretaceous based on plastid phylogenomics. New Phytologist 214:1355–1367.
Zhao Q.-Y., Wang Y, Kong Y-M, Luo D, Li X, Hao P. 2011;Optimizing de novo transcriptome assembly from short-read RNA-Seq data: A comparative study. BMC Bioinformatics 12(Suppl 14):S2.

Article information Continued

Fig. 1.

Circular map of Aruncus aethusifolius complete chloroplast genome. The genes located outside of the circle are transcribed clockwise, while those located inside are transcribed counterclockwise. The dark gray plot in the inner circle corresponds to GC content. Large single-copy, small single-copy, and inverted repeat are indicated with LSC, SSC, and IR (IRA and IRB), respectively.

Fig. 2.

Maximum likelihood phylogenetic tree of 14 chloroplast genomes of 12 species of the tribe Spiraeeae with two Gillenia species as outgroups. The numbers above the branches correspond to the bootstrap support values from the maximum likelihood and neighbor-joining methods, as well as posterior probabilities from the Bayesian inference.