Isolation and characterization of EST-SSR markers for Astilboides tabularis (Saxifragaceae), endangered species in Korea
Article information
Abstract
Genetic assessments of rare and endangered species are among the first steps necessary to establish the proper management of natural populations. Transcriptome-derived single-sequence repeat markers were developed for the Korean endangered species Astilboides tabularis (Saxifragaceae) to assess its genetic diversity. A total of 96 candidate microsatellite loci were isolated based on transcriptome data using Illumina pair end sequencing. Of these, 26 were polymorphic, with one to five alleles per locus in 60 individuals from three populations of A. tabularis. The observed and expected heterozygosity per locus ranged from 0.000 to 0.950 and from 0.000 to 0.741, respectively. These polymorphic transcriptome-derived simple sequence repeat markers would be invaluable for future studies of population genetics and for ecological conservation of the endangered species A. tabularis.
Astilboides tabularis (Hemsl.) Engler is the only species in Astilboides (Saxifragaceae) known to be distributed in a cluster in the forests of river valleys of northeastern Korea and China (Jintang and Cullen, 2001; The Angiosperm Phylogeny Group et al., 2016). This species is a protected wild plant classified as endangered wildlife grade II by the Ministry of the Environment due to the possibility of the extinction of the population and/or reduction in the number of individuals by climate change (Ministry of the Environment of Korea, 2014). A. tabularis is a potential horticultural plant as an ornamental species given its large leaves (approximately 1 m in diameter) and beautiful panicles (Belyaeva and Butenkova, 2016; Choi et al., 2016). It also has a long history of usage as a medicinal plant for diabetes (Liu et al., 2016). Due to biological conservation efforts and given the ecological importance of A. tabularis, genetic diversity analysis studies have been conducted using AFLP and isozymes (Ku et al., 2006; Lee, 2008).
To the best of our knowledge, no microsatellite markers have been developed thus far for A. tabularis for population studies. Population genetics research provides insight into conservation and management plans for rare and threatened species (Ottewell et al., 2016). To assess the genetic diversity of A. tabularis, we developed expressed sequence tag–simple sequence repeat (EST-SSR) markers. These have been used as a powerful molecular tool for genetic diversity studies of many plant species (Yan et al., 2016; Wang et al., 2017).
Materials and Methods
For the construction of the RNA library, the total RNA was extracted from leaves of individuals representing A. tabularis from three populations (Voucher No. NIBRVP0000655607) (Table 1). RNA was extracted using RNeasy kits, version 2.2 (Illumina, San Diego, CA, USA) following the manufacturer’s instructions, and was subsequently used for TruSeq cDNA library preparation and high-throughput Illumina HiSeq 100 bp paired-end de novo transcriptome sequencing. The analysis results reads were obtained and assembled into 102,884 unigenes with 7,476,378,742 read numbers. The de novo transcriptome assembly of these reads was performed using the short read assembling program Trinity r20140717 (Haas et al., 2013) with the default parameters. Microsatellites were detected using MIcroSAtellite (MISA) version 1.0.0 (Thiel et al., 2003) with thresholds of ten repeat units for dinucleotide and five repeat units for tri-, tetra-, penta-, and hexanucleotides. MISA identified 38,598 simple sequence repeats (SSRs), of which 96 loci were selected depending on (1) the number of SSR repeats, (2) a PCR product size of 150–500 bp, (3) an annealing temperature range of 55–60°C, and (4) a minimum GC content of 50% for further testing of A. tabularis. The primer sets were designed to flank the microsatellite-rich regions with a minimum of eight repeats using the Primer3 program (Rozen and Skaletsky, 1999).
We sampled 60 A. tabularis individuals from three wild populations (Table 1). All samples included in this study were collected in accordance with the requirements of permission and support from relevant authorities. To avoid collecting clones, we specified a distance of at least 2 m among individuals within each population. The voucher specimens were deposited in the National Institute of Biological Resources Herbarium (KB) and in the Herbarium of Hallym University (HHU) in Korea (Table 1). The locations of the sites have been withheld to prevent illegal collection. The utility of the 96 microsatellite markers was confirmed by PCR from each population of A. tabularis in a total volume 25 μL, containing 2.5 μL of 10× Ex Taq buffer (TaKaRa Bio Inc., Otsu, Japan), 2 μL of 2.5 mM dNTPs, 0.01 μM each of a forward and reverse primer, 0.1 μL of TaKaRa Ex Taq DNA polymerase (5 units/μL) (TaKaRa Bio Inc.), and 5–10 ng of template DNA. All PCRs were performed in a GeneAmp PCR System 9700 thermocycler (Applied Biosystems, Carlsbad, CA, USA) using the following program: initial denaturation at 98°C for 5 min followed by 30 cycles of denaturation at 95°C for 1 min, annealing at annealing at 59°C for 1 min with an extension at 72°C for 1.5 min, and a final extension step at 72°C for 10 min. Fluorescently labeled (HEX, FAM) PCR products were analyzed by an automated sequencer (ABI 3730XL) with the GeneScan 500 LIZ Size Standard (Applied Biosystems). Genotyping was performed using GeneMapper 3.7 (Applied Biosystems), and peaks were scored manually by visual inspections. Finally, we identified 26 polymorphic markers based on genotyping data, and functional annotations for these markers were performed on a subset of ESTs with BLASTX scores (E-value < 1 × 10−10) using the gene ontology database (Table 2). The genetic parameters of polymorphic loci were assessed by calculating the number of alleles (A), the observed heterozygosity (Ho), and the expected heterozygosity (He) using GenAlEx 6.5 (Peakall and Smouse, 2012). Degrees of deviation from the Hardy-Weinberg equilibrium (HWE) were estimated with ARLEQUIN 3.5 (Excoffier and Lischer, 2010).
Results and Discussion
The results of the genetic diversity of 26 polymorphic markers are shown in Table 3. Overall, the 26 microsatellite loci were polymorphic, with the number of alleles per locus ranging from one to five (average 2.218). The Ho and He values ranged from 0.000 to 0.950 and from 0.000 to 0.741, respectively (Table 3). Some polymorphic loci significantly deviated from HWE, though this was not consistent across populations.
This study describes the first assembly and characterization of the leaf transcriptome of A. tabularis using the Illumina paired-end sequencing method. Twenty-six polymorphic markers were successfully amplified, revealing polymorphisms in A. tabularis. This work can serve as a basis for further studies on the genetic diversity and structure of A. tabularis and can provide essential information for devising effective conservation strategies.
Acknowledgements
This research was supported by Grant No. NIBR201703201 from the National Institute of Biological Resources under the Ministry of Environment, Republic of Korea.
Notes
Conflict of Interest
The authors declare that there are no conflicts of interest.