Next-generation sequencing based Cyperus niveus (Cyperaceae) complete chloroplast genome: A comparative analysis and phylogeny

Article information

Korean J. Pl. Taxon. 2024;54(2):99-109
Publication date (electronic) : 2024 June 30
doi : https://doi.org/10.11110/kjpt.2024.54.2.99
1Plant Systematics and Evolution Laboratory (PSEL), Department of Agricultural Sciences and Technology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad 44000, Pakistan
2Alpha Genomics Private Limited, 45710, Islamabad, Pakistan
3Group of Biometrology, Korea Research Institute of Standards and Science (KRISS), Daejeon 34113, Korea
Corresponding author Muhammad Qasim HAYAT E-mail: m.qasim@asab.nust.edu.pk
Received 2024 March 15; Revised 2024 April 6; Accepted 2024 May 8.

Abstract

Cyperus niveus, a perennial herb, holds significant traditional and scientific medicinal value. In this study, the complete chloroplast genome sequence of C. niveus was assembled and comparisons were conducted among species in the genus Cyperus. Molecular phylogenetic analysis was also conducted in Cyperaceae. The chloroplast genome of C. niveus exhibits a quadripartite structure, spanning a length of 185,454 base pairs (bp). The genome contains a large single-copy region (99,339 bp), flanked by two inverted repeats (IRs) (IRA and IRB: each 38,021 bp) separated by a small single-copy (SSC) region (10,073 bp). The chloroplast genome of C. niveus was rich in adenine-thymine (AT) at 66.7% and guanine-cytosine (GC) at 33.3%. A total of 140 unique genes were identified in the chloroplast genome of C. niveus, including 83 protein-coding genes, 39 tRNA genes, and 8 rRNA genes. An amino acid frequency analysis uncovered the prevalence of leucine (10.90%) and isoleucine (9.20%). One hundred and forty-six simple sequence repeats and 49 oligonucleotide repeats were found in the chloroplast genome of C. niveus. A comparative analysis highlighted greater divergence within the IR and SSC regions. A phylogenetic assessment revealed that species within the genus Cyperus display phylogenetic conservation compared to other genera. Notably, the genus Carex exhibited substantial divergence, indicative of rapid evolutionary changes. C. niveus showed close resemblance with C. iria and C. exaltatus. In essence, this study and of chloroplast genome of C. niveus represents a valuable genomic asset that holds significance in species identification and comparative genomic research, presenting the first evolutionary relationship in Cyperaceae on the basis of chloroplast genome.

INTRODUCTION

Cyperusniveus is perennial herb having dark brown, woody rhizomes which are globular in shape (Zafar et al., 2011). Cyperusniveus belongs to family Cyperaceae, which is wind pollinated wild family comprising around 5500 species with cosmopolitan distribution (Ball and Reznicek, 2002). Cyperusniveus commonly known as snow white sedge, is found in cold dry habitats along roadsides and rock crevices. Stem of C. niveus is erect, triquetrous while leaves are basal. The inflorescence is capitate, sessile spikelets and 2–3 involucre bracts. Flowering period of C. niveus is from April to October. It is a native species of Asia; mainly Central, South, and Southeast. In Pakistan widely distributed in Sindh, Balochistan, Thal, Kurram, Peshawar, Kohat, Chitral, Swat, Barikot, Mingora, Hazara, Siran Valley, Nathia Gali, Sakesar, Kahuta, Saidpur, Murree Road, Jhelum Valley, Azad Kashmir, Poonch, etc. (Marwat and Khan, 2008; Zafaret al., 2011; Parvez et al., 2022). Cyperusniveus is used as traditional herb medicine and fodder (Amjad et al., 2013). It is effective in various medical conditions like stomachic, used as antidiarrheal and anti-cancerous, for increasing appetite, and decreasing inflammation (Agarwal and Ghosh, 1989). It is effective in treating piles, edema, eye sore, scorpion sting and lumbago. This herb also used for common colds, stomach worms in goats and cows in crushed form (Ahmad, 2007; Tariq et al., 2014). The presence of flavanols, phenolic acid and aurone was confirmed in TLC finger printing under UV rays (Zafaret al., 2011). Crude extract of C. niveus has antidiarrheal and antispasmodic effects which work through dual blockade of muscarinic receptors and Ca2+ channels and, also has anti-emetic properties (Aleem and Janbaz, 2018). In some countries C. niveus grows as agricultural weed but in Pakistan, it is rare and under great pressure of grazing due to high nutritional values and palatability. With that extent of medicinal importance, C. niveus is currently under the radar as a locally endangered species (Amjad et al., 2015; Bano et al., 2018). Its ability to thrive in xeric environments and tolerate extreme conditions in mountainous regions makes it a candidate for restoration efforts (Parvez et al., 2022).

Chloroplasts, which are photosynthetic organelles found in the plant cells, are crucial for photosynthesis and manufacture of pigments, amino acids, starches, and fatty acids (Neuhaus and Emes, 2000; Rodríguez-Ezpeleta et al., 2005). It is believed that free-living endosymbiotic cyanobacteria are the originator of chloroplasts (Raven and Allen, 2003). Angiosperm chloroplasts have their own circular genomes that are between 120 and 160 kb long and contain 110 to 130 distinct genes (Yan et al., 2019). Angiosperm chloroplast typically hasa quadripartite pattern with two replicas of inverted repeats (IR) regions separated by one large single copy (LSC) region and one small single copy (SSC) region (Yan et al., 2019). Angiosperm chloroplast genomes are often well preserved in terms of gene content and genome structure in comparison to nuclear and mitochondrial genomes (Wicke et al., 2011). However, some angiosperm lineages have been identified to exhibit gene deletions and genomic rearrangements (Lee et al., 2007). Additionally, chloroplast genome nucleotide substitution rates are slower than nuclear genome nucleotide substitution rates (Duchene and Bromham, 2013; Smith, 2015). Chloroplast genomes are appropriate for comparative genomic and molecular evolution research because of their short genome size, simpler structure, and preserved gene content (Huang et al., 2014; Walker et al., 2014). Chloroplast transformation also has some benefits over nuclear gene transformation, including high levels of transgene expression, the absence of post-transcriptional gene silencing, site-specific transgene integration, and environmental friendliness (Maliga, 2004; Bock, 2007). The availability of the entire chloroplast genome would streamline chloroplast transformation methods in scientific research (Yan et al., 2019).

The aim of this study was assembly of C.niveus complete chloroplast genome, exploration of genomic features, and comparative analysis with nine other available chloroplast genomes of genus Cyperus. Additionally, this research aims to establish the first family-level phylogeny of Cyperaceae using the complete chloroplast genome.

MATERIALS AND METHODS

Sample collection, DNA extraction, and sequencing

Fresh sample of C. niveus was collected on 12 September 2022 from hills of the National University of Sciences and Technology, Islamabad, Pakistan. The sample was brought to the Pakistan Museum of Natural History for confirmation and voucher number PMNH-041726 was assigned for future reference. Fresh leaves were silica dried and DNeasy Plant Mini Kit (Qiagen, Inc., Hilden, Germany) was used for whole genome DNA extraction. For quality and quantity checking 1% agarose gel electrophoresis and Multiskan GO (Thermo Scientific, Inc., Waltham, MA, USA) wereused. For sequencing, high-quality whole genome DNA was sent to Novogene, Hong Kong and sequenced from paired ends with 150 bp short reads and a 350 bp insert size using HiSeq2500 (Steemers and Gunderson, 2005). All other species used in this study are provided in Table 1 along with the accession number.

List of Cyperaceae species along with accession number.

Genome assembly and annotation

The chloroplast genome of C. niveus was de novo assembled according to methods of Abdulla et al. (2020) with Kmer values of 141, 131, 121, 111, 101, and 91 using Velvet 1.2.10 (Zerbino and Birney, 2008). The resulted contigs were then mapped to reference using Cyperusiria (accession No. MW123056.1) in Geneious R8.1 (Kearse et al., 2012) which generated whole chloroplast genome of C. niveus. The coverage depth analysis was performed using the Burrow-Wheeler aligner (Li and Durbin, 2010) and the boundaries of LSC, SSC, and IR (IRA and IRB) were determined using careful observation of overlapping contigs in Geneious R8.1 (Kearse et al., 2012).

Chloroplast genome of C. niveus was annotated using online server of GeSeq (Tillich et al., 2017) and presence of transfer RNA genes was further confirmed using tRNAscan-SE version 2.03 (Schattner et al., 2005) The circular map of chloroplast genome of C. niveus was drawn using OGDRAW (Organellar Genome Draw) (Greiner et al., 2019). For GenBank submission, the final annotated file was converted into GenBank format by using GB2sequin and after submission accession number OR418070 was assigned.

Genomic features of Cyperus niveus and comparative analyses

Genomic features of C. niveuswas analyzed in Geneious R8.1 (Kearse et al., 2012) and was compared with 10 other species of genus Cyperus. To access the degree of divergence and nucleotide variability (Pi) within the chloroplast genome, a group of 11 Cyperus species were aligned together using MAFFT v7.490 (Katoh and Standley, 2013) and manual adjustments were made using Geneious R8.1 (Kearse et al., 2012). An analysis using the sliding window approach was conducted in DnaSP v6.12.03 (Rozas et al., 2017) with the following settings: window length of 800 and step size of 100. The contraction and expansion of repeats regions in closely related four members of genus Cyperus (C.niveus, C. exaltatus, C. difformis, and C. aromaticus) was determined using IRscope (Amiryousefi et al., 2018).

Codon usage, amino acids frequency, and RNA editing sites

Amino acids frequency and codon usage of protein-coding genes of C. niveusweredetermined using Geneious R8.1 (Kearse et al., 2012). RNA editing sites weredetermined using Plant RNA Editing-Prediction & Analysis Computer Tool (PREPACT) version: 3.12.0 using NCBI BLAST 2.2.26+ (http://www.prepact.de/prepact-main.php) and Arabidopsisthaliana was selected as reference protein database (Lenz and Knoop, 2013).

Simple sequence repeats and oligonucleotide repeats

For simple sequence repeat (SSR) analysis Perl script MIcroSAtellite identification tools (MISA) v2.1, 2020-08-25 (Beier et al., 2017) was used at the following parameters; 10 for mono-, 5 for di-, 4 for tri-, 3 for tetra-, penta-, and hexanucleotide repeats. For oligonucleotide repeats REPuter (Kurtz et al., 2001) was used to detect forward (F), reverse (R), palindromic (P), and complementary (C) repeats at minimum repeat size set to 30 with 3 mismatches and minimum similarity set to 90%.

Substitution and InDel analysis

For substitution analysis, each region (LSC, SSC, IR) of C. niveus chloroplast genome was pairwise aligned with C. iria taking it as reference. using MAFFT v7.490 (Katoh and Standley, 2013) and then visualized in Geneious R8.1 (Kearse et al., 2012). For InDel events DnaSP (Rozas et al., 2017) was used.

Phylogenetic relationship family Cyperaceae

For phylogenetic relationship, chloroplast genome of 28 Cyperaceae species and 1 Poaceae species, Brachypodium-distachyon (accession No. NC_011032) which was used as outgroups. Complete chloroplast genome of 29 species was multiple aligned using MAFFT v7.490 (Katoh and Standley, 2013). All types of InDels were removed from alignment and only substitutions were used to construct Maximum Likelihood tree using online server of IQ-TREE at default parameters and substitution model GTR+FO was adapted automatically on basis of data type (https://www.hiv.lanl.gov/ content/sequence/IQTREE/iqtree.html) (Nguyen et al., 2014; Trifinopoulos et al., 2016). The ultrafast bootstrap value of 1,000 replicates wasapplied todataset (Hoang et al., 2018). The resulted Newick tree was brought into MEGA version 11.0.13 for visualization and beautification (Tamura et al., 2021). Different marker identifications were applied to differentiate species of each genus and outgroup.

RESULTS AND DISCUSSION

Chloroplast genome features of C. niveus

Hiseq2500 paired end run generated 5.3 GB sequencing raw data that contained 17.86 million reads. The whole genome sequence data was used to de novo assemble the complete chloroplast genome of C. niveus with average coverage depth of 100×. The chloroplast genome of C.niveus had quadripartite structure with total length of 185,454 base pairs (bp). The chloroplast genome contained an LSC region of 99,339 bp, two IRs (IRA and IRB: each 38,021 bp) separated by SSC of 10,073 bp. The chloroplast genome of C. niveus was rich in adenine-thymine (AT) content constituting 66.7% and guanine-cytosine (GC) content was 33.3%. As a pivotal indicator of the evolutionary relatedness between distinct species, the GC content of C. niveus was similar to other species of genus Cyperus (Zhu, 2019; Ren et al., 2021; Yang et al., 2021). The GC content of three regions of chloroplast genome varies; the IR regions contain a high GC content (37.3%), compared to LSC (30.9%) and SSC (25.5%). One possible reason of higher GC content in IR regions can be the presence of rRNA and tRNA genes in these regions (Guo et al., 2018; Biju et al., 2019). Comparatively, the GC content of C. niveus was similar to C. rotundus while other species of Cyperus had lower GC content (Zhu, 2019; Ren et al., 2021; Yang et al., 2021). The detailed genomic features of C. niveus have been provided in Table 2. The complete annotated circular chloroplast genome of C. niveus is provided in Fig. 1.

Genomic features of Cyperusniveus.

Fig. 1.

Circular representation of the complete chloroplast genome of Cyperusniveus. Genes located within the circular structure undergo transcription in a clockwise direction, while those positioned outside the circle experience counterclockwise transcription. Genes possessing diverse functions are assigned distinct color codes. The inner region of the circle is shaded in a deeper gray to represent the GC content, while the lighter gray hue indicates the AT content. The circular illustration designates specific segments as LSC (large single copy), SSC (small single copy), and features inverted repeats known as IRA and IRB.

The chloroplast genome contained 140 unique genes, including 83 protein-coding genes, 39 tRNA genes, and 8 rRNA genes. Among these genes, 26 protein-coding genes, 8 rRNA genes, and 16 tRNA genes were present and duplicated in the IR regions, whereas 52 protein-coding genes and 21 tRNA genes were present in LSC region, and 5 protein-coding genes and 2 tRNA genes were present in SSC region. Furthermore, in chloroplast genes, 17 protein-coding genes and 8 tRNA genes containing introns were identified. Among these introns containing genes, clpP1 and pafl contained two introns while all other genes contain one intron. Seven introns containing genes (ndhA, ndhB, rpl2, rpl23, rps12, trnA-UGC, and trnl-GAU) were present in IR region and thus were duplicated. The other 11 introns containing genes were present in LSC and no intron containing gene were present in SSC. In these intron containing genes rps12 wasidentified as transsplicing gene as its 5′-end existed in LSC region as single copy while 3′-end existed in IR region as duplicate. The complete details of genes with their functions are given in Table 3.

Detail of genes, their types and function in chloroplast genome of Cyperusniveus.

Codon usage, amino acid frequency, and RNA editing sites

Deviation in codon usage holds significant importance in the modulation of gene expression and cellular processes. This phenomenon not only serves as a critical regulatory mechanism but also offers an extra avenue for investigating speciation and evolutionary dynamics at the molecular scale (Plotkin and Kudla, 2011). Codon usage analyses revealed that protein-coding sequence in C. niveus codes for a total of 20,997 codons. Of these codons, ATT showed maximum frequency (970) that codes for isoleucine while TGA showed minimum frequency which is a stop codon (Online Supplementary Material S1). The amino acid frequency analyses showed the abundance of leucine (10.90%) and isoleucine (9.20%) while cysteine frequency (1.10%) was in least. These observations were found consistent with other species of angiosperms (Biju et al., 2019). The PREPACT successfully anticipated 171 post-transcriptional RNA editing alterations within 83 protein-coding genes of C. niveus. Among these modifications, the matK and rpoB genes showed the highest RNA editing site predictions, with 15 sites each. Additionally, the ndhF gene displayed 13 predicted RNA editing sites, while ccsA, rpoC2, and ycf2 genes exhibited 9 RNA editing sites each. Moreover, the ndhD gene contained six predicted RNA editing sites. Notably, all these modifications occurred at the initial and secondary nucleotides of codons. The conversion rate at the second nucleotide was 1.5 times greater compared to the first nucleotide. The maximum number of amino acid shift was threonine to isoleucine, and leucine to phenylalanine. The detailed events of RNA editing sites are given in Online Supplementary Material S2.

SSR and oligonucleotide repeats

Dispersed repeats sequences are recognized for their pivotal role in driving chloroplast genome rearrangement and recombination processes (Weng et al., 2014; Wang et al., 2018). Furthermore, long repetitive sequences have proven to be valuable markers, extensively employed in the realms of plant evolution, comparative genomics, and phylogenetics research (Park et al., 2017). Perl script MIcroSAtellite identification tools (MISA) detected 146 total SSR and 28 compound SSR in chloroplast genome of C. niveus. The distribution of SSR was as follows; 93 in LSC, 21 in IR, and 9 in SSC. All compound SSR were distributed in LSC. These SSR include 90 mononucleotides (61.64%), 21 dinucleotides (14.38%), 11 trinucleotides (7.53%), 20 tetranucleotides (13.70%), 4 pentanucleotides (2.74%), and no hexanucleotides. Mononucleotides were made of 10–24 repeat units, dinucleotides were made of 5–13 repeat units, trinucleotides were made of 4–5 repeat units, tetranucleotides andpentanucleotides were made of three repeat units. In the case of complementary most of repeats belongs to A/T types (90), while AATCT/AGATT (1) were least (Online Supplementary Material S3). In closely related species C.esculentus 396 SSRs were identified and these SSRs encompassed 339 mononucleotides (constituting 85.6%), 22 dinucleotide (making up 5.56%), 10 trinucleotides (comprising 2.53%), 23 tetranucleotides (accounting for 5.81%), two pentanucleotide repeats (representing 0.51%), with no hexanucleotides detected (Ren et al., 2021). The SSRs pattern in C. niveus and C. esculentus were consistent with previous studies (Biju et al., 2019).

Two types of oligonucleotide repeats were found in chloroplast genome of C. niveus, which include 28 forward repeats and 21 palindromic repeats, whereas no inverse or complementary repeats were found. Most of repeats (91.83%) varied from 82–182 bp in length, and only four repeats were above 700 bp in length. In 49 oligonucleotide repeats, 13 repeats were only found in IR regions, five repeats were found only in LSC region, and one repeat wasfound only in SSC region. All other 24 oligonucleotide repeats were found in more than one region, either both in LSC and IR regions or both in SSC and IR regions. Furthermore, six forward oligonucleotide repeats were found both in IRA and IRB, and thus were duplicated. Among those 49 oligonucleotides repeats most (81.63%) were in intergenic spaces. In protein-coding genes, 6 oligonucleotide repeats were found (3 palindromic, 3 forward) in rpoB, while one forward repeat in each of following three genes; rpl32, rpoC2, and rps12 (Online Supplementary Material S4). In a preceding investigation, two categories of extended repeats were pinpointed within the chloroplast genome of C. esculentus. This encompassed 15 forward repeats and 34 palindromic repeats, with no detection of inverse repeats or complementary repeats. The majority of these repeats (93.8%) ranged from 64 to 385 base pairs in length, with only three exceeding 600 base pairs (Ren et al., 2021).

Substitution and InDel analysis

The different regions of chloroplast genome of C. niveus were pairwise aligned with C. iria taking it as reference. The 2,000 substitution was observed in complete chloroplast genome, 841 in LSC, 146 in SSC, and 1,013 in IR. A total of 290 InDel events were recorded; 183 in LSC, 25 in SSC, and 82 in IR (Table 4). Prior research findings indicate a greater prevalence of InDels and substitutions events within the LSC and SSC regions in comparison to the IR regions (Ahmed et al., 2012; Abdullah et al., 2019; Liu et al., 2019; Shahzadi et al., 2020). In the current study, InDels events were higher in LSC but substitutions events were more in IR as compared to LSC and SSC.

Type and number of substitution and InDel events in chloroplast genome of Cyperusniveus.

Out of these 2,000 substitutions 962 were transition and 1,038 were transversion type. The transition to transversion ratio (Ts/Tv) was 0.92 of complete chloroplast, while 0.95 in LSC, 0.92 in SSC, and 0.90 in IR. The current study’s Ts/Tv ratio wasconsistent with previous studiesin which authors reported Ts/Tv<1 (Ahmed et al., 2012; Liu et al., 2019; Abdullahet al., 2019).

Comparative analysis in genus

The chloroplast genome of C. niveus was compared with nine other Cyperus species whose chloroplast genome was already assembled. C. difformis has the largest genome size (187,808 bp), while C. rotundus has the smallest genome (182,986 bp). In the case of LSC individual size, C. glomeratus has the largest size (104,040 bp), and C. niveus has the smallest size (99,339 bp). The SSC and IRs have almost similar sizes in all species (Online Supplementary Material S5). The expansion and contraction of the IRs at their boundaries play crucial roles in the evolutionary processes, leading to variations in the length of plant chloroplast genomes (Kim and Lee, 2004). A thorough comparison was performed between the junctions of the IRs and two distinct single copy regions in the chloroplast genomes of four Cyperus species. This analysis encompassed regions such as LSC/IRA (JLA), LSC/IRB (JLB), SSC/IRA (JSA), and SSC/IRB (JSB), alongside the rearrangement of neighboring genes. Six genes (rps3 rps8, rpl14, rpl16, clpP1, and trnH) were present at junction of LSC and IRs. In C. niveus, clpP1 was entirely located in LSC region 124 bp closer to JLB, ndhA in IRB region 25 bp away from JSB, ndhl entirely in SSC, ndhF at JSA 926 bp in SSC and 1,279 bp in IRA, rps3 entirely in IRB, trnH in LSC 31 bp toward JLA. In C. exaltatus rps8, rpl14 was entirely in LSC region, rpl16 at the JLB 551 bp in LSC region and 905 bp in IRB region, ndhG in IRB region 97 bp toward JSB, ndhF entirely in SSC, ndhG in IRA 96 bp away from JSA, ndhl entirely in IRA, trnH in LSC 31 bp toward JLA. In C. difformis and C. aromaticus rps8, rpl14 both were in LSC, while rpl16 also in LSC but in C. difformis it was 124 toward JLB and 140 bp toward JLB in C. aromaticus. In both species ndhG were in IRB 87 bp toward JSB, ndhG in IRA 86 bp away from JSA, ndhl entirely in IRA. In C. difformis ndhF was entirely in SSC and trnH in LSC 31 bp toward JLA. In C. aromaticus psaC was entirely in SSC and while trnH was in LSC 29 bp toward JLA (Fig. 2). In genus Cyperus, variation was observed in genes presence and absence. In total 19 genes showed variation. In C. niveus, five genes were found unique as compared to other 10 Cyperus species. These include clpP1 gene, pafI gene, pafII gene, pbf1 gene, and rpl23 gene. The IR and SSC region were more divergent than LSC. The nucleotide variability of IR and SSC was higher (Pi > 0.3) than set threshold (Pi ≥ 0.2) The regions which showed greater divergence include tRNA (trnN-GUG, trnT-UGU, and trn-GUU) and genes (ccsA, ndhD, ndhE, ndhF, ndhG, and psac) (Fig. 3).

Fig. 2.

Comparative analysis of contraction and expansion of regions in Cyperus species. Colored boxes were employed above or below a central line to indicate adjacent border genes. The extent between these genes and the respective boundaries was quantified in terms of base lengths (bp). JSA (SSC/IRa), JSB (IRb/SSC), JLA (IRa/LSC), and JLB (IRb/LSC) serve as indicators of the specific sites where two corresponding regions of a genome meet and connect.LSC, large single copy; SSC, small single copy; IR, inverted repeats.

Fig. 3.

The sliding window analysis of complete chloroplast genome of Cyperusniveus. X-axis denotes the position of nucleotide and Y-axis denotes nucleotide variability (Pi) with threshold set to 0.2. The colored line below X-axis denotes different regions of chloroplast genome.

Phylogenetic relationship in family Cyperaceae

Complete chloroplast sequences hold significant value in unraveling phylogenetic relationships, particularly among closely related taxa. This is particularly relevant in cases involving recent divergence, rapid speciation, or slow genome evolution, where the extent of sequence variation may be limited (Daniell et al., 2016; Williams et al., 2016; Tonti-Filippini et al., 2017). Phylogenetic trees were constructed between C. niveus and 27 other species of Cyperaceae and 1 of Poaceae, Brachypodium distachyon as outgroup. The Maximum Likelihood tree showed that all species of the genus Cyperus are phylogenetically conserved as compared to other genera. The genus Carex showed maximum divergence which means that they are evolving rapidly and moving evolutionarily away from each other. The C. niveus in the family tree showed close resemblance with C. iria and C. exaltatus (Fig. 4). In this study, Cyperus emerged as single monophyletic group who has C4 photosynthetic cycle due to xeric conditions. A study was carried out to define phylogenetic relationship of giant genus Cyperuson the basis of nuclear ribosomal DNA (ETS1f) and plastid DNA (rpl32-trnL, trnH-psbA) reported that giant genus Cyperus consists of paraphyletic C3 genera and monophyletic C4 genera. The monophyletic C4 genera have nine nested segregate genera (Alinula, Ascolepis, Lipocarpha, Kyllinga, Pycreus, Queenslandiella, Remirea, Sphaerocyperus, and Voikiella) and they suggest these segregate genera should be incorporated as part of broad C4 genus Cyperus (Larridon et al., 2013). Our results wereconsistent with study as Cyperus emerged as single C4 monophyletic group. These findings were also congruent with the positions of other Cyperus species (Ren et al., 2021). Overall, these results are also consistent with classification of Cyperaceae as monophyletic, divided into two clades Mapanioideae and Cyperoideae on the basis of plastid rbcL and trnL-F (Muasya et al., 2009).

Fig. 4.

Maximum Likelihood (ML) tree of family Cyperaceae. The tree was constructed between 28 Cyperaceae and 1 Poaceae species Brachypodium distachyon (outgroup) using IQ-TREE at ultrafast bootstrap value of 1,000. Branch labels denote bootstrap confidence and colored identifier denotes different genus.

ONLINE SUPPLEMENTARY MATERIALS

S1–S5 are available at https://doi.org/10.11110/kjpt.2024.54.2.99

S1. Codon usage in chloroplast genome of Cyperus niveus.

S2. Amino acids frequency in chloroplast genome of Cyperus niveus.

kjpt-54-2-99-Supplementary-Table-1,2.pdf

S3. Simple sequence repeats in chloroplast genome of Cyperus niveus.

kjpt-54-2-99-Supplementary-Table-3.pdf

S4. Oligonucleotide repeats in chloroplast genome of Cyperus niveus.

kjpt-54-2-99-Supplementary-Table-4.pdf

S5. Genomic features comparison in genus Cyperus.

kjpt-54-2-99-Supplementary-Table-5.pdf

Acknowledgements

Plant Systematics and Evolution Laboratory, Department of Plant Biotechnology, Atta-ur-Rahman School of Applied Biosciences, National University of Sciences and Technology, Islamabad, and National Research Foundation of Korea (NRF) are acknowledged for providing funds, support, and all necessary lab work facilities for this research.

Notes

CONFLICTS OF INTEREST

The author declare that there are no conflicts of interest.

References

Abdulla F., Mehmood I., Shahzadi S., Waseem B., Mirza I. Ahmed, Waheed M. T.. 2020;Chloroplast genome of Hibiscus rosa-sinensis (Malvaceae): Comparative analyses and identification of mutational hotspots. Genomics 112:581–591.
Abdullah I., Shahzadi F., Mehmood Z., Ali M. S., Malik S., Waseem B., Mirza I. Ahmed, Waheed M. T.. 2019;Comparative analyses of chloroplast genomes among three Firmiana species: Identification of mutational hotspots and phylogenetic relationship with other species of Malvaceae. Plant Gene 19:100199.
Agarwal V. S., Ghosh B.. 1989. Drug Plants of India Kalyani Publishers. New Delhi: p. 330.
Ahmad S. S.. 2007;Medicinal wild plants from Lahore-Islamabad motorway (m-2). Pakistan Journal of Botany 39:355–375.
Ahmed I., Biggs P. J., Matthews P. J., Collins L. J., Hendy M. D., Lockhart P. J.. 2012;Mutational dynamics of aroid chloroplast genomes. Genome Biology and Evolution 4:1316–1323.
Aleem A., Janbaz K. H.. 2018;Dual mechanisms of anti-muscarinic and Ca++ antagonistic activities to validate the folkloric uses of Cyperus niveus Retz. as antispasmodic and antidiarrheal. Journal of Ethnopharmacology 213:138–148.
Amiryousefi A., Hyvönen J., Poczai P.. 2018;Irscope: An online program to visualize the junction sites of chloroplast genomes. Bioinformatics 34:3030–3031.
Amjad M. S., Arshad M., Qureshi R.. 2015;Ethnobotanical inventory and folk uses of indigenous plants from Pir Nasoora National Park, Azad Jammu and Kashmir. Asian Pacific Journal of Tropical Biomedicine 5:234–241.
Amjad M. S., Arshad M., Hussain M. Z.. 2013;An overview of human-plant interaction in Nikyal rangeland district Kotli Azad Jammu and Kashmir. African Journal of Plant Science 7:571–576.
Ball P. W., Reznicek A.. 2002. Carex L. Flora of North America, North of Mexico 23 In : Flora of North America Editorial Committee, ed. Oxford University Press. New York: p. 254–273.
Bano S., Khan S. M., Alam J., Alqarawi A. A., Abd_Allah E. F., Ahmad Z., Rahman I. U., Ahmad H., Aldubise A., Hashem A.. 2018;Eco-floristic studies of native plants of the Beer Hills along the Indus River in the districts Haripur and Abbottabad, Pakistan. Saudi Journal of Biological Sciences 25:801–810.
Beier S., Thiel T., Münch T., Scholz U., Mascher M.. 2017;Misa-web: A web server for microsatellite prediction. Bioinformatics 33:2583–2585.
Biju V. C., S P. R, Vijayan S., Rajan V. S., Sasi A., Janardhanan A., Nair A. S.. 2019;The complete chloroplast genome of Trichopus zeylanicus, and phylogenetic analysis with. Dioscoreales Plant Genome 12:190032.
Bock R.. 2007;Plastid biotechnology: Prospects for herbicide and insect resistance, metabolic engineering and molecular farming. Current Opinion in Biotechnology 18:100–106.
Daniell H., Lin C.-S., Yu M., Chang W.-J.. 2016;Chloroplast genomes: Diversity, evolution, and applications in genetic engineering. Genome Biology 17:134.
Duchene D., Bromham L.. 2013;Rates of molecular evolution and diversification in plants: Chloroplast substitution rates correlate with species-richness in the Proteaceae. BMC Evolutionary Biology 13:65.
Greiner S., Lehwark P., Bock R.. 2019;OrganellarGenome-DRAW (OGDRAW) version 1.3. 1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Research 47:W59–W64.
Guo S., Guo L., Zhao W., Xu J., Li Y., Zhang X., Shen X., Wu M., Hou X.. 2018;Complete chloroplast genome sequence and phylogenetic analysis of Paeonia ostii . Molecules 23:246.
Hoang D. T., Chernomor O., von Haeseler A., Minh B. Q., Vinh L. S.. 2018;UFBoot2: Improving the ultrafast bootstrap approximation. Molecular Biology and Evolution 35:518–522.
Huang H., Shi C., Liu Y., Mao S.-Y., Gao L.-Z.. 2014;Thirteen Camelliachloroplast genome sequences determined by high-throughput sequencing: Genome structure and phylogenetic relationships. BMC Evolutionary Biology 14:151.
Katoh K., Standley D. M.. 2013;Mafft multiple sequence alignment software version 7: Improvements in performance and usability. Molecular Biology and Evolution 30:772–780.
Kearse M., Moir R., Wilson A., Stones-Havas S., Cheung M., Sturrock S., Buxton S., Cooper A., Markowitz S., Duran C., Thierer T., Ashton B., Meintjes P., Drummond A.. 2012;Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649.
Kim K.-J., Lee H.-L.. 2004;Complete chloroplast genome sequences from Korean ginseng (Panax schinseng Nees) and comparative analysis of sequence evolution among 17 vascular plants. DNA Research 11:247–261.
Kurtz S., Choudhuri J. V., Ohlebusch E., Schleiermacher C., Stoye J., Giegerich R.. 2001;Reputer: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Research 29:4633–4642.
Larridon I., Bauters K., Reynders M., Huygh W., Muasya A. M., Simpson D. A., Goetghebeur P.. 2013;Towards a new classification of the giant paraphyletic genus Cyperus (cyperaceae): Phylogenetic relationships and generic delimitation in C4 Cyperus . Botanical Journal of the Linnean Society 172:106–126.
Lee H.-L., Jansen R. K., Chumley T. W., Kim K.-J.. 2007;Gene relocations within chloroplast genomes of Jasminum and Menodora (Oleaceae) are due to multiple, overlapping inversions. Molecular Biology and Evolution 24:1161–1180.
Lenz H., Knoop V.. 2013;PREPACT2.0: Predicting C-to-U and U-to-CRNA editing in organelle genome sequences with multiple references and curated rna editing annotation. Bioinformatics and Biology Insights 7:1–19.
Li H., Durbin R.. 2010;Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics 26:589–595.
Liu E., Yang C., Liu J., Jin S., Harijati N., Hu Z., Diao Y., Zhao L.. 2019;Comparative analysis of complete chloroplast genome sequences of four major Amorphophallus species. Scientific Reports 9:809.
Maliga P.. 2004;Plastid transformation in higher plants. Annual Reviewof Plant Biology 55:289–313.
Marwat S. K., Khan M. A.. 2008;Biodiversity of Sedges in Dera Ismail Khan district, NWFPPakistan. Sarhad Journal of Agriculture 24:293–303.
Muasya AM., Simpson D. A., Verboom G. A., Goetghebeur P., Naczi R. F. C., Chase M. W., Smets E.. 2009;Phylogeny of Cyperaceae based on DNA sequence data: Current progress and future prospects. Botanical Review 75:2–21.
Neuhaus H. E., Emes M. J.. 2000;Nonphotosynthetic metabolism in plastids. Annual Review of Plant Physiology and Plant Molecular Biology 51:111–140.
Nguyen L.-T., Schmidt H. A., von Haeseler A., Minh B. Q.. 2014;IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolugion 32:268–274.
Park I., Yang S., Choi G., Kim W. J., Moon B. C.. 2017;The complete chloroplast genome sequences of Aconitum pseudolaeve and Aconitum longecassidatum, and development of molecular markers for distinguishing species in the Aconitum subgenus Lycoctonum . Molecules 222012;
Parvez I., Ahmad I., Hameed M., Ahmad M. S. A.. 2022;Plasticity in anatomical traits of Cyperus niveus Retz. along elevational gradient of dry sub-Himalayan mountains. Arid Land Research and Management 37:197–221.
Plotkin J. B., Kudla G.. 2011;Synonymous but not the same: The causes and consequences of codon bias. Nature Reviews Genetics 12:32–42.
Raven J. A., Allen J. F.. 2003;Genomics and chloroplast evolution: What did cyanobacteria do for plants? Genome Biology 4:209.
Ren W., Guo D., Xing G., Yang C., Zhang Y., Yang J., Niu L., Zhong X., Zhao Q., Cui Y., Zhao Y., Yang X.. 2021;Complete chloroplast genome sequence and comparative and phylogenetic analyses of the cultivated Cyperus esculentus . Diversity 13:405.
Rodríguez-Ezpeleta N., Brinkmann H., Burey S. C., Roure B., Burger G., Löffelhardt W., Bohnert H. J., Philippe H., Lang B. F.. 2005;Monophyly of primary photosynthetic eukaryotes: Green plants, red algae, and glaucophytes. Current Biology 15:1325–1330.
Rozas J., Ferrer-Mata A., Sánchez-DelBarrio J. C., Guirao-Rico S., Librado P., Ramos-Onsins S. E., Sánchez-Gracia A.. 2017;DnaSP6: DNA sequence polymorphism analysis of large data sets. Molecular Biology and Evolution 34:3299–3302.
Schattner P., Brooks A. N., Lowe T. M.. 2005;The trRNAscan-SE, snoscan and snoGPSweb servers for the detection of tRNAs and snoRNAs. Nucleic Acids Research 33:W686–W689.
Shahzadi I., Abdullah F., Mehmood Z., Ali I. Ahmed, Mirza B.. 2020;Chloroplast genome sequences of Artemisia maritima and Artemisia absinthium: Comparative analyses, mutational hotspots in genus Artemisia and phylogeny in family Asteraceae. Genomics 112:1454–1463.
Smith D. R.. 2015;Mutation rates in plastid genomes: They are lower than you might think. Genome Biology and Evolution 7:1227–1234.
Steemers F. J., Gunderson K. L. 2005;Illumina, inc. Pharmacogenomics 6:777–782.
Tamura K., Stecher G., Kumar S.. 2021;Mega11: Molecular evolutionary genetics analysis version 11. Molecular Biology and Evolution 38:3022–3027.
Tariq A., Mussarat S., Adnan M., AbdElsalam N. M., Ullah R., Khan A. L.. 2014;Ethnoveterinary study of medicinal plants in a tribal society of Sulaiman range. ScientificWorld-Journal 2014:127526.
Tillich M., Lehwark P., Pellizzer T., Ulbricht-Jones E. S., Fischer A., Bock R., Greiner S.. 2017;Geseq: Versatile and accurate annotation of organelle genomes. Nucleic Acids Research 45:W6–W11.
Tonti-Filippini J., Nevill P. G., Dixon K., Small I.. 2017;What can we do with 1000 plastid genomes? Plant Journal 90:808–818.
Trifinopoulos J., Nguyen L.-T., von Haeseler A., Minh B. Q.. 2016;W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Research 44:W232–W235.
Walker J. F., Zanis M. J., Emery N. C.. 2014;Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae). American Journal of Botany 101:722–729.
Wang X., Zhou T., Bai G., Zhao Y.. 2018;Complete chloroplast genome sequence of Fagopyrum dibotrys: Genome features, comparative analysis and phylogenetic relationships. Scientific Reports 8:12379.
Weng M.-L., Blazier J. C., Govindu M., Jansen R. K.. 2014;Reconstruction of the ancestral plastid genome in Geraniaceae reveals a correlation between genome rearrangements, repeats, and nucleotide substitution rates. Molecular Biology and Evolution 31:645–659.
Wicke S., Schneeweiss G. M., dePamphilis C. W., Müller K. F., Quandt D.. 2011;The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Molecular Biology 76:273–297.
Williams A. V., Miller J. T., Small I., Nevill P. G., Boykin L. M.. 2016;Integration of complete chloroplast genome sequences with small amplicon datasets improves phylogenetic resolution in Acacia . Molecular Phylogenetics and Evolution 96:1–8.
Yan C., Du J., Gao L., Li Y., Hou X.. 2019;The complete chloroplast genome sequence of watercress (Nasturtium officinale R. Br.): Genome organization, adaptive evolution and phylogenetic relationships in Cardamineae. Gene 699:24–36.
Yang L.-L., Niu J.-Q., Tang W.-W.. 2021;The complete chloroplast genome of pioneering plant Cyperus iriaL.(Cyperaceae) in ecological restoration. Mitochondrial DNA Part B Resources 6:1335–1336.
Zafar M., Ahmad M., Khan M. A., Sultana S., Jan G., Ahmad F., Jabeen A., Shah G. M., Shaheen S., Shah A., Nazir A., Marwat S. K.. 2011;Chemotaxonomic clarification of pharmaceutically important species of Cyperus L. African Journal of Pharmacy and Pharmacology 5:67–75.
Zerbino D. R., Birney E.. 2008;Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Research1 8:821–829.
Zhu Z.. 2019;The complete chloroplast genome of pioneering plant Cyperus difformis (Cyperaceae) in ecological restoration. Mitochondrial DNA Part B Resources 4:1988–1989.

Article information Continued

Fig. 1.

Circular representation of the complete chloroplast genome of Cyperusniveus. Genes located within the circular structure undergo transcription in a clockwise direction, while those positioned outside the circle experience counterclockwise transcription. Genes possessing diverse functions are assigned distinct color codes. The inner region of the circle is shaded in a deeper gray to represent the GC content, while the lighter gray hue indicates the AT content. The circular illustration designates specific segments as LSC (large single copy), SSC (small single copy), and features inverted repeats known as IRA and IRB.

Fig. 2.

Comparative analysis of contraction and expansion of regions in Cyperus species. Colored boxes were employed above or below a central line to indicate adjacent border genes. The extent between these genes and the respective boundaries was quantified in terms of base lengths (bp). JSA (SSC/IRa), JSB (IRb/SSC), JLA (IRa/LSC), and JLB (IRb/LSC) serve as indicators of the specific sites where two corresponding regions of a genome meet and connect.LSC, large single copy; SSC, small single copy; IR, inverted repeats.

Fig. 3.

The sliding window analysis of complete chloroplast genome of Cyperusniveus. X-axis denotes the position of nucleotide and Y-axis denotes nucleotide variability (Pi) with threshold set to 0.2. The colored line below X-axis denotes different regions of chloroplast genome.

Fig. 4.

Maximum Likelihood (ML) tree of family Cyperaceae. The tree was constructed between 28 Cyperaceae and 1 Poaceae species Brachypodium distachyon (outgroup) using IQ-TREE at ultrafast bootstrap value of 1,000. Branch labels denote bootstrap confidence and colored identifier denotes different genus.

Table 1.

List of Cyperaceae species along with accession number.

Sr. No. Species name Accession No.
1 Carex agglomerate MT795185
2 Carex alatauensis OL674104
3 Carex capillifolia OL674105
4 Carex gibba OK539703
5 Carex kokanica OL674106
6 Carex laevissima MZ846224
7 Carex lithophila ON920464
8 Carex littledalei ON920463
9 Carex microglochin OL674107
10 Carex myosuroides MZ962720
11 Carex neurocarpa KU238086
12 Carex sargentiana OL674108
13 Carex siderosticta ON920465
14 Cyperus aromaticus MT937178
15 Cyperus difformis OK539707
16 Cyperus exaltatus MW123055
17 Cyperus flavidus OK539740
18 Cyperus glomeratus OK539708
19 Cyperus iria MW123056
20 Cyperus michelianus OK539709
21 Cyperus niveus OR418070
22 Cyperus rotundus MT937176
23 Cyperus simaoensis OK539710
24 Eleocharis dulcis MN967018
25 Gahnia tristis OK539723
26 Hypolytrum nemorum KU207098
27 Isolepis setacea OK539727
28 Scleria parvula OK539742
29 Brachypodium distachyon (Outgroup) NC_011032

Table 2.

Genomic features of Cyperusniveus.

Characteristics Cyperusniveus
Size (bp) 184,454
LSC length (bp) 99,339
SSC length (bp) 10,073
IR length (bp) 38,021
Total No.of genes 140
Protein-coding genes 83
tRNA genes 39
rRNA genes 8
Duplicate genes 50
GC%
  Total 33.3
  LSC 30.9
  SSC 25.5
  IR 37.3

LSC, large single copy; SSC, small single copy; IR, inverted repeats.

Table 3.

Detail of genes, their types and function in chloroplast genome of Cyperusniveus.

Category Function Genes
Photosynthesis ATP synthase atpA, atpB, atpE, atpF*, atpH, atpI
Calvin cycle rbcL
Cytochrome complex petA, petB, petD, petG, petL, petN
NADH dehydrogenase ndhA*, ndhB*, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhK
Photosystem I pafI**, psaA, psaB, psaC, psaI, psaJ
Photosystem II pafII, psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbT, psbZ
RNA genes Ribosomal RNA rrn16, rrn23, rrn4.5, rrn5, rrn5
Transfer RNA trnA-UGC*, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnG-GCC*, trnG-UCC, trnH-GUG, trnI-GAU*, trnK-UUU*, trnL-CAA, trnL-UAA*, trnL-UAG, trnM-CAU, trnN-GUU, trnP-UGG, trnQ-UUG, trnR-ACG, trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-UGU, trnV-GAC, trnV-UAC*, trnW-CCA, trnY-GUA
Transcription and translation related genes Transcription and splicing pbf1, rpoA, rpoB, rpoC1*, rpoC2
Translation, ribosomal proteins rpl2*, rpl20, rpl22, rpl23*, rpl32, rpl33, rpl36, rps11, rps12*, rps14, rps15, rps16*, rps18, rps19, rps2, rps3, rps4, rps7, rps8
Other genes Conserved reading frames accD, ccsA, cemA, clpP1**, infA, matK, ycf2
*

indicates genes containing one intron,

**

indicates genes containing two introns.

Table 4.

Type and number of substitution and InDel events in chloroplast genome of Cyperusniveus.

Type LSC SSC IR
A/G 195 33 225
C/T 216 37 256
A/T 151 27 227
A/C 112 20 113
G/T 103 20 132
C/G 64 9 60
Total 841 146 1,013
No. of InDels 183 25 82
Average size 8.230 43.360 7.707

LSC, large single copy; SSC, small single copy; IR, inverted repeats.