Comparative analyses of 3654 chloroplast genomes unraveled new insights into the evolutionary mechanism of green plants
Background: Chloroplast are believed to arise from a cyanobacterium through endosymbiosis and they played vital roles in photosynthesis, oxygen release and metabolites synthesis for the plant. With the advent of next-generation sequencing technologies, until December 2018, about 3,654 complete chloroplast genome sequences have been made available. It is possible to compare the chloroplast genome structure to elucidate the evolutionary history of the green plants. Results: We compared the 3654 chloroplast genomes of the green plants and found extreme conservation of gene orders and gene blocks in the green plant such as ATP synthase cluster, Phytosystem, Cytochrome cluster, and Ribosomal cluster. For the chloroplast-based phylogenomics, we used three different data sets to recover the relationships within green plants which accounted for biased GC content and could mitigate the bias in molecular data sets by increasing taxon sampling. The main topology results include: I) Chlorokybales + Mesostigmatales as the earliest-branching lineage and a clade comprising Zygnematales+ Desmidiales formed a grade as the sister group to the land plants, II) Based on matrix AA data, Bryophytes was strongly supported as monophyletic but for matrix nt123 data, hornworts, mosses and liverworts were placed as successive sister lineages of Tracheophytes with strong support, III) Magnoliids were placed in the outside of Monocots using the matrix nt123 data and the matrix AA data, IV) Ceratophyllales + Chloranthales as sister to the Eudicots using matrix nt123 data, but when using matrix nt12 data and AA data, only Ceratophyllales sister to the Eudicots. Conclusion: We present the first of its kind large scale comparative analyses of the chloroplast coding gene constitution for 3654 green plants. Some important genes likely showed co-occurrence and formed gene cluster and gene blocks in Streptophyta. We found a clear expansion of IRs (Inverted Repeats) among seed plants. The comprehensive taxon sampling and different data sets recovered a strong relationship for green plants. Keywords: Chloroplast genome; Phylogenetics; Evolution; Viridiplantae; Inverted Repeats; Gene expansion
- Downloaded 639 times
- Download rankings, all-time:
- Site-wide: 34,744
- In plant biology: 740
- Year to date:
- Site-wide: 14,155
- Since beginning of last month:
- Site-wide: 46,082
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!