Phylogenomics from low‐coverage whole‐genome sequencing

Publication Type:Journal Article
Year of Publication:2019
Authors:F. Zhang, Ding, Y., Zhu, C. ‐D., Zhou, X., Orr, M. C., Scheu, S., Luan, Y. ‐X.
Journal:Methods in Ecology and Evolution
Volume:10
Issue:4
Pagination:507 - 517
Date Published:04-2019
ISSN:2041-210X
Keywords:desktop PC, genome assembly, hybrid enrichment, single-copy orthologs, ultraconserved elements
Abstract:

1.Phylogenetic studies are increasingly reliant on next‐generation sequencing (NGS). Transcriptomic and hybrid‐enrichment sequencing techniques remain the most prevalent methods for phylogenomic data collection due to their relatively low demands for computing powers and sequencing prices, compared to whole genome shotgun sequencing (WGS). However, the transcriptome‐based method is constrained by the availability of fresh materials and hybrid enrichment is limited by genomic resources necessary in probe designs, especially for non‐model organisms.

2. We present a novel WGS‐based pipeline for extracting essential phylogenomic markers through rapid de novo genome assembling from low‐coverage genome data, employing a series of computationally efficient bioinformatic tools. We tested the pipeline on a Hexapoda dataset and a more focused Phthiraptera dataset (genome sizes 0.1–2 Gbp), and further investigated the effects of sequencing depth on target assembly success rate based on raw data of six insect genomes (0.1–1 Gbp).

3. Each genome assembly was completed in 2–24 hours on desktop PCs. We extracted 872–1,615 near‐universal single‐copy orthologs (BUSCOs) per species. This method also enables development of ultraconserved element (UCE) probe sets; we generated probes for Phthiraptera based on our WGS assemblies, containing 55,030 baits targeting 2,832 loci, from which we extracted 2,125–2,272 UCEs. Resulting phylogenetic trees all agreed with currently‐accepted topologies, indicating that markers produced in our methods were valid for phylogenomic studies. We also showed that 10–20× sequencing coverage was sufficient to produce hundreds to thousands of targeted loci from BUSCO sets, and even lower coverage (5×) was required for UCEs.

4. Our study demonstrates the feasibility of conducting phylogenomics from low‐coverage WGS for a wide range of organisms without reference genomes. This new approach has major advantages in data collection, particularly in reducing sequencing cost and computing consumption, while expanding loci choices.

URL:https://besjournals.onlinelibrary.wiley.com/doi/epdf/10.1111/2041-210X.13145
DOI:10.1111/2041-210X.13145
Short Title:Methods Ecol Evol
File attachments: 
Wed, 2021-02-03 15:35 -- Yokb
Scratchpads developed and conceived by (alphabetical): Ed Baker, Katherine Bouton Alice Heaton Dimitris Koureas, Laurence Livermore, Dave Roberts, Simon Rycroft, Ben Scott, Vince Smith