FALCON-meta: a method to infer metagenomic composition of ancient DNA

D. Pratas, A.J. Pinho, Raquel M. Silva, J.M.O.S. Rodrigues, M. Hosseini, T. Caetano, P.J.S.G. Ferreira

Research output: Working paperPreprint

Abstract

The general approaches to detect and quantify metagenomic sample composition are based on the alignment of the reads, according to an existing database containing reference microbial sequences. However, without proper parameterization, these methods are not suitable for ancient DNA. Quantifying somewhat dissimilar sequences by alignment methods is problematic, due to the need of fine-tuned thresholds, considering relaxed edit distances and the consequent increase of computational cost. Additionally, the choice of the thresholds poses the problem of how to quantify similarity without producing overestimated measures. We propose FALCON-meta, a compression-based method to infer metagenomic composition of next-generation sequencing samples. This unsupervised alignment-free method runs efficiently on FASTQ samples. FALCON-meta quickly learns how to give importance to the models that cooperate to predict similarity, incorporating parallelism and flexibility for multiple hardware characteristics. It shows substantial identification capabilities in ancient DNA without overestimation. In one of the examples, we found and authenticated an ancient Pseudomonas bacteria in a Mammoth mitogenome. The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
Original languageEnglish
DOIs
Publication statusPublished - 18 Feb 2018
Externally publishedYes

Fingerprint

Dive into the research topics of 'FALCON-meta: a method to infer metagenomic composition of ancient DNA'. Together they form a unique fingerprint.

Cite this