National Repository of Grey Literature 1 records found  Search took 0.01 seconds. 
Tree of life in a gappy genomic era
Martínková, Natália
Increasing volume of publicly available DNA sequence data enables comprehensive studies that address integrative questions. For these projects, bioinformatic analysis requires advanced methods and computational infrestructure. I present the character of DNA sequence matrices for multilocus datasets, which contain large portions of missing data. A condition critical for analysis of multilocus data is that datasets for all loci or genes need to have partially overlapping taxon sets. The work-flow for analysing such data differs between supermatrix and supertree estimation of species trees. In the supermatrix approach, aligned sequences for all genes are concatenated and the species tree is estimated directly from a partitioned matrix. In the supertree approach, gene sequence alignments are used for inference of gene trees. Those are then combined into a species supertree. Smaller projects could benefit from utilising all available information in the supermatrix. Larger projects should rely on supertree methods for computational optimisation.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.