VICTOR: genome-based phylogeny and classification of prokaryotic viruses

  • Motivation Bacterial and archaeal viruses are crucial for global biogeochemical cycles and might well be game-changing therapeutic agents in the fight against multi-resistant pathogens. Nevertheless, it is still unclear how to best use genome sequence data for a fast, universal and accurate taxonomic classification of such viruses. Results We here present a novel in silico framework for phylogeny and classification of prokaryotic viruses, in line with the principles of phylogenetic systematics, and using a large reference dataset of officially classified viruses. The resulting trees revealed a high agreement with the classification. Except for low resolution at the family level, the majority of taxa was well supported as monophyletic. Clusters obtained with distance thresholds chosen for maximizing taxonomic agreement appeared phylogenetically reasonable, too. Analysis of an expanded dataset, containing >4000 genomes from public databases, revealed a large number of novelMotivation Bacterial and archaeal viruses are crucial for global biogeochemical cycles and might well be game-changing therapeutic agents in the fight against multi-resistant pathogens. Nevertheless, it is still unclear how to best use genome sequence data for a fast, universal and accurate taxonomic classification of such viruses. Results We here present a novel in silico framework for phylogeny and classification of prokaryotic viruses, in line with the principles of phylogenetic systematics, and using a large reference dataset of officially classified viruses. The resulting trees revealed a high agreement with the classification. Except for low resolution at the family level, the majority of taxa was well supported as monophyletic. Clusters obtained with distance thresholds chosen for maximizing taxonomic agreement appeared phylogenetically reasonable, too. Analysis of an expanded dataset, containing >4000 genomes from public databases, revealed a large number of novel species, genera, subfamilies and families. Availability and implementation The selected methods are available as the easy-to-use web service ‘VICTOR’ at https://victor.dsmz.de. Supplementary information Supplementary data are available at Bioinformatics onlineshow moreshow less

Download full text files

Export metadata

Statistics

Number of document requests

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Jan P. Meier-KolthoffORCiDGND, Markus Göker
URN:urn:nbn:de:bvb:384-opus4-1067383
Frontdoor URLhttps://opus.bibliothek.uni-augsburg.de/opus4/106738
ISSN:1367-4803OPAC
ISSN:1367-4811OPAC
Parent Title (English):Bioinformatics
Publisher:Oxford University Press (OUP)
Place of publication:Oxford
Type:Article
Language:English
Year of first Publication:2017
Publishing Institution:Universität Augsburg
Release Date:2023/08/14
Tag:Computational Mathematics; Computational Theory and Mathematics; Computer Science Applications; Molecular Biology; Biochemistry; Statistics and Probability
Volume:33
Issue:21
First Page:3396
Last Page:3404
DOI:https://doi.org/10.1093/bioinformatics/btx440
Institutes:Fakultät für Angewandte Informatik
Fakultät für Angewandte Informatik / Institut für Informatik
Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Biomedizinische Informatik, Data Mining und Data Analytics
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Licence (German):CC-BY 4.0: Creative Commons: Namensnennung (mit Print on Demand)