• search hit 6 of 14
Back to Result List

Pipeline Olympics: continuable benchmarking of computational workflows for DNA methylation sequencing data against an experimental gold standard

  • DNA methylation is a widely studied epigenetic mark and a powerful biomarker of cell type, age, environmental exposures, and disease. Whole-genome sequencing following selective conversion of unmethylated cytosines into thymines via bisulfite treatment or enzymatic methods remains the reference method for DNA methylation profiling genome-wide. While numerous software tools facilitate processing of DNA methylation sequencing reads, a comprehensive benchmarking study has been lacking. In this study, we systematically compared complete computational workflows for processing DNA methylation sequencing data using a dedicated benchmarking dataset generated with five whole-genome profiling protocols. As an evaluation reference, we employed accurate locus-specific measurements from our previous benchmark of targeted DNA methylation assays. Based on this experimental gold-standard assessment and multiple performance metrics, we identified workflows that consistently demonstrated superiorDNA methylation is a widely studied epigenetic mark and a powerful biomarker of cell type, age, environmental exposures, and disease. Whole-genome sequencing following selective conversion of unmethylated cytosines into thymines via bisulfite treatment or enzymatic methods remains the reference method for DNA methylation profiling genome-wide. While numerous software tools facilitate processing of DNA methylation sequencing reads, a comprehensive benchmarking study has been lacking. In this study, we systematically compared complete computational workflows for processing DNA methylation sequencing data using a dedicated benchmarking dataset generated with five whole-genome profiling protocols. As an evaluation reference, we employed accurate locus-specific measurements from our previous benchmark of targeted DNA methylation assays. Based on this experimental gold-standard assessment and multiple performance metrics, we identified workflows that consistently demonstrated superior performance and revealed major workflow development trends. To ensure the long-term utility of our benchmark, we implemented an interactive workflow execution and data presentation platform, adaptable to user-defined criteria and readily expandable to future software.show moreshow less

Download full text files

Export metadata

Statistics

Number of document requests

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Yu-Yu Lin, Kersten Breuer, Dieter Weichenhan, Pascal Lafrenz, Antonella Sarnataro, Agata Wilk, Maryna Chepeleva, Oliver Mücke, Maximilian Schönung, Franziska Petermann, Philip Reiner Kensche, Lena Weiser, Frank Thommen, Gideon Giacomelli, Karl Nordstroem, Edahi Gonzalez-Avalos, Angelika Merkel, Helene Kretzmer, Jonas Fischer, Stephen KrämerORCiDGND, Murat Iskar, Stephan Wolf, Ivo Buchhalter, Manel Esteller, Christian Lawerenz, Sven Twardziok, Marc Zapatka, Volker Hovestadt, Matthias SchlesnerORCiDGND, Marcel H. Schulz, Steve Hoffmann, Clarissa Gerhauser, Jörn Walter, Mark Hartmann, Daniel B. Lipka, Yassen Assenov, Christoph Bock, Christoph Plass, Reka Toth, Pavlo Lutsik
URN:urn:nbn:de:bvb:384-opus4-1260451
Frontdoor URLhttps://opus.bibliothek.uni-augsburg.de/opus4/126045
ISSN:0305-1048OPAC
ISSN:1362-4962OPAC
Parent Title (English):Nucleic Acids Research
Publisher:Oxford University Press (OUP)
Place of publication:Oxford
Type:Article
Language:English
Year of first Publication:2025
Publishing Institution:Universität Augsburg
Release Date:2025/10/31
Volume:53
Issue:19
First Page:gkaf970
DOI:https://doi.org/10.1093/nar/gkaf970
Institutes:Fakultät für Angewandte Informatik
Fakultät für Angewandte Informatik / Institut für Informatik
Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Biomedizinische Informatik, Data Mining und Data Analytics
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Licence (German):CC-BY 4.0: Creative Commons: Namensnennung