Detecting Foldback Artifacts in Long-reads.

bioRxiv : the preprint server for biology
Authors
Keywords
Abstract

Long-read sequencing data is useful for detecting large and complex structural variations; however, technical artifacts can lead to false structural variant calls. In our analyses, we became aware of a foldback artifact in long-read data. Therefore, we developed the open-source Breakinator tool to flag putative foldback artifact reads, as well as previously known chimeric artifacts. Through an alignment-based approach, Breakinator can detect artifacts missed by existing quality control tools. We profiled the occurrences of foldbacks and chimeric reads in both nanopore and single-molecule real-time sequences across a range of specimens, library types, sequencing chemistries, sequencing machines, and base-calling software.

Year of Publication
2025
Journal
bioRxiv : the preprint server for biology
Date Published
09/2025
ISSN
2692-8205
DOI
10.1101/2025.07.15.664946
PubMed ID
40791372
Links