DOAJ Open Access 2024

Maximum-scoring path sets on pangenome graphs of constant treewidth

Broňa Brejová Travis Gagie Eva Herencsárová Tomáš Vinař

Abstrak

We generalize a problem of finding maximum-scoring segment sets, previously studied by Csűrös (IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2004, 1, 139–150), from sequences to graphs. Namely, given a vertex-weighted graph G and a non-negative startup penalty c, we can find a set of vertex-disjoint paths in G with maximum total score when each path’s score is its vertices’ total weight minus c. We call this new problem maximum-scoring path sets (MSPS). We present an algorithm that has a linear-time complexity for graphs with a constant treewidth. Generalization from sequences to graphs allows the algorithm to be used on pangenome graphs representing several related genomes and can be seen as a common abstraction for several biological problems on pangenomes, including searching for CpG islands, ChIP-seq data analysis, analysis of region enrichment for functional elements, or simple chaining problems.

Penulis (4)

B

Broňa Brejová

T

Travis Gagie

E

Eva Herencsárová

T

Tomáš Vinař

Format Sitasi

Brejová, B., Gagie, T., Herencsárová, E., Vinař, T. (2024). Maximum-scoring path sets on pangenome graphs of constant treewidth. https://doi.org/10.3389/fbinf.2024.1391086

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.3389/fbinf.2024.1391086
Informasi Jurnal
Tahun Terbit
2024
Sumber Database
DOAJ
DOI
10.3389/fbinf.2024.1391086
Akses
Open Access ✓