arXiv Open Access 2022

Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins

Carlos A. Gandarilla-Perez Sergio Pinilla Anne-Florence Bitbol Martin Weigt
Lihat Sumber

Abstrak

Predicting protein-protein interactions from sequences is an important goal of computational biology. Various sources of information can be used to this end. Starting from the sequences of two interacting protein families, one can use phylogeny or residue coevolution to infer which paralogs are specific interaction partners within each species. We show that these two signals can be combined to improve the performance of the inference of interaction partners among paralogs. For this, we first align the sequence-similarity graphs of the two families through simulated annealing, yielding a robust partial pairing. We next use this partial pairing to seed a coevolution-based iterative pairing algorithm. This combined method improves performance over either separate method. The improvement obtained is striking in the difficult cases where the average number of paralogs per species is large or where the total number of sequences is modest.

Penulis (4)

C

Carlos A. Gandarilla-Perez

S

Sergio Pinilla

A

Anne-Florence Bitbol

M

Martin Weigt

Format Sitasi

Gandarilla-Perez, C.A., Pinilla, S., Bitbol, A., Weigt, M. (2022). Combining phylogeny and coevolution improves the inference of interaction partners among paralogous proteins. https://arxiv.org/abs/2208.11626

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓