Algorithms for matching partially labelled sequence graphs

More about Open Access at the Crick

Abstract

In order to find correlated pairs of positions between proteins, which are useful in predicting interactions, it is necessary to concatenate two large multiple sequence alignments such that the sequences that are joined together belong to those that interact in their species of origin. When each protein is unique then the species name is sufficient to guide this match, however, when there are multiple related sequences (paralogs) in each species then the pairing is more difficult. In bacteria a good guide can be gained from genome co-location as interacting proteins tend to be in a common operon but in eukaryotes this simple principle is not sufficient.

Journal details

Volume 12
Issue number 1
Pages 24
Available online
Publication date

Crick labs/facilities

Acknowledged team