Reduction, alignment and visualisation of large diverse sequence families

More about Open Access at the Crick


Current volumes of sequence data can lead to large numbers of hits identified on a search, typically in the range of 10s to 100s of thousands. It is often quite difficult to tell from these raw results whether the search has been a success or has picked-up sequences with little or no relationship to the query. The best approach to this problem is to cluster and align the resulting families, however, existing methods concentrate on fast clustering and either do not align the sequences or only perform a limited alignment.

Journal details

Volume 17
Issue number 1
Pages 300
Available online
Publication date

Crick labs/facilities