Sequence context-specific profiles for homology searching.

Biegert, A.; Söding, J.

doi:10.1073/pnas.0810767106

アイテム詳細

登録内容を編集ファイル形式で保存

一時保存へ追加

タグ情報を表示リリース履歴を表示詳細要約

公開

学術論文

Sequence context-specific profiles for homology searching.

MPS-Authors

/persons/resource/persons128572

Söding, J.
Research Group of Computational Biology, MPI for Biophysical Chemistry, Max Planck Society;

External Resource

http://www.pnas.org/content/106/10/3770.full.pdf+html
(出版社版)

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

フルテキスト (公開)

1944232.pdf
(出版社版), 994KB

付随資料 (公開)

1944232_Suppl.pdf
(付録資料), 563KB

引用

Biegert, A., & Söding, J. (2009). Sequence context-specific profiles for homology searching. Proceedings of the National Academy of Sciences of the United States of America, 106(10), 3770-3775. doi:10.1073/pnas.0810767106.

引用: https://hdl.handle.net/11858/00-001M-0000-0017-D4D3-2

要旨

Sequence alignment and database searching are essential tools in biology because a protein's function can often be inferred from homologous proteins. Standard sequence comparison methods use substitution matrices to find the alignment with the best sum of similarity scores between aligned residues. These similarity scores do not take the local sequence context into account. Here, we present an approach that derives context-specific amino acid similarities from short windows centered on each query sequence residue. Our results demonstrate that the sequence context contains much more information about the expected mutations than just the residue itself. By employing our context-specific similarities (CS-BLAST) in combination with NCBI BLAST, we increase the sensitivity more than 2-fold on a difficult benchmark set, without loss of speed. Alignment quality is likewise improved significantly. Furthermore, we demonstrate considerable improvements when applying this paradigm to sequence profiles: Two iterations of CSI-BLAST, our context-specific version of PSI-BLAST, are more sensitive than 5 iterations of PSI-BLAST. The paradigm for biological sequence comparison presented here is very general. It can replace substitution matrices in sequence- and profile-based alignment and search methods for both protein and nucleotide sequences.