Loading [MathJax]/extensions/MathZoom.js
Pairwise Statistical Significance of Local Sequence Alignment Using Substitution Matrices with Sequence-Pair-Specific Distance | IEEE Conference Publication | IEEE Xplore

Pairwise Statistical Significance of Local Sequence Alignment Using Substitution Matrices with Sequence-Pair-Specific Distance


Abstract:

Pairwise sequence alignment forms the basis of numerous other applications in bioinformatics. The quality of an alignment is gauged by statistical significance rather tha...Show More

Abstract:

Pairwise sequence alignment forms the basis of numerous other applications in bioinformatics. The quality of an alignment is gauged by statistical significance rather than by alignment score alone. Therefore, accurate estimation of statistical significance of a pairwise alignment is an important problem in sequence comparison. Recently, it was shown that pairwise statistical significance does better in practice than database statistical significance, and also provides quicker individual pairwise estimates of statistical significance without having to perform time-consuming database search. Under an evolutionary model, a substitution matrix can be derived using a rate matrix and a fixed distance. Although the commonly used substitution matrices like BLOSUM62, etc. were not originally derived from a rate matrix under an evolutionary model, the corresponding rate matrices can be back calculated. Many researchers have derived different rate matrices using different methods and data. In this paper, we show that pairwise statistical significance using rate matrices with sequence-pair-specific distance performs significantly better compared to using a fixed distance. Pairwise statistical significance using sequence-pair-specific distanced substitution matrices also outperforms database statistical significance reported by BLAST.
Date of Conference: 17-20 December 2008
Date Added to IEEE Xplore: 30 December 2008
ISBN Information:
Conference Location: Bhubaneswar, India

1. Introduction

Sequence alignment is a very important underlying application in the analysis of DNA and protein sequences [33], [7], [8], which forms the basic step of many other applications like homology detection, finding protein function, protein structure, deciphering evolutionary relationships, etc. Currently, there exist several programs for sequence alignment that use popular algorithms [40], [37] or their heuristic versions [8], [30], [32]. Recently, there have been many enhancements in alignment program features [15], [14] using difference blocks and multiple scoring matrices, which attempt to capture some more biological features in the alignment algorithm.

Contact IEEE to Subscribe

References

References is not available for this document.