1. INTRODUCTION
Modern molecular biology is being revolutionized by the development of new technology. With the complete sequencing of several model organisms, a pressing problem is how to describe and understand the huge amount of the biological data which is accumulating. Fortunately, the marriage of biology and computer science makes the analysis of genomic data much more tractable. To annotate genomes automatically, splice site identification is of vital importance, because it is an essential step for gene prediction. As the central dogma [1] says: “DNA makes RNA makes Protein”, when most genes of eukaryotic organisms are translated into proteins, intervening sequences known as introns are removed from the pre-mRNA in the nucleus before the matured mRNA is transported into the cytoplasm and translated. The regions of genes that are ultimately translated into proteins are often referred to as exons, and a splicesite is just the boundary of an intron and an exon. Fig. 1 presents a visual description of splice junctions in the gene structure. Hence, if we could identify splice sites accurately, a specific gene structure will become much clearer. In this sense, a good method for characterizing and predicting splice junctions would surely improve automatic genome annotation. A visual description of splice junctions in the gene structure