1. Introduction
Almost as if no one was looking the character of problem solving in bioinformatics has changed from established methods suited to a scarcity of data to the development of new methods suited to a wealth of diverse data. Originally, when data was scarce, the foremost biological problem associated with sequence data was identifying unknowns. Today the complete genomes of over 600 organisms have been sequenced [1]. Although existing methods can always be improved, they do an admirable job of identifying the location of genes and associating them with known, or at least putative, functional details [2]. The next step requires understanding how individual genes and their protein products function in a much more systematic way. Where once scientists asserted that 90% of the genome was “junk”, the importance of noncoding RNAs and repeat regions that make up the junk may actually transcend the importance of protein coding genes [3]–[5].