1. Introduction
The task of classifying proteins into various categories is a major challenge in the field of bioinformatics and effective models contribute to greater insight into the fundamentals of molecular biology. Such tasks may involve predicting whether or not any two proteins are likely to interact [1], predicting protein solubility [2], classification of functional families of proteins [3], or predicting what strain of HIV-1 a given protein may belong to [4]. Common to each of these cited examples is the use of an increasingly popular means of encoding a given protein sequence into a machine-readable format: the Chaos Game Representation (CGR).