I. Introduction
One of the most popular methods which has been frequently used is for image classification in different applications is Convolutional Neural Networks (CNNs). Most CNNs require training time within an acceptable range. Also, there are other CNNs such as Res-Net, VGG, Alex net. [9] [10] [11] that can also be used for image classification; however, the training time required for such networks is significantly more than that of CNN. In this paper, we examine how adding a transform layer at the beginning of a CNN and different training strategies can result in performance improvement. The improved performance can be traded off for network size reduction. Therefore, we can make the network smaller and maintain almost the same performance.