I. Introduction
Artificial Intelligence and Internet of Things (AIoT) is the product of the integration of two technologies: Artificial Intelligence (AI) and Internet of Things (IoT) [1]. The widespread adoption of IoT devices and the explosive growth of data generated by them provide AI with a vast amount of real-world data. AI technology supports the intelligent analysis and empowers processing of data in IoT systems, what’s more, it provides the objective demand and abundant opportunities for the practical application of by the use of corpus [1]. The close relationship between AI and IoT reveal the common needs of corpus, especially comparable corpus. Corpus, a set of well-sampled and processed electronic texts, is the basic resource that is necessary for linguistics theoretical study and natural language processing, especially language engineering of the applications or smart devices of AIoT [2]. As one of the foundational part, comparable corpus have been widely used in smart devices.