I. Introduction
Traceability recovery is used to discover relationships between thousands of software artifacts to facilitate the efficient retrieval of relevant information in large-scale industrial projects [1]. Complete and accurate traceability links can ensure each related elements will be considered when changing requirements and ensure every requirement is implemented, therefore traceability recovery play important roles in software maintenance [1], bug localizations [11], [35], [36] and etc. Traditional methods of recovering traceability include building requirement traceability matrices (RTMs), building requirement traceability graphs. However, these methods are difficult to extend and error-prone with the evolution of software [2]. Hence, many researchers put forward approaches to solve this problem with information retrieval (IR) techniques, and these methods are mainly based on text retrieval, e.g. VSM [3]–[5], LSA [3]–[5]. As highlighted in [2], text analysis techniques are used to solve more and more problems in software engineering.