I. Introduction
Developing systems that can reason over explicit knowledge has attracted substantial attention in current AI research [1]. Complex Question Answering (Complex QA) tasks provide a comprehensive and quantitative way to measure these abilities, with evidence provided by structured knowledge bases (e.g.WikiData) or natural language texts (e.g. Wikipedia). Considering the high cost of constructing structured knowledge bases, this paper focuses on complex QA over textual evidence.