I. Introduction
With the wide deployment of wireless networks and rapid development of communication technology and Internet of Things (IoT), localization has attracted increasing interest in industry and research community and been widely used for a wide range of intelligent applications [1], [2], such as smart home, health management, and smart guiding. In practice, accurately determining the locations of a user in a home has the potential to significantly enhance in-home experience. In addition, the home assistant can personalize the responses based on where a user is within sensing area and the temperature, for example, can be automatically controlled, thereby resulting in an energy saving. Future home robots would be more intelligent if they know where a user is in the house, providing high-level analytic service for sensing applications such as responsive smart home and health monitoring. Owing to its importance, many indoor localization systems have been proposed with various advanced techniques in recent years and these approaches can be divided into computer vision-based [3]–[6], dedicated device-based [7]–[10] and received signal strength (RSS)-based [11]–[14]. Specifically, these schemes employing dedicated devices worn or held by the users are inconvenient and cumbersome, especially the elderly often forget to charge them or leave them on tables. In addition, these cameras-based schemes cannot work in non-line-of-sight (NLoS) situation and also incur the users’ privacy concerns. In reality, most people are uncomfortable with being filmed all the time, especially in their own homes. Those RSS-based schemes are vulnerable to the channel dynamics and background noises since it is a superposition of multiple signal propagation paths. Therefore, a device-free, passive and robust indoor localization needs to be developed to overcome these remained limitations of current schemes.