I. Introduction
In recent years, contact-free sensing technology using wireless signals (e.g., Wi-Fi [7], [12], [37], RFID [6], [11], [17], [18], and acoustic [4], [5], [13], [40]) has attracted extensive attention from both the academic and industrial sectors. Wireless sensing technology enables low-cost, sensor-free human sensing applications such as activity tracking, gesture recognition, respiration monitoring, and motion tracking. Wi-Fi-based wireless sensing has a considerable advantage in deployment cost due to the widely deployed Wi-Fi communication infrastructure. Although significant progress has been made in Wi-Fi sensing systems, it is worth noting that existing Wi-Fi sensing typically uses static transmitters and receivers. However, smartphones, smartwatches, and other sensing devices close to the human body will move with the hand. Besides, limited Wi-Fi bandwidth hinders obtaining sufficient range resolution to distinguish multiple sensing objects during realistic multi-person sensing. Therefore, a critical aspect missing in Wi-Fi sensing is the ability to perform sensing in multi-person scenarios using a handheld device in a moving state.