I. Introduction
One of the main goals of traffic management is to provide a safe traffic infrastructure, where most of the potential traffic congestion scenarios and car collisions are avoided. This issue can be addressed by modeling the traffic behavior of the road network and by developing predictive methods that allow forecasting metrics that characterize traffic, such as flow, speed, or travel time [1]. This process enables the implementation of operational measures for decision making (e.g., redirecting traffic flow to other alternative routes). These traffic forecasting models are usually built from past traffic observations. However, in practice traffic data acquisition systems cannot be deployed over every link of a road, mainly due to the high costs of deployment and maintenance of such sensing equipment. On many occasions this issue has been addressed by deploying temporal sensors that provide measurements for certain locations of interest during a limited period of time. Nevertheless, a proper characterization of the traffic behavior under a variety of circumstances (e.g., events or holidays) requires real traffic measurements over more dilated periods [2]. Thus, real traffic data are not available for every road link, nor are they collected for the time needed for a thorough characterization. Indeed, in some cases placing a sensor is not feasible due to manifold reasons. If possible, another concern arises from the long latency needed to collect sufficient data for the subsequent model training. Since the performance of traffic forecasting models is constrained to both data quality and quantity, data availability is arguably the practical bottleneck for a successful traffic network characterization.