I. Introduction
The competitive race toward the mass deployment of self-driving cars driving side-by-side with human-driven vehicles on public roads advocates for an accurate and highly precise safety evaluation framework to ensure safe driving. However, achieving meaningful precision is a challenging task when the safety-critical events under study are rare in naturalistic situations. A recent method has been developed which which adopts Monte Carlo method empowered by the Importance Sampling technique as a variance reduction scheme; this has produced appealing results. In [1], it is shown that the efficiency is enhanced by ten thousand times with the incorporation of large-scale driving data sets and the employed statistical models. This improved efficiency is highly appealing for autonomous vehicle (AV) researchers as the required testing effort is overly demanding, an estimate of 8.8 billion driving miles required to provide ‘sufficient’ evidence to compare the safety of AV driving and human driving from logged data [2].