I. Introduction
Automated driving (AD) of autonomous vehicles (AVs) is considered as the main future trend in intelligent road mobility to reduce human labor and costs and enhance safety and reliability [1], [2], [3]. Recent academic publications have demonstrated the success of DRL in training AI agents to drive better than experienced human racers [4] or in developing intelligent environments (background vehicles) to accelerate testing and evaluation processes [5]. Despite recent technological advancements, the successful deployment of DRL-based AD techniques in real-world environments depends on users’ acceptance and trust. Due to reports on AV accident cases [6], [7] and the black box nature of DRL neural networks (NNs) [8], people are reluctant to accept the use of AVs. Therefore, effective approaches to increasing public confidence in DRL-based decision-making and control applications are necessary.