I. Introduction
The area of Computational Bioacoustic Scene Analysis has received increasing attention by the scientific community in the last decades [1], [2], [3], [4]. Such interest is motivated by the potential benefits that can be acquired towards addressing major environmental challenges including invasive species, infectious diseases, climate and land-use change, etc. Availability of accurate information regarding range, population size and trends is crucial for quantifying the conservation status of the species of interest. Such information can be obtained via classical observer-based survey techniques; however these are becoming inadequate since they are a) expensive, b) subject to weather conditions, c) cover a limited amount of time and space, etc. To this end, autonomous recording units (ARUs) are extensively employed by biologists [5], [6]. An ARU which could be useful for the specific application is available at https://www.wildlifeacoustics.com/products/song-meter-sm4. This is also motivated by the cost of the involved acoustic sensors which is constantly decreasing due to the advancements in the field of electronics.