I. Introduction
The critical requirement for precise species identification and tracking has never been greater in this era of rapidly declining biodiversity [1]. For ecologists and conservationists, the balance of rapidly increasing rates of extinction and slowly progressing species documentation and monitoring presents an enormous challenge. With tools for quick, precise, and non-intrusive species monitoring, computer vision has the potential to revolutionise the conservation of biodiversity [2]. But computer vision's traditional supervised learning techniques heavily depend on large, labelled datasets for every species, which is a requirement that is frequently unachievable given the vast diversity and quantity of undiscovered species [3]. This disparity calls for a paradigm change in favour of approaches that can function well with little to no data for classes [4]. ZSL, a promising method designed to identify objects and entities that were not seen during the model's training.Unlike traditional methods, ZSL is an advanced machine learning technique that does not require direct examples of every class it needs to identify [5]. Rather, it makes use of common characteristics and semantic connections between known and unknown classes to make it possible to identify new entities by connecting them to previously acquired information. Figure 1 illustrates the ZSL based on embedding based method.