I. Introduction
English Vocabulary Learning is worldwide great significance to many non-native English speakers is foundation of language learning and it is improving the ability to speak, read and write [1]. There are two type of people English and non-English speaker which leads to performance gap in non-English languages and negatively affect user experiences. This includes longer response times, shorter context lengths and difficulties for higher-level professional. Expanding the tokenizer vocabulary by introduces frequently used long words as additional tokens help mitigate these issues [2–3]. In Machine Learning is attempted by developing structure of English vocabulary learning. However, developing such as structures was labour intensive and does not achieve acceptable performance or required domain restriction [4–5]. To make vocabulary selection and teaching more systematic, the English lexicon has been categorized. It is argued that high coverage and essential words should be the focus, particularly for beginner English language learners, who concentrate on English word families [6]. The ML based method tackles these issues by dynamically assigning weights to various modalities depend on their task relevance selecting the most beneficial and allow to capture the gap of text in efficiently achieve better outcome in English vocabulary [7].