I. Introduction
Humans can experience various emotions during communication. There may be rapid changes in the types and intensity of emotions experienced in a collaborative context. The pandemic has brought a substantial change in the medium of teaching, with online learning gaining popularity in many educational institutions. Affective computing specializes in developing systems that can identify and reproduce human emotions. Among the variety of emotions users may experience during online interactions, we focus on confusion. For the context of this study, we define confusion as a state of mind where an individual is uncertain about the information communicated to them. Compared to happiness, sadness, or anger, confusion is a more subtle affective state and poses a challenge [1]. This study seeks to identify this ambiguous emotion using multimodal data.