I. Introduction
Multimodal Sentiment Analysis (MSA) is widely used in all areas of software engineering and is used throughout the software engineering lifecycle. For example, as one of the most important parts of the software engineering lifecycle, software development is a highly collaborative activity that is highly influenced by the emotional state of developers. Negative affective states can make developers less productive in software projects and can easily introduce software defects [33], while positive affect is expected to increase developer productivity [1]. In addition, obtaining valid sentiment status from APP reviews, technical Q&A sites such as Stack Overflow, and developers' comments on APIs is crucial for subsequent product improvement and service refinement. For example, the comment "I am not able to deploy my App Engine project locally." on the Java API predicts a negative sentiment, which in turn leads to the possibility of optimizing the mentioned "App Engine" service. This shows that it is necessary and essential to conduct research on sentiment analysis in software engineering.