Loading [MathJax]/extensions/MathZoom.js
Yaxian Wang - IEEE Xplore Author Profile

Showing 1-5 of 5 results

Filter Results

Show

Results

Geometry Problem Solving has drawn growing attention recently due to its application prospects in intelligent ed-ucation field. However, existing methods are still inade-quate to meet the needs of practical application, suffering from the following limitations: 1) explainability is not en-sured which is essential in real teaching scenarios; 2) the small scale and incomplete annotation of existing ...Show More
Diagram Question Answering (DQA) aims to correctly answer questions about given diagrams, which demands an interplay of good diagram understanding and effective reasoning. However, the same appearance of objects in diagrams can express different semantics. This kind of visual semantic ambiguity problem makes it challenging to represent diagrams sufficiently for better understanding. Moreover, sinc...Show More
Textbook Question Answering (TQA) task requires answering questions by reasoning based on both the given diagrams and text context. There are mainly two challenges for the task. First, the diagrams are different from the natural images. Similar shapes or color blocks may express different semantics and there is also a large intra-topic variation for diagrams. Hence, the characteristics of visual s...Show More
Visual question answering (VQA) is a task that machines should provide an accurate natural language answer given an image and a question about the image. Many studies have found that the current VQA methods are heavily driven by the surface correlation or statistical bias in the training data, and lack sufficient image grounding. To address this issue, we devise a novel end-to-end architecture tha...Show More
Textbook question answering (TQA) is a task that one should answer non-diagram and diagram questions accurately, given a large context which consists of abundant diagrams and essays. Although lots of studies have made significant progress in the natural image question answering (QA), they are not applicable to comprehending diagrams and reasoning over the long multimodal context. To address the ab...Show More