A Survey on Truth Discovery: Concepts, Methods, Applications, and Opportunities | IEEE Journals & Magazine | IEEE Xplore

A Survey on Truth Discovery: Concepts, Methods, Applications, and Opportunities


Abstract:

In the era of data information explosion, there are different observations on an object (e.g., the height of the Himalayas) from different sources on the web, social sens...Show More

Abstract:

In the era of data information explosion, there are different observations on an object (e.g., the height of the Himalayas) from different sources on the web, social sensing, crowd sensing, and data sensing applications. Observations from different sources on an object can conflict with each other due to errors, missing records, typos, outdated data, etc. How to discover truth facts for objects from various sources is essential and urgent. In this paper, we aim to deliver a comprehensive and exhaustive survey on truth discovery problems from the perspectives of concepts, methods, applications, and opportunities. We first systematically review and compare problems from objects, sources, and observations. Based on these problem properties, different methods are analyzed and compared in depth from observation with single or multiple values, independent or dependent sources, static or dynamic sources, and supervised or unsupervised learning, followed by the surveyed applications in various scenarios. For future studies in truth discovery fields, we summarize the code sources and datasets used in above methods. Finally, we point out the potential challenges and opportunities on truth discovery, with the goal of shedding light and promoting further investigation in this area.
Published in: IEEE Transactions on Big Data ( Volume: 11, Issue: 2, April 2025)
Page(s): 314 - 332
Date of Publication: 05 July 2024

ISSN Information:

Funding Agency:


I. Introduction

In the past few decades, the amount of helpful information available on the Web has been proliferating, which has brought dramatic changes to human society [1]. People are more dependent on the Web to fulfill their information needs than ever [2]. However, a huge amount of disinformation, outdated data, and factual errors are filled on the Web [3]. It is difficult for users to distinguish the truth from various information [4], [5], [6], [7]. When searching for the birthplace of Adolf Hitler on the web, the answers include “Austria”, “Braunau”, “Germany”. It is difficult for users to choose the correct information among these conflicting answers. Since the collected information may be informed, incomplete, outdated, or existing factual errors, it is crucial to discover truths from various information which improves accuracy in information extraction. To solve the problem, truth discovery has attracted researchers’ attention recently [8], [9], [10] in many real-world application scenarios including web, social sensing, crowd sensing, privacy sensing, and deep neural network applications. For example, true facts are found from a large amount of conflicting information on many objects provided by various websites [11]. The sensory data collected from various mobile devices are usually unreliable in mobile cloud computing. Truthful information is extracted from unreliable sensory data in mobile crowd sensing [12]. It is necessary to aggregate noisy information on the objects, entities, or events collected from various sources.

Contact IEEE to Subscribe

References

References is not available for this document.