Loading [MathJax]/extensions/MathZoom.js

Showing 1-13 of 13 results

Filter Results

Show

Results

In this article, we present an aggregation platform of journalistic contents based on an intelligent crawler of articles from Online Press, of an inclusive fact-checking approach and an Opinion Mining method. More than 300 000 press articles from more than 145 Senegalese online information websites have been collected, processed, analyzed, aggregated, stored, and classified by category and theme, ...Show More
Information websites, known as On-line Press, contain a tremendous amount of data which is potentially promising. All this information is available in real-time. Due to the velocity in the information broadcasting and the volume of available data, the traditional data extraction methods appear unsuitable to extract the good information on the right pages. It is in this context that we propose a ne...Show More
POS Tagging (Part-Of-Speech Tagging) is considered as a basic step for automatic natural language processing. It consists in identifying the grammatical category of each word in a text. It is tricky for a machine to understand the grammatical form of words in some languages including French. These difficulties are related to the labeling of unknown words, the ambiguitsy of certain words, the under...Show More
Opinion Mining is the processing textual contents from on-line discussions in order to highlight the opinions of net surfers. In the process, the identification of the comments through a model and a formalism which encompass the methods and the data is critical step. Unfortunately, this step is generally neglected in most of the Sentiment analyses studies. The objective of this article is to addre...Show More
A press article is a collection of statements (opinions and/or allegations) that are a priori non-factual (the facts), i.e. an opinion on a specific subject. This article seeks to address the challenging issue of modeling news articles and facts. The authors also discuss the possibilities of modeling networks of factual articles based on the RED Model and Linked Data.Show More
The complexity of comments from the Senegalese on-line press is mainly due to ambiguity. This has led to customized abbreviations and the high presence of local languages. These factors make the current opinion mining tools ineffective. In view of this complexity, our objective is to suggest an opinion lexicon to process these types of data. This lexicon will take into account Wolof, French and ur...Show More
Main objective of Web Scraping is to extract information from one or many websites and process it into simple structures such as spreadsheets, database or CSV file. However, in addition to be a very complicated task, Web Scraping is resource and time consuming, mainly when it is carried out manually. Previous studies have developed several automated solutions. The purpose of this article is to rev...Show More
Nowadays, the study of online press has become an issue of phenomenal research. From articles collections and merging, opinion mining, artificial intelligence or automatic classification to fact-checking; researches are developing and opening new perspectives. The main objective is to pave the way for the journalistic consumption of the future. However, though the literature review mentions previo...Show More
In recent years, automatic natural language processing (NLP) has made considerable progress in terms of performance. Nevertheless, to undertake a linguistic analysis of the facts in French remains a real problem today. On the one hand, current taggers do not match the definition of fact in fact-checking and, on the other hand, the complexity of the French language considerably decreases their perf...Show More
Nowadays the web is very rich that's why it is the principal corpus of fact checkers. These journalists in their quest for the truth daily scan manually a lot of sources in search of relevant informations. But in the data-journalism context where data are always increasing in real time, this manual exploration became impossible and more and more laborious. One of the actual used techniques for thi...Show More
In this paper, we present a fact-checking algorithm named SnVera based on the confrontation of opinions. We mean by confronting opinions, using the identical point of view and controversial point of view of the sources of the corpus, as essential elements in the quest for truth. The main objective is to propose a new formalization of the problem of detection of the truth by introducing two new sco...Show More
The pluralism of media and the unbridled running in production have resulted in a considerable increase of the volume of treated and produced data, but also the proliferation of wrong information. The internet-user is so confronted with two major problems: the identification of the reliable sources and the reliability of information. Indeed, there is no guarantee of the accuracy of the given infor...Show More
Today, the main problem in data journalism is quality. Indeed, even if the web is a broad data source, it is essential to note that everything is not good to take and that it is always necessary to control good quality, relevance of data and reliability of sources. In this paper, we introduce a state of the art of the automation of the fact-checking as well as a diagnosis of obstacles for its auto...Show More