Abstract:
Nowadays the web is very rich that's why it is the principal corpus of fact checkers. These journalists in their quest for the truth daily scan manually a lot of sources ...Show MoreMetadata
Abstract:
Nowadays the web is very rich that's why it is the principal corpus of fact checkers. These journalists in their quest for the truth daily scan manually a lot of sources in search of relevant informations. But in the data-journalism context where data are always increasing in real time, this manual exploration became impossible and more and more laborious. One of the actual used techniques for this problem is the automatic extraction of information on web pages more known as "web scraping". The principal goal of web scraping is to retrieve on a web page. The main purpose of web scraping is to bring out a web page, specific and highly structured data with a reduced human effort. In this paper, we present an automatic extractor of articles and journalistic claims implemented on 15 Senegalese news websites.
Published in: 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS)
Date of Conference: 15-18 October 2018
Date Added to IEEE Xplore: 02 December 2018
ISBN Information: