Loading [a11y]/accessibility-menu.js
OnPerDis: Ontology-Based Personal Name Disambiguation on the Web | IEEE Conference Publication | IEEE Xplore

OnPerDis: Ontology-Based Personal Name Disambiguation on the Web


Abstract:

With the growth of web documents, the ambiguity of personal name becomes more common and brings poor performance of web search. Identifying a correct personal entity from...Show More

Abstract:

With the growth of web documents, the ambiguity of personal name becomes more common and brings poor performance of web search. Identifying a correct personal entity from the a piece of or the whole document is still a very challenging problem, especially for Chinese websites. In this paper, we propose a novel Ontology-based approach for Personal Name Disambiguation (named "OnPerDis"). This approach has two main steps: first, we construct person ontology (PO) with rich conceptual modeling as well as a large set of supporting instances, second, for a given personal name on the web, we create a temporary instance and extract features from the web documents, calculate the similarity between this temporary instance and the instances in the PO. The one with the highest similarity score is chosen as the appropriate personal name. Our extensive evaluations with two rich real-life datasets (CIPS-SIGHAN 2012 NERD and Chinese web documents) shows OnPerDis' efficacy on personal name disambiguation on the Web.
Date of Conference: 17-20 November 2013
Date Added to IEEE Xplore: 23 December 2013
ISBN Information:
Conference Location: Atlanta, GA, USA

I. Introduction

Recently, web search engines become vital in people's daily life and are widely used to retrieve information of realworld entities including people themselves. In such cases, users enter the name of the target entity in search engines to obtain a set of Web pages that contain the name. However, the ambiguous of name (many entities share the same name or an entity has several names) typically causes ambiguous search results containing Web pages of several different entities. Such ambiguity is more common in Chinese names. For example, when search “Yao Ming”, the results are dominated by the well-known basketball player, and users have to manually fitter out these Web pages to identify the expected non-famous people who share the same name. This is the personal name ambiguity problem.

References

References is not available for this document.