Loading [MathJax]/extensions/MathMenu.js
2009 International Conference on Natural Language Processing and Knowledge Engineering - Conference Table of Contents | IEEE Xplore
IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE)

2009 International Conference on Natural Language Processing and Knowledge Engineering

DOI: 10.1109/NLP-KE15698.2009

24-27 Sept. 2009

Proceedings

The proceedings of this conference will be available for purchase through Curran Associates.

Natural Language Processing and Knowledge Engineering (NLP-KE), 2009 International Conference on

The similarity between sentences is a theoretical basis and key technology to the question answering system. The method presented in this paper is as follows. Firstly, the dependency question sets are obtained and the key words are extracted from the major components of the question sentences and the target question form the related libraries, and then the candidate question sets are obtained thro...Show More
This paper presents a conditional mutual information based selectional association to measure the selectional preference between two words in the same sentence. This selectional association is integrated conditional mutual information and a syntactic knowledge called link grammar. The selectional association is applied to indicative words selection for target words disambiguation. The experimental...Show More
Rule based machine translation systems face different challenges in building the translation model in a form of transfer rules. Some of these problems require enormous human effort to state rules and their consistency. This is where different human linguists make different rules for the same sentence. A human linguist states rules to be understood by human rather than machines. The proposed transl...Show More
Most of efficient computational approaches in NLP tasks are supervised methods which need annotated corpora. But the lack of supervised data in Persian encourages researchers to increase their interests and efforts on unsupervised and semi-supervised approaches. This paper presents a novel semi-supervised approach which called Genetic-based inside-outside (GIO), for Persian grammar inference for i...Show More
The rapid growth of online information services causes the problem of information explosion. Automatic text summarization techniques are essential for dealing with this problem. The process of compacting a source document to reduce complexity and length, retaining the most important information is called text summarization. This paper introduces PARSUMIST; a text summarization system for Persian d...Show More
Latent Dirichlet Allocation (LDA) is a generative model employing the symmetry Dirichlet distribution as prior of the topic-words' distributions to implement model smoothing. When LDA is applied to text classification, smoothing is essential to classification performance. In this paper, we propose a feature-enhanced smoothing method in the idea that words not appeared in the training corpus can he...Show More
This paper describes how the Google Web 1T 5-gram data set, contributed by Google Inc., can be stored so that it can be used efficiently with respect to time. We present an efficient way of accessing all the 5-grams for a specific word of interest from the stored files. We measure the maximum access and processing efficiency achievable for any word of interest. We also compare results (access time...Show More
The focus of data integration systems is on producing a comprehensive global schema successfully integrating data from heterogeneous data sources (heterogeneous in format and in structure). Starting from the ldquomeaningsrdquo associated to schema elements (i.e. class/attribute labels) and exploiting the structural knowledge of sources, it is possible to discover relationships among the elements o...Show More
With high increasing documents and electronic texts in Persian language, the use of fast methods to achieve texts through huge sets of documents is highly crucial. Persian text summarization which shows the main concept of a text in minimum size is an effective solution. One of the steps in Persian text summarization is to stem and eliminate common words. The aim of this research is to stem words ...Show More
Gene mention tagging is a critical step for biomedical text mining. Only when gene and gene product mentions are correctly identified could other more complex tasks, such as, gene normalization and gene-gene interaction extraction, be performed effectively. In this paper, six divergent models are implemented with different machine learning algorithms and dissimilar feature sets. We integrate these...Show More
Emotion recognition on text has wide applications. In this study we propose a method of emotion recognition at sentence level based on a relative large emotion annotation corpus (Ren-CECps). From this corpus, we get the emotion lexicons for the eight basic emotions (expect, joy, love, surprise, anxiety, sorrow, angry and hate). Statistics show that the emotion lexicons derived from Ren-CECps are u...Show More
In order to investigate the picture superiority effect, we compared the ERP between picture combined word (picture-word) and pure word (word) at study and test phase. During encoding, the FN400 was more negative and lasted longer for picture-words than for words. The late positive component (LPC) was more positive and distributed broadly for words compared to picture-words. During retrieval, the o...Show More
Emotion plays a significant role in human communications in our daily life. With progress in human-machine interface technology, recent research has placed more emphasis on the recognition of emotion reaction. Comparing to some other ideal experimental settings, blog posts online would be respond more to real-world events. And a huge resource of text-based emotion can be found from the World Wide ...Show More

Hyponymy acquisition from Chinese text by SVM

Fang Tian;Fuji Ren

Publication Year: 2009,Page(s):1 - 6
Cited by: Papers (1)
Hyponymy as one of semantic relation taxonomies provides a fundamental knowledge for natural language processing applications. In this paper, we propose a method for automatically learning hyponymy terms by machine learning technique from text for Chinese. Our method relies on hand-crafted hyponymy patterns, and uses the syntactic features to build a multiple classifier to identify novel hyponymy ...Show More
Analysis of emotions in texts has wide-ranging applications. In the analysis of emotional expressions, degree words are important for expressing emotion intensity of emotions. With the support of a large Chinese emotion corpus (Ren-CECps), in this paper, we present analysis on degree words for Chinese emotion expressions based on syntactic parse and rules. At first, Ren-CECps is used to extract th...Show More
There is a widely held belief in the NLP and computational linguistics communities that identifying and defining roles of predicate arguments in a sentence has a lot of potential for and is a significant step toward improving important applications such as document retrieval, machine translation, question answering and information extraction. In this paper, we present an semantic role labeling (SR...Show More
We present a method for correcting real-word spelling errors using the Google Web 1T n-gram data set and a normalized and modified version of the longest common subsequence (LCS) string matching algorithm. Our method is focused mainly on how to improve the correction recall (the fraction of errors corrected) while keeping the correction precision (the fraction of suggestions that are correct) as h...Show More
We perform a research about Japanese sightseeing guidance question answering (QA) system. In this domain Web information retrieval approach is always being used. However for the Web information retrieval, Japanese unknown place word detection needs to be considered. Some famous places are known very well, such as "Kinkaku-ji", it can be recognized by morphological analysis. However, the recognitio...Show More
Research on machine translation has a long history and many methods and techniques have been proposed and developed. However, low quality of translation is still a major problem and many related problems remain unresolved. Super function based machine translation was proposed to perform translation without going through syntactic and semantic analysis as many machine translation systems usually do...Show More
There are lots of negative sentences in both Japanese and Chinese, which are varing from meanings and phrasings. Especially the negative sentences always express complex relationships in Chinese language. So it is easy to make ambiguity when translating Japanese into Chinese. At present, it's very common that the translation softwares sold on the market make errors due to wrong translation of the ...Show More
In order to recognize human emotion, a three-dimensional (3D) emotion space model was established and used to identify the text emotional orientation visually and intuitively. An emotion dictionary contains informations of eight basic emotions, was built based on the emotion corpus. By using this emotion dictionary, emotion keywords which behave distributions of 3D structure can be projected into ...Show More
In this paper, the purpose is to arrange information to understand at one view. The proposed summarization frame technology is a system to hierarchically arrange and classify information by targeting content and level of importance in sentences. Moreover, the technique in which the Concept Base, the Degree of Association Algorithm, the Time Judgment system and the Place judgment system are used to...Show More
In this paper, we propose an approach on answer generation for cooking question answering system. We first review previous work of question analysis. Then, we give annotation scheme for knowledge database. Finally, we present the answer planning based approach for generate an exact answer in natural language. An evaluation has been conducted on natural language questions and the result shows that ...Show More
In recent years, researchers have begun to pay more attention to the emotion recognition in natural language processing. In order to help this pursuit, this paper proposes a semi-automatic approach to create a Chinese emotion thesaurus with tag of emotion intensity based on two kinds of language resources HowNet and Tongyici Cilin. As a basic emotion resource, the emotion thesaurus should be used ...Show More
In this work, we use Hidden Markov Models (HMM), Conditional Random Field (CRF), Gaussian Mixture Models (GMM) and Mathematical Methods of Statistics (MMS) for Chinese and Japanese text summarization. The purpose of this work is to study the applicability of mentioned three trainable models for cross-language text summarization. For model training, we use several training features such as sentence...Show More

Proceedings

The proceedings of this conference will be available for purchase through Curran Associates.

Natural Language Processing and Knowledge Engineering (NLP-KE), 2009 International Conference on