I. Introduction
Nowadays we are faced with a vast amount of online digitalized information. As the amount continues to grow, it becomes more and more difficult to find what we are looking for but, it will be way more facile if we could look for our needed information by exploring based on thematic data instead of raw data. Probabilistic topic modeling introduces methods which can extract thematic structure of documents. The basic idea of these methods is that a document is a mixture of latent topics and each topic is a distribution over words. Suppose we have documents where each document consists of words and such that there are topics and unique words . The topic assigned to each word is denoted by • Based on this view we can approach the problem of extracting topics of a corpus like this: each topic is a distribution over words where the words are exchangeable i.e. each document is a bag of words. Documents are also exchangeable. Each word in each document is extracted from the distribution of its assigned topic. For each document there is a distribution over topics which shows how the topics have been mixed to produce the document. Then there are two parameters in model; distribution of words in topics and distribution of topics in documents .