• جزئیات بیشتر مقاله
    • تاریخ ارائه: 1391/01/01
    • تاریخ انتشار در تی پی بین: 1391/01/01
    • تعداد بازدید: 574
    • تعداد پرسش و پاسخ ها: 0
    • شماره تماس دبیرخانه رویداد: -
     clustering is one of data mining task which aims to divides a set of objects into groups so that similar objects fall into the same group and objects with different features are put into different and separate groups. this paper presents a technique for semantic word clustering which is one of the applications of data mining techniques in the task of natural language processing. word clustering is used in various fields of text mining such as word disambiguation, information retrieval, language modelling, and text classification. this paper proposes a graph based method to clustering persian words. the proposed method is a type of pattern-based clustering. this method includes two parts; in the first part using statistical similarity measures such as chi-square, pointwise mutual information (pmi), and cosine a word co-occurrence graph is obtained. in the second part, the graph is further divided into appropriate clusters by newman’s graph clustering algorithm. our researches show that chi-square is the best measure to cluster the words in persian.

سوال خود را در مورد این مقاله مطرح نمایید :

با انتخاب دکمه ثبت پرسش، موافقت خود را با قوانین انتشار محتوا در وبسایت تی پی بین اعلام می کنم
مقالات جدیدترین رویدادها
مقالات جدیدترین ژورنال ها