Exploiting Turkish Wikipedia as a semantic resource for text classification

Yükleniyor...
Küçük Resim

Tarih

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Majority of the existing text classification algorithms are based on the "bag of words" (BOW) approach, in which the documents are represented as weighted occurrence frequencies of individual terms. However, semantic relations between terms are ignored in this representation. There are several studies which address this problem by integrating background knowledge such as WordNet, ODP or Wikipedia as a semantic source. However, vast majority of these studies are applied to English texts and to the date there are no similar studies on classification of Turkish documents. We empirically analyze the effect of using Turkish Wikipedia (Vikipedi) as a semantic resource in classification of Turkish documents. Our results demonstrate that performance of classification algorithms can be improved by exploiting Vikipedi concepts. Additionally, we show that Vikipedi concepts have surprisingly large coverage in our datasets which mostly consist of Turkish newspaper articles.

Açıklama

Ganiz, Murat Can (Dogus Author), Akyokuş, Selim (Dogus Author) -- Full conference title: INISTA 2012: International Symposium on Innovations in Intelligent Systems and Applications: 2-4 July, 2012: Trabzon, Turkey

Anahtar Kelimeler

Textual Data Mining, Text Classification, Turkish Text Classification, Wikipedia, Vikipedi, Semantic Algorithms

Kaynak

INISTA 2012: International Symposium on Innovations in Intelligent Systems and Applications

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Poyraz, M., Ganiz, M. C., Akyokuş, S., Görener, B., & Kilimci, Z. H. (2012). Exploiting Turkish Wikipedia as a semantic resource for text classification. In INISTA 2012: International Symposium on Innovations in Intelligent Systems and Applications (5p.). [Piscataway, NJ]: IEEE. https://dx.doi.org/10.1109/INISTA.2012.6246996

Onay

İnceleme

Ekleyen

Referans Veren