A simple semantic kernel approach for SVM using higher-order paths
Citation
Altınel, B., Ganiz, M. C., & Diri, B. (2014). A simple semantic kernel approach for SVM using higher-order paths. In 2014 IEEE International Symposium on Innovations in Intelligent Systems and Applications (INISTA) Proceedings (pp. 431-435). Piscataway, NJ: IEEE. https://dx.doi.org/10.1109/INISTA.2014.6873656.Abstract
The bag of words (BOW) representation of documents is very common in text classification systems. However, the BOW approach ignores the position of the words in the document and more importantly, the semantic relations between the words. In this study, we present a simple semantic kernel for Support Vector Machines (SVM) algorithm. This kernel uses higher-order relations between terms in order to incorporate semantic information into the SVM. This is an easy to implement algorithm which forms a basis for future improvements. We perform a serious of experiments on different well known textual datasets. Experiment results show that classification performance improves over the traditional kernels used in SVM such as linear kernel which is commonly used in text classification.
Source
IEEE International Symposium on Innovations in Intelligent Systems and Applications (INISTA) ProceedingsCollections
Related items
Showing items related by title, author, creator and subject.
-
A novel semantic smoothing kernel for text classification with class-based weighting
Altınel, Berna; Diri, Banu; Ganiz, Murat Can (Elsevier, 2014-12-24)In this study, we propose a novel methodology to build a semantic smoothing kernel to use with Support Vector Machines (SVM) for text classification. The suggested approach is based on two key concepts; class-based term ... -
Exploiting Turkish Wikipedia as a semantic resource for text classification
Poyraz, Mitat; Ganiz, Murat Can; Akyokuş, Selim; Görener, Burak; Kilimci, Zeynep Hilal (IEEE, 2012)Majority of the existing text classification algorithms are based on the "bag of words" (BOW) approach, in which the documents are represented as weighted occurrence frequencies of individual terms. However, semantic ... -
Wikipedia based semantic smoothing for Twitter sentiment classification
Torunoğlu, Dilara (Doğuş Üniversitesi Fen Bilimleri Enstitüsü, 2013-06)Sentiment classification is one of the important and popular application areas of text classification in which texts are labeled as positive and negative. Moreover, Naive Bayes (NB) is one of the mostly used algorithms in ...