The effectiveness of homogenous ensemble classifiers for Turkish and English texts

Yükleniyor...
Küçük Resim

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Text categorization has become more and more popular and important problem day by day because of the large proliferation of documents in many fields. To come up with this problem, several machine learning techniques are used for categorization such as naïve Bayes, support vector machines, artificial neural networks, etc. In this study, we concentrate on ensemble of multiple classifiers instead of using only a single one. We perform a comparative analysis of the impact of the ensemble techniques for text categorization domain. To carry out this, the same type of base classifiers but diversified training sets are used which is referred as homogenous ensembles. In order to diversify the training dataset, various ensemble algorithms are utilized such as Bagging, Boosting, Random Subspace and Random Forest. Multivariate Bernoulli Naïve Bayes is preferred as a base classifier due to its superior classification performance compared to the success of the other single classifiers. A wide range of comparative and extensive empirical studies are conducted on four widely-used datasets in text categorization domain in both Turkish and English. Finally, the effectiveness of ensemble algorithms is discussed.

Açıklama

Kilimci, Zeynep Hilal (Dogus Author) -- Akyokuş, Selim (Dogus Author) -- Conference full title: 2016 International Symposium on INnovations in Intelligent SysTems and Applications, INISTA 2016; Sinaia; Romania; 2 August 2016 through 5 August 2016.

Anahtar Kelimeler

Homogeneous Ensembles, Bagging, Random Subspace, Random Forest, Text Categorization, Ensemble Learning

Kaynak

2016 International Symposium on INnovations in Intelligent SysTems and Applications (INISTA)

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Kilimci, Z. H., Akyokus, S., & Omurca, S. İ. (2016). The effectiveness of homogenous ensemble classifiers for Turkish and English texts. In 2016 International Symposium on INnovations in Intelligent SysTems and Applications (INISTA), (pp. 1-7). Piscataway, NJ: IEEE. https://dx.doi.org/10.1109/INISTA.2016.7571854

Onay

İnceleme

Ekleyen

Referans Veren