Comparing alternative classifiers for database marketing: the case of imbalanced datasets

Yükleniyor...
Küçük Resim

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Elsevier

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

There are various algorithms used for binary classification where the cases are classified into one of two non-overlapping classes. The area under the receiver operating characteristic (ROC) curve is the most widely used metric to evaluate the performance of alternative binary classifiers. In this study, for the application domains where the high degree of imbalance is the main characteristic and the identification of the minority class is more important, we show that hit rate based measures are more correct to assess model performances and that they should be measured on out of time samples. We also try to identify the optimum composition of the training set. Logistic regression, neural network and CHAID algorithms are implemented for a real marketing problem of a bank and the performances are compared.

Açıklama

Anahtar Kelimeler

Database Marketing, Imbalance Datasets, Propensity Modeling, Performance Measures

Kaynak

Expert Systems with Applications

WoS Q Değeri

Scopus Q Değeri

Cilt

39

Sayı

1

Künye

DUMAN, E., EKİNCİ, Y., TANRIVERDİ, A. (2012). Comparing alternative classifiers for database marketing: the case of imbalanced datasets, Expert Systems with Applications, 39 (1), pp. 48-53. https://dx.doi.org/10.1016/j.eswa.2011.06.048.

Onay

İnceleme

Ekleyen

Referans Veren