Performance Comparison of Logistic Regression and Classification Regression tree Models for Binary Dependent Variable

Document Type : Scientific Research

Author

Abstract

This paper describes the performance analysis of two classifier models common in statistics and data mining on binary dependent variable, binary Logistic Regression (B.LR) and Classification Regression Tree (CART). The evaluation method is using all data in training stage. The using data set is from “Evaluation of patients with Jaundice on children” report. Data set is collection of categorical and continues independent variables. The classification performance of two classifiers is presented by using statistical performance measures like accuracy, specificity and sensitivity. Experimental result showed that accuracy of LR is more than 83% and CLASSIFICATION AND REGRESSION TREE is nearly 73%. So the sensitivity measure for BINARY LOGISTIC REGRESSION is nearby 77% and 66% for CLASSIFICATION AND REGRESSION TREE as well the specificity scale is 85% for BINARY LOGISTIC REGRESSION and 76% for CLASSIFICATION AND REGRESSION TREE. The result shows the performance of BINARY LOGISTIC REGRESSION classifier is found to be better than CLASSIFICATION AND REGRESSION TREE.

Keywords


[1] Jiwaei Han, Kamber Micheline, Jian Pei Data mining: Concepts and Techniques, Morgam Kaufmann Publishers (Mar 2006).
[2] Pakgohar, Alireza. Statistical applications in data mining: special view in logistic regression. Islamic Azad University, branch of Mashad. department of Science. M.A degree thesis. 2006. [Persian language].
[3] Pakgohar, Alireza. Evaluation of patients with gastroenteritis, Pneumonia and Jaundice on children, Payame Noor University, Report. 2012. [Persian Language].
[4] SPSS 18(PASW) help file. http//www-.spss.com
[5] Pakgohar, Alireza. Tabrizi, Reza Sigari. Khalili, Mohadeseh. Esmaeili, Alireza. The role of human factor in incidence and severity of road crashes based on the CART and LR regression: a data mining approach, Procedia Computer Science, Volume 3, 2011, Pages 764-769, ISSN 1877-0509, 0.1016/j.procs.2010.12.126.
[6] Alaa M. Elsayad “Predicting the severity of breast masses with ensemble of Bayesian classifiers” journal of computer science 6 (5): 576-584, 2010, ISSN 1549-3636.