Using Bayesian networks for bankruptcy prediction: Some methodological issues |
| |
Authors: | Lili Sun Prakash P. Shenoy |
| |
Affiliation: | 1. Accounting and Information Systems, Rutgers, The State University of New Jersey, 180 University Ave, Newark, NJ 07102-1897, USA;2. School of Business, University of Kansas, 1300 Sunnyside Ave, Summerfield Hall, Lawrence, KS 66045-7585, USA |
| |
Abstract: | This study provides operational guidance for building naïve Bayes Bayesian network (BN) models for bankruptcy prediction. First, we suggest a heuristic method that guides the selection of bankruptcy predictors. Based on the correlations and partial correlations among variables, the method aims at eliminating redundant and less relevant variables. A naïve Bayes model is developed using the proposed heuristic method and is found to perform well based on a 10-fold validation analysis. The developed naïve Bayes model consists of eight first-order variables, six of which are continuous. We also provide guidance on building a cascaded model by selecting second-order variables to compensate for missing values of first-order variables. Second, we analyze whether the number of states into which the six continuous variables are discretized has an impact on the model’s performance. Our results show that the model’s performance is the best when the number of states for discretization is either two or three. Starting from four states, the performance starts to deteriorate, probably due to over-fitting. Finally, we experiment whether modeling continuous variables with continuous distributions instead of discretizing them can improve the model’s performance. Our finding suggests that this is not true. One possible reason is that continuous distributions tested by the study do not represent well the underlying distributions of empirical data. Finally, the results of this study could also be applicable to business decision-making contexts other than bankruptcy prediction. |
| |
Keywords: | Bankruptcy prediction Bayesian networks Naï ve Bayes Variable selection Discretization of continuous variables |
本文献已被 ScienceDirect 等数据库收录! |
|