Abstract:
Purpose – Loan default risk or credit risk evaluation is important to financial institutions which provide loans to businesses and individuals. Loans carry the risk of being defaulted. To understand the risk levels of credit users (corporations and individuals), credit providers (bankers) normally collect vast amounts of information on borrowers. Statistical predictive analytic techniques can be used to analyse or to determine the risk levels involved in loans. This paper aims to address the question of default prediction of short-term loans for a Tunisian commercial bank.
Design/methodology/approach – The authors have used a database of 924 files of credits granted to industrial Tunisian companies by a commercial bank in the years 2003, 2004, 2005 and 2006. The naive Bayesian classifier algorithm was used, and the results show that the good classification rate is of the order of 63.85 per cent. The default probability is explained by the variables measuring working capital, leverage, solvency, profitability and cash flow indicators.
Findings – The results of the validation test show that the good classification rate is of the order of 58.66 per cent; nevertheless, the error types I and II remain relatively high at 42.42 and 40.47 per cent, respectively. A receiver operating characteristic curve is plotted to evaluate the performance of the model. The result shows that the area under the curve criterion is of the order of 69 per cent.
Originality/value – The paper highlights the fact that the Tunisian central bank obliged all commercial banks to conduct a survey study to collect qualitative data for better credit notation of the borrowers.
Paper type: Research paper
Keywords:ROC curveROC curve, Risk assessment Risk assessment, Default risk Default risk, Banking sector Banking sector, Bayesian classifier algorithm Bayesian classifier algorithm.
Resumen:
Propósito – El riesgo de incumplimiento de préstamos o la evaluación del riesgo de crédito es importante para las instituciones financieras que otorgan préstamos a empresas e individuos. Existe el riesgo de que el pago de préstamos no se cumpla. Para entender los niveles de riesgo de los usuarios de crédito (corporaciones e individuos), los proveedores de crédito (banqueros) normalmente recogen gran cantidad de información sobre los prestatarios. Las técnicas analíticas predictivas estadísticas pueden utilizarse para analizar o determinar los niveles de riesgo involucrados en los préstamos. En este artículo abordamos la cuestión de la predicción por defecto de los préstamos a corto plazo para un banco comercial tunecino.
Diseño/metodología/enfoque – Utilizamos una base de datos de 924 archivos de créditos concedidos a empresas industriales tunecinas por un banco comercial en 2003, 2004, 2005 y 2006. El algoritmo bayesiano de clasificadores se llevó a cabo y los resultados muestran que la tasa de clasificación buena es del orden del 63.85%. La probabilidad de incumplimiento se explica por las variables que miden el capital de trabajo, el apalancamiento, la solvencia, la rentabilidad y los indicadores de flujo de efectivo.
Hallazgos – Los resultados de la prueba de validación muestran que la buena tasa de clasificación es del orden de 58.66% ; sin embargo, los errores tipo I y II permanecen relativamente altos, siendo de 42.42% y 40.47%, respectivamente. Se traza una curva ROC para evaluar el rendimiento del modelo. El resultado muestra que el criterio de área bajo curva (AUC, por sus siglas en inglés) es del orden del 69%.
Originalidad/valor – El documento destaca el hecho de que el Banco Central tunecino obligó a todas las entidades del sector llevar a cabo un estudio de encuesta para recopilar datos cualitativos para un mejor registro de crédito de los prestatarios.
Tipo de artículo – Artículo de investigación 1. Introduction
Palabras clave: Curva ROC, Evaluación de riesgos, Riesgo de incumplimiento, Sector bancario, Algoritmo clasificador bayesiano.
Using a naive Bayesian classifier methodology for loan risk assessment. Evidence from a Tunisian commercial bank
Received: 07 February 2016
Accepted: 02 February 2017
Bank credit risk assessment is widely used at banks around the world. As credit risk evaluation is very crucial, a variety of techniques are used for risk level calculation. In addition, credit risk is one of the main functions of the banking community.
According to the Basel Committee on Banking Supervision, credit risk is most simply defined as the potential that a bank borrower or counterparty will fail to meet its obligations in accordance with the agreed terms (Okan, 2007).
Banks classify clients according to their profiles. While classifying, the financial background of the customers and subjective factors related to them are evaluated. Financial ratios play an important role for risk level calculation, as per Berk et al. (2011). These ratios are objective and indicate the financial statement of the business. Balance sheet, income statement and cash flows are some financial statements used for collecting information to calculate objective financial ratios. There are many other subjective factors too; these depend on bank decision strategy and its mission, according to Berk et al. (2011).
In February 2006, the Basel Committee on Banking Supervision issued a consultative document for comment. This document was intended to provide banks and supervisors with guidance on sound credit risk assessment and valuation policies and practices for loans regardless of the accounting framework applied. Principle 3 of thedocumentstatesthat"Abank’s policies should appropriately address validation of any internal credit risk assessment models"[1].
The implementation of this principle turns out to be a daily decision based on a binary classification problem, distinguishing good payers from bad payers. Certainly, assessing the insolvency plays an important role, as a good estimate (related to a borrower) can help to decide whether to grant the requested loans or not. The Basel Committee proposes a choice between two broad methodologies for calculating their capital requirements for credit risk, either external mapping approach or internal rating system.
Although the external mapping approach is difficult to apply because of the unavailability of external rating grades, the internal rating system is easy to implement, as numerous methods have been proposed in the literature to develop credit-risk evaluation models[2]. In fact, credit scoring methods are used to evaluate both objective and subjective factors. These techniques spread all around the world in the 1950s (Abramowicz et al., 2003). Through these methods, information collection from a customer is formalized. Besides, the scoring system forms a basis for loan approval. These models include traditional statistical methods [e.g. logistic regression (Steenackers and Goovaerts, 1989)], nonparametric statistical models [e.g. k-nearest neighbour (Henley and Hand, 1997)] and classification trees (Davis et al., 1992) and neural network (NN) models (Desai et al., 1996; Matoussi and Krichène, 2010). The NN models have served as versatile tools for data analysis in a variety of complex environments. In finance, they have been successfully applied to bankruptcy and loan-default prediction and credit evaluation (West, 2000; Wu and Wang, 2000; Atiya, 2001; Pang et al., 2002; Odom and Sharda, 1990). Recent contributions have proposed using Bayesian classification rules with naive Bayes classifiers. The results of these studies demonstrated their frequent ability to do better than most existing techniques. In this context, Sarkar and Sriram (2001) and Sun and Shenoy (2007) have successfully applied the Bayesian model to bankruptcy prediction. Moreover, Stibor (2010) noted that the naive Bayesian classifier is one kind of Bayesian classifier which is actually known as a simple and effective probability classification method (Jie and Bo, 2011) and works based on applying the theorem of Bayes with strong (naive) independence assumptions. The naive Bayes classifier is particularly appropriate when the dimensionality of the inputs is high. Despite its simplicity, naive Bayes can often outperform more sophisticated classification methods (Hill and Lewicki, 2007). That is why we choose to use them in this study.
Our research question is how banks in Tunisia can develop fairly accurate quantitative prediction models that can serve as very early warning signals for counterparty defaults. Previous works look at business failure prediction from the mid-term and long-term prospects (failure vs non-failure). In our paper, we look at the short-term prospect (payment vs non-payment of the short-term credit at maturity). We also consider the case of a bank that wants to use a prediction model to assess its credit risk (Matoussi et al., 1999; Abid and Zouari, 2000; El-Shazly, 2002; Davutyan and Özar, 2006).
Specifically, we use a naive Bayes classifier model to help the credit-risk manager in explaining why a particular applicant is classified as either bad or good. The NN parameters will be set using an optimization procedure analogous to the gradient – in classical topology –and a feed-forward NN with ad hoc connections.
The remainder of this paper is organized as follows: In Section 2, we provide the conceptual framework and empirical modelling supporting our research question and research design, respectively. In Section 3, we describe the data and methodology. In Section 4, we present our results and their interpretations. Finally, Section 5 concludes the paper and presents some limits.
One of the most fundamental applications of the agency theory to the lender–borrower problem is the derivation of the optimal form of the lending contract. In a debt market, the borrower usually has better information about the project to be financed and its potential returns and risk. The lender, however, does not have sufficient and reliable information concerning the project to be financed. This lack of information in quantity and quality creates problems before and after the transaction takes place. The presence of asymmetric information normally leads to moral hazard and adverse selection problems. This situation illustrates a classical principal–agent problem.
The principal–agent models of the agency theory may be divided into three classes according to the nature of information asymmetry. First, there are moral hazard models, where the agent receives some private information after signing the contract. Moral hazard refers to a situation in which the asymmetric information problem is created after the transaction occurs. As the borrower has relevant information about the project that the lender does not have, the lender runs the risk that the borrower will engage in activities that are undesirable from the lender’s point of view because they make it less likely that the loan will be paid back. These models are qualified as models with ex-post asymmetric information.
Second, we find adverse selection models, where the agent has private information already before signing the contract. Adverse selection refers to a situation in which the borrower has relevant information that the lender lacks (or vice versa) about the quality of the project before the transaction occurs. This happens when the potential borrowers who are the most likely to produce an undesirable (adverse) outcome (bad credit risks) are the ones who are most active to get a loan and are thus most likely to be selected. In the simplest case, lenders’ price cannot discriminate between good and bad borrowers, because the riskiness of projects is unobservable. These models are known as models with ex-ante asymmetric information.
Finally, there are signalling models, in which the informed agent may reveal his private information through a signal which he sends to the principal.
This problem is traditionally considered in the framework of costly state verification, introduced by Townsend (1979). The essence of the model is that the agent, who has no endowment, borrows money from the principal to run a one-shot investment project. The agent is faced with a moral hazard problem. Should he announce the true value or should he lower the outcome of the project? This situation describes ex-post moral hazard. We can also face a situation of ex-ante moral hazard, where the unobservable effort by the agent during project realization may influence the result of the project. Townsend (1979) showed that the optimal contract which solves this problem is the so-called standard (or simple) debt contract. This standard debt contract is characterized by its face value, which should be repaid by the agent when the project is finished. Another theoretical justification for simple debt contract was considered by Diamond (1984), where the costly state verification was replaced by a costly punishment. Hellwig (2000, 2001) showed that the two models are equivalent only under the risk neutrality assumption. However, when we consider the introduction of risk aversion, the costly state verification model still works, but the costly punishment model does not survive.
To overcome the asymmetric information problem and its consequences on credit risk assessment in the real world, banks use either collateral or bankruptcy prediction modelling or both. The next subsection will deal with this aspect.
After the high number of profile bank failures in Asia, research activity on credit risk took a step further. As a result, the regulators recognize the need and urge banks to utilize cutting-edge technology to assess the credit risk in their portfolios. Measuring the credit risk accurately also allows banks to engineer future lending transactions, so as to achieve targeted return/risk characteristics. The assessment of credit risk requires the development of fairly accurate quantitative prediction models that can serve as very early warning signals for counterparty defaults (Atiya, 2001)[3].
Many researchers proposed two main approaches to deal with credit scoring in the literature. In the first approach, i.e. the structural or market-based models, the default probability derivation is based on modelling the underlying dynamics of interest rates and firm characteristics. This approach is based on the asset value model originally proposed by Merton (1974), where the default process is endogenous, and relates to the capital structure of the firm. Default occurs when the value of the firm’s assets falls below some critical level. In the second approach, i.e. the empirical or accounting-based models, instead of modelling the relationship of default with the characteristics of a firm, this relationship is learned from the data. Raymond (2007), Thomas et al. (2002) and Galindo and Tamayo (2000) synthesized some methods used in this context. In this regard, we can cite the work of Beaver (1963) and Altman (1968). Bankruptcy prediction has been studied actively by academics and practitioners. Many models have been proposed and tested empirically. Altman’s popular Z-score (Altman, 1968) is an example, which was based on linear discriminant analysis and was used to predict the probability of default of firms. Ohlson’s O-score (Ohlson, 1980) is based on generalized linear models. Generalized linear models or multiple logistic regression models have been used either to identify the best determinants of bankruptcy or the predictive accuracy rate of their occurrence. NN models were adapted and used in bankruptcy prediction. Their high power of prediction makes them a popular alternative with the ability to incorporate a very large number of features in an adaptive nonlinear model (Kay and Titterington, 2000).
A lot of research studies have focused on the nonparametric methods class [e.g. k-nearest neighbour (Henley and Hand, 1996), decision trees (Quinlan, 1992) and NNs (Mcculloch and Pitts, 1943)], which also have been largely applied in the field of credit scoring. There are also some other approaches that combine several techniques to create a classification model, such as the Support Vector Machine (Lee and Chen, 2005; Lee et al., 2002). Antonietta and Paolo (2003) developed a Bayesian regression model to predict the credit risk of companies classified in different sectors. Maltritz and Molchanov (2008) proposed a Bayesian model to find the variables which are most likely to determine country default risk in emerging markets. The collection provided by Bocker (2010) also includes several studies on Bayesian credit risk modelling. For example, Jacobs and Kiefer (2010) present a step-by-step guide to Bayesian analysis in the default setting, including details on elicitation of expert information. According to Miguéis et al. (2012), despite the intense study of credit scoring, there is no consensus on the most appropriate classification technique to use’. Baesens et al.(2003) revealed that some conflicts can occur when comparing the findings of different studies. However, Thomas et al. (2002) also suggested that most methods applied in credit scoring have similar levels of performance. In fact, for banks and financial institutions, the reasons that may motivate the preference for certain methods are their interpretability and the transparency (Martens et al., 2009). According to Miguéis et al. (2012), "two aspects of methods for credit scoring are very important: that is the predictive performance, as well as the insights or interpretations that are revealed by the model".
Banks are in a very competitive environment; therefore, the service quality during credit risk assessment is very important. When a customer demands credit from a bank, the bank should evaluate the credit demand as soon as possible (Berk et al., 2011) to gain competitive advantage. Additionally, for each credit demand, the same process is repeated and constitutes a cost for the bank. Due to the importance of credit risk analysis, most techniques and models are developed by financial institutions to help them decide whether to grant or not to grant credit (Çinko, 2006).
In this section, we briefly review the implementations of the binary classification using the naive Bayes algorithm. In fact, the naive Bayes algorithm is a classification algorithm based on Bayes rule, which assumes the attributes X1...Xn are all conditionally independent of one another, given Y. The value of this assumption is that it dramatically simplifies the representation of P(X/Y), and the problem of estimating it from the training data, according to Mitchell (2010).
A Bayesian network (BN) represents a joint probability distribution over a set of continuous inputs (attributes) Xi. In this case, designing a naive Bayes classifier is based on the use of equation (1):
(1)Given a new occurrence Xnew = (X1 . . . Xn), equation (1) shows how to estimate the probability that Y will take on any given value, given the observed input values of Xnew and given the distributions P(Y) and P(Xi/Y) estimated fromthe training data. Ifwe are interested only in the most probable value of Y, then we have the naive Bayes classification rule as shown in equation (2):
(2)However, according toMitchell (2010), "when the Xi are continuous we must choose some other way to represent the distributions P(Xi/Y)". One common approach is to assume that for each possible discrete value yk of Y, the distribution of each continuous Xi is Gaussian, and is defined by a mean and standard deviation specific to Xi and yk. To train such a naive Bayes classifier, we must, therefore, estimate the mean and standard deviation of each of these Gaussians:
(3)
(4)For each input Xi and each possible value yk of Y, note that there are 2nK of these parameters, all of which must be estimated independently. Certainly, we have to estimate the priors on Y as well:
(5)The above model summarizes a naive Bayes classifier, which assumes that the data X are generated by a mixture of class-conditional (i.e. dependent on the value of the class variable Y) Gaussians. Furthermore, the naive Bayes assumption introduces the additional constraint that the input values Xi are independent of one another within each of these mixture components.
A receiver operating characteristic (ROC) curve is generally a useful performance-graphing method. In other words, an ROC graph is a method for visualizing, organizing and selecting classifiers based on their performance (Fawcett, 2006). Spackman (1989) was the earliest adopters of ROC graphs in machine learning. He demonstrated the value of ROC curves in evaluating and comparing algorithms (Fawcett, 2006). In fact, the use of ROC graphs in the machine-learning community has increased in recent years, as simple classification accuracy is often a poor metric for measuring performance (Provost and Fawcett, 1997; Provost et al., 1998). Besides, they have properties that make them especially useful for domains with skewed class distribution and unequal classification error costs (Fawcett, 2006; Figure 1).
An ROC curve is a two-dimensional representation of classifier performance. According to Fawcett (2006), "to compare classifiers we may want to reduce ROC performance to a single scalar value representing expected performance". To do so, many researchers, such as Bradley (1997) and Hanley and McNeil (1982), recommend the use of a common method – which is to calculate the area under the ROC curve, abbreviated AUC. The AUC is defined as a portion of the area of the unit square; its value will always be between 0 and 1.0. However, because random guessing produces the diagonal line between (0, 0) and (1, 1), which has an area of 0.5, no realistic classifier should have an AUC less than 0.5 (Fawcett, 2006).
The need for models that predict defaults correctly is very high because in commercial banks, credit risk measurement is crucial to discriminate reliable clients from the non-reliable ones. Among the quantitative methods for solving credit risk evaluation problems, the simple Bayesian classifier was applied for estimating the posterior probabilities of default. In fact, Antonakis and Sfakianakis (2009) showed that the posterior probability of an event is the probability of an event after collecting some empirical data. Rosner (2006) demonstrated that the posterior probability is obtained by integrating information from the prior probability with additional data related to the event in question. According to Mileris (2010):
[. . .] often analysis begins with initial or prior probability estimates for specific events of interest. Then from sources such as a sample we obtain additional information about the events. Given this new information the prior probability values can be updated by calculating revised probabilities, referred to as posterior probabilities.
Anderson et al. (2007) demonstrated that the Bayesian theoremprovides a means for making these probability calculations.
In our experiment, we use a sample of bank credit files and divide them into two subsamples. The first sub-sample is composed of 924 files of short-term loans granted to Tunisian companies in the years 2003, 2004, 2005 and 2006.
Let’s recall that our objective is to use the naive Bayes classifier methodology for default prediction of a bank’s commercial loans. However, to solve a problem using the Bayesian algorithm, we need to gather data for training purposes. The training data set includes a number of cases, each containing values for a range of input and output variables. The first decision we need to make is which variables to use. The second one concerns the subjects whose behaviour we want to predict. For our case, the variables are indicators of default risk and the subjects are borrowers. The data collected for our investigation came from a large private commercial bank (BIAT). We chose a private bank to avoid the potential inefficiency of the public banking sector, whose decisions are sometimes dictated by government choices. We also chose to work with short-term commercial loans because they represent the largest part of loans and are subject to renewal every year[4].
We have used a database of 924 files of credits granted to industrial Tunisian companies by a commercial bank in the years 2003, 2004, 2005 and 2006. This period was chosen because it corresponds to a central bank instruction, in which it asks banks to provide credit risk classes for their borrowers. In the case of the BIAT, by the end of every quarter, it classifies these files into five clusters, each one corresponding to a risk class. Files without delay of payment correspond to the healthy firms. The four remaining classes correspond to four riskier classes of firms with three months, six months, nine months and one year (or more) delay of payment, respectively. We group these four classes in one class: risky companies.
Our dependant variable is the probability of default. We use a dummy variable, Y, which equals 0 if the firmis classified as healthy and 1 otherwise. Hence:
Y = 0 if no delay of payment
Y = 1 if more there is more than a three-month delay
Default risk prediction relies, in general, on a good appraisal of the couple risk-return of a company. Financial ratios drawn from financial statements (balance sheet, income and cash flow statement) are usually used. Financial ratio analysis groups the ratios into categories which tell us about different facets of a company's finances and operations (liquidity, activity or operational, leverage and profitability).
In our experiment, we retain 24 financial and nonfinancial indicators, 22 of them are financial ratios and 2 are not. The financial indicators are inspired from Altman’s popular Zscore and recommended textbooks in the field of financial statement analysis and valuation (Berstein and Wild, 1998; Revsine et al., 1999; and Palepu et al., 2000). The financial indicators measure liquidity (working capital, operating activity and cash flow), leverage, long-term solvency and profitability. The nonfinancial variables used in this research are firm size and collateral (Table I).
To get a better idea about our data before running the naive Bayes classifier models, we will perform a test of mean differences between the two risk classes defined above (Table II). The summary statistics and the mean differences can be seen as an analysis similar to that of Beaver (1963). Table II presents the descriptive statistics of our data. When we run the mean difference analysis between the two risk classes (healthy and risky groups), this analysis can give us a flavour of our data, as such an analysis allows us to verify if there is a difference between the two classes in terms of financial ratios. Table II recalculates some summary statistics for the two risk classes.
Table II shows significant mean differences between the two groups for some ratios (R2, R4, R6, R8, R11, R12, R14, R15, R18, R21 and R22) and no significant differences for others (R1, R3, R5, R7, R9, R10, R11, R13, R16, R17, R19 and R20). Globally, they tell us that the liquidity risk does not differentiate the two groups. The leverage and solvency ratios do better in differentiating the two groups. For other indicators (coverage and profitability), the results are mitigated. For example, while return on equity (R18) shows a significant difference, gross profit margin (R16) and return on invested capital (R17) do not.



When we look at the relevance of the mean differences, we realize that, globally, the good indicators are superior in the healthy group, while the bad indicators are higher in the risky group. For example, the mean of cash flow (R6), working capital requirement (R2) and leverage and solvency (R11, R12, R14 and R8) ratios is bigger in the healthy group. Current ratio (R4) and profitability ratios (R18 and R15) have a higher mean in the risky group.
Let us see now if Bayes models do a better job in predicting default risk.
Bayesian classifiers can predict class membership probabilities, such as the probability that a given sample belongs to a particular class. Bayesian classifier is based on Bayes’ theorem. Naive Bayesian classifiers assume that the effect of an attribute value on a given class is independent of the values of the other attributes. This assumption is called class-conditional independence.
Initially, we used a database composed of 32 independent variables. We eliminated eight ratios which posed the problem of multi-collinearity. These ratios are detected by the software. The database used in this work is composed of 24 variables which do not pose the multi-collinearity problem.
Panels 1 and 2 of Tables III and IV show the results for the first and second Bayesian models.
We can see from these results (Panels 1 and 2) that the global good classification rate is getting better when we introduce indicators relating to cash flow, firm size and guarantee. In fact, the best good classification rates are of the order of 59.63 and 63.85 per cent, respectively, for the two models (non-cash flow and full information). A lot of research has examined the criterion of type I and II errors.
According to Yang (2002), type I error rate is also called a rate or credit risk; it is the rate of bad customers being categorized as good. When this happens, the misclassified bad customers will become default. Therefore, if a credit institution has a high rate, which means the credit granting policy is too generous, it is exposed to credit risk.


Type II error rate is also called a commercial risk; it is the rate of good clients being classified as bad. When this happens, the misclassified good clients are rejected, and the bank, therefore, supports (endures) an opportunity cost caused by the loss of good customers. Bogess (1967) showed that if a credit institution has a high type II error for a long period, which means it follows a restrictive credit granting policy for a long time (Yang, 2002), it may lose its share in the market. The credit institution is, thus, exposed to commercial risk.
In this study, we find that error type I is very high for the firstmodel of the first panel (non-cash flow Bayesmodel). In fact, this rate is of the order of 52.79 per cent. Table V shows these results. The introduction of cash flow variables (Panel 2) improved the results. The good classification rate got better. In addition, the model based on the full information indicator reduced error type I to 39.27 per cent (cf. Table VI) but error type II increased to 32.97 per cent.
In this research, we would like to assess credit risk using a selection of financial ratios recommended in debt contracts. The predictions on the selection of financial ratios illustrate the relation between financial ratios and credit risk. This evidence is well known in the practitioner and academic literature (Demerjian, 2007). In fact, textbooks emphasize the role of ratios in evaluating credit quality (Lundholm and Sloan, 2004), while academic studies conclude that financial ratios serve to provide signals about borrower credit risk when used as covenants (Smith and Warner, 1979; and Dichev and Skinner, 2002).
Table VI shows the working capital requirement (R2), account receivable liquidity(R3), cash flow ratio(R6), debt cash flow coverage ratio (R8), gross profit margin(R16), fixed asset to debt ratio (R19) and guarantee (V02).
In other words, ratios R2, R3, R6, R8, R16, R19 and V02 capture five aspects of credit risk:



Moreover, the evidence also shows that liquidity, cash flow and profitability leverage are the most informative (enlightening) about the failure of corporate borrowers, with a p-value of 0.
The inventory turnover (R7) and interest coverage (R14) ratios, related, respectively, to liquidity and solvency, contribute less than the other ratios cited previously in the prediction of credit risk, with a p-value of the order of 0.014 and 0.038, respectively, for R7 and R14.
Finally, R1, related to long-term financing of working capital, is less informative with a p-value of 0.09.
Regarding the other variables used in this study, R4, R5, R9, R10, R11, R12, R13, R15, R17, R18, R20, R21 and V01, the results are not significant with a very high p-value, varying from0.82 (R18) to 0.13 (R5 and R20).
An ROC curve for the perfect classifier, which orders all bad cases before good cases, is the curve that follows the two axes. It would classify 100 per cent bad cases into class bad and 0 per cent good cases into class bad for some value of the sill. According to Yang (2002):
[. . .] a classifier with a ROC curve which follows the 45° line would be useless. It would classify the same proportion of the bad cases and good cases into the class bad at each value of the threshold; it would not separate the classes at all. Real-life classifiers produce ROC curves which lie between these two extremes.

To evaluate the performance of the curve, we have to use a measure given by the AUC (Hand, 1997). The curve that has a larger AUC is better than the one that has a smaller AUC (Figure 2).
We can note that the criterion of AUC is of the order of 69 per cent. This score is superior to 50 per cent, but it is not considered a very good score.
By comparing ROC curves, we can learn the difference in classification precision between two or more classifiers. The higher curve will be nearer the perfect classifier and will have more accuracy. In this context, by using the same data and carrying out an NN model with hidden layers, we can compare the classification accuracy between the two models – naive Bayes and the NN model (Figure 3).
We can conclude that the NN model outperforms the naive Bayes model because the criterion of AUC for the NN model is 83.2 per cent (yellow curve) and that of the Bayes model is 69 per cent (green curve).
We note that the out-of-sample validation will be done on a second sub-sample, which contains data on 150 files of short-term loans granted to industrial Tunisian companies in 2006. Table VII displays the sub-sample repartition (Table VIII).



Table IX presents the results of the validation test; we can see the corresponding good classification rate of the order of 58.66 per cent. The error types I and II remain relatively high at 42.42 and 40.47 per cent, respectively (Table X).
Commercial banks that grant client borrower loans need consistent models that can correctly detect and predict defaults. Moonasar (2007) emphasized that one of the basic tasks which any bank has to deal with, in the current competitive and turbulent business environment, is to reduce its credit risk. Traditionally, we used scoring methods to estimate the creditworthiness of a credit applicant. In fact, the quantitative method known as credit scoring has been developed for the credit assessment problem (Yang, 2002). Credit scoring is basically an application of classification methods, which classify borrowers into different risk groups. The objective of scoring methods is to predict the probability that an applicant or existing borrower will default (Komor¨ad, 2002). In credit risk evaluation, credit scoring is a key method that helps financial institutions to make a decision whether or not to grant credit to a customer (Thomas, 2002). According to Moonasar (2007):
[. . .] a common approach of credit scoring is to apply a classification technique on data of previous customers (both good credit customers and delinquent customers) in order to find a relationship between the customers’ characteristics and potential failure to service their debt. Institutions use credit scoring techniques (utilizing information from the consumers’ past credit history and current economic conditions) to determine which applicants will pay back their liabilities.
An accurate classifier is necessary to discriminate between new potential good and bad credit applicants.
In this paper, we tried to assess the credit risk for a Tunisian bank by modelling the default risk of its commercial loans. We used a database of 924 credit files during the years 2003, 2004, 2005 and 2006. Input variables were classified into two categories: non-cash flow ratios and cash flow ratios.
The main results show that the introduction of cash flow variables improves the prediction quality, and the classification rates passed from 59.63 to 63.85 per cent, respectively, in the non-cash flow and cash flow models. Moreover, collateral played an important role in default risk prediction. In fact, this indicator has an explanatory capacity to predict the credit risk. To evaluate the performance of the model, an ROC curve was plotted. The result shows that the AUC criterion is of the order of 69 per cent. Using the same data, this criterion is improved and passed to 83 per cent when we used the NN methodology.
Our study is, however, incomplete in the sense that we used only quantitative variables and it did not show the importance of qualitative variables based on strategic data in completing the financial analysis and assessing the solidity of a borrower. We note that the Tunisian central bank obliged all commercial banks to conduct a survey study to collect qualitative data for better credit notation of the borrowers.















