An official website of the United States government
Here's how you know
Official websites use .gov
A
.gov website belongs to an official
government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you've safely
connected to the .gov website. Share sensitive
information only on official, secure websites.
As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsement of, or agreement with,
the contents by NLM or the National Institutes of Health.
Learn more:
PMC Disclaimer
|
PMC Copyright Notice
Bankruptcy prediction is a long-standing issue that receives significant attention of academic researchers and industry practitioners. Most of the papers on bankruptcy prediction focus on companies that are listed on the stock market, and there are only limited data for the rest of the companies. These companies, not indexed at any stock market, represent a significant part of the economy. The presented dataset consists of financial ratios of Slovak companies. There are 21 distinctive financial ratios which are available for three consecutive years prior to evaluation year in which companies may have filed for bankruptcy or not. The companies come from four different industries - agriculture, construction, manufacture, retail. We provide data for four consecutive years 2013–2016 for each industry. All companies are categorized as small-medium enterprises according to EU classification. Prediction performance results on this dataset are published in the research paper “Bankruptcy prediction for small- and medium-sized companies using severely imbalanced datasets” (Zoričák et al., 2019).
Data were divided into four categories – agriculture, construction, manufacture, and retail All four categories were split into bankrupt and non-bankrupt companies For all industries are provided four years in which are data evaluated
Experimental features
Machine learning methods were used to identify bankrupt companies
The dataset provides financial ratios of an exhaustive set of companies in four sectors: agriculture, construction, manufacture, and retail
•
The data can be used to propose or to benchmark statistical models or machine learning algorithms for bankruptcy prediction
•
The dataset can be used to investigate markers of upcoming bankruptcy
•
The data can be used to benchmark and validate methods for imbalanced learning, since distribution of the bankrupt and non-bankrupt companies is strongly imbalanced
•
Financial ratios of companies are provided for three years prior to the year when the company is evaluated as bankrupt or non-bankrupt.
The dataset is accessible on Data Mendeley [2] and provides financial ratios of limited liability companies. There are three possible views on the data as depicted in Table 1:
1)
Companies are divided into four different industries: agriculture, construction, manufacture, and retail,
2)
Four different evaluation years are considered: 2013, 2014, 2015 and 2016
3)
Two classes are defined (bankrupt (B) and non-bankrupt (NB)) for each evaluation year and industry
Table 1.
Number of bankrupt and non-bankrupt companies per industry and evaluation year.
The distribution of missing values is visualized in a heatmap in the Fig. 1. Missing values for individual datasets are displayed one, two and three years prior to evaluation. The most missing values are for the financial variable Labor Productivity (LP). From the industry perspective, the most missing values are for retail with the year of evaluation 2016.
In order to provide an overview of individual variables, we provide descriptive statistics in the form of boxplots in Fig. 2. All variables include outliers for almost all years. The interquartile range is relatively stable for all variables for all industries.
Financial ratios per industry for three years prior evaluation.
2. Experimental design, materials, and methods
We extracted values from the financial statements of each company for all available years. Financial statements consist of balance sheet and the income statement. A balance sheet provides detailed information regarding assets, equity, and liabilities. The income statement covers revenues, costs, and profit/loss for a given accounting period. Financial statements are publicly accessible on the Register of Financial Statements [3], which is database of financial statements of all business entities operated by Ministry of Finance of Slovak Republic. We used extracted values to calculate the financial ratios listed in Table 1 using Equation (1), (2), (3), (4), (5), (6), (7), (8), (9), (10), (11), (12), (13), (14), (15), (16), (17), (18), (19), (20), (21). Based on the available data, we identified four evaluation years – 2013, 2014, 2015 and 2016. Companies were evaluated and divided into two categories: bankrupt and non-bankrupt. Companies were evaluated based on [4] with two distinctive proceedings defined for companies in financial difficulties. It is either bankruptcy procedure or restructuring. A company which begins the restructuring process may recover its financial health but, nevertheless, poses risk for its creditors. Thus, we classify companies in both the bankruptcy procedure and restructuring process as bankrupt. After classification, we selected only companies with available data for three years prior to the evaluation year. For example, for all companies evaluated in the year 2013 financial ratios are available for years 2012 (R-1), 2011 (R-2), and 2010 (R-3). There are four domains in which the investigated companies operate: agriculture, construction, manufacture and retail. Companies were included in mentioned categories based on their prevailing activity. This information is included in the balance sheet in the form of SK NACE classification.
(1)
(2)
(3)
(4)
(5)
Ldenominator = Short-term Liabilities + Short-term Financial Assistance + Current Bank Loans + Accruals
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)
(18)
(19)
(20)
(21)
Acknowledgments
This work was supported by the Slovak Research and Development Agency, project number APVV-15-0358 and APVV-18-0368.
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
1.Zoričák M., Gnip P., Drotár P., Gazda V. Bankruptcy prediction for small- and medium-sized companies using severely imbalanced datasets. Econ. Modell. 2019 [Google Scholar]
2.Drotár P., Gnip P., Zoričak M., Gazda V. vol. 1. May 2019. (Dataset of Financial Ratios of Slovak Companies). [Google Scholar]