Skip to main content
F1000Research logoLink to F1000Research
. 2023 Apr 14;12:405. [Version 1] doi: 10.12688/f1000research.132949.1

Dataset of companies’ profitability, government debt, financial statements' key indicators and earnings in an emerging market: Developing a panel and time series database of value-added tax rate increase impacts

Mahfoudh Mgammal 1,2,a, Ebrahim Al-Matari 1,2
PMCID: PMC10905151  PMID: 38434636

Abstract

Company profitability is a crucial indicator that can be used for developing and sustaining trust in accounting information and, thus, inefficient capital markets. Companies with good financial statements’ key indicators have a more extensive customer base and can diversify their revenue streams, making them more resilient to economic downturns. Assembling and managing taxes is a critical underpinning to protecting a country’s financial intensity and developing a country’s tax-system. VAT is a primary source of financial gain in developing nations, which differs from economic income in developed countries, where economic income is primarily derived from tax income. In emerging economies, the existing practice requires firms to effectively and efficiently publish annual-reports and indicators on market-websites, as users rely heavily on timely-information and need it to make decisions. However, these practices fell short of expectations, requiring more research. These variables are crucial for most accounting/economics/taxation research models and the lack of easily attainable data in well-known databases (e.g., ARGAAM; DataStream). This article is primarily a dataset for analysing taxation, performance variables, and key financial-statement indicators. The data describes the raw, combined, and filtered information at the company level, such as company profit and government debt in Saudi Arabia. It combines a firm-level panel dataset sample of company profit that its measures scaled by total assets and include: earnings before interest, taxes, decrease and amortisation, earnings before interest and taxes, earnings after taxes and earnings before taxes—moreover, the time series dataset sample of 11 financial statements’ key indicators. The dataset results from 494 company-year observations (226-panel data sample and 268-time series data sample) from 2019 to 2020. Data has been collected from taxation reports, corporate annual reports, ARGAAM database, FinBox database, the Trading Economics database and the Tadawul-market website in Saudi Arabia.

Keywords: Earnings; Government Debt; Performance; Taxation; Shareholders' Equity; Profitability

Introduction

The reasoning and setting behind the creation of this dataset are to help the researcher to perform a comparative analysis across different disciplines to study the role of these variables in a specific phenomenon or the factors enlightening value-added tax (VAT) usefulness and effectiveness. For example, estimate the consequences of a VAT rate increase on profitability, government debt etc. This dataset helps complete two of the few studies investigating the VAT effect in the Kingdom of Saudi Arabia (KSA). There was an encouragement to observe the effect of imposing the new 15% VAT on the profitability of nonfinancial Saudi-listed companies. This dataset is in two types: a firm-level panel dataset sample and a time series sample ( Mgammal, 2021; Mgammal, Al-Matari, and Alruwaili, 2023).

Methods

This dataset article contains proxies to measure the economic impacts of VAT as used by several prior researchers ( Mgammal, 2021). The representatives for measuring the economic effects of VAT were manually collected and built using secondary data obtained from publicly available data from 2019 to 2020. In Table 1, ProFtEBITDA means company profit measured by earnings before interest, taxes, devaluation, and amortisation (EBITDA) and scaled by total assets and data collected from FinBox database tools. Data were hand collected from companies’ tax reports. We specify companies and identify them with inclusion/exclusion criteria, as mentioned in Table 3. The inclusion/exclusion criteria of all data in this article are as follows: we included nonfinancial companies and excluded finance firms. Then we filtered the sample by excluding companies with annual reports unavailable for two years. The fiscal year-end date is not 31/12/of each year, and the accounting period is over 12 months. Consequently, the final dataset is 494 company-year observations (226-panel data sample and 268-time series data sample) from 2019 to 2020. This final sample is the foundation that can be used for analysis in future research. SIZE2020 and SIZE2019 are the mean company size in 2020 and 2019, measured by the natural logarithm of total assets and data gathered from companies’ annual reports. EAT2020 and EAT2019 are the mean company earnings after tax in 2020 and 2019 and were measured by deducting all expenditures and revenue taxes from the business’s revenues. Following prior research, two variables were added, GvD2020 and GvD2019, which meant government debt in 2020 and 2019 and was measured as government debt over the gross domestic product (GDP). Data for this were extracted from the Trading Economics ( https://tradingeconomics.com/) database. Regarding the process for accessing the data, the data was hand collated directly from the Trading Economics website using many Saudi Arabia Indicators reports, especially GDP Indicators reports. In this context, the data collected from the Trading dataset are the same inclusion/exclusion criteria as mentioned above, and we started data collection on 23/01/2021 to 30/03/2021.

Table 1. Definitions of the variables.

Variables Definition
ProFtEBITDA Company Profit
SIZE2020 Company Size in 2020
EAT2020 Company Earnings after tax in 2020
GvD2020 Government Debt 2020
EAT2019 Company Earnings after tax in 2019
SIZE2019 Company Size in 2019
GvD2019 Government Debt 2019

Table 3. Descriptive for the samples’ characteristics.

Sectors VAT 5% implementation period * VAT 15% Implementation period * Panel data sample by companies Time-series sample by quarterly observations
Energy From Jan. 1, 2018, to July 1, 2020 From July 1, 2020, ongoing 5 10
Materials 42 84
Commercial and Professional Services 3 6
Transportation 5 10
Consumer durables and Apparel 6 12
Consumer services 10 20
Media and entertainment 2 4
Retailing 8 16
Food and staples retailing 4 4
Food and beverages 12 24
Health care equipment and Service 7 14
Pharma, Biotech and Life Science 1 2
Software and services 2 4
Telecommunication services 4 8
Utilities 2 4
Capital goods - 24
Real estate management and development - 22
Total 113*2
= 226 Obs
268 Obs
*

In the Kingdom of Saudi Arabia (KSA), value-added tax (VAT) was first introduced in all industries as a 5% VAT on goods and services as of Jan. 1, 2018, and, because of COVID-19, the Kingdom of Saudi Arabia (KSA) increased the VAT from 5% to 15% on July 1, 2020.

As in Table 2 below, an index containing 11 items was included: BALANCE SHEET: total assets, total equity, and liabilities-equity. INCOME STATEMENT: total income, total revenues, total expenses, and net income. CASH FLOW: changes in operation. Activity, changes in investing act, changes in the financing act, and cash at the end of the period. These factors help control for potential impacts when analysing how a VAT increase will affect a firm. We take each into account as each has a component of the likely effect of a VAT increase. Collectively, these classifications were based on data available on the Tadawul market website as we clarify in inclusion/exclusion criteria above. It is a periodic data set of most Saudi registered companies from before the introduction of the new VAT rate of 15% in 2019 to after the introduction of the new VAT rate in 2020. The dataset framework was chosen when the new 15% VAT was introduced due to public access to VAT information for nonfinancial companies. Tadawul requires all public companies to publish their financial statements on the Tadawul website quarterly and annually ( https://m5.gs/OXFqam) (Tadawul, 2019-2020). These data in the file can help build additional variables, such as measuring the effect of VAT before and after increasing its rate using unique techniques such as the difference-in-difference (DID) approach and the autoregressive integrated moving average (ARIMA) modelling approach. Nevertheless, since the literature does not provide common definitions or metrics, we leave the creation of these additional variables at the discretion of potential users. Thus, users can use this dataset to create these measurements from their perspective ( Baatwah and Aljaaidi, 2021; Mgammal, 2021).

Table 2. Financial statements’ key indicators definitions.

No Variables abbreviations Before (b) in 2019 After (a) in 2020
Balance sheet Balance sheet
1 TAb & TAa 1. Total Assets (TAb) 1. Total Assets (TAa)
2 SEb & SEa 2. Shareholders’ Equity (SEb) 2. Shareholders’ Equity (SEa)
3 TLSEb & TLSEa 3. Total Liabilities and Shareholder Equity (TLSEb) 3. Total Liabilities and Shareholder Equity (TLSEa)
Statement of income Statement of income
4 TIb & TIa 4. Total Income (TIb) 4. Total Income (TIa)
5 TRb & TRa 5. Total Revenues (TRb) 5. Total Revenues (TRa)
6 TEb & TEa 6. Total Expenses (TEb) 6. Total Expenses (TEa)
7 NIb & NIa 7. Net Income (NIb) 7. Net Income (NIa)
Cash flow Cash flow
8 COAb & COAa 8. Other Changes in Oper. Activity (COAb) 8. Other Changes in Oper. Activity (COAa)
9 CIAb & CIAa 9. Other Changes in Investing Act. (CIAb) 9. Other Changes in Investing Act. (CIAa)
10 CFAb & CFAa 10. Other Changes in Financing Act. (CFAb) 10. Other Changes in Financing Act. (CFAa)
11 CEPb & CEPa 11. Cash at End of Period (CEPb) 11. Cash at End of Period (CEPa)

Data description

The dataset included in this article contains three files describing and defining the sample and variables. Excel file 1 consists of all raw and filtered data for the variables for the panel data sample. Excel file 2 depicts time-series and cross-sectional data for nonfinancial firms listed on the Saudi market for the second and third quarters of 2019 and the third and fourth quarters of 2020. Excel file 3 presents the raw material of variables used in measuring the company’s profitability of the panel data sample. The period of this data is selected from an extensive section of registered companies in Saudi Arabia from 2019 before imposing the new VAT rate, 15%, to 2020 after setting the new VAT rate. The major segments are consumer discretionary, information technology, energy, consumer staples, materials, health care, industrials, communication services, real estate, utilities, and financials. The sample framework was chosen due to the time of implementation new 15% VAT rate and the public access to information about VAT if nonfinancial companies, where Tadawul forces all listed companies to publish their financial statements publicly quarterly and annually on the Tadawul website. Financial companies were excluded from the sample framework as they have unique treatments, and some previous studies investigated the effects of VAT on KSA Banks. The final panel data sample framework is 131 listed companies and 268 observations for the time sires’ sample, as depicted in Table 3 below.

We utilise balanced panel data as it is a more sensitive measurement of the modifications that could occur between points in time ( Cavana, Delahaye, & Sekeran, 2001). Additionally, the outcomes created are more robust, consistent, and stable, enabling a generalisation of the population to be more meaningful and representative. Therefore, the final dataset sample is 226 observations were specified to be eligible for implication in the analyses. It is the basis for the research, i.e. multivariate, bivariate, additional tests and descriptive.

Descriptive statistics

To recognise and determine the situation of every concept, descriptive statistics were utilised to clarify. Table 4 displays the statistics of descriptive (standard deviation, median, mean, minimum, maximum values and degrees of freedom) for 113*2 = 226 observations of all variables of the panel data sample and 268 observations for the time series sample.

Table 4. Descriptive statistics.

(A) Penal data sample (Observations = 226)
Variable Obs Mean Std. Dev. Min Max
ProFtEBITDA 113 0.0000475 0.0000871 -0.0002105 0.000412
ProFtEBIT 113 0.0000227 .0000751 -0.000222 0.0001892
ProFtEBTUnu 113 0.0000344 .0001461 -0.0002215 0.0013329
2019 (Before Tax Reform)
EAT2019 113 0.0000842 0.0007884 -0.0006502 0.008303
SIZE2019 113 6.402416 0.7762833 4.799299 9.600352
GvD2019 113 2.224271 8.346027 0.0095256 7.217121
2020 (After Tax Reform)
EAT2020 113 0.0000171 0.0000853 -0.0002951 0.0002637
SIZE2020 113 6.401014 0.7646444 4.63827 9.274225
GvD2020 113 1.27e+14 7.60e+14 1.79e+11 7.75e+15
Chg (ERN) 113 2.973124 25.42851 -16.4361 264.1
(B) Time series data sample (Observations = 268)
Before (b) in 2019 After (a) in 2020
Variable Mean Std. Dev. Min Max Variable Mean Std. Dev. Min Max
OTAb 1.28E+07 5.08E+07 62994 4.77E+08 TAa 1.28E+07 5.17E+07 43478 5.03E+08
SEb 4788402 1.69E+07 55457 1.69E+08 SEa 4652292 1.66E+07 14494 1.65E+08
TLSEb 1.28E+07 5.08E+07 62994 4.77E+08 TLSEa 1.28E+07 5.17E+07 43478 5.03E+08
TIb 331645.6 1109441 -3407 8375361 TIa 318024.6 1206166 -242725 8970955
TRb 347839.8 1140208 -105027 8699826 TRa 338827.1 1254699 -178025 9083262
TEb 233783.7 980947.3 -1374325 7746953 β17TEa 225897.5 996286.7 -1199752 7762214
NIb 78985.93 356338.2 -724512 2679387 NIa 62371.49 342734.2 -609803 2765537
COAb 55342.46 970096.6 -4252574 9451664 COAa 86007.48 600282.1 -1329988 6184696
CIAb 191045.9 1078954 -455858 7620294 CIAa 15994.71 268631.9 -1140654 1589257
CFAb -642943 2665571 -2.03E+07 517795 CFAa -439313 1907536 -1.57E+ 278246
CEPb 698759 3357006 -4764 3.74E+07 CEPa 619081.7 2733571 -34220 3.02E+07

Value of the data

  • This dataset is essential because it covers data on variables rarely overlooked in accounting, taxation and business performance research models, collectively or individually, but appeal to a wide range of stakeholders. For example, it enables capital markets regulators, standard setters, practitioners and users of financial reporting to easily access long-term data to assess the effectiveness and efficiency of VAT in controlling the risk of tax evasion in fast-growth markets.

  • This dataset is valuable for interdisciplinary studies investigating the role of these variables in specific phenomena or factors enlightening VAT and company performance effectiveness.

  • This dataset is beneficial because it contains data on profitability collected to measure company profit using earnings before interest, taxes, depreciation, and amortisation (EBITDA) and scaled by the total assets data set. Further, the dataset has been arranged into individual and multiple measurements.

  • The data allows researchers to scrutinise the influence of VAT increase on various accounting matters, such as corporate governance mechanisms, the performance of companies and the quality of financial reporting. Furthermore, this dataset is valuable to related parties, e.g., investors, stakeholders, tax authorities, decision-makers, managers and market regulators in assessing and reviewing the tax system in Saudi Arabia. This assessment will give them assurance in making different decisions.

  • These data can be analysed and/or compared to other emerging economies and G20 countries ( https://www.g20.org/en/). It can be used in discussing the tax system and VAT rate regarding which parties are responsible for adding some VAT incentives in the tax system and updating the tax system sideways with VAT implementation to advantage from the effectiveness of VAT in the KSA. Because KSA could be similar to other countries in G20 and Gulf Cooperation Council (GCC).

  • The data is also beneficial for studies on VAT incentives efficiency using data in the long term.

Funding Statement

The author(s) declared that no grants were involved in supporting this work.

[version 1; peer review: 2 approved]

Data availability

Underlying data

Harvard Dataverse: Dataset of companies’ profitability, government debt, Financial Statements’ Key Indicators and earnings in an emerging market: Developing a panel and time series database of value-added tax rate increase impacts. https://doi.org/10.7910/DVN/HEL3YG ( Mgammal and Al-Matari, 2023).

The project contains the following underlying data:

  • Final Data - Panel dataset- DiB-2.xlsx. (Excel file 1 consists of all raw and filtered data for the variables for the panel data sample).

  • VAT-Final-Time seires dataset DIB-2.xlsx (Excel file 2 depicts time-series and cross-sectional data for nonfinancial firms listed on the Saudi market for the second and third quarters of 2019 and the third and fourth quarters of 2020).

  • Raw matiral – DiB-2.xlsx. (Excel file 3 presents the raw material of variables used in measuring the company’s profitability of the panel data sample).

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

References

  1. Baatwah SR, Aljaaidi KS: Dataset for audit dimensions in an emerging market: Developing a panel database of audit effectiveness and efficiency. Data Brief. 2021;36:107061. 10.1016/j.dib.2021.107061 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Cavana R, Delahaye BL, Sekeran U: Applied business research: Qualitative and quantitative methods. Australia: John Wiley & Sons;2001. [Google Scholar]
  3. Mgammal MH: Does a Value-Added Tax Rate Increase Influence Company Profitability? An Empirical Study in the Saudi Stock Market. J. Australas. Tax Teach. Assoc. 2021;16(1):92–128. [Google Scholar]
  4. Mgammal MH, Al-Matari E: Dataset of companies’ profitability, government debt, Financial Statements’ Key Indicators and earnings in an emerging market: Developing a panel and time series database of value-added tax rate increase impacts [Developing a panel and time series database]. 2023. 10.7910/DVN/HEL3YG [DOI] [PMC free article] [PubMed]
  5. Mgammal MH, Al-Matari EM, Alruwaili TF: Value-added-tax rate increases: a comparative study using difference-in-difference with an ARIMA modeling approach. Humanit. Soc. Sci. Commun. 2023;10:121. Accepted. (In press). 10.1057/s41599-023-01608-y [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2024 Feb 8. doi: 10.5256/f1000research.145908.r230650

Reviewer response for version 1

Faozi Almaqtari 1

Thank you for giving me chance review of this intriguing data article. I believe that by utilizing this dataset, researchers can explore compelling issues within the realm of value-added tax (VAT) in the specific context of Saudi Arabia. I would kindly suggest that the authors carefully review the paper, as there are a few minor typos that could be addressed. In summary, I anticipate significant value emanating from this dataset, foreseeing its potential to contribute valuable insights and implications for economic considerations.

Are sufficient details of methods and materials provided to allow replication by others?

Yes

Is the rationale for creating the dataset(s) clearly described?

Yes

Are the datasets clearly presented in a useable and accessible format?

Yes

Are the protocols appropriate and is the work technically sound?

Yes

Reviewer Expertise:

Governance, Financial Performance, Auditing

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2023 May 24. doi: 10.5256/f1000research.145908.r172936

Reviewer response for version 1

Saeed Rabea Baatwah 1

Thank you for reviewing this interesting data article. I think that researchers can test interesting issues in the context of VAT and in Saudi Arabia using this dataset. I would suggest to the authors that it will be more reflective of the dataset if the title is updated to "Dataset of value-added tax, companies’ profitability, government debt, financial statements' key indicators, and earnings in an emerging market: Developing a panel and time series dataset". Furthermore, I would suggest the authors review the paper, as there are some minor typos. Overall. I expect greater value from this dataset and its potential to help draw economic implications.

Are sufficient details of methods and materials provided to allow replication by others?

Yes

Is the rationale for creating the dataset(s) clearly described?

Yes

Are the datasets clearly presented in a useable and accessible format?

Yes

Are the protocols appropriate and is the work technically sound?

Yes

Reviewer Expertise:

I am interested in auditing, financial reporting, CSR, taxation, and emerging countries.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2023 May 31.
Mahfoudh Mgammal 1

I would like to thank you for this great feedback and suggestions. We agree with you that updating the title to "Dataset of value-added tax, companies’ profitability, government debt, financial statements' key indicators, and earnings in an emerging market: Developing a panel and time series dataset" would be more reflective of the dataset. It's also good to hear that the dataset has the potential to draw economic implications. Moreover, we confirmed that we reviewed and corrected the paper for minor typos as per requested. In this regard, we looking forward to seeing the greater value that this dataset can bring to researchers studying VAT and Saudi Arabia.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Availability Statement

    Underlying data

    Harvard Dataverse: Dataset of companies’ profitability, government debt, Financial Statements’ Key Indicators and earnings in an emerging market: Developing a panel and time series database of value-added tax rate increase impacts. https://doi.org/10.7910/DVN/HEL3YG ( Mgammal and Al-Matari, 2023).

    The project contains the following underlying data:

    • Final Data - Panel dataset- DiB-2.xlsx. (Excel file 1 consists of all raw and filtered data for the variables for the panel data sample).

    • VAT-Final-Time seires dataset DIB-2.xlsx (Excel file 2 depicts time-series and cross-sectional data for nonfinancial firms listed on the Saudi market for the second and third quarters of 2019 and the third and fourth quarters of 2020).

    • Raw matiral – DiB-2.xlsx. (Excel file 3 presents the raw material of variables used in measuring the company’s profitability of the panel data sample).

    Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).


    Articles from F1000Research are provided here courtesy of F1000 Research Ltd

    RESOURCES