Systematic literature review of the performance characteristics of Chebyshev polynomials in machine learning applications for economic forecasting in low-income communities in sub-Saharan Africa

Darrold Cordes; Shahram Latifi; Gregory M Morrison

doi:10.1007/s43546-022-00328-w

. 2022 Nov 10;2(12):184. doi: 10.1007/s43546-022-00328-w

Systematic literature review of the performance characteristics of Chebyshev polynomials in machine learning applications for economic forecasting in low-income communities in sub-Saharan Africa

Darrold Cordes ^1,^✉, Shahram Latifi ¹, Gregory M Morrison ²

PMCID: PMC9647249 PMID: 36407751

Abstract

Chebyshev polynomials have unique properties that place them in a class of functions that are highly efficient in the approximation of non-linear functions. Machine learning techniques are being applied to solve complex non-linear problems in the financial markets where there is a proliferation of financial products. The techniques for valuing diverse portfolios of these products can be time consuming and expensive. Formal research has been conducted to determine how machine learning can considerably reduce the computational effort without losing accuracy. The objective of this systematic literature review is to discover evidence of research on the optimal use of Chebyshev polynomials in machine learning and neural networks that may be used for the estimation of generalized financial outcomes of large clusters of small economic units in low-income communities in sub-Saharan Africa. Scopus, ProQuest, and Web of Science databases were queried with search criteria designed to recover peer-reviewed research articles that addressed this objective. Many articles discussing broader applications in engineering, computer science, and applied mathematics were found. Several articles provided insights into the challenges of forecasting stock price outcomes from unpredictable market activities, and in investment portfolio valuations. One article addressed specific environmental issues relating to energy, biology, and ecological situations, and presented encouraging results. While the literature search did not find any similar articles that address economic forecasting for low-income communities, the applications and techniques used in stock market forecasting and portfolio valuations can contribute to formative theory on sustainable development. There is currently no theoretical underpinning of sustainable development initiatives in developing countries. A framework for small business structures, data collection, and near real-time processing is proposed as a potential data-driven approach to guide policy decisions and private sector involvement.

Supplementary Information

The online version contains supplementary material available at 10.1007/s43546-022-00328-w.

Introduction

The World Bank (2021) estimated that in 2017 approximately 689 million people globally were living below their poverty threshold of US$1.90 per day. This situation is now far worse because of climate change, armed conflict, population growth, and disease. The World Bank (2021) have estimated that the number of people living under the poverty line could increase by up to 150 million in 2021 due to COVID-19, and by as many as 132 million by 2030 due to climate change. Sub-Saharan Africa (S-SA) and South Asia are the most vulnerable of all global regions (World Bank 2021).

Small to medium enterprises (SMEs) are community-based initiatives with a relatively small service area operating in diverse sectors with wide geographic diversity (Abisuga-Oyekunle et al. 2020; Azunu and Mensah 2019; GSMA 2020). Well-organized and managed SMEs could improve the wellbeing of these communities and reduce poverty in S-SA (Abisuga-Oyekunle et al. 2020; Anetor et al. 2020). This would give rise to potentially millions of SMEs in S-SA requiring a systematic approach to determining the collective economic, social, and environmental value of these SMEs, potential risks, and generalization characteristics. However, the research literature does not provide the depth and breadth of ideas and techniques required for the design, implementation, analysis, and visualization of generalizable programs for economic, social, and environmental wellbeing (Aftab and Ismail 2015; Khavul and Bruton 2013; Manzoor et al. 2019; Mazambani and Mutambara 2018). There is an absence of research relating to theoretical frameworks from which strategies for sustainable development leading to poverty alleviation can be derived. A well-developed business approach to poverty alleviation does not exist (Khavul and Bruton 2013). The absence of performance evaluation frameworks that inform economic and social performance initiatives precludes any systematic strategic approach to sustainable development (Mazambani and Mutambara 2018). There is no consensus in the field of development studies nor any established theory on how sustainable development should proceed (Aftab and Ismail 2015). Further, there is no systematic approach to bridging the digital and information divides with equitable access to the Internet (Urquhart et al. 2008).

The modelling of large numbers of SMEs requires the continuous collection of data covering a wide range of social, economic, and environmental measures, and sophisticated data analysis techniques to estimate local, regional, and national growth, prosperity, and wellbeing. The computational cost of such analysis could become burdensome. Some insights to this cost can be drawn from recent developments in financial market forecasting and portfolio valuations (Laris and Ruiz 2018). The complexity of investment portfolios has resulted in significant computational burdens when undertaking risk assessments and portfolio revaluations (Laris and Ruiz 2018). The challenge of modelling large numbers of SMEs under different risk scenarios is similar. Data analytics techniques and machine learning (ML) have made it possible to study sustainability phenomena on a large scale. As well as drawing analytical conclusions of historical performance, it is possible to develop predictive and even prescriptive assessments of social, economic, and environmental development initiatives with high levels of granularity.

The purpose of this systematic literature review (SLR) was to discover formal research into the development of efficient ML algorithmic models that may be suitable for the processing of large non-linear datasets expected from SMEs in S-SA, and the role of Chebyshev polynomials in reducing computational effort and hence costs. The search results were very limited with a few articles that focused on time series forecasting of financial market prices. A 2020 literature review of the application of ML to financial market forecasting revealed more than 150 articles (Ryll and Seidens 2019). This survey concluded that ML techniques generally outperformed traditional financial market forecasting models. A 2019 literature review reported that deep learning models significantly outperformed other techniques in ML but there was a lack of published research in the application of deep learning in time series forecasting in finance (Sezer et al. 2020). Examples of financial and stock market forecasting using various ML techniques claim more accurate outcomes compared with “traditional” models and techniques for more computationally efficient valuations of investment portfolios are being developed (Maciel and Ballini 2010; Mohapatra et al. 2018; Parida et al. 2017; Rout et al. 2017; Siddique et al. 2017; Vlasenko et al. 2019). As financial products become more complex the methods for undertaking comprehensive near real-time risk assessment are becoming computationally intensive (Mohapatra et al. 2018; Ryll and Seidens 2019). Chebyshev interpolation techniques have been shown to exhibit exponential convergence for many non-linear functions (Boyd and Petschek 2014), potentially leading to faster computation of investment portfolios and financial forecasting. Research into the application of Chebyshev polynomials in a neural network trained by differential evolution in a specific ecological/environmental case study produced encouraging results compared with other ML techniques (Troumbis et al. 2020). In this case study the highly non-linear nature of environmental data was recognized, and a Chebyshev series expansion was found suitable for approximation of this data. Further, the research acknowledged the complex nature of environmental data which included climate, economics, environment management, ecology, and biology datasets (Troumbis et al. 2020).

Neural networks with fast optimization may provide a useful tool for developing multiple “what-if?” scenarios and hence lead to more informed policy and resource planning decisions. Chebyshev, Legendre, and Jacobian polynomials are used in many fields of engineering, mathematics, and computer science (Boyd and Petschek 2014, Table 1). With the increased use of big data analytics and ML in social sciences, including economics and environmental science, many situations have arisen where the fast optimization of models for the sustainability of economic, environmental, and social outcomes is required. Chebyshev polynomials are well known as a class of polynomials that produce very efficient outcomes in the approximation of arbitrary functions (Boyd and Ong 2011; Boyd and Petschek 2014) and can be used in the non-linear approximation of large datasets (Troumbis et al. 2020). The selection of Chebyshev polynomials for further investigation was based upon encouraging results in ML. Are these polynomials optimum for use in neural networks that aim to predict the outcomes of large numbers of financial stimulus programs for low-income communities and their impact on GDP and other measures of wellbeing?

Table 1.

Database queries

Title only			Title, abstract, and keywords*
Research question 1
	Chebyshev	AND	Machine learning	AND	Economic
		OR	Neural networks	OR	Financial
Sub-question 1
	Chebyshev	AND	Approximation function
		OR	Curve fitting
		OR	Interpolation
Sub-question 2
	Chebyshev	AND	Convergence	AND	Gradient descent
Sub-question 3
	Chebyshev	AND	Activation function	AND	Machine learning
				OR	Neural networks
Sub-question 4
	Chebyshev	AND	Jacobi
Sub-question 5
	Chebyshev	AND	Legendre

Open in a new tab

*All variants of the keywords/phrases were searched

These objectives are encapsulated in the following research questions:

Research Question 1—Can Chebyshev polynomials improve computational efficiency in ML models for financial or economic portfolio valuations or predictions?

Sub-Question 1—How are Chebyshev polynomials used in the approximation of non-linear functions?

Sub-Question 2—Can Chebyshev polynomials improve the rate of convergence in gradient descent?

Sub-Question 3—Are Chebyshev polynomials useful as activation functions in machine learning?

Sub-Question 4—Chebyshev polynomials versus Jacobi polynomials in optimization problems?

Sub-Question 5—Chebyshev polynomials versus Legendre polynomials in optimization problems?

The significance of this research is embedded in the development of new computational techniques that can more readily provide responsive analyses of community-led initiatives covering a wide range of social, economic, and environmental issues related to wellbeing. The ability to engage sophisticated and highly granular analyses will provide researchers with views on the sustainability and generalizability of these initiatives, and the impact of collaborative efforts between SMEs in product development, marketing, and sales. Rapid insights into poverty alleviation strategies across all spatial and cultural domains may be derived through classification of the complex social structures that characterize low-income communities. This SLR draws upon observations made in other disciplines to construct a potential framework for optimizing sustainability models for social wellbeing. A contribution is also made in the development of an organizational structure for community-led initiatives based on the design science research methodology. A final dividend of the research is potentially from the development of procedures for the rapid reporting of progress towards the United Nations sustainable development goals (United Nations 2022).

The next section describes a comprehensive, reproducible, and rigorous methodological process for the discovery of relevant research from published literature. This is followed by discussion on the results of the literature search, an assessment of the quality and relevance of the research articles, an analysis of research findings, and finally a discussion on the application of the research findings in sustainable development and future research. This SLR follows the Prisma (2020) structure and content recommendations for systematic literature (PRISMA 2020).

Methodology

The search for relevant articles requires an approach that demands rigorous attention to detail. An iterative process to obtain an optimum collection of the most relevant and credible research articles is used. The challenge is to extract these articles from the large research repositories, each with proprietary query definitions. The combination of the bibliographic capabilities of EndNote and the qualitative search capabilities of NVivo facilitate meta-analysis and good record keeping at each iteration. There are multiple tools for the visualization of the search outcomes including Excel, Tableau, and custom code. In this SLR, Excel is used to represent 2 dimensional results. Figure 1 illustrates the search process of this SLR.

Table 1 lists keywords, phrases, and Boolean structures that were used for each research question. Scopus, Web of Science, and ProQuest databases were searched. Google Scholar does not have the capability to distinguish between data sources and hence returns many references that are secondary sources of information and is used in this SLR for verification purposes only. The search criteria require articles to be peer reviewed, published in journals or conference proceedings, English language, and published during or after 2010.

The search capabilities of each of the databases were used to undertake high-level searches on the broadest scope for each research question. EndNote was used to aggregate the search results into one database to create a single project library. The search and collation capabilities of EndNote were then used to identify relevant research articles for further examination. NVivo is an analytic tool for qualitative research on unstructured data in many different forms. The final step in this process was to present the outcomes of the search in a manner that can be readily visualized and understood. NVivo has some of this capability, but other applications have such as Excel and Tableau have greater flexibility, and data presentations are more readily changed to add greater power to the interpretive outcomes of the search process. This procedure can be readily shared with researchers and practitioners for authentication and collaboration purposes. It inherently records an audit trail, and it can be easily updated without having to recreate the project library each time.

ML is increasingly being used in many areas of social science (Franco and Santurro 2020) but much of this work has not been reported in formal academic research (Ryll and Seidens 2019; Sezer et al. 2020). Hence the limitations of the methodology can be discussed at the following levels:

Choice of databases to be queried. The databases for recovery of research articles were chosen purposively. Other databases may have contained publications that would have captured additional perspectives on the research questions.
The scope of the database search. In this SLR, the searches were limited to peer-reviewed articles presented for publication in journals or in conference proceedings. This excludes research notes, book chapters, institutional reports, dissertations, aid agency reports, and contributions by various other government and private sector organizations.
Choice of search terms and phrases. The primary goal of this SLR was to discover research in the applicability of Chebyshev polynomials in computational methods for financial forecasting of sustainable development initiatives in low-income communities. This approach may have precluded other more effective computational methods for this purpose. However, Chebyshev polynomials are well understood for their role in approximating non-linear data, a characteristic of social science phenomena in the context of sustainability studies in low-income communities. Further, search related to Research Question 1 was predicated on the mandatory presence of the terms “machine learning” and “neural networks” appearing in the title or abstract of the research article. This may have precluded other more effective techniques in statistics or econometrics.

The next section presents the results of the extensive database search using the criteria defined in this section and the manual selection process to refine the list to articles that are deemed most likely to contribute to the research questions.

Search results

Table 2 records the number of research articles returned for each query on each database. Eighteen files were created, 6 for each of Web of Science, ProQuest, and Scopus. The queries returned in Table 2, except for Google Scholar, were uploaded to Endnote. Endnote was used to create a single consolidated repository for each of the research questions. The Endnote upload process eliminated some duplicates while appending some of the research articles.

Table 2.

Database query results

Query	Web of Science	ProQuest	Scopus	Google
Research question 1	6	2	3	1,600*
Sub-question 1	134	13	169	Not queried
Sub-question 2	1	0	3	Not queried
Sub-question 3	2	3	7	Not queried
Sub-question 4	40	9	41	Not queried
Sub-question 5	73	8	62	Not queried

Open in a new tab

*Some articles were selected from the Google Scholar results

EndNote searches for the PDFs of the articles and, if successful, attaches them to the article references. Otherwise, the PDFs were extracted manually from the journal databases and attached to the article record in EndNote. EndNote queries, together with manual reading of the article abstracts, were used to further reduce the library. Many articles were eliminated because of subtle differences in author names, or the title not being detected by Endnote. Others were eliminated because they were deemed not highly relevant to the research questions even though they satisfied the search criteria. Typically, these articles were in areas of engineering or science that were very application specific. The manually selected article counts are listed in Table 3 and the details of each article are listed in Tables 1–6 of Appendix A.

Table 3.

Final number of articles selected

Query	Endnote merge	After manual selection
Research question 1	22	14
Sub-question 1	250	24
Sub-question 2	3	3
Sub-question 3	10	5
Sub-question 4	49	4
Sub-question 5	87	12

Open in a new tab

The next section describes the qualitative approach undertaken with the aid of NVivo to further identify the core articles that will be analyzed and discussed in the analysis section.

Assessment

Each of the files for the research questions were exported from EndNote to NVivo for further analysis. The purpose of this step was to provide a very high-level assessment of the frequency of occurrences of the primary and associated keywords in the articles to give a broad view of the quality and relevance of their content to the research questions. Other textual searches were performed to test for the presence of omitted relevant keywords/phrases.

Research question 1

Appendix A Table 1 lists the articles for Research Question 1. During the search process for Research Question 1 only a few articles were discovered that related somewhat to the search criteria. The final selection of 14 articles was compiled from the results of the searches returned from Scopus, Web of Science, and ProQuest to which some manually selected articles were added from Google Scholar searches. Because Research Question 1 is interested in the role of Chebyshev polynomials in neural network implementations to study financial and economic forecasting, the results in Fig. 2 show the distribution of variants of the search criteria such as “stock price*” as a proxy for “finance*” and “economic*”.

Fig. 2 — Occurrences of keywords/phrases in all articles for research question 1

Sub-question 1

Appendix A Table 2 lists the articles for Sub-question 1. The keywords/phrases makeup of Sub-Question 1 are plotted in Fig. 3. The word “algorithm” appeared frequently in the NVivo analysis and was included in overall counts. Interpolation is the focus of the algorithm, with accuracy as a key determinant of predictive analysis. Examination of Fig. 3 reveals an expectation that Chebyshev interpolation is a frequent topic.

Sub-question 2

Appendix A Table 3 lists 3 articles for Sub-question 2, but only one article could be uploaded to NVivo for technical reasons. Article 2 was processed by NVivo, and Fig. 4 shows the occurrences of the search words and the additional phrase “classification accuracy”.

Fig. 4 — Occurrences of keywords/phrases in article 2 in sub-question 2

This article was published in 2017 in the Journal for Advances in Intelligent Systems and Computing which has an impact factor of 0.570 and a H index of 34.

Sub-question 3

Appendix A Table 4 lists 5 articles for Sub-Question 3. This sub-question addresses the potential for Chebyshev polynomials to describe almost any non-linear activation function in neural networks. Non-linear activation functions facilitate the functional computation of any process. They enable efficient back propagation because their derivatives can be calculated. Observation of the frequency of “activation function” with “Chebyshev” in Fig. 5 suggests that the 4 articles are relevant to Sub-Question 3. One article was eliminated from the Appendix A Table 4 search results because it could not be imported to NVivo for technical reasons. The additional search terms “approximation” and “classification” relate to specific tasks in neural networks.

Fig. 5 — Occurrences of keywords/phrases in all articles in sub-question 3

Sub-questions 4 and 5

Appendix A Tables 5 and 6 list 6 articles for Sub-Questions 4 and 5. Analysis of the searched literature revealed that these two questions can be combined, and a single high-level appraisal made. Figure 6 is the outcome of the merger of Appendix A Tables 5 and 6 and presents a good visualization of the scope of literature that shows the relationships between the various polynomial families of which Chebyshev polynomials belong. Article 1 from Appendix Tables 5 and 6, and article 6 from Table 1 indicate good potential for the discussion on the relative benefits of the three classes of polynomials, Chebyshev, Jacobi, and Legendre. The X axis of Fig. 6 denotes Table-Article, e.g., 5-01 means Table 5 Article 1.

Fig. 6 — Occurrences of keywords/phrases in all articles in sub-questions 4 and 5

Table 5.

Design science research model for action plan for low-income community development

Design phase	Description
Problem recognition	Failings of current efforts to reduce poverty
Idea for a resolution	Solution based on community empowerment, capital, and technology
Design of trial solution	Design a model for creating a microeconomic cell that addresses social, environmental, and economic issues one community at a time
Implementation of trial	Implement a trial to measure the goals of the model, and to collect data
Evaluation	Evaluate the data, refine the model
Information	Provide metrics for poverty reduction and sustainable development

Open in a new tab

Quality appraisal

Appendix B Tables 1 lists 14 journals from which 14 articles were recovered for Research Question 1. Appendix B Table 2 lists 32 journals and 8 proceedings from which 44 articles were recovered for Sub Questions 1–5. Overall, only 4 journals contained no more than 2 articles returned from the searches. The journals and proceedings cover mathematics, engineering, financial, economic, and computer science. This high-level assessment reveals a good spread of research as reflected in the diversity of the publications. Figure 7 is a chronological profile of the articles recovered for each year since 2010. Journals for Research Question 1 are shown separately from the journals for Sub-Questions 1–5. Both categories show an upwards trend. This aligns with an observed increase in discussions and development activities in the informal media on a wide range of applications for neural networks across all disciplines, including social sciences.

There is considerable diversity in the sources of articles. The reasons for this were not researched, but one intuitive view is that this is good because the articles are drawn from a broader base of scientific thinking and possibly less biased. However, had a lot more articles been recovered by the search process then this might have provided a more compelling view. On the other hand, the lack of depth across many journals might also indicate a lack of concentrated research on the application of neural networks and data analytics in the social sciences and information systems disciplines. The proliferation of journals is taken as a positive indicator of intellectual diversity but there is a potential gap in the literature on this subject. The journals are considered credible because all articles, with two possible exceptions, are peer reviewed. The impact factors for some journals were not available.

Over the past decade, the number of articles published per year has exhibited an upward trend but there are few research articles in this field, indicating a potential research gap. However, it is noted that the application of neural networks in social sciences is a relatively new field of research and, while there is significant research activity, this has not been reflected in high numbers of peer-reviewed articles. This SLR mainly reflects peer-reviewed research and hence the risk of exclusion of early-stage research is acknowledged.

Journal impact factors (# times cited/Number of articles published over a specific period) or other journal ranking strategy would most likely give a misleading ranking result (Smith 2013). The depth of peer-reviewed research pertaining to Research Question 1 seems limited. The number of citations for each article was examined. As expected, these generally conform inversely with year of publication. Older well-cited articles are indicative of credibility, but the number of citations would also fail under any attempt to rank them. Year of publication has a little more promise with the most recent articles likely to share contemporary thinking on neural network applications in social sciences.

Quantitative ranking of the articles was facilitated through the textual search capabilities of NVivo. A simple ranking of articles based upon the frequency of the search criteria was adopted and the articles were sorted in highest to lowest order of the sum of the number of occurrences of the search keywords and phrases for each research question. Figures 2, 3, 4, 5, and 6 depict these rankings. Figure 6 shows a combination of the search results for Sub-questions 4 and 5. This is appropriate because of the interrelated nature of these two questions. A more sophisticated ranking might apply weights to each of the search criteria.

The approach to qualitative assessment of the articles was to read each article and to encode relevant text and mathematical formulae using NVivo. This produced a searchable index of the different classes of information. This is a highly organized way of discovery and retention of qualitative material which was then searched looking for specific themes. A final potential qualitative measure was simply, how readable was the article? However, readability as a ranking criterion may lead to bias and was not attempted in this SLR. The next section summarizes the key findings from the research articles.

Analysis

The field of computational social science aided by the availability of large datasets and data analytics is expanding rapidly. There is an on-going debate at the intersection of social theory and computational science with one view attempting to discredit social theory, while another view foresees a blend (Radford and Joseph 2020). The sheer volume of data from finely granular social environments can be processed to provide highly nuanced visualizations of the human condition, the natural environment, the economic environment, and the social structures. Hence, there is a compelling view that an inductive evidence-based approach is less biased and speaks dispassionately and directly to the actual social condition. The following analysis attempts to highlight the key outcomes of those research articles that are most likely to address this approach. Large medium velocity datasets, cultural diversity, spatial diversity, and highly variable environmental factors characterize the challenge of managing large numbers of SMEs into sustainability.

Analysis of qualified search results

This SLR investigates formal research in the application of ML techniques that may be used for determining the future likelihood of sustainable livelihoods in low-income communities in the presence of some artefact promoting social wellbeing. The artefact is an economic stimulus mitigated by a wide range of external factors such as climate change, political stability, threats of terrorism, fertility rates, disease, education levels, and more. In this context, the literature search did not expose any significant depth of articles and the observations of Sezer et al. (2020) and Ryll and Seidins (2019) were supported.

Research Question 1 seeks to discover the role of Chebyshev polynomials in the construct of efficient neural network designs for economic forecasting purposes. The research sub-questions are a series of primers on the characteristics of Chebyshev polynomials and their role in developing efficient computational techniques. The scale of the social initiatives that may realize a sustainable direction for poverty alleviation is very large. Low-income communities dominate the landscape in S-SA and the computational models will process potentially millions of community datasets.