Predictive models for charitable giving using machine learning techniques

Leily Farrokhvar; Azadeh Ansari; Behrooz Kamali

doi:10.1371/journal.pone.0203928

. 2018 Oct 3;13(10):e0203928. doi: 10.1371/journal.pone.0203928

Predictive models for charitable giving using machine learning techniques

Leily Farrokhvar ^1,^*, Azadeh Ansari ¹, Behrooz Kamali ¹

Editor: Agustin Martinez Molina²

PMCID: PMC6169901 PMID: 30281644

Abstract

Private giving represents more than three fourths of all U.S. charitable donations, about 2% of total Gross Domestic Product (GDP). Private giving is a significant factor in funding the nonprofit sector of the U.S. economy, which accounts for more than 10% of total GDP. Despite the abundance of data available through tax forms and other sources, it is unclear which factors influence private donation, and a reliable predictive mechanism remains elusive. This study aims to develop predictive models to accurately estimate future charitable giving based on a set of potentially influential factors. We have selected several factors, including unemployment rate, household income, poverty level, population, sex, age, ethnicity, education level, and number of vehicles per household. This study sheds light on the relationship between donation and these variables. We use Stepwise Regression to identify the most influential variables among the available variables, based on which predictive models are developed. Multiple Linear Regression (MLR) and machine learning techniques, including Artificial Neural Networks (ANN) and Support Vector Regression (SVR) are used to develop the predictive models. The results suggest that population, education level, and the amount of charitable giving in the previous year are the most significant, independent variables. We propose three predictive models (MLR, ANN, and SVR) and validate them using 10-fold cross-validation method, then evaluate the performance using 9 different measuring criteria. All three models are capable of predicting the amount of future donations in a given region with good accuracy. Based on the evaluation criteria, using a test data set, ANN outperforms SVR and MLR in predicting the amount of charitable giving in the following year.

Introduction

Charities are Non-Profit Organizations (NPO) focused on humanitarian and social issues [1]. The NPOs are listed as tax-exempt organizations which cannot benefit people or other corporations [2]. According to the National Center for Charitable Statistics, there are 1,406,820 tax-exempt organizations in the United States, including 945,415 public charities. Charitable contributions include cash and non-cash gifts [3]. Over two-thirds of households making a significant portion of charitable giving in the United States announce their own giving amount [4]. Americans recently assigned approximately 2% of their disposable income to charitable goals in 2010, a value unchanged over 5 decades before that [5]. Over three-fourths of all contributions come from private or individual donors earn more than 2% of total U.S. Gross Domestic Product (GDP). Their contribution is essential to the nonprofit sector of the U.S. economy [5].

The National Center for Charitable Statistics stated at there were $1.59 trillion in total revenues and $1.49 trillion in total expenses reported by public charities in 2011 [6]. Contributions and government grants made up 22% of the total revenue while program service revenues and other resources made 72% and 6% of the revenue, respectively [1]. Charitable giving has had an upward trend since 1975 and in 2014, it exceeded the peak level right before the recession in 2008 [5]. It was estimated that charitable giving would increase to $358 billion in 2015 from $298 billion in 2011. Individual donors are the most common sources of charitable giving [7]. There is growing competition over time, money, and resources among non-profit organizations, so there is a need to predict donations [8]. It has always been debated how and when donors decide to donate, and many factors are identified in the literature that can affect decisions [9].

The factors that have been reported to drive charitable giving are awareness of need [10], solicitation [11,12], mode of communication [13], donor characteristics [14] including age [4,15, 16], sex [17], ethnicity [4,17], personal income [4,15] and tax itemizing [15], education [4,15], volunteerism/civic-minded donor [4,18], moral norms/values & guilt [11, 12], religiosity [4, 16, 19, 20], obligation (wealthy) [21] or lack of family need [11], and attitudes toward charitable organizations [22], historical data on donation [16], size of request [23], altruism [10, 11], reputation of individual/charity [24], psychological benefits [10], and efficacy [10]. Bekkers and Wiepking [25] performed an extensive literature review on how age, education, religion, and solicitation are correlated with the amount of charitable giving. They stated that most of the previous works had found that education and age have positive relationship with the amount of donation and investigated the effects of different characteristics of religion and solicitation on the giving amount. Wiepking and Bekkers [26] completed their literature review and investigated how sex, family composition, and income are related to charitable giving. They claimed that there is strong evidence showing a positive relationship between income and charitable giving value.

In this manuscript, we aim to develop models capable of predicting levels of charitable giving using measurable and readily available variables describing donor characteristics for different zip codes that are most closely associated with charitable giving on a national level. We validated the models using a U.S. nationwide dataset and analyzed the results.

Methodology

Giving and demographic data

In this section, we describe the process of collecting and preparing the initial set of variables that are used to identify the factors with the most predictive value for the amount of charitable giving. The following demographic data from 2010 U.S. census is gathered for a sample of 9410 U.S. zip codes, each of which includes at least one NPO with charitable contributions in 2014 and 2015: (a) % unemployment, (b) median household income, (c) % of persons living at/below the poverty line, (d) population, (e) sex, (f) age, (g) ethnicity (% non-whites), (h) education (% college graduates), and (i) % households with 2 vehicles or more. This data was collected using Zip Atlas (www.zipatlas.com), which is a structured collection of zip code, area code, city and state demographic, social, and economic profiles. We also included the previous year’s total contribution amount for each zip code as a potential predictive variable. The data regarding charitable contributions to NPOs was obtained by reviewing U.S. federal tax forms (990 and 990EZ) for two consecutive years (2014 and 2015). All the organizations that have some portion of their income exempted from tax are required by the IRS to fill out one of these forms, including organizations that received charitable contributions and grants. These forms are publicly available on the Internal Revenue Service website (IRS; www.irs.gov). Overall, about 25,882 organizations were reviewed for 2014 and 28,517 for 2015. To have consistent scope for all the data elements, we aggregated the contribution data at the zip code level. All the variables are defined in Table 1. The values or ratios are all collected for each specific zip code.

Table 1. Variable definitions.

Variables	Definition
Unemployment Rate (%)	Percentage of unemployed individuals to all individuals currently in the labor force
Average Household Income	Combined incomes of all people sharing a particular household
Poverty Level (%)	Percentage of the number of people (in a given age group) whose income falls below the poverty line
Average Population Age	The age that divides a population into two numerically equal groups
Population	Number of people living in the zip code
Whites (%)	Percentge of white Americans to all population
College Graduates (%)	The ratio of the number of people with college degree to all population
Male/Female Ratio	The ratio of male to female in the population
Households with 2+ Cars (%)	Percentage of number of houses with 2 or more cars
Giving	The amount of donation (charitable giving) a NPO receives

Open in a new tab

As the donation data set is skewed, Tukey’s [27] outlier detection method was used to identify the outliers. To be more conservative, all data points falling outside the 3 Interquartile Range (IQR) were identified as outliers and Winsorized [27]. Winsor proposed to replacing the value of the potential outliers by the highest value that is not considered an outlier in the data set instead of removing them. To have the same scale for all the variables and coeffiecients, all the variables were normalized using equation.

X_{N} = \frac{X - M i n {X}}{M a x {X} - M i n {X}}

(1)

In equation, X and X_N denote the original and normalized data, respectively. Each entity in the data set is normalized using its own minimum and maximum values shown by Min{X} and Max{X}, respectively.

In this pre-processing step, aggregated contribution data associated with 131 zip codes were Winsorized as they were beyond the 3 IQR limits. All 9410 rows of data were then normalized using equation. Table 2 summarizes the descriptive statistics on all the variables including 9 demographic variables and 2014 and 2015 donation data.

Table 2. Descriptive statistics for all variables.

Variables	Mean	Std. Dev.	Min	Max
Unemployment Rate (%)	0.053	0.040	0	1
Average Household Income	47,350.080	19,100.640	5,787	196,298
Poverty Level (%)	0.078	0.070	0	1
Average Population Age	37.094	5.276	16.300	75
Population	19,145.170	15,666.840	5	114,124
Whites (%)	0.828	0.186	0.006	1
College Graduates (%)	0.264	0.159	0	0.945
Male/Female Ratio	0.974	0.138	0.250	5.140
Households with 2+ Cars (%)	0.537	0.156	0	1
Giving 2014	42,836.180	64,130.710	0	1,320,216
Giving 2015	44,090.530	65,641.760	0	1,470,209
Giving 2015 (Winsorized)	42,611.990	56,739.420	0	249,165

Open in a new tab

The correlation matrix in Table 3 shows Pearson’s correlation coefficients between all the variables, including the dependent variable. Pearson’s correlation coeficients show how well these variables are linearly related to each other and the output.

Table 3. Correlation matrix on dependent variables.

Variables	Unemp. Rate (%)	Income	Pov. %	Avg. Pop. Age	Pop.	Whites %	Coll. Grad.	M/ F Ratio	H/holds 2+ Cars %	Giving 2014	Giving 2015
Unemp. Rate (%)	1	-0.46	0.67	-0.24	0.07	-0.45	-0.30	0.09	-0.50	0.01	0.02
Income		1	-0.63	0.14	0.05	0.22	0.72	-0.06	0.58	0.08	0.08
Pov. (%)			1	-0.30	0.08	-0.57	-0.42	0.05	-0.59	0.02	0.02
Avg. Pop. Age				1	-0.33	0.40	0.08	-0.16	-0.03	-0.02	-0.03
Pop.					1	-0.39	0.11	-0.12	-0.11	0.18	0.19
Whites (%)						1	0.09	-0.04	0.43	-0.07	-0.08
Coll. Grad. (%)							1	-0.08	0.15	0.19	0.19
M/F Ratio								1	-0.03	0.00	-0.01
H/holds 2+ Cars (%)									1	-0.07	-0.08
Giving 2014										1	0.87
Giving 2015											1

Open in a new tab

Following data pre-processing, Stepwise Regression was applied to the entire data set to identify the most significant variables among 9 demographic variables and one variable for donation amounts in 2014 for 9,410 U.S. zip codes as the input variables to predict donation amounts for 2015.

Stepwise regression is a semi-automated process of model building by successively adding or removing variables based on the t-statistics of their estimated coefficients. This technique starts with a null model and adds the variable with the lowest p-value and continues with forward selection and backward elimination until it reaches to the point where no variable can be added or removed. The alpha-to-enter and alpha-to-remove are often selected as a value between 5% to 15%. We used 5% and 10% as the significant levels for the forward selection and backward elimination in our analysis, respectively.

All data analysis was done in MATLAB environment using a computer with an Intel® Core i5 CPU 2.20 GHz processor and 8 GB RAM. The analysis is described in detail in the following section. The result of the stepwise regression is shown in Table 4.

Table 4. Stepwise regression result.

Variable	Coefficient	Standard Error	Status	P-Value
Intercept	0.025	0.003	_	<0.0001
Unemployment Rate (%)	0.031	0.033	'Out'	0.339
Average Household Income	-0.056	0.017	'In'	<0.0001
Poverty Level (%)	0.019	0.022	'Out'	0.375
Average Population Age	-0.001	0.014	'Out'	0.972
Population	0.062	0.009	'In'	<0.0001
Whites (%)	-0.005	0.007	'Out'	0.462
College Graduates (%)	0.063	0.010	'In'	<0.0001
Male/Female Ratio	-0.025	0.042	'Out'	0.551
Households with 2+ Cars (%)	-0.013	0.010	'Out'	0.223
Giving 2014	3.996	0.025	'In'	0

Open in a new tab

The stepwise regression selected these four variables to build the predictive models: average household income, population, percentage of college graduates, and amount of charitable giving in the previous year. However, the correlation matrix shows that the average household income has high collinearity with the percentage of college graduates. It is crucial to avoid multicollinearity as it can cause unstable coefficient estimation. Variance Inflation Factor (VIF) is a criterion that measures the multicollinearity between variables. The calculation of VIF for β_i estimate is shown in equation.

{V I F}_{i} = \frac{1}{1 - R_{i}^{2}}

(2)

In equation, $R_{i}^{2}$ is the coefficient of determination of the regression equation with X_i {\displaystyle X_{i}}on the left hand side, and all other independent variables on the right hand side. The minimum value of VIF is 1, which indicates a model with completely independent variables. A VIF greater than 5 is considered high and can be an evidence of multicollinearity. Table 5. VIF measurement for independent variables Table 5 shows the values of VIF for all independent variables in our model. As we suspected, average household income has a relatively high VIF, meaning that it can be removed from the model without affecting the prediction.

Table 5. VIF measurement for independent variables.

Variable	VIF
Unemployment Rate (%)	1.93
Average Household Income	4.56
Poverty Level (%)	3.19
Average Population Age	1.61
Population	1.39
Whites (%)	2.11
College Graduates (%)	2.93
Male/Female Ratio	1.08
Households with 2+ Cars (%)	2.96
Giving 2014	1.08

Open in a new tab

In the results from stepwise regression, giving in the previous year (2014) had the most significant effect, followed by percentage of college graduates. To summarize, the most significant variables among all 10 tested variables were population, percentage of college graduates, and amount of charitable giving in the previous year. Table 5 shows no collinearity in the model with these three variables.

Predictive modelling approaches

We used the most significant variables identified in the previous section to develop three predictive models based on Multiple Linear Regression (MLR), Artificial Neural Networs (ANN), and Support Vector Regression (SVR). All these techniques use input variables to predict future values of a dependent variable. Their main difference is the way they calculate the weights on the connections between input nodes and output nodes. MLR technique was described in the previous section. Here we describe ANN and SVR approaches in more details.

In MLR, the relationship between two or more explanatory (i.e., independent) variables and a response (i.e., dependent) variable is modeled by fitting a linear equation to the learning data. Table 6 shows a list of notations for MLR technique.

Table 6. MLR notations.

Notation	Definition
i	Independent variable index
P	Number of predictor (Independent) variables
Y	Dependent variable (Output)
β₀	Intercept of the regression model
β_i	Coefficient of Variable 𝑖 in the Regression Model
x_i	Input Variable 𝑖 in the Regression Model (Input)
ε	Random error term

Open in a new tab

The general form of the MLR model is shown in equation.

Y = β_{0} + β_{1} x_{1} + {\dots + β}_{i} x_{i} + \dots + β_{P} x_{P} + ε

(3)

In equation, β₀ is the intercept and β₁,β₂,…,β_P are the corresponding coefficients for independent variables, which will be obtained using the generalized least square method. The error term of the model is denoted by ε and the output or dependent variable is shown as Y.

The ANN model uses an algorithm to train the network and assign weights to connections between nodes in input layer, hidden layer(s), and output layer. This algorithm can be Back Probagation, Feed Forward, or Feed Backward. In this work, we use a Feed Forward Neural Network (FFNN) which has been widely used in forecasting applications [28]. Table 7 summarizes all the notatios used to explain ANN. All neural network models start with an initial random weight for each connection and improve the weights in their learning process to better predict the output. This process continues until it gets close enough to the output, which is determined by a threshold value.

Table 7. ANN notations.

Notation	Definition
i	Index unit in input layer or the feature number
h	Index unit in hidden layer
x_i	Independent variable i (input)
P	Number of predictor (Independent) variables
net_h	Scalar net activation of neuron h in hidden layer
net_o	Scalar net activation of output layer
f	Activation function
o_h	Weighted output of neuron h in the hidden layer
w_ih	Weight on the connection from input node i to hidden node h
w_h	Weight on the connection from hidden node h to output node

Open in a new tab

Fig 1 shows the structure of a one-hidden layer neural network. Each connection is assigned a weight determined by the activation function of the neural networks. The weight of the connection from input node to the hidden node is denoted as w_ih and the weight of the connection between hidden node and output node is denoted as w_h.

With only a few minor differences the SVR uses the same principles as the Support Vector Machine (SVM) for classification which was first developed by Vapnik and Lerner [29]. SVR uses the same concepts to train the model for prediction purposes. The notation used for describing SVR is shown in Table 8.

Table 8. SVR notations.

Notation	Definition
j	Index unit of j^th data point
m	Number of data points
x_j	Input vector for j^th data point
t_j	Target vector for j^th data point
C	Cost of not falling inside the SVR tube (Penalty)
ϵ	The width of the tube (Acceptable deviation)
$ζ_{j}^{+}$	Positive deviation from acceptable region for j^th data point
$ζ_{j}^{-}$	Negative deviation from acceptable region for j^th data point
w_SVR	Regression line weight vector in SVR
b	Bias term
L_ϵ	Loss function of the tube with width of ϵ
*Gamma*	The Kernel function parameter

Open in a new tab

In SVR, a training data set is introduced as {(x₁,t₁),…,(x_j,t_j),…,(x_m,t_m)}, where x_jϵRⁿ and t_jϵR represent the input vector and the target value, respectively. The main purpose is to obtain a function f(x) that has less deviation than the maximum acceptable deviation ϵ from the actual target t_j for all data points in the training data set. SVR aims to determine this function such that the bias and variance trade-offs can be met. This function should be neither so as complex to cause an over-fitting problem nor so simple that it lacks the capability to capture the patterns. To avoid aforementioned problems, a proper value of C is crucial. The maximum deviation from the target value is denoted by ϵ shown in Fig 2. The points outside the maximum deviation region are assigned a positive or negative deviation depending on their location.

The regression function is defined in equation. In this expression, b is the intercept and $w_{S V R}^{T}$ are the weights attained from SVR. To have a simpler function, smaller sizes of $w_{S V R}^{T}$ are recommended.

f (x_{m}) = w_{S V R}^{T} x_{m} + b

(4)

The loss function defined in equation assigns a penalty if the point is outside the range of the predefined deviation size of ϵ shown in Fig 2, which assigns a cost “C” for the points outside of the predefined range.

L_{ϵ} (x_{m}, f (x_{m})) = {\begin{cases} 0 i f | w_{S V R}^{T} x_{m} + b - t_{m} | \leq ϵ \\ | w_{S V R}^{T} x_{m} + b - t_{m} | - ϵ o t h e r w i s e \end{cases} \forall m

(5)

Equation shows the objective function, which assigns a penalty cost, C, if the point is outside of the acceptable range. Equation shows the set of constraints in the SVR model.

m i n i m i z e \frac{1}{2} w_{S V R}^{T} w_{S V R} + C \sum_{m = 1}^{N} (ζ_{m}^{+} + ζ_{m}^{-})

(6)

Subject to:

w_{S V R}^{T} x_{m} + b - t_{m} < ϵ + ζ_{m}^{+} \forall m

(7)

Where $ζ_{m}^{+}$ and $ζ_{m}^{-}$ are positive and negative deviations from acceptable region for the m^th pattern, respectively.

Results

Using MATLAB, the 9410 data rows (zip codes) were randomly divided into 2 separate and exclusive data sets: test (20%) and training (80%). The models were developed using the training data set and tested using the test data set, which was not included in the training part. A 10-fold cross validation technique was used for model validation.

Multiple linear regression

MLR was applied to the training data set with the selected 3 input variables identified as the most significant variables. Table 9 shows a summary of the results from the MLR technique.

Table 9. MLR predictive model parameters.

Variable	Coefficient	Std. Error	t-stat	P-Value	95% Confidence Interval
Intercept	0.021	0.003	7.344	<0.00001	0.016	0.027
Population	0.071	0.001	7.254	<0.00001	0.051	0.090
College Graduates (%)	0.038	0.008	4.757	<0.00001	0.022	0.053
Giving 2014	3.903	0.028	141.228	0	3.849	3.957

Open in a new tab

The Analysis of Variance (ANOVA) table for this model shown in Table 10 confirms that this model is significant at 5% significance level.

Table 10. ANOVA table for MLR predictive model.

Source	DF	Sum of Squares	Mean Square	F-Value	P-Value
Regression	3	290.291	96.764	7310.093	0
Error	7524	99.595	0.013
Total	7527	389.886

Open in a new tab

This model was tested using the test data set. Fig 3 shows the predicted versus actual donations in 2015 in a normalized scale using MLR technique.

Artificial neural networks

The same training data set was used to train and build a model using ANN. This model has 1 hidden layer. The number of neurons in the hidden layer varies from 2 to 10, demonstrating that 2 neurons minimized the percentage of residual variance. In this study, we considered 1 hidden layer. The network with 1 hidden layer and 2 neurons in that single hidden layer gives the best ANN model among all the tested networks with learning rate of 0.001.

Table 11 shows the characteristics of our ANN model. Logistic and Linear activation functions have been adopted for the input and hidden layers, based on a trial and error, to produce less error.

Table 11. ANN predictive model parameters.

Layer	Neurons	Activation Function	Min. Weight	Max. Weight
Input	3
Hidden	2	Logistic	-0.379	3.640
Output	1	Linear	-1.320	2.970

Open in a new tab

The predicted versus actual donations in 2015 in a normalized scale using the ANN technique is shown in Fig 4.

Support vector regression

SVR is the the other machine learning technique that we use to build a predictive model for 2015 donations. The same training and test data sets are used in this approach. Table 12 demonstrates the parameters of the best model we could capture on this data set.

Table 12. SVR predictive model parameters.

Parameter	Value
Epsilon (ϵ)	0.018
C	1.0
Gamma	4.51
Bias	-0.113

Open in a new tab

Fig 5 shows the results from the SVR predictive model on the training and test data sets, comparing the predicted vs the actual 2015 giving in normalized scale.

Discussion

The difference between the actual (target) and the predicted value is the forecasting error which represents the accuracy measure. To evaluate the performance of the applied models in this study, some accuracy measures have been calculated as follows:

Symmetric Mean Absolute Percentage Error (SMAPE) is calculated using equation. SMAPE is an alternative criterion for Mean Absolute Percentage Error when there are zero values for giving.
$S M A P E = \frac{2}{n} \sum_{i = 1}^{n} | \frac{x_{i}^{P} - x_{i}^{A}}{x_{i}^{P} + x_{i}^{A}} |$ (8)
Mean Absolute Error (MAE) is calculated using equation. A MAE that approaches zero is an indication of the model’s high accuracy.
$M A E = \frac{\sum_{i = 1}^{n} | x_{i}^{P} - x_{i}^{A} |}{n}$ (9)
Root Mean Square Error (RMSE) is calculated using equation. Small RMSE values also denote good performance on the part of the model.
$R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(x_{i}^{P} - x_{i}^{A})}^{2}}{n}}$ (10)
Normalized Root Mean Square Error (NRMSE) is calculated using equation. A NRMSE value close to 1 indicates a poor model performance, whereas value close to 0 shows a good model performance.
$N R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(x_{i}^{P} - x_{i}^{A})}^{2}}{\sum_{i = 1}^{n} {(x_{i}^{A})}^{2}}}$ (11)
Mean Square Error (MSE) is calculated using equation. The value of MSE depends on the scale of data but we report the MSE values on the normalized data for easier comparison.
$M S E = \frac{1}{n} \sum_{i = 1}^{n} {(x_{i}^{P} - x_{i}^{A})}^{2}$ (12)

Where $x_{i}^{P}$ is the predicted value and $x_{i}^{A}$ is the actual value of giving for the i^th observation.

Using and comparing several accuracy measures alongside each other enables us to better evaluate the results, as each accuracy measure has its own advantages and limitations and there is no single measure that is universally applicable under all conditions [31]. The values of all accuracy measures for all three models is shown in Table 13.

Table 13. Test results comparison of all three techniques.

Criteria	Definition	MLR	ANN	SVR
SMAPE	Symmetric Mean Absolute Percentage Error	0.829	0.765	0.759*
MAE	Mean Absolute Error	0.067	0.055*	0.057
RMSE	Root Mean Squared Error	0.111	0.098*	0.105
NRMSE	Normalized Root Mean Square Error	0.396	0.350*	0.374
MSE	Mean Squared Error	0.012	0.010*	0.011
Residual	Unexplained variance after model fit	23.294	18.268*	20.837
R²	Proportion of variance explained by model	0.753	0.807*	0.783
Max Err	Maximum error	0.933	0.611*	0.877
R	Correlation between actual and predicted	0.868	0.898*	0.885

Open in a new tab

*- Best performing technique

As Table 13 shows, ANN outperforms SVR and MLR in predicting the charitable giving using the three significant variables comparing R-Squared (R²) and error terms on this data set. According to this Table 9 measuring criteria show that ANN performs slightly better in predicting the U.S. charitable giving.

Conclusions and future work

In this paper, we studied the effect of 10 factors (population, personal income, education level, unemployment rate, poverty, and charitable giving in a previous year) on the receipt of charitable giving. Stepwise regression identified the four most influential variables among these 10 tested variables to be average household income, population, percentage of college graduates, and giving amounts for the previous year. To avoid dealing with multicollinearity, average household income was excluded as it was highly correlated with percentage of college graduates. All variables have positive relationship with donation. Previous year donation is the most highly correlated factor and percentage of college graduates is the least correlated. Three forecasting models were developed using MLR, ANN, and SVR and tested on a data set. To compare, 9 criteria measures were calculated for the results of each model. All three models are capable of predicting the amount of future donation in a given region with good accuracy, however, ANN outperforms SVR and MLR in most cases.

Moving forward, we plan to expand our study using other machine learning techniques, and by developing comparative studies to find potential ways to improve our suggested models. There are many reasons why being able to forecast charitable giving will help organizations better plan. For example, the occurrence of natural and man-made disasters may affect giving both in positive and in negative ways [32]. As an instance, national level disasters could potentially decrease giving, while a regional natural disaster may increase giving from those unaffected. After the terrorist attacks of September 11, 2001 (New York City, Washington D.C., Somerset County, PA), 65% of US household made charitable contributions [33]. The largest not-for-profit recipient organization received approximately $2 billion USD by the end of 2001. This was the largest giving in recorded US history. Although a blessing, such a large unexpected and rapid charitable influx can create its own complications for the receiving organization as a lack of preparedness, may potentially result in mismanagement of the contributions [34]. This example illustrates why charitable organizations, especially those involved in disaster relief efforts, need to be able to accurately forecast charitable giving to promote more equitable and efficient use of resources.

We plan to study the effects of natural or man-made disasters on the donor’s behavior, and to develop models and evaluate them using the recent disasters, such as hurricane Sandy. Furthermore, future research could identify and incorporate national economic factors in the predictive models.

Data Availability

All of the data sources are publicly available and the data sources are mentioned in the paper and references. Data sources are mainly US Federal Tax forms available on the IRS website, and demographic factors available on US Census (we have collected these from zipatlas.com).

Funding Statement

The authors received no specific funding for this work.

References

1.Lacasse R-M, Lambert B. Cracking the complexity code of charities. International Journal of Technology Management & Sustainable Development. 2016;15(2):159–75. [Google Scholar]
2.Mularz CM, Ülkü MA. Analytics for Nonprofits. Encyclopedia of Business Analytics and Optimization: IGI Global; 2014. p. 115–23. [Google Scholar]
3.Barrett KS. Panel-data estimates of charitable giving: A synthesis of techniques. National Tax Journal. 1991;44(3):365–81. [Google Scholar]
4.Gittell R, Tebaldi E. Charitable giving: Factors influencing giving in US states. Nonprofit and Voluntary Sector Quarterly. 2006;35(4):721–36. [Google Scholar]
5.Navigator C. Giving USA 2015, the annual report on philanthropy. 2015.
6.McKeever BS. The Nonprofit Sector in Brief 2015: Public Charities. Giving, and. 2015.
7.Brown S, Greene WH, Harris MN, Taylor K. An inverse hyperbolic sine heteroskedastic latent class panel tobit model: An application to modelling charitable donations. Economic Modelling. 2015;50:228–36. [Google Scholar]
8.de Vries NJ, Reis R, Moscato P. Clustering consumers based on trust, confidence and giving behaviour: Data-driven model building for charitable involvement in the Australian not-for-profit sector. PloS one. 2015;10(4):e0122133 10.1371/journal.pone.0122133 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Dickert S, Kleber J, Västfjäll D, Slovic P. Mental imagery, impact, and affect: A mediation model for charitable giving. PloS one. 2016;11(2):e0148274 10.1371/journal.pone.0148274 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Bekkers R, Wiepking P. A literature review of empirical studies of philanthropy: Eight mechanisms that drive charitable giving. Nonprofit and voluntary sector quarterly. 2011;40(5):924–73. [Google Scholar]
11.Sikkel D, Schoenmakers E. Bequests to health‐related charitable organisations: a structural model. International Journal of Nonprofit and Voluntary Sector Marketing. 2012;17(3):183–97. [Google Scholar]
12.Yörük BK. How responsive are charitable donors to requests to give? Journal of Public Economics. 2009;93(9–10):1111–7. [Google Scholar]
13.Sargeant A. Charitable giving: Towards a model of donor behaviour. Journal of Marketing Management. 1999;15(4):215–38. [Google Scholar]
14.Hernández-Murillo R, Roisman D. The economics of charitable giving: what gives? The Regional Economist. 2005(October):12–3. [Google Scholar]
15.Schervish PG, Havens JJ. Social participation and charitable giving: A multivariate analysis. Voluntas: International Journal of Voluntary and Nonprofit Organizations. 1997;8(3):235–60. [Google Scholar]
16.Leslie LM, Snyder M, Glomb TM. Who gives? Multilevel effects of gender and ethnicity on workplace charitable giving. Journal of Applied Psychology. 2013;98(1):49 10.1037/a0029943 [DOI] [PubMed] [Google Scholar]
17.Saunders TJ, Taylor AH, Atkinson QD. No evidence that a range of artificial monitoring cues influence online donations to charity in an MTurk sample. Royal Society open science. 2016;3(10):150710 10.1098/rsos.150710 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Ranganathan SK, Henley WH. Determinants of charitable donation intentions: a structural equation model. International Journal of Nonprofit and Voluntary Sector Marketing. 2008;13(1):1–11. [Google Scholar]
19.Ostrower F. Why the wealthy give: The culture of elite philanthropy: Princeton University Press; 1997. [Google Scholar]
20.Webb DJ, Green CL, Brashear TG. Development and validation of scales to measure attitudes influencing monetary donations to charitable organizations. Journal of the academy of marketing science. 2000;28(2):299–309. [Google Scholar]
21.Winterich KP, Mittal V, Aquino K. When does recognition increase charitable behavior? Toward a moral identity-based model. Journal of Marketing. 2013;77(3):121–34. [Google Scholar]
22.Chiang K-P. Prediction of monetary donation: a path analysis. ADVANCES IN MARKETING. 2003:138. [Google Scholar]
23.Yen ST. An econometric analysis of household donations in the USA. Applied Economics Letters. 2002;9(13):837–41. [Google Scholar]
24.Ülkü MA, Bell KM, Wilson SG. Modeling the impact of donor behavior on humanitarian aid operations. Annals of Operations Research. 2015;230(1):153–68. [Google Scholar]
25.Bekkers R, Wiepking P. Who gives? A literature review of predictors of charitable giving part one: religion, education, age and socialisation. Voluntary Sector Review. 2011;2(3):337–65. [Google Scholar]
26.Wiepking P, Bekkers R. Who gives? A literature review of predictors of charitable giving. Part Two: Gender, family composition and income. Voluntary Sector Review. 2012;3(2):217–45. [Google Scholar]
27.Tukey JW. The future of data analysis. The annals of mathematical statistics. 1962;33(1):1–67. [Google Scholar]
28.Schalkoff RJ. Artificial neural networks: McGraw-Hill New York; 1997.
29.Vapnik V, Lerner AY. Recognition of patterns with help of generalized portraits. Avtomat i Telemekh. 1963;24(6):774–80. [Google Scholar]
30.Vapnik V. The nature of statistical learning theory: Springer science & business media; 2013.
31.Zhang G, Patuwo BE, Hu MY. Forecasting with artificial neural networks:: The state of the art. International journal of forecasting. 1998;14(1):35–62. [Google Scholar]
32.Brown MS, Rooney PM. Giving following a crisis: an historical analysis. 2010.
33.Steinberg KS, Rooney PM. America gives: A survey of Americans’ generosity after September 11. Nonprofit and voluntary sector quarterly. 2005;34(1):110–35. [Google Scholar]
34.Steuerle CE. Managing Charitable Giving in the Wake of Disaster. 2002.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[pone.0203928.ref001] 1.Lacasse R-M, Lambert B. Cracking the complexity code of charities. International Journal of Technology Management & Sustainable Development. 2016;15(2):159–75. [Google Scholar]

[pone.0203928.ref002] 2.Mularz CM, Ülkü MA. Analytics for Nonprofits. Encyclopedia of Business Analytics and Optimization: IGI Global; 2014. p. 115–23. [Google Scholar]

[pone.0203928.ref003] 3.Barrett KS. Panel-data estimates of charitable giving: A synthesis of techniques. National Tax Journal. 1991;44(3):365–81. [Google Scholar]

[pone.0203928.ref004] 4.Gittell R, Tebaldi E. Charitable giving: Factors influencing giving in US states. Nonprofit and Voluntary Sector Quarterly. 2006;35(4):721–36. [Google Scholar]

[pone.0203928.ref005] 5.Navigator C. Giving USA 2015, the annual report on philanthropy. 2015.

[pone.0203928.ref006] 6.McKeever BS. The Nonprofit Sector in Brief 2015: Public Charities. Giving, and. 2015.

[pone.0203928.ref007] 7.Brown S, Greene WH, Harris MN, Taylor K. An inverse hyperbolic sine heteroskedastic latent class panel tobit model: An application to modelling charitable donations. Economic Modelling. 2015;50:228–36. [Google Scholar]

[pone.0203928.ref008] 8.de Vries NJ, Reis R, Moscato P. Clustering consumers based on trust, confidence and giving behaviour: Data-driven model building for charitable involvement in the Australian not-for-profit sector. PloS one. 2015;10(4):e0122133 10.1371/journal.pone.0122133 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0203928.ref009] 9.Dickert S, Kleber J, Västfjäll D, Slovic P. Mental imagery, impact, and affect: A mediation model for charitable giving. PloS one. 2016;11(2):e0148274 10.1371/journal.pone.0148274 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0203928.ref010] 10.Bekkers R, Wiepking P. A literature review of empirical studies of philanthropy: Eight mechanisms that drive charitable giving. Nonprofit and voluntary sector quarterly. 2011;40(5):924–73. [Google Scholar]

[pone.0203928.ref011] 11.Sikkel D, Schoenmakers E. Bequests to health‐related charitable organisations: a structural model. International Journal of Nonprofit and Voluntary Sector Marketing. 2012;17(3):183–97. [Google Scholar]

[pone.0203928.ref012] 12.Yörük BK. How responsive are charitable donors to requests to give? Journal of Public Economics. 2009;93(9–10):1111–7. [Google Scholar]

[pone.0203928.ref013] 13.Sargeant A. Charitable giving: Towards a model of donor behaviour. Journal of Marketing Management. 1999;15(4):215–38. [Google Scholar]

[pone.0203928.ref014] 14.Hernández-Murillo R, Roisman D. The economics of charitable giving: what gives? The Regional Economist. 2005(October):12–3. [Google Scholar]

[pone.0203928.ref015] 15.Schervish PG, Havens JJ. Social participation and charitable giving: A multivariate analysis. Voluntas: International Journal of Voluntary and Nonprofit Organizations. 1997;8(3):235–60. [Google Scholar]

[pone.0203928.ref016] 16.Leslie LM, Snyder M, Glomb TM. Who gives? Multilevel effects of gender and ethnicity on workplace charitable giving. Journal of Applied Psychology. 2013;98(1):49 10.1037/a0029943 [DOI] [PubMed] [Google Scholar]

[pone.0203928.ref017] 17.Saunders TJ, Taylor AH, Atkinson QD. No evidence that a range of artificial monitoring cues influence online donations to charity in an MTurk sample. Royal Society open science. 2016;3(10):150710 10.1098/rsos.150710 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0203928.ref018] 18.Ranganathan SK, Henley WH. Determinants of charitable donation intentions: a structural equation model. International Journal of Nonprofit and Voluntary Sector Marketing. 2008;13(1):1–11. [Google Scholar]

[pone.0203928.ref019] 19.Ostrower F. Why the wealthy give: The culture of elite philanthropy: Princeton University Press; 1997. [Google Scholar]

[pone.0203928.ref020] 20.Webb DJ, Green CL, Brashear TG. Development and validation of scales to measure attitudes influencing monetary donations to charitable organizations. Journal of the academy of marketing science. 2000;28(2):299–309. [Google Scholar]

[pone.0203928.ref021] 21.Winterich KP, Mittal V, Aquino K. When does recognition increase charitable behavior? Toward a moral identity-based model. Journal of Marketing. 2013;77(3):121–34. [Google Scholar]

[pone.0203928.ref022] 22.Chiang K-P. Prediction of monetary donation: a path analysis. ADVANCES IN MARKETING. 2003:138. [Google Scholar]

[pone.0203928.ref023] 23.Yen ST. An econometric analysis of household donations in the USA. Applied Economics Letters. 2002;9(13):837–41. [Google Scholar]

[pone.0203928.ref024] 24.Ülkü MA, Bell KM, Wilson SG. Modeling the impact of donor behavior on humanitarian aid operations. Annals of Operations Research. 2015;230(1):153–68. [Google Scholar]

[pone.0203928.ref025] 25.Bekkers R, Wiepking P. Who gives? A literature review of predictors of charitable giving part one: religion, education, age and socialisation. Voluntary Sector Review. 2011;2(3):337–65. [Google Scholar]

[pone.0203928.ref026] 26.Wiepking P, Bekkers R. Who gives? A literature review of predictors of charitable giving. Part Two: Gender, family composition and income. Voluntary Sector Review. 2012;3(2):217–45. [Google Scholar]

[pone.0203928.ref027] 27.Tukey JW. The future of data analysis. The annals of mathematical statistics. 1962;33(1):1–67. [Google Scholar]

[pone.0203928.ref028] 28.Schalkoff RJ. Artificial neural networks: McGraw-Hill New York; 1997.

[pone.0203928.ref029] 29.Vapnik V, Lerner AY. Recognition of patterns with help of generalized portraits. Avtomat i Telemekh. 1963;24(6):774–80. [Google Scholar]

[pone.0203928.ref030] 30.Vapnik V. The nature of statistical learning theory: Springer science & business media; 2013.

[pone.0203928.ref031] 31.Zhang G, Patuwo BE, Hu MY. Forecasting with artificial neural networks:: The state of the art. International journal of forecasting. 1998;14(1):35–62. [Google Scholar]

[pone.0203928.ref032] 32.Brown MS, Rooney PM. Giving following a crisis: an historical analysis. 2010.

[pone.0203928.ref033] 33.Steinberg KS, Rooney PM. America gives: A survey of Americans’ generosity after September 11. Nonprofit and voluntary sector quarterly. 2005;34(1):110–35. [Google Scholar]

[pone.0203928.ref034] 34.Steuerle CE. Managing Charitable Giving in the Wake of Disaster. 2002.

PERMALINK

Predictive models for charitable giving using machine learning techniques

Leily Farrokhvar

Azadeh Ansari

Behrooz Kamali

Roles

Abstract

Introduction

Methodology

Giving and demographic data

Table 1. Variable definitions.

Table 2. Descriptive statistics for all variables.

Table 3. Correlation matrix on dependent variables.

Table 4. Stepwise regression result.

Table 5. VIF measurement for independent variables.

Predictive modelling approaches

Table 6. MLR notations.

Table 7. ANN notations.

Fig 1. One-hidden layer artificial neural networks model.

Table 8. SVR notations.

Fig 2. Deviation in SVR [30].

Results

Multiple linear regression

Table 9. MLR predictive model parameters.

Table 10. ANOVA table for MLR predictive model.

Fig 3. Predicted vs. actual 2015 donations using MLR on training and test data sets.

Artificial neural networks

Table 11. ANN predictive model parameters.

Fig 4. Predicted vs. actual 2015 Giving using ANN on training and test data sets.

Support vector regression

Table 12. SVR predictive model parameters.

Fig 5. Predicted vs. actual 2015 giving using SVR on training and test data sets.

Discussion

Table 13. Test results comparison of all three techniques.

Conclusions and future work

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases