Abstract
We study the impact that financial intermediation can have on productivity through the alleviation of credit constraints in occupation choice and/or an improved allocation of risk, using both static and dynamic structural models as well as reduced form OLS and IV regressions. Our goal in this paper is to bring these two strands of the literature together. Even though, under certain assumptions, IV regressions can recover accurately the true model-generated local average treatment effect, these are quantitatively different, in order of magnitude and even sign, from other policy impact parameters (e.g., ATE and TT). We also show that laying out clearly alternative models can guide the search for instruments. On the other hand adding more margins of decision, i.e., occupation choice and intermediation jointly, or adding more periods with promised utilities as key state variables, as in optimal multi-period contracts, can cause the misinterpretation of IV as the causal effect of interest.
Keywords: Contract Theory, Financial Intermediation and Econometric Policy Evaluation
1 Introduction
This paper links contract theory models of financial intermediation to econometric policy evaluation. We study a variety of static and dynamic models in which financial intermediation has an impact on productivity through the alleviation of credit constraints in occupation choice and/or an improved allocation of risk. These models of intermediation are structural choice models which are known in the literature, and, more recently, estimated with cross sectional or panel data from developing countries (e.g., Thailand). On the other hand there is a large empirical literature which takes advantage of natural experiments, or instruments, to assess the impact that policy variation and financial institutions are having on incomes, occupations, risk sharing and a variety of other variables (some also in Thailand). Our goal in this paper is to bring these two strands of the literature together. Even though, under certain assumptions, an instrumental variable (IV) strategy can recover accurately the true model-generated local average treatment effects (LATE), these are quantitatively different, in order of magnitude and even sign, from other policy impact parameters (e.g., treatment on the treated TT, the average treatment effect ATE, etc). We also show that laying out clearly alternative models can guide the search for instruments. Mechanism design can deliver natural lotteries or randomization that can be used as sources of identification in empirical analyses. On the other hand, adding more margins of decision, i.e., occupation choice and intermediation jointly, or adding more periods with promised utilities as key state variables, as in optimal multi-period contracts, can cause the researcher to lose key identifying assumptions associated with the IV strategy (e.g., uniformity), so that IV and LATE might no longer coincide. Our objective is to help researchers and policy makers assess accurately the impact of financial intermediation.
The models we use are simple models of discrete choice when there are credit constraints. Typically some households are in financial autarky and others in a fully intermediated sector. There is a cost to entering the financial system, and this is imagined to pick up both the actual cost of traveling to a financial institution (bank) as well as policy distortions which limit access for some agents. We imagine there is variation in the cost/policy in the data, so that some households are financial sector participants and others are not. Indeed, we can generate cross sectional or panel data from a given model (sometimes using parameters which have been estimated from emerging market economies) and ask whether that data would allow an accurate quantification of the gain in the population produced by different policy variations (that emerging market countries had actually experienced). Of course in the model itself we implement the envisioned policy and compare various techniques that assess impact.
A key ingredient in this exercise is heterogeneity in the population, both observed and potentially unobserved. This means that there can be a nontrivial distribution of gains and/or losses in the population, depending on the policy. This is what can make the LATE (identified using the subsidy as instrument) different from TT and ATE, and at realistic parameter values, these can be quite distinct. The logic of the models also makes clear why this is likely to happen. For example, a subsidy can induce relatively inefficient households to enter business, whereas the larger population of businesses consists of talented households who were not on the margin of decision. This makes LATE negative and ATE positive. In other instances heterogeneity in one dimension destroys monotonicity in another. A new, nearby branch of a bank can facilitate intermediation by lowering costs, for those on the margin, and though some talented households will borrow to go into business, other richer, inefficient households will withdraw from low return business and put their money in savings in the bank. Talent is not observed. This makes it difficult without the economic model to assess the impact of intermediation on profits of entrepreneurs. This also means that widely used econometric techniques can potentially give misleading estimates, depending on what one is willing to assume and what one is trying to measure.
In section 2, we focus first on observable and unobservable characteristics such as wealth and talent in a simple model of occupation choice, to clarify some key issues. The credit constraint is extreme: self finance only. The utility functions are linear, but the financing constraint (sale of wealth), makes the problem non-linear. Indeed to guage the impact of this, and for expositional clarity, we begin with exogenous variation in business subsidies for those in financial autarky, computing various measures of welfare gains and comparing the numbers to IV estimates. Section 3 then introduces the full model with intermediation costs and policy variation, distinguishing which instruments are valid for intermediation, and which are valid for occupation choice only.
In section 4 we adopt a long horizon dynamic programming formulation to study endogenous financial deepening in a model with unobserved preferences and financial participation costs. We show that unobserved preference heterogeneity can create the need for instruments, as the decision to go to a bank and the outcome of being banked, and unbanked, can depend on what for an econometrician would be a common error. Importantly, participation costs can be used as instruments. Here IV and LATE coincide if policy variation on the participation costs comes as a surprise or if the participation decisions are made initially and the unobserved shocks in the model are independent and serially uncorrelated. But even in those cases, the identification of other treatment effects, such as TT or ATE, require much more work. Of course anticipated policy changes lowering costs cause the researcher to lose the validity of the instrument, as is well known, and we provide a clear example of this.
Section 5 introduces a model of financial intermediation with moral hazard and unobserved talent. In the model, unobserved talent is an input in the production technology, determining (counterfactual) consumption levels and individual’s preferences for financial intermediation. The key role played by unobserved heterogeneity, a feature shared by all the models considered in this paper, generates heterogeneity in impact parameters. We discuss under what assumptions the economic model generates instrumental variables that can be used to identify a causal effect of financial intermediation on consumption. In other words, we use the model to discuss its consequences for policy evaluations. We study its static and dynamic versions. We show how, in the static case, random assignment of wealth through a lottery can help us to recover instrumental variables at least over specified ranges of ex-ante wealth. Intuitively, we show how individual specific variables affecting the probability of winning the lottery, but independent from potential outcomes associated with intermediation (e.g., costs of entering the randomization), can be used to identify a causal effect of financial intermediation on consumption. Section 5.2 shows however, that in dynamic mechanism design problems the levels of promised utilities in the future matter for choices today, and one so typically looses the availability of instrumental variables even with random assignment of wealth. Essentially promised utilities for tomorrow depend on outcomes today, to induce proper incentives, along with contemporary rewards today. But those promises for tomorrow vary with the costs of intermediation, so we loose the independence of the outcomes from the instrument.
Section 6 presents our conclusions.
2 A Standard Model of Occupation Choice
We start the analysis with a static model of occupational choice without intermediation. We use this simplified financial autarky framework to illustrate some of the general issues which arise later in the paper. This occupation choice model originated with Lloyd-Ellis and Bernhardt (2000) and has been used by Gine and Townsend (2004) and Jeong and Townsend (2008) to understand how occupation choice and the spread of financial infrastructure can create growth in per capita income, movements in inequality, and more generally, to quantify the welfare gains in the population from the spread of financial intermediation.
Let us assume that the individual has linear preferences over current period consumption, of the form u(c) = c, that is u’(c) > 0 and u”(c) = 0. The individual faces the budget constraint c ≤ W where end-of-period wealth W depends on the within-period occupational choice of the agent.1 The individual has beginning-of-period wealth bi, assumed to be observed perfectly by the econometrician, so the initial distribution of wealth is known. This is the source of observable heterogeneity. The individual has an unobserved (from the point of the analyst) business entry cost . Such entry costs are standard in the industrial organization literature. See Salop (1979) for an early example. The individual also has an unobserved talent as wage earner . These two unobserved talents are as if randomly assigned in the population, again a source of unobserved heterogeneity. For simplicity, we assume that θW and θE are independent. We denote by fθj (·) the density function of θj with j = {E, W}, and we assume E (θW) = E (θE) = 0. We put additional structure on these densities in future sections. The literature cited earlier did not include unobserved talent in wage work.
The occupational choice of the individual is between enterprise and wage work. These two alternatives can be described by their associated potential outcomes. Specifically, for individual i we have that end-of-period wealth is the sum of initial wealth plus within-period earnings,
| (1) |
Here w is the market wage for (unskilled) labor2 and represents the profit function obtained after solving the production/profit maximization problem
| (2) |
| (3) |
The production function technology f(k, l) is common to all potential firms. Here labor hired l is measured in efficiency units, not number of people per se. k is the level of capitalization measured in units of wealth. In financial autarky, the unobserved entry cost and capital k must be self-financed from wealth bi. A household is said to be constrained when capital is equal to total wealth minus setup costs, i.e., , and this is binding. Indeed in the original model, if it is simply not possible to establish a business. In this case we can not ask what would be the earnings of someone who has not entered business for that reason. We modify the model below to take this into account. On the other hand, this constraint has been used in structural estimation via likelihood methods as it provides a source of identification. We discuss this point in section 2.2 below.
The decision rule associated with this occupation choice model can be presented as:
Therefore, if we denote by D a binary variable such that D = 1 if the agent becomes an entrepreneur, and 0 otherwise, we can write
This model is standard in the development literature. The model can be interpreted more generally as a Roy model (Roy, 1951) in which the occupational selection is based, given the individual’s talents and , and wealth bi, on the comparison of the potential gains.3
2.1 Standard Econometric Approaches for The Analysis of the Impact of Occupational Decisions
We focus on a simple issue: whether we can identify the effect of occupation choice on earnings using a reduced form approach instead of the full structural model.
In this static model the econometrician observes either or , depending on whether the choice Di = 1 or Di = 0 is taken by the individual i. Thus, if we denote by Yi the end-of-period observed outcome we have:
where without additional structure profits are non-linear in entrepreneur talent (), wealth (bi), and market wage (w) . However, the empirical literature primarily uses linear and separable models. That is,
| (4) |
This set-up is particularly attractive if one notes that
which can be expressed as a linear regression model
| (5) |
where , and the term in parenthesis (ϕbbi + (ϕw − 1) w) represents the gain in gross income that does not depend on unobserved talent. Notice that the random variable Di is by construction correlated with ε)i, so the OLS regression of observed earnings onto an occupational dummy (conditioning on wealth)
would provide a biased estimator of this gain, ϕbbi + (ϕw − 1) w. We illustrate the consequences of this selection problem below. Importantly, the interaction between unobserved talents, potential outcomes and occupational choice that generates the selection problem is not a result of a linear and separable profit function but a general consequence of the theoretical framework with unobserved talent and endogenous selection.
A widely used alternative is the instrumental variable method. In order to consider this approach, we introduce a policy distortion (instrument) into the model. This distortion affects occupation choices in a simple way. Specifically, we assume the existence of an exogenous subsidy that increases ex-post profits at the end-of-period by ψ. This subsidy is randomly assigned in the population, so that ψ is a random variable with ψ > 0 and known to the econometrician even if the choices of the household is to be a wage earner. Intuitively, it can be interpreted as an experiment or exogenous policy treatment affecting the occupation choices of the individuals but received only if the choice is to setup a firm. However, this subsidy cannot be used to finance k and so the constraint 0 ≤ k ≤ b − θE is unaltered.
The policy distortion impacts the decision rule:
where ψi represents the subsidy to agent i in the event of becoming an entrepreneur. More simply, and to emphasize the role of ψi, we use the notation D(ψi) below but clearly this binary variable is a function of other observable and unobservable variables. We assume that talents (θE,θW) and subsidy ψ are independent. Indeed, the government cannot see θE (or θW) but has total control over the random subsidy. The subsidy ψ affects the decision rule, but not the potential outcomes net of the subsidy, as it enters additively. Therefore, the maximization problem of the household as a firm, if it becomes a firm, and its choice of k and l are unaltered. It gets the subsidy independent of the behavior as a firm.
The subsidy ψi appears to be a valid instrument. It influences choices but not the potential outcomes.4 Additionally, in this setup the subsidy satisfies the uniformity/monotonicity condition (Imbens and Angrist, 1994; Heckman et al., 2006). That is, for each individual an increase (decrease) in the subsidy unambiguously increases (reduces) the chances of becoming an entrepreneur. Indeed, suppose that the subsidy can take on two values and . In this case, and without imposing a linear separable model for profits, we can use the instrument ψ to estimate
which, under the assumption of uniformity, identifies the local average treatment effect (LATE) in income for those in the population induced to enter entrepreneurship due to the change of ψ from to (the treatment here is to become an entrepreneur), or more formally
This parameter does not pick up the earnings difference for those who would be entrepreneurs, versus wage earners, regardless of the value of the instrument. Instead, the local average treatment effect ΔLATE naturally provides the answer to a policy experiment.5
Given that the model features heterogeneous treatment effects, we can complete the analysis by computing two alternative treatment effects: the treatment on the treated ΔTT (average benefits of becoming an entrepreneur for individuals that actually decide to become entrepreneur) and the average treatment effect ΔATE (the earnings gain or loss of becoming an entrepreneur versus a wage earner in the entire population). Specifically, and presenting the treatment parameters for a particular wealth level b, we have:
| (6) |
| (7) |
If there were no heterogeneity or all heterogeneity were observed, then all these effects (including LATE) would be equivalent (see Heckman and Vytlacil, 2001). Otherwise, ΔTT (b) and ΔATE (b) depend on counterfactual wages and profits for a given wealth level b, and the estimation of these parameters is not straightforward.
2.2 Parametric and Semi-Parametric Identification of Treatment Effect Parameters
Following Gine and Townsend (2004), we assume
and the profit function can be written
| (8) |
where , , . The model delivers a quadratic occupation partition as depicted in figure 1 (Panel A) and a nonlinear profit function.
Figure 1.
Occupational Choice Maps and The Effect of the Subsidy
For expositional simplicity, we set θW = 0 and assume in figure 1A. The points θE*, b* and determine entirely the can be shape of curves. the These points expressed of functions C0(w), C1(w) and C2.
This framework also allows us to illustrate the effect of the subsidy. Panel B in figure 1 shows the effect of moving ψ from to . This change essentially shifts the line of indifference vertically upward as the subsidy simply adds to the net profits of entrepreneurs. (This upward shift is not present when the household is constrained by beginning-of-period wealth). Now for every value of wealth b there exists a group of θE households who weakly shift into business. The impact of the subsidy is “uniform” (or monotone in the language of Imbens and Angrist, 1994), that is, the movement is (at most) in one direction only. This is the group of individuals that provides the source of variation used when estimating ΔLATE.
Finally, under the assumption σ2/ρ = β and optimal capital (k* = b − θ), we can obtain linear profit functions. We want to emphasize that this approximation is not designed to exactly characterize the economic model but to show how to link the theory with common econometric practice. Therefore, from this point forward, we follow the traditional econometric approach and assume a linear and additively separable approximation for the profit function.
By itself the assumption of linear and additive separable profit functions is not sufficient for the computation of treatment effects. We need additional structure to deal with the selection problems. Consider first the case of independent and normally distributed unobserved talents, i.e., , . In this context, we can define the probability of being an entrepreneur in our model as
where Φ (·) represents the cumulative distribution function of a standard normal distribution. Therefore, given the normality assumption, the structure of this last expression and with information on occupational choice (D), subsidy (ψ), wealth (b), the observed average wage in the economy (w) and profits (π) for those households with D = 1, we can use a probit model to identify the parameters (ϕw − 1), ϕb, and .
The mean observed profit (conditional on bi and ψi) can be written as:
Given that and are standard jointly normally distributed random variables, we have that
| (9) |
where λ (·) represents the Mills’ ratio.6 Expression (9) justifies the estimation of a linear regression model of observed profits/earnings onto the wage w (intercept), wealth bi and the Mills’ ratio λ (.), to obtain (since is known from the probit), and also . The parameters ϕθ and cannot be identified separately from this regression.
On the other hand, although unobserved, average wages among those choosing to be entrepreneurs can be written as
| (10) |
which depends only on identified parameters, so it can be constructed for any value of wealth and subsidy. Thus, we can compute the treatment on the treated (ΔTT (b, ψ)) as
and, likewise, the average treatment effect ΔATE(bi)
The unconditional version of ΔTT (b, ψi), i.e. ΔTT (b), can be obtained by simply integrating out ψ over the relevant region.
The normality assumption for the identification of treatment parameters can be relaxed at the price of additional conditions. In particular, let θE and θW be two independent random variables distributed according to a (general) joint distribution function fθE,θW (·, ·). As shown in the context of the economic model, these variables, which are unobserved by the analyst, determine profits, wages and occupational choices.
On the other hand, and provided enough data variation, we can non-parametrically estimate the probability of Di = 1 using information on bi, w, ψi and actual choices Di (Matzkin, 1992). Let p (w, bi, ψi) denote this probability, also known in the literature as the propensity score. We can then write the conditional expectation of observed outcome Yi as a function of the probability of selection and wealth:
| (11) |
where and for notational convenience, we use pi instead of p (w, bi, ψi). As shown by Heckman and Vytlacil (2001) we can use this conditional expectation to form ΔTT (b) and ΔATE(b), expressions (6) and (7), respectively, without imposing normality. In particular, these authors show how by computing
usually called the local instrumental variable estimator, the analyst can identify the treatment parameter
where ΔMTE (p, b) represents the treatment effect for those individuals indifferent between occupations given a particular value (p) for the random variable (conditional on wealth level b).7 Finally, Heckman and Vytlacil (2001) show that ΔATE(b) and ΔTT (b) can be obtained as weighted averages of ΔMTE (p, b) according to the following expressions:
where ωATE (u) = 1, ωTT (u, b) = Pr (p(w, b, ψ) > u) / ∫ Pr(p(w, b, ψ) > u)du. The argument of integration u is associated with the random variable which is uniformly distributed.8
The question then becomes how to compute ΔLIV (p, b). We can use formal semi-parametric techniques to estimate E(Yi|pi, bi) (expression (11)), and its derivative with respect to p. An alternative and simpler way to estimate this function is by approximating it using a polynomial on pi (see Heckman et al., 2006).9
2.3 Measuring the Impact of Occupations on Income
We illustrate the importance of the previous discussion by computing and comparing different estimates of the effect of occupational decisions on income. In order to do this we simulate data from our model. We utilize the quadratic production function described above, and consider the parameterization in table 1. These parameter values are taken directly from Gine and Townsend (2004).10 We assume a discrete subsidy. Specifically, we assume that the subsidy ψ can take two values: 0 or 1. The value of the subsidy is randomly assigned in the population.
Table 1.
Parameter Values
| Parameter | Value |
|---|---|
| α | 0.54561 |
| β | −0.39064 |
| σ | 0.1021 |
| ξ | 0.2582 |
| ρ | −0.03384 |
| w | 0.048 |
Note: The numbers in this table are obtained from Gine and Townsend (2004). Using data from Thailand, Gine and Townsend (2004) estimate a model of occupational choice with similar characteristics to the one studied in this paper.
Wealth (b), talents (θW and θE) and the subsidy (ψ) are assumed to be distributed as follows:
We first reproduce the analysis that a researcher could carry out using a cross-sectional data set with information on wealth, occupation and observed (factual) incomes.
2.3.1 Using Cross-Sectional Information to Estimate the Effect of Occupational Choice
Table 2 presents the sorting into occupations obtained from model generated cross-sections of 25,000 individuals.
Table 2.
Sorting by Occupational Choices Model of Occupational Choice - Simulated Cross-sectional Data
| Occupational Choice | Sample Size |
|---|---|
| Wage Earners | 6,109 |
| Entrepreneur | 18,891 |
| Constrained | 14,519 |
| Unconstrained | 4,372 |
| Total | 25,000 |
Note: The number of observations in each occupation is the result of the endogenous decision process faced by each of the 25,000 simulated individuals.
Consider an “agnostic” empirical approach in which the researcher tries to estimate the “effect” of the occupational choice on outcomes using a simple regression model. In particular, suppose that observed income (profits/wages) Y is written as:
where Di takes a value of 1 if the individual i is an entrepreneur, and 0 otherwise.11 Notice that in this equation we do not incorporate the talent explicitly. This because in practice the analyst does not observe this variable so it must be excluded from the list of controls (and contained in the error term).
Table 3 presents the estimated effect of Di on Yi obtained using OLS and IV.12 The results show large differences between the two approaches. The results from IV deliver negative impacts whereas OLS would suggest a positive impact. The large discrepancies are a clear manifestation of the biases caused by the selection process. A researcher would draw dramatically different conclusions depending on how she interpreted the policy impact coefficients in the IV regression. In practice, likely instruments can show up as correlated with unobserved error producing the misinterpretation of the results. The analyst must understand the economics behind the selection mechanism before drawing conclusions.
Table 3.
OLS and IV Estimates Model of Occupational Choice - Estimates from Cross-sectional Data
| Parameter | Estimates | |
|---|---|---|
| Δ OLS | Δ IV | |
|
| ||
| κ 0 | 0.606** | 1.189** |
| κ 1 | 1.155** | 1.142** |
| κ 2 | −0.136** | −0.082 |
| κ 3 | 0.457** | −0.356* |
|
|
||
| Average Effect (κ2b‾ + κ3) | 0.303** | −0.450 |
Note: This table presents the parameters obtained from a linear regression of observed income (profits or wages depending on individual’s occupation) on wealth, the occupational dummy, and the interaction between wealth and occupation (dummy). In addition, the column ΔIV presents the estimates when ψ is used as instrument. Overall these results illustrate what the analyst can obtain using information produced from the model (observed outcome, wealth, and occupation) using a reduced-form strategy.
denotes statistical significance at 5%
denotes statistical significance at 1%.
Interestingly, the negative effect estimated by IV is intuitively correct since the individuals switching occupations as a result of the variation in the instruments are those with lower profits (and higher wages). That is, since the subsidy is not included in the income gains, it induces inefficient choices. On the other hand, there are others who benefits from the subsidy but would have chosen to be entrepreneurs in any event, and they had efficient rents which dominate wage earnings.
2.3.2 Using the Structure of the Model to Generate Counterfactual Outcomes and The Causal Effects of Occupational Choices
Given our knowledge of the model, we can study the consequences of exogenous policy changes. Specifically, we provide individuals that did not receive a subsidy when it was originally assigned with the subsidy. We then use the sample of individuals switching occupation due to the change in subsidy status (from ψ = 0 to ψ = 1) to compute the model generated local average treatment effect (ΔLATE (1, 0)) (i.e., the average effect of the treatment for those individuals switching occupations as a result of a change in the instrument). However, since occupation status also depends on wealth, we first compute ΔLATE (1, 0; bk) where bk represent the k-th percentile of the wealth distribution, and then we compute ΔLATE (1, 0) as the (weighted) averages of ΔLATE (1, 0; bk).13
As a result of our experiment 1,861 of our original wage earners become entrepreneurs. This is precisely the group from which we can compute the model generated local average treatment effect. Additionally, from our knowledge of the model we can directly compute the average treatment effect (ATE) and the treatment effect of those treated (TT).
Table 4 presents the model generated treatment effects. Notice the similarities between the model generated LATEs (ΔLATE in table 4) and the IV effect estimated using the cross-sectional data sets (ΔIV in table 3). The discrepancies can be attributed to the linear approximation used in the regression model. We relax this assumption in the next sections.14 In our model, ΔLATE is negative as the subsidy induces low productivity individuals to enter business and the subsidy is not counted as part of the gain. This is exactly the same conclusion draw in the context of table 3.
Table 4.
Model Generated Treatment Paramaters Model of Occupational Choice - The Causal Effects of Occupation on Income
| Parameter | Value |
|---|---|
| Δ ATE | 0.619 |
| Δ TT | 1.270 |
| ΔLATE(1, 0) | −0.459 |
Note: The numbers in this table represents the model’s underlying treatment parameters associated with the effect of occupation on income (or model-generated treatment parameters). In order to obtain them, we use the structure of the model to simulate data on wages, profits and choices for 25,000 individuals. The analyst would need to characterize the structure of the model (counter-factual outcomes and decision rule) before producing these treatment parameters (as opposed to a reduced form strategy).
TT and ATE on the other hand are positive numbers indicating the positive benefits associated with entrepreneurship.
Finally, figure 2 presents the local average treatment effects by percentile of the wealth distribution. The figure presents the model generated LATE (ΔLATE(1, 0; b) and the estimated IV (ΔIV(ψ)(1, 0; b)) by wealth level. As expected, although the model generated LATE fluctuates across levels of wealth (a result of our sample size), on average it is close to what the standard econometric technique delivers.
Figure 2.
Model Generated and Estimated Local Average Treatment Effect by Percentile of the Wealth Distribution
This example illustrates how the economic model delivers a valid instrument, how this instrument allows the identification of a causal effect of interest, and how this causal effect can differ from other relevant treatment parameters.
3 Occupational Choice Under Financial Intermediation
The simple model presented in section 2, with the subsidy to firms, can be easily extended to incorporate an intermediated sector. The analysis in Gine and Townsend (2004) does exactly that. We follow their approach. The underlying model in this case is similar to the model in section 2, but now there is borrowing and lending of capital and wealth. We denote by Qi the individual-specific cost of using the intermediated sector. Examples of Qi include travel time to district center or branch office, whether or not a particular intermediary has been active in a city or village according to history, particular policies of financial institutions which vary in effectiveness, new credit in a city or village divided by the number of households, etc. See Kaboski and Townsend (2005, 2009) and Alem and Townsend (2008) for examples.
We take the initial distribution of Q as given and, for simplicity, focus on a binary Q. The analysis can be extended directly to a continuous-valued Q. We assume Q independent from ψ, and denote by r the (equilibrium) interest rate.
An entrepreneur using the intermediated sector solves the following problem
| (13) |
There is a neoclassical separation between production and household wealth. In effect, the agent can put all his wealth bi in financial markets and earn interest r. Meanwhile the firm (individual) can borrow what it needs to finance k and set up cost . There is lot of indeterminacy in between, in financing, i.e., self invest and borrow/lend the difference with wealth, but real quantities and net income are all pinned down.
The wage is common to both sectors, as households are allowed to work wherever they prefer. They can join an intermediary and put their money in a saving account if they do not become firms.
As before, denote by Di a binary variable such that Di = 1 if agent i decides to become an entrepreneur, and 0 otherwise. Thus, the occupation choice when the agent is participating in the intermediated sector can be described by:
where denotes the resulting profits after solving (13).
In this context, the researcher would observe or depending on the value of . Thus, if we denote by the outcome observed under intermediation, we have
| (14) |
and the cost Qi and the subsidy ψi are not subtracted or added, respectively, from YI, that is, we have gross gains.
On the other hand, recall that without financial intermediation the occupation choice model is
so that the observed outcome under financial autarky (and not counting the subsidy) can be presented as:
| (15) |
In sum, the sub-index k in Yk indicated the sector (financial autarky A or intermediation I). We use this notation in what follows.
The choice of sector, autarky versus intermediation, is made by a simple comparison of the potential associated outcomes and but adjusting in the choice for the subsidy ψi and entry cost Qi. Note that, in general, the heterogeneity does not enter additively into or . Thus, let Υi be a binary variable that takes a value of 1 if the individual decides to use the financial intermediary, and 0 otherwise. Then,
This simple framework allows us to analyze policies regarding the access to financial intermediation.
3.1 Identifying the Effects of Financial Intermediation
In the context of our model, the effect of having access to financial intermediation at the individual level (agent i) is defined as
the average treatment effect (ATE) associated with financial intermediation is
and the average effect of the treatment on those treated (TT) equals
where again for simplicity we use Υi instead of . Additionally, in what follows we use Di and Di (r) to denote the occupation choices and under financial autarky and the intermediated sector, respectively.
In order to analyze whether conventional econometric methods (OLS and IV) allow the identification of any of these effects, we first denote by ξi the observed outcome, i.e.,
which after substituting expressions (14) and (15) can be written as:
| (16) |
This expression illustrates the fact that all potential choices and outcomes play a role even when the researcher is only interested in the impact of having access to financial intermediation.
Following the conventional empirical strategy, we assume profit functions of the form:
Substituting these expressions into equation (16), and after some algebra, we obtain
| (17) |
where so contains all the terms involving unobserved talents and . Using expression (17), we can define the individual effect of having access to financial intermediation, , as
| (18) |
Notice that (conditional on wealth b) depends on the occupation of the individual under each regime and the unobserved talents.
On empirical grounds, expression (17) suggests the estimation of the parameters defining through a regression of ξi on bi, Υibi, Di(1 − Υi), Di(r) Υi, biDi(1 − Υi) and biΥiDi(r). However, since unobserved talents (contained in the error term) affect both choices and potential outcomes, without further assumptions, conventional OLS estimates will not provide unbiased estimates of the parameters in the model.
An alternative approach is the instrumental variable method. The economic model provides one natural instrument for Υi, namely Qi. The cost Qi affects the choice of sector but does not affect the potential outcomes. In addition, notice that changes in Qi produce uniform (monotonic) responses in choice Υi. Consequently, given two values for the instrument Qi, and (lowering the cost so that ) and conditioning on wealth b, we can estimate
| (19) |
to identify the local treatment effect of financial intermediation on income
| (20) |
where and . Intuitively, in this case the local IV (expression (19)) identifies the gains in outcomes (including profits and wages but not the subsidy nor the intermediary cost) for those individuals induced to join the financial system as a consequence of the reduction in intermediation cost.
Importantly, one cannot interpret this parameter as the effect of financial intermediation on profits for entrepreneurs or on income for wage earners. This is because the change in Q also induces changes in occupational decisions in a non-uniform way. That is, changes in Q may endogenously induce individuals to switch from the wage sector to entrepreneurship and vice-versa.
Additionally, although in principle the analyst could use the information on occupations to compute versions of among wage earners and/or entrepreneurs, in general, these estimates would not identify the local causal effects of financial intermediation (as defined in (20)) in those populations. This is again a consequence of the non-uniform responses in occupational decisions induced by the changes in Q. Intuitively, by restricting the estimation of ΔIV(Q) to entrepreneurs (wage earners) the analyst would be erroneously excluding the gains on outcomes from those initial entrepreneurs (wage earners) who would become wage earners (entrepreneurs) as a result of the change in Q. In other words, the analyst would conceptually identify the effect of financial intermediation for those entrepreneurs (wage earners) who would not have switched occupations as a result of the change in the instrument. Given the economic incentives operating in the model, the ΔIV(Q) estimated in this way would only provide a partial response to the question of the effect of financial intermediation among entrepreneurs (wage earners).15 We illustrate this point below.
We can use the same logic to identify the local average treatment effect of occupation (entrepreneurship) on income through the following local IV estimator:
where is Di (r) Υi + Di (1 − Υi). Under uniformity of ψ on , this parameter identifies the local treatment effect of occupation on income.
Analogously to the case of financial intermediation, one cannot use to determine the gains in income for those induced to enter to the financial system as a result of the subsidy. This is because the change in the subsidy does not produce necessarily uniform (or monotonic) movements with respect to intermediation choice.16
The complications of identifying by occupation or by status of financial intermediation are due to the presence of two margins of choice in the model. Strictly speaking, the model includes four categories or possible treatments: wage sector and financial autarky, wage sector and financial intermediation, entrepreneurship and financial autarky, and entrepreneurship and financial intermediation. Indeed, we could phrase our discussion in the context of a model with multiple treatments and multiple instruments. In this framework, the definition of treatment effects is not as straightforward as in the binary case. Specifically, the pair-wise comparison of the outcomes associated with different alternatives needs to be supplemented by considerations of the alternatives left out from the comparison. This adds a new level of complexities to the definition of treatment effects. As an example, notice that we can define the effect of financial intermediation on profits (i.e., the effect of intermediation among businesses) but for individuals effectively participating in the wage sector. This is not an intuitive treatment effect, but it is well defined in the context of a model with multiple treatments.
Heckman et al. (2006) analyze the identification power of instrumental variables in the context of models with multiple treatments and unobserved heterogeneity. They show that provided a variable (instrument) determining the preferences for a particular alternative but excluded from its potential outcome (e.g., instrument Zj determining utility associated with alternative/option j, Vj), in models such as the one considered in this section, a local IV strategy (using Zj as the instrument and based on a regression of observed income on dummy variables describing individual’s observed decisions) would identify the effect of option j versus the next best alternative.17 This result complements our discussion about the difficulties of further interpreting ΔIV(Q) or ΔIV(ψ).18 See Heckman et al. (2006) for additional discussion.
3.2 Example
Following the same logic utilized in our previous example (section 2.3), we investigate the consequences of using different econometric techniques when estimating the effects of financial intermediation and occupational choices. We use the parameterization presented in table 1. Our main results are robust to different parameterizations. In addition to the structure presented in table 1, we assume
with Qi independent of ψi, , and bi. Table 5 presents the sorting simulated from the model for a sample size of 25,000 individuals. Given our parameterization, approximately one fourth of the individuals become wage earners (most of them working under financial autarky), more than half of the individuals are entrepreneur under autarky (most of whom are unconstrained), and the rest are entrepreneurs with access to financial intermediation.
Table 5.
Sorting by Occupational Choices and Access to Financial Intermediation Model of Occupational Choice and Financial Intermediation Simulated Cross-sectional Data
| Occupational Choice | Financial Intermediation | Autarky |
|---|---|---|
| Wage Earners | 940 | 5,072 |
| Entrepreneurs | 2,678 | 16,310 |
| Constrained | - | 14,015 |
| Unconstrained | - | 2,295 |
Note: The number of observations in each cell is the result of the endogenous decision process faced by each of the 25,000 simulated individuals.
3.2.1 Using Cross-Sectional Information to Estimate the Effect of Financial Intermediation and Occupational Choices
Suppose the econometrician focuses first on the impact of financial intermediation proposing the following linear model:
| (21) |
where Υi takes a value of 1 if the individual i has access to financial intermediation, and 0 otherwise.
Table 6 presents the results from OLS and IV on equation (21). The results suggest positive average effects of financial intermediation. However, because of the selection bias, the effect suggested by OLS is almost double the effect estimated by IV.
Table 6.
OLS and IV Estimates of the Effect of Financial Intermediation on Income Model of Occupational Choice and Financial Intermediation
| κ 0 | κ 1 | κ 2 | κ 3 | Average Effect (κ2b‾ + κ3) | |
|---|---|---|---|---|---|
| Δ OLS | 1.015** | 0.954** | 0.313** | 0.227** | 0.585** |
| Δ IV(Q) | 0.933** | 1.071** | 0.076* | 0.236** | 0.323** |
Note: This table presents the parameters obtained from a linear regression of observed income (profits or wages depending on individual’s occupation) on wealth, the financial intermediation dummy, and the interaction between wealth and financial intermediation (dummy). In addition, the row ΔIV(Q) presents the estimates when Q is used as instrument for endogenous financial intermediation. Overall these results illustrate what the analyst can obtain using information produced from the model (observed outcome, wealth and financial intermediation) using a reduced-form strategy.
denotes statistical significance at 5%
denotes statistical significance at 1%.
On the other hand, suppose that the analyst proposes the following linear model to investigate the effects of occupation on income.
where Di takes a value of 1 if the individual i is an entrepreneur, and 0 otherwise. These model follows closely the one presented in section 2.3. Table 7 presents the IV and OLS of the “effect” of Di on Yi from our model generated data. As in the previous section, the OLS estimate delivers a positive effect whereas the IV suggests a negative effect of occupation on income/profit.19
Table 7.
OLS and IV Estimates of the Effect of Occupation on Income Model of Occupational Choice and Financial Intermediation
| τ 0 | τ 1 | τ 2 | τ 3 | Average Effect (τ2b‾ + τ3) | |
|---|---|---|---|---|---|
| Δ OLS | 0.578* | 1.259* | −0.146* | 0.433* | 0.266* |
| Δ IV(ψ) | 1.212* | 1.177* | −0.027 | −0.426* | −0.458** |
Note: This table presents the parameters obtained from a linear regression of observed income (profits or wages depending on individual’s occupation) on wealth, the occupational dummy, and the interaction between wealth and occupation (dummy). In addition, the row ΔIV(ψ) presents the estimates when ψ is used as instrument for the endogenous occupational decision. Overall these results illustrate what the analyst can obtain using information produced from the model (observed outcome, wealth and occupation) using a reduced-form strategy.
denotes statistical significance at 5%
denotes statistical significance at 1%.
3.2.2 Using the Structure of the Model to Generate Counterfactual Outcomes and the Causal Effect of Financial Intermediation and Choices
Table 8 presents the model generated local average treatment effects (LATE). These LATEs are not obtained using econometric techniques, but generated using the structure of the model. The table displays both ΔLATE(ψ)(1, 0) (LATE associated with the effect of occupation) and ΔLATE(Q)(0.25, 1) (LATE associated with the effect of financial intermediation).
Table 8.
Model Generated Local Average Treatment Effects Model of Occupational Choice and Financial Intermediation
| Parameter | Value | Number of Movers |
Direction |
|---|---|---|---|
| ΔLATE(ψ)(1, 0) | −0.466 | 2,219 | From Wage Earner to Entrepreneur |
|
| |||
| −0.444 | 1,548 | From Wage Worker under Autarky to Entrepreneur under Autarky |
|
| −0.278 | 278 | From Wage Worker under Autarky to Entrepreneur under Financial Intermediation |
|
| −0.724 | 322 | From Wage Worker under Financial Intermediation Entrepreneur under Autarky |
|
| −0.519 | 71 | From Wage Worker under Financial Intermediation to Entrepreneur under Financial Intermediation |
|
|
| |||
| ΔLATE(Q)(0.25, 1) | 0.388 | 3,757 | From Autarky to Financial Intermediation |
|
| |||
| 0.355 | 911 | From Wage Worker under Autarky to Wage Worker under Financial Intermediation |
|
| −0.203 | 176 | From Wage Worker under Autarky to Entrepreneur under Financial Intermediation |
|
| 0.752 | 75 | From Entrepreneur under Autarky to Wage Worker under Financial Intermediation |
|
| 0.430 | 2,595 | From Entrepreneur under Autarky to Entrepreneur under Financial Intermediation |
|
Note: The numbers in the table are obtained using the factual and counterfactual information on income generated by the economic model. Specifically, for each individual in the sample we analyze the consequences of modifying the values of the instruments initially assigned. We study the individual’s changes in occupational choices as well as the changes in decisions involving the financial system. Then, for each individual modifying her decisions as a result of the changes in Q or ψ, we compute the associated effects on income. This table presents the average effects on income generated using this logic. It also displays the number of individuals switching decisions as a result of the changes in the instrument (column Number of Movers).
Importantly, our knowledge of the model allows us to generate not only an overall local average treatment effect (bold numbers in table 8) but also the local effects of the treatment for specific groups of individuals. For example, in the case of financial intermediation, we obtain the local treatment effects for individuals switching from “wage-earner under autarky” to “wage-earner with access to financial system” (as a result of the exogenous change in the instrument) as well as the local effect for those “wage-earners under autarky” becoming “entrepreneurs under intermediation” (also as a result of a change in the instrument). This analysis cannot be done without a structural analysis.
Notice, as expected, the model generated overall local treatment effects are very close to the effects estimated using the IV strategy (tables 6 and 7).
Table 8 also displays how the individuals in our model react to changes in the instrument. Interestingly, we observe how changes in Q induces people to move away from entrepreneurship and into the wage sector. In particular, and given our parameterization, 75 entrepreneurs would have become wage earners as a result of a change in Q. This illustrates our previous comment about the difficulty of interpreting ΔIV(Q) as the effect of financial intermediation on profits for entrepreneurs and income for wage earners. The change in Q induces (non-uniform) changes in occupation. A similar logic prevents interpreting ΔIV(Q) as the effect of occupation on income for individual using the financial system or for individuals under financial autarky. As table 8 shows, a change in ψ induces (non-uniform) changes in the financial participation decisions of the individuals in the model.
Finally, table 9 presents the model generated ATE and TT for the effect of financial intermediation and occupational choice. These causal parameters are presented for all the different groups of interest. It is worth noting the significant differences between these treatment effects and the local effects reported in table 8. This illustrates the potential discrepancies between the different treatment parameters. All these parameters represent causal effects, but in our model with selection based on unobserved talents and gains, they all answer different economic questions.
Table 9.
Model Generated Treatment Parameters associated with Occupational Choices and Financial Intermediation
| Treatment Parameter | Alternatives Considered in the Comparison | Value |
|---|---|---|
| Δ ATE | Entrepreneurship vs. Wage Sector under Financial Autarky | 0.619 |
| Δ ATE | Entrepreneurship vs. Wage Sector under Financial Intermediation | 0.607 |
| Δ ATE | Financial Intermediation vs. Autarky for Wage Earners | 0.227 |
| Δ ATE | Financial Intermediation vs. Autarky for Entrepreneurs | 0.215 |
|
| ||
| Δ TT | Entrepreneurship vs. Wage Sector under Financial Autarky | 1.205 |
| Δ TT | Entrepreneurship vs. Wage Sector under Financial Intermediation | 1.734 |
| Δ TT | Financial Intermediation vs. Autarky for Wage Earners | 0.364 |
| Δ TT | Financial Intermediation vs. Autarky for Entrepreneurs | 0.433 |
Note: The table presents the treatment parameters associated with the pairwise comparison of different alternatives in the model conditional on a specific alternative for the margin not considered in the comparison. For example, the first row presents the mean difference between profits and wages for individuals not participating in the financial system. The other rows can be interpreted using the same logic.
4 Dynamics, Risk Sharing, Unobserved Heterogeneity and Occupational Choice
In this section we follow the analysis of Greenwood and Jovanovic (1999) (from hereafter GJ), Townsend and Ueda (2006, 2009), Jeong and Townsend (2008), and Felkner and Townsend (2007) with additional modifications. This literature discusses endogenous financial deepening and how well it fits both mmicroeconomic and macroeconomic data, examining for targeting of government development banks and interest rate distortions that created a crisis and increased government involvement in the banking sector.
Consider a dynamic problem with an infinite horizon. Household i maximizes discounted expected utility
where u(·) is strictly concave and initial wealth is ki,0 = bi,0. E0(·) denotes the expectation given the information available at t = 0. We incorporate unobserved heterogeneity by allowing the individuals to differ in their discount factors. Specifically, we assume , where θi is an individual specific component known to the agent only, and is common knowledge.
In autarky there is a law of motion for wealth as a function of savings, investment in specific occupations, and an exogenous random endowment. Let sit denote the savings rate of household i at date t expressed as a fraction of wealth kit at date t. Let be the proportion of the savings invested in a risky enterprise sector and be the proportion invested in wage sector activities. Additionally, one unit of wealth invested in enterprise E yields units of capital (wealth), whereas one unit invested in wage activity W yields an ex-post rate of return of . The returns and are realized at the end of date t and are unknown when within-period decisions are made.
The law of motion for wealth in autarky is thus
| (23) |
Consumption in autarky at t is the residual, i.e., .
The value function W0 associated with financial autarky, A, exists under standard regularity conditions. It satisfies the Bellman equation:
subject to (23). The function W0 (kit, θi) is strictly concave in kit. Under general preferences, the saving and investment policies are functions of wealth kit. However, for CRRA preferences they are constant. More precisely, under these preferences
where , is the income from enterprise, is the labor income, i.e.,
Therefore, and since by definition βi = β‾ + θi, we can write the equation describing optimal consumption in autarky A as:
where yit is the sum of all sources of income , and where is the unobserved component.
Participation in the intermediated sector on the other hand, allows household to share any idiosyncratic shock and, as in GJ, get perfect advanced information on aggregate shocks , .20 The bank directs all investment as if each household were exchanging shares in its own return stream for shares in a common mutual fund. The law of motion for wealth is then
| (24) |
where τ is the marginal intermediation transaction cost. The value function VI for those in the intermediated sector, I, satisfies the Bellman equation
subject to (24). Again VI(kit, θi) is strictly concave in kit. Policy sit might be a nonlinear function of kit, but again under CRRA preferences, sit is linear in kit. Thus,
where the aggregate shock At is equal to max , and is equal to . Following our previous analysis, we can write:
where and is the unobserved component.
4.1 Once-And-For-All Participation Decisions and Participation Costs as Instruments
In this section we extend the analysis of GJ. In particular, while GJ has endogenous entry determined by the solution to a dynamic programming problem with a period-by-period decision, we consider the special case of a once-and-for-all entry decision at an initial date. For an empirical application of this idea see Alem and Townsend (2008).
Initially at t = 0, given ki0, the household decides whether to participate in the financial sector or not. Once decided there is no going back. Let Zi denote an individual specific participation costs. This subtracts from wealth ki0. Again this cost is meant to capture exogenous variation in the ability to access intermediation, through either policy variation of physical infrastructure. These can be thought of as household specific transaction costs (with any correlation across individuals taken into account by other control variables, which is the way we treat wealth below). In the original GJ model, these costs are subtracted upon entry to the financial system. These are also transaction costs models in the finance literature, e.g. Vissing-Jorgensen (2002).
Then, with VI and W0 strictly concave in kit, the decision to participate depends on participation cost Zi and wealth ki0. More precisely, if we denoted by Ii0 the participation decision, we can write
Additionally, we can write observed consumption at t as a function of potential consumption levels and the participation decision Ii0:
| (25) |
where . Equation (25) shows how, if intermediation is effective for those who choose it, idiosyncratic income yit should not determine consumption.
Notice that the error term in (25), vit, depends on the decision made at t = 0, Ii0, so there is a selection bias argument that prevents the researcher of using OLS in the estimation of (25). In this context, an IV strategy becomes an appealing alternative.
The obvious issue is then how to come up with a valid instrument. Interestingly, the economic model delivers a natural instrument, namely Zi. In order to see this, notice that under autarky and the assumption of CRRA preferences, optimal saving rates and proportions of savings invested in each sectors do not depend on kit. As a result of this, potential consumption in the intermediated and autarky sectors do not depend on the choice of intermediation other than at t = 0 (when the costs are paid). Consequently, although Zi affects the initial choice of intermediation sector versus financial autarky, for all time periods t > 0 the individual participation cost does not affect the potential levels of consumption and . These two conditions make Zi a valid instrument for the effect intermediation on consumption.
Using the instrument Zi the researcher can identify LATE, a causal relationship between financial intermediation and consumption.
Estimating the average treatment effect (ATE) or the treatment effect on those treated (TT) is more delicate. Notice that due to the role of θi in the model, Ii0 is correlated with each of the components of vit, namely and . This structure is similar to the one discussed in the context of the models introduced in sections 2 and 3. As in those cases, the presence of unobserved components and the endogenous selection of the individuals into sectors (based on the comparison of counterfactual outcomes affected by unobserved variables) produces heterogeneity in treatment effects. In this context, we can show that under the assumption of a uniform response of Ii0 to changes in Zi (for all i), the instrumental variable estimator will indeed identify a causal effect of Ii0 on cit (see Heckman et al., 2006; Imbens and Angrist, 1994). But the causal effect identified by IV might be different from, for example, ATE or TT. Only under the special case of no selection on unobserved gains IV, ATE and TT would be identical. However, the presence of unobserved components and the endogenous selection process make this case unlikely.21
4.2 Sequential Participation Decisions
Now suppose the choice of sector takes place each period t, not just initially. Then for those not yet in the intermediated sector at t ≥ 0, but may choose so at t + 1, the value function satisfies the Bellman equation
subject to .
There is a critical family of values k* (Zi, θi) which define thresholds for participation. Under some regularity conditions entry is permanent. However, saving st(kit) and investments , are generally functions of wealth kit even with CRRA utility. It can be established, in fact, that savings and investment in risky assets will rise with kit as that wealth approaches critical entry k* (Zi, θi). See Townsend and Ueda (2006).
Thus variation in Zi determines both k* and pre participation outcomes. Therefore, Zi cannot be considered as a potential instrument. Careful researchers do take into account the impacts of anticipated policy when designing experiments. Subjects are not given full information of what is to happen step by step. See Olken (2007).
4.3 The Identification Power of Policies
Interesting, the existence of unanticipated policies can allow us to identify the effect of financial intermediation on consumption. For example, assume a once-and-never-more policy shifting at some date t* the cost of participation Zi. Then period t* can be interpreted as period zero and the earlier analysis applies (except we have pre-intervention data, and savings and investment are non linear in wealth kit). At period t* we have pre-established positions for those not yet in, and the participation decision for them is:
In effect, the policy change can be interpreted as a once-and-for-all wealth shock in the event of joining the financial sector. Consumption equations are as before. For t > t*, we have
Then, if the agent enters at t*, induced by the sudden and temporary policy change, we can analyze this decision as if it would have been a “once for all” decision. In this case, the policy changes the entry decision, but it does not affect the potential outcomes at t > t*.22
However, if the policy is permanent, then the policy is subject to the same qualifications as the case when choice of sector takes place each period. Subsequently, pre entry behavior for those not yet entering at period t* will be altered.
5 A Model of Financial Intermediation with Moral Hazard and Collateral Constraints
5.1 Statics
In this section, we study the consequences for impact evaluation of a model with financial intermediation with moral hazard. Our model is similar to the one discussed in Paulson et al. (2006) estimated using data from Thailand. This follows the tradition of the earlier literature on occupation choice but attempts to estimate the financial regime in place, i.e., moral hazard versus limited commitment. Here we focus on moral hazard and the endogeneity of the intermediation decision.
We first introduce the static version of the model though for simplicity we suppress the occupation choice and focus on firms. We also focus our interest on the empirical consequences of randomized contracts. We then go to the dynamics.23
We denote by u(ci, ei) the utility function associated with individual i. This function is increasing in consumption ci and decreasing in effort ei. The technology in the model is described by a stochastic production function Pr(qi|ei, θi) where θi denotes outcome and θi represents individual’s talent or type.
The individual as firm must decide whether or not to participate in a lottery determining who gets intermediated. If participating in the lottery, she must pay an amount bi to the bank and, as a results of this, she gets a randomized contract determining if she will have to run her business in autarky or if her output will depend on a transfer agreement associated with credit and insurance. The entry into randomization has a fixed and individual-specific costs Zi. However, Zi produces a natural source of variation that can be used to identify and estimate the effect of financial intermediation on consumption. We illustrate this point in our example.
Overall, the timing of the model is as follow. First, wealth bi is transferred to the bank. Then, the outcome of the lottery is revealed. If the result is autarky, some wealth may be transferred from bank to the individual before she “opens” her business. This amount is such that the on average the individual ends up with the same wealth level as autarky. Let wAi denote the optimal transfer and ΠA (wAi, qi, ei) be the joint distribution of the transfer, production and effort.
Here ΠA (wAi, qi, ei) allows non trivial probabilities but much of the outcomes can be deterministic. The ΠA (wAi, qi, ei) makes the problem linear. The following expressions characterize the problem of determining the optimal transferred level wAi, for each θi type;
s.t.
| (26) |
| (27) |
| (28) |
The first constraint in this program implies that, regardless of the initial transfer, the distribution of output given the effort level is consistent with the production function associated with the individual’s type θi, Pr(qi|ei, θi). The second constraint gives back to the agent his wealth in expectation.
On the other hand, if the outcome of the lottery is “intermediation”, the contract defines first a recommended level of effort, and then a distribution of consumption conditional on output. These choices are described by the joint distribution of consumption, output and effort under intermediation (ΠI(ci, qi, ei)). Again, ei may be deterministic and ci a non trivial function of qi. Additionally, we assume the existence of an individual-specific utility cost kIi in case of being intermediated, as otherwise intermediation would always dominate autarky.
Since effort is only known by the individuals, we need to add the following constraint that makes recommended effort ei weakly dominate any e‾i:
| (29) |
Additionally, the joint distribution of consumption, output and effort under intermediation must be consistent with the production technology Pr(qi|ei, θi). Thus, we require
| (30) |
In sum, the contract can be characterized by the joint distribution of transfer, output and effort under financial autarky ΠA (wAi, qi, ei) and by the joint distribution of consumption, output and effort under intermediation ΠI(ci, qi, ei).
We must impose:
| (31) |
| (32) |
Finally, we impose a zero expected profit condition to our bank. Therefore, the following constraint must hold for each θi type:
| (33) |
This constraint implies that the net expected amount of transfers to the individuals (under both regimes) equals the initial transfer received by the bank.
Program 1 describes the efficient arrangements given bi.
Program 1
s.t. (26),(27),(28),(29),(30),(31),(32),(33). The outcome of this program U(bi; θi, Zi) is the indirect utility from the contract given a wealth level bi, the individual’s type θi and cost Zi. Notice the important role of kIi. If kIi < 0 we will have intermediation with probability one. On the contrary, a non-negative kIi will make intermediation more attractive for low values of wealth bi. Therefore, the possibility of randomization will occur only for non-negative values of kIi.
5.1.1 The Role of Lotteries and Z as a Valid instrument
Random assignments of wealth can help us to recover instruments at least over specified ranges of ex-ante wealth. Figure 3 illustrates this point.
Figure 3.
Random Assignments of Wealth as a Source of Instruments
For values wealth between bL < bi − Zi < bU a lottery puts mass on participation and autarky points in proportion to the utility distance. That is, suppose that an individual with initial wealth bi in this range forfeits Zi in wealth and enters the lottery with bi − Zi. Then, the effect of cost Zi is to shift ex-ante wealth to the left and increased the probability of loosing the lottery, that is becoming poor and needing the financial system.
Figure 3 shows that when bi < bL, intermediation is chosen with probability one, and those agents do not play the lottery (and do not pay costs Zi).
The point is that in the relevant range of wealth (and only in that range) costs Zi affect the probability of participation without changing outcomes associated with the participation decision. This logic produces the instrument. Additionally, changes in the instrument produce uniform or monotonic responses in the chances of getting access to the financial system. Therefore, even under the presence of unobserved talent driving consumption levels and probabilities of intermediation, the IV strategy will identify a causal effect associated with financial intermediation.
We note that ex-ante expected utility is a function of the instrument and we come back to this in a consideration of dynamics.
5.1.2 Example
In order to understand the consequences of our analysis for the impact evaluation of financial intermediation, we generate data from our theoretical model and estimate what the effect of financial intermediation would be using different econometric techniques. Specifically, we use our model to generate data on consumption, wealth and financial intermediation for a sample of approximately 1,800,000 individuals.24 As previously explained, talent plays a critical role in our theoretical model, but since talent is observed only by the individual, we do not condition on it.
Table 10 presents our parameterization of the theoretical model. In our data, we observe 67.90% of the individuals (endogenously) reporting financial intermediation. Notice that wealth (b) and the instrument (Z) are defined as continuous random variables. However, once we identify the region in which randomization is non trivial, we solve the model for a set of discrete values of b and Z. Specifically, we work with ten values for both wealth (b1, … , b10) and the instrument (Z1, … , Z10).25 This not only allows us to make the numerical solution of the theoretical problem feasible but also to mimic what an analyst would face in reality.
Table 10.
Model of Financial Intermediation with Moral Hazard and Collateral Constraints Parameterization
| Utility Function | u(c, e) = −100c−1.5 − v(e) |
| Dis-utility of Effort | v(0.06) = 2.9, v(16) = 3 |
| Probability of High Output | |
| Effort Grid | e ∈ {0.06, 16} |
| Output Grid | q ∈ {0.5, 15} |
| Cost of Intermediation | κI = 0.1 |
| Talent (θ) and Wealth (b) | |
| Instrument (Z) | Z ~ U(0, 1) with Z independent from θ and b |
First, we estimate the effect of financial intermediation using OLS and IV techniques. We denote by ci and Di observed consumption and financial intermediation (dummy variable), respectively. bi denotes individual’s wealth level. We carry out the estimation considering two different models of consumption, as illustrative:26
| (34) |
| (35) |
where, in (35), the dependency of the coefficients α and β on values of wealth (bk) indicates that the equation is estimated for each bk. In this manner, the second equation represents a more flexible functional form than the first (and standard) model. The needs for instruments comes from the fact that intermediation Di is an endogenous variable. Additionally, given the presence of unobserved talent and the endogenous selection mechanism driving the intermediation decisions, behind expressions (34) and (35) we have a model with heterogeneous treatment effects.
We denote by ΔOLS(bk) and ΔIV (bk) the effect of financial intermediation on consumption (conditional on wealth) obtained from model (35). Table 11 presents our results. The results are ordered increasingly on wealth (i.e., b1 < b2 < … < b10). The last two rows of table 11 present the overall effects obtained from (34) and (35) (across wealth levels). The comparison of these columns illustrates the empirical consequences of imposing a priori the restricted functional form (34), i.e., and for all (k, k’), and the biased results delivered by OLS.
Table 11.
IV and OLS Estimates computed by Wealth Level - Static Model
| Wealth Level |
ΔIV (bk) (1) |
ΔOLS(bk) (2) |
% Difference |
|---|---|---|---|
| b 1 | −0.91 | −0.87 | −4.8% |
| b 2 | −2.02 | −1.85 | −8.9% |
| b 3 | −2.92 | −2.83 | −3.1% |
| b 4 | −3.97 | −3.92 | −1.4% |
| b 5 | −4.80 | −4.91 | 2.3% |
| b 6 | −5.76 | −5.97 | 3.5% |
| b 7 | −6.91 | −7.00 | 1.3% |
| b 8 | −7.91 | −8.04 | 1.7% |
| b 9 | −9.09 | −9.04 | −0.5% |
| b 10 | −9.75 | −10.05 | 3.0% |
|
| |||
| Overall Effect-Restricted Model (Equation (34)) | −5.40 | −5.72 | 5.6% |
| Overall Effect-Full Interactions (Equation (35)) | −5.38 | −5.45 | 1.3% |
Note: All estimates are statistically signi cant at 5% level.
Additionally, and following Imbens and Angrist (1994), we can decompose the IV estimates (ΔIV (bk)) presented in table 11 into its local components. Specifically, we can write:
where , and the weights are such that λl(bk) ≥ 0 and Σlλl(bk) = 1 for all bk with k = 1, … , 10. Table 12 presents the estimated ΔIV (Zl+1, Zl; bk) obtained using data generated from the model, whereas table 13 presents the associated weights. The variability of ΔIV (Zl+1, Zl; bk) demonstrates the presence of unobserved heterogeneity in our model across levels of wealth. The IVs presented in table 11 gives a partial picture of the local effects contained in the data.27
Table 12.
Local IV Estimates by Wealth Level ΔIV (Zl+1, Zl; bk) Static Model
| Wealth Level | (Z2, Z1) | (Z3, Z2) | (Z4, Z3) | (Z5, Z4) | (Z6, Z5) | (Z7, Z6) | (Z8, Z7) | (Z9, Z8) | (Z10, Z9) | ΔIV (bk) |
|---|---|---|---|---|---|---|---|---|---|---|
| b 1 | 11.89 | −8.95 | −0.29 | −5.53 | 7.73 | −10.81 | 5.60 | −6.13 | 9.03 | −0.92 |
| b 2 | −7.49 | −1.31 | 1.29 | −1.32 | −2.97 | −6.23 | −2.69 | 1.74 | 1.05 | −2.02 |
| b 3 | −3.36 | −2.79 | −3.17 | −1.63 | −0.93 | −6.39 | −4.06 | 0.10 | −4.17 | −2.92 |
| b 4 | −3.76 | −3.50 | −1.95 | −7.04 | −5.36 | −2.91 | −2.22 | −3.25 | −6.09 | −3.97 |
| b 5 | −7.05 | −3.88 | −2.43 | −4.84 | −6.72 | −7.51 | −3.65 | −1.40 | −5.79 | −4.80 |
| b 6 | −2.84 | −5.24 | −3.58 | −8.85 | −5.27 | −7.12 | −6.86 | −3.94 | −5.26 | −5.76 |
| b 7 | −5.58 | −7.88 | −4.96 | −6.42 | −8.63 | −6.88 | −8.22 | −6.18 | −6.00 | −6.91 |
| b 8 | −9.72 | −6.58 | −8.43 | −10.29 | −10.13 | −4.44 | −10.11 | −3.57 | −6.01 | −7.91 |
| b 9 | −4.17 | −8.68 | −12.31 | −15.37 | −1.80 | −12.74 | 1.77 | −19.21 | −7.99 | −9.10 |
| b 10 | −16.29 | −11.53 | 0.92 | −24.90 | −12.82 | 7.29 | −22.50 | 5.74 | −17.65 | −9.75 |
Table 13.
IV Weights by Wealth Level (λl(bk)) Static Model
| Wealth Level |
(Z2, Z1) (1) |
(Z3, Z2) (2) |
(Z4, Z3) (3) |
(Z5, Z4) (4) |
(Z6, Z5) (5) |
(Z7, Z6) (6) |
(Z8, Z7) (7) |
(Z9, Z8) (8) |
(Z10, Z9) (9) |
Σlλl(bk) (1)+(2)+…+(9) |
|---|---|---|---|---|---|---|---|---|---|---|
| b 1 | 0.05 | 0.10 | 0.13 | 0.15 | 0.15 | 0.14 | 0.13 | 0.10 | 0.05 | 1.00 |
| b 2 | 0.05 | 0.10 | 0.13 | 0.15 | 0.14 | 0.15 | 0.13 | 0.10 | 0.05 | 1.00 |
| b 3 | 0.05 | 0.10 | 0.13 | 0.15 | 0.15 | 0.15 | 0.13 | 0.10 | 0.05 | 1.00 |
| b 4 | 0.05 | 0.10 | 0.13 | 0.15 | 0.15 | 0.15 | 0.13 | 0.10 | 0.05 | 1.00 |
| b 5 | 0.05 | 0.10 | 0.12 | 0.15 | 0.15 | 0.15 | 0.13 | 0.10 | 0.05 | 1.00 |
| b 6 | 0.06 | 0.10 | 0.13 | 0.14 | 0.15 | 0.15 | 0.13 | 0.10 | 0.05 | 1.00 |
| b 7 | 0.05 | 0.10 | 0.13 | 0.14 | 0.15 | 0.15 | 0.13 | 0.09 | 0.05 | 1.00 |
| b 8 | 0.05 | 0.10 | 0.12 | 0.16 | 0.15 | 0.14 | 0.13 | 0.10 | 0.06 | 1.00 |
| b 9 | 0.06 | 0.10 | 0.13 | 0.15 | 0.14 | 0.15 | 0.13 | 0.09 | 0.05 | 1.00 |
| b 10 | 0.06 | 0.11 | 0.12 | 0.13 | 0.15 | 0.14 | 0.14 | 0.09 | 0.06 | 1.00 |
Importantly, the local IV estimates presented in table 12 have a causal interpretation. Specifically, they identify the effects of the treatment for those individuals induced to switch regime as a result of a change in the instrument. In other words, ΔIV (Zl+1, Zl; bk) identifies where and denote the consumption levels under intermediation and autarky, respectively, and Di(Zl) denotes the value for the dummy variable associated with intermediation when individual i faces Z = Zl. This causal interpretation of IV comes from the fact that Z is a valid instrument and from the assumption of a uniform (or monotonic) effect of Z on D (from the lottery). In the next section, we show how this causal interpretation of local IV breaks down in the context of a dynamic model.28
As previously discussed, in the context of models with unobserved heterogeneity, reduced form approaches (including IVs) might not give estimates of the average effect of the treatment (ATE) or the treatment effect on those treated (TT). This is because each of these parameters depend in one way or another on counterfactual outcomes, and therefore, their estimation requires additional structure. Fortunately for us, full control of our model allows us to generate these counterfactual states, and consequently, all the treatment parameters. Table 14 presents these treatment parameters. It also presents the average treatment effects for those untreated or TUT. We immediately observe that there are differences among the treatment parameters, which is again a manifestation of the presence of unobserved heterogeneity.
Table 14.
Model Generated Treatment Parameters: ATE, TT and TUT, by Wealth Level Static Model
| Wealth Level |
ΔATE(bk) | ΔTT (bk) | ΔTUT (bk) | Consumption Under Financial Autarky |
|---|---|---|---|---|
| b 1 | −0.81 | −0.81 | −0.83 | 22.92 |
| b 2 | −1.83 | −1.84 | −1.82 | 23.95 |
| b 3 | −2.85 | −2.87 | −2.79 | 24.97 |
| b 4 | −3.87 | −3.88 | −3.86 | 25.99 |
| b 5 | −4.89 | −4.90 | −4.87 | 27.01 |
| b 6 | −5.91 | −5.92 | −5.91 | 28.03 |
| b 7 | −6.93 | −6.93 | −6.94 | 29.05 |
| b 8 | −7.95 | −7.93 | −7.99 | 30.07 |
| b 9 | −8.97 | −8.97 | −8.98 | 31.09 |
| b 0 | −9.99 | −10.00 | −9.99 | 32.11 |
|
| ||||
| Overall | −5.40 | −5.17 | −5.90 | 27.52 |
5.2 Dynamic Mechanism Design
Suppose now there are two time periods in our contract model. We continue defining Zi as a cost of entering the lottery which is subtracted from wealth bi.
We denote by the wealth level in the second period which is a “decision” variable in the context of the first period. Individuals in our model are allowed to switch from intermediation today to autarky tomorrow, and also the opposite.
The program introduced in section 5.1 already determined the optimal arrangement in the second period. Importantly, in the first period, not only consumption but also the characteristics of the future arrangement are used to reward individuals. But indirect utility carries all the information from the second period arrangement that is relevant for the characterization of the optimal contract in the first period as only utility matters for incentives. We use this fact in what follows.
When the result of the lottery is autarky in the first period, a particular distribution of transfer to the individual, wAi, is determined. Then, the individual decides the amount of effort, the output level is obtained, and finally, the individual decides how to split his resources qi + wAi between consumption today and wealth level for the second period. Thus, the program determining the optimal policy given the available resources bi can be written as:
s.t.
| (36) |
| (37) |
| (38) |
In this dynamic version of the model, the contract under intermediation can be characterized by the joint distribution of next period’s wealth, and the first period levels of consumption, production and effort, .
The incentive constraint under intermediation in the first period is,
| (39) |
and the constraint securing that be consistent with the stochastic technology is
| (40) |
Finally, the probability constraints are:
| (41) |
| (42) |
The bank faces the following zero expected profit condition:
| (43) |
This constraint assures that the amount initially given to the bank equals the expected amount transferred for the individual either under intermediation or autarky, plus the cost of intermediation Zi. (A similar constraint is already imposed in the second period).
Therefore, given the initial promise , the first period program describing the efficient allocation of resources becomes:
Program 2
Our main interest in this model is the critical role of ex-ante promise utility in the second period . This variable determines the incentives in the first period, and this fact has consequence for the interpretation of Zi as a valid instrument. Note from our earlier discussion that expected utility in the static problem depends on Zi. Now the static problem is the second period problem, and so one sees the intuition that varying levels of utility depend on Z through these second period promised utilities. These promises thus impact current period incentives and so vary with Z. In order to see this, notice the higher is Zi the less surplus there will be during the second period to maintain a given level of promise utility . As a result of this, we might expect U(·) to be a monotone decreasing function of Zi. On the other hand, assignment to intermediation in the second period when Zi is high (low) allows the assignment of a low (high) promise as a threat for bad outcomes in the first period. Therefore, either way the level promise U(·) depends on the instrument Zi. Thus, we lose the desirable properties of the Zi as a potential instrument. Promised utility in the second period depends on Zi and promised utility determines current period incentives
5.2.1 An Example
As in the static case, we study the differences between the model generated treatment parameters and the estimates obtained by a researcher using observational data on consumption, wealth, and intermediation. We use the same parameterization as in the previous case (see table 10, and equations (34) and (35)). In this case, we observe 33.10% of the individuals (endogenously) reporting financial intermediation.29
Table 15 presents the IV and OLS estimates (overall and by level of wealth). In general, the differences between IV and OLS are larger than what we computed in table 11. Table 16 on the other hand, presents the local IV estimates ΔIV (Zl+1, Zl; bk) for k = (1, … , 9), which - jointly with the weights presented in table 17 - produce the IV estimates reported in table 15. We again observe a larger variability in IVs (across wealth levels) compared to the results in table 12. This fact reflects the strong selection process driving the decision into intermediation.
Table 15.
IV and OLS Estimates computed by Wealth Level - Dynamic Model
| Wealth Level |
ΔIV (bk) (1) |
ΔOLS(bk) (2) |
% Difference |
|---|---|---|---|
| b 1 | −1.85 | −1.48 | −25.0% |
| b 2 | −2.59 | −2.16 | −19.7% |
| b 3 | −3.08 | −2.86 | −7.8% |
| b 4 | −3.77 | −3.55 | −6.1% |
| b 5 | −4.46 | −4.24 | −5.2% |
| b 6 | −5.14 | −4.92 | −4.4% |
| b 7 | −5.70 | −5.61 | −1.6% |
| b 8 | −6.42 | −6.30 | −1.9% |
| b 9 | −7.16 | −6.97 | −2.8% |
|
| |||
| Overall Effect-Restricted Model (Equation (34)) | −4.74 | −4.33 | −9.5% |
| Overall Effect-Full Interactions (Equation (35)) | −4.70 | −4.50 | −4.5% |
Note: All estimates are statistically significant at 5% level.
Table 16.
Estimated Local IV by Wealth Level ΔIV (Zl+1, Zl; bk)Dynamic Model
| Wealth Level | (Z2, Z1) | (Z3, Z2) | (Z4, Z3) | (Z5, Z4) | (Z6, Z5) | (Z7, Z6) | (Z8, Z7) | (Z9, Z8) | (Z10, Z9) | ΔIV (bk) |
|---|---|---|---|---|---|---|---|---|---|---|
| b 1 | 3.56 | 3.93 | −1.94 | −8.45 | 1.40 | −6.88 | −5.11 | −1.83 | −0.80 | −1.85 |
| b 2 | −3.80 | −3.23 | −2.08 | −1.64 | −1.70 | −4.54 | −3.19 | −4.19 | −1.72 | −2.59 |
| b 3 | −5.14 | −0.55 | −4.26 | −3.37 | −2.13 | −4.58 | −3.60 | −4.67 | −2.25 | −3.08 |
| b 4 | −7.10 | 0.74 | −4.96 | −3.69 | −2.86 | −6.26 | −5.72 | −3.45 | −3.45 | −3.77 |
| b 5 | −5.58 | −2.92 | −3.32 | −6.42 | −2.64 | −7.46 | −6.08 | −4.38 | −3.83 | −4.46 |
| b 6 | −2.27 | −4.45 | −5.32 | −4.00 | −4.76 | −7.03 | −5.48 | −6.76 | −5.06 | −5.14 |
| b 7 | −9.50 | −2.87 | −5.88 | −6.01 | −5.30 | −7.78 | −5.62 | −5.60 | −5.81 | −5.70 |
| b 8 | −7.44 | −6.39 | −4.18 | −7.43 | −5.67 | −6.46 | −7.87 | −6.27 | −6.54 | −6.42 |
| b 9 | −8.66 | −0.31 | −8.88 | −8.79 | −6.32 | −8.90 | −6.74 | −7.94 | −5.80 | −7.16 |
Table 17.
IV Weights by Wealth Level (λl(bk)) Dynamic Model
| Wealth Level |
(Z2, Z1) (1) |
(Z3, Z2) (2) |
(Z4, Z3) (3) |
(Z5, Z4) (4) |
(Z6, Z5) (5) |
(Z7, Z6) (6) |
(Z8, Z7) (7) |
(Z9, Z8) (8) |
(Z10, Z9) (9) |
Σlλl(bk) (1)+(2)+…+(9) |
|---|---|---|---|---|---|---|---|---|---|---|
| b 1 | 0.03 | 0.09 | 0.09 | 0.12 | 0.25 | 0.08 | 0.13 | 0.11 | 0.10 | 1.00 |
| b 2 | 0.03 | 0.09 | 0.09 | 0.12 | 0.26 | 0.07 | 0.14 | 0.11 | 0.10 | 1.00 |
| b 3 | 0.03 | 0.08 | 0.09 | 0.13 | 0.26 | 0.06 | 0.14 | 0.11 | 0.10 | 1.00 |
| b 4 | 0.02 | 0.07 | 0.09 | 0.13 | 0.25 | 0.06 | 0.15 | 0.11 | 0.10 | 1.00 |
| b 5 | 0.02 | 0.07 | 0.10 | 0.14 | 0.25 | 0.06 | 0.15 | 0.11 | 0.09 | 1.00 |
| b 6 | 0.02 | 0.06 | 0.10 | 0.14 | 0.25 | 0.06 | 0.16 | 0.12 | 0.09 | 1.00 |
| b 7 | 0.02 | 0.06 | 0.10 | 0.15 | 0.24 | 0.06 | 0.16 | 0.12 | 0.09 | 1.00 |
| b 8 | 0.02 | 0.05 | 0.10 | 0.15 | 0.23 | 0.06 | 0.16 | 0.12 | 0.09 | 1.00 |
| b 9 | 0.02 | 0.05 | 0.10 | 0.16 | 0.23 | 0.07 | 0.16 | 0.13 | 0.09 | 1.00 |
Tables 18 and 19 present the model generated treatment parameters. The numbers in these tables are obtained using the counterfactual consumption levels delivered by the model, which would not be available in observational data (as the one used to generate the numbers in tables 15, 16, and 17).
Table 18.
Model Generated Treatment Parameters: ATE, TT and TUT, by Wealth Level Dynamic Model
| Wealth Level |
ΔATE(bk) | ΔTT (bk) | ΔTUT (bk) | Consumption Under Financial Autarky |
|---|---|---|---|---|
| b 1 | −1.49 | −1.47 | −1.51 | 22.00 |
| b 2 | −2.18 | −2.17 | −2.19 | 22.69 |
| b 3 | −2.87 | −2.84 | −2.89 | 23.38 |
| b 4 | −3.56 | −3.52 | −3.58 | 24.07 |
| b 5 | −4.25 | −4.22 | −4.26 | 24.76 |
| b 6 | −4.94 | −4.92 | −4.95 | 25.46 |
| b 7 | −5.63 | −5.59 | −5.64 | 26.17 |
| b 8 | −6.32 | −6.30 | −6.32 | 26.87 |
| b 9 | −7.01 | −7.05 | −7.00 | 27.57 |
|
| ||||
| Overall | −4.51 | −4.17 | −4.69 | 25.04 |
Table 19.
Model Generated Local Average Treatment Effects by Wealth Level ΔLATE(Zl+1, Zl; bk) Dynamic Model
| Wealth Level | (Z2, Z1) | (Z3, Z2) | (Z4, Z3) | (Z5, Z4) | (Z6, Z5) | (Z7, Z6) | (Z8, Z7) | (Z9, Z8) | (Z10, Z9) | ΔLATE(bk) |
|---|---|---|---|---|---|---|---|---|---|---|
| b 1 | −1.37 | −1.51 | −1.77 | −1.55 | −1.28 | −1.76 | −1.64 | −1.60 | −1.45 | −1.55 |
| b 2 | −2.25 | −2.22 | −2.01 | −2.12 | −2.10 | −1.92 | −2.21 | −2.49 | −2.26 | −2.17 |
| b 3 | −2.79 | −2.78 | −2.78 | −2.80 | −2.80 | −2.67 | −2.89 | −3.03 | −3.06 | −2.85 |
| b 4 | −3.48 | −3.41 | −3.45 | −3.40 | −3.51 | −3.49 | −3.58 | −3.62 | −3.75 | −3.53 |
| b 5 | −4.15 | −4.25 | −4.20 | −4.23 | −4.06 | −4.26 | −4.27 | −4.45 | −4.45 | −4.27 |
| b 6 | −4.92 | −4.88 | −4.88 | −4.75 | −4.83 | −4.87 | −4.88 | −5.15 | −5.04 | −4.92 |
| b 7 | −5.81 | −5.51 | −5.58 | −5.69 | −5.43 | −5.39 | −5.61 | −5.67 | −5.89 | −5.62 |
| b 8 | −6.47 | −6.37 | −6.16 | −6.26 | −6.04 | −6.08 | −6.32 | −6.29 | −6.47 | −6.27 |
| b 9 | −6.51 | −6.81 | −6.96 | −6.73 | −7.22 | −7.00 | −6.66 | −7.22 | −7.15 | −7.00 |
The results in table 18 show important differences between the average treatment effect (ATE), the treatment effect on the treated (TT), and the treatment effect on the untreated (TUT). These differences illustrate how the presence of unobserved talent and the sorting mechanism into financial intermediation generate heterogenous treatment parameters. In this context, the analyst must first state the question she wants to answer, and then use the appropriate empirical approach to identify the treatment parameters of interest.
Table 19 on the other hand, presents the model generated local average treatment effects . Given the problematic definition of Zi as a proper instrument, the model generated LATE (table 19) and the estimated local IVs (table 16) are now different. Table 20 summarizes these large differences. In our dynamic model with dynamic incentives, local IVs would not identify the well-defined causal parameter LATE.
Table 20.
Model Generated Local Average Treatment Effect versus Estimated Local IVs, by Wealth Level Dynamic Model
| Wealth Level |
ΔLATE(bk) (1) |
ΔIV (bk) (2) |
% Difference |
|---|---|---|---|
| b 1 | −1.55 | −1.85 | 16.2% |
| b 2 | −2.17 | −2.59 | 16.0% |
| b 3 | −2.85 | −3.08 | 7.6% |
| b 4 | −3.53 | −3.77 | 6.4% |
| b 5 | −4.27 | −4.46 | 4.2% |
| b 6 | −4.92 | −5.14 | 4.2% |
| b 7 | −5.62 | −5.70 | 1.4% |
| b 8 | −6.27 | −6.42 | 2.4% |
| b 9 | −7.00 | −7.16 | 2.3% |
|
| |||
| Overall | −4.50 | −4.70 | 4.3% |
6 Conclusions
This paper links contract theory models of financial intermediation to econometric policy evaluation. We have discussed a variety of economic models with unobserved heterogeneity and endogenous decisions involving financial intermediation. We also analyzed econometric techniques and policy evaluation which are appropriate or inappropriate, depending on the vision of the underlying model, the assumptions one is willing to make, and the data at hand.
Even though, under certain assumptions, an IV strategy can recover accurately a true model-generated causal effect (LATE), these are quantitatively different, in order of magnitude and even sign, from other policy impact parameters (e.g., treatment on the treated and the average treatment effect). We also show that laying out clearly alternative models can guide the search for instruments. Mechanism design can deliver natural lotteries of randomization that can be used as sources of identification in empirical analyses. On the other hand adding more margins of decision, i.e., occupation choice and intermediation jointly, or adding more periods with promised utilities as key state variables, as in optimal multi-period contracts, can cause the misinterpretation of the IV estimates as the causal parameter of interest (e.g., uniformity), so that IV and LATE might no longer coincide.
Our objective is to help researchers and policy makers assess accurately the impact of financial intermediation. In order to identify the impact of financial intermediation, researchers and policy makers need a clear understanding of the role of unobserved heterogeneity (coming from preferences, costs or talents) and the economic mechanisms driving individual’s endogenous decisions. A limited understanding of the economic fundamentals could result in a misinterpretation of policy parameters estimated from observational data. The good news is that there is a wide array of options, so it is a matter of choosing carefully.
Acknowledgments
Research funding from NICHD, NFS, Templeton Foundation, and Bill and Melinda Gates Foundations to the University of Chicago is gratefully acknowledge. The views expressed in this paper are those of the authors and not necessarily those of the funders listed here. We have received helpful comments from Cynthia Kinnan, Benjamin Olken, Marti Mestieri and Gabriel Madeira. Gabriel Madeira provided excellent research assistance.
Footnotes
This is easily modified to allow a choice between savings s and consumption c where c + s ≤ W and preferences are determined by a Cobb-Douglas utility function, giving a (myopic) savings rate.
Although this wage w is taken as given for each individual choice problem, it is consistent with a market clearing equilibrium wage.
See Rubin (1974) and Heckman and Honoré (1990) for a formal exposition of the Roy model.
This since we assume that the subsidy ψ is not correlated with unobserved talents θW and θE.
If the subsidy takes on a finite number of discrete values, and we order them according to their magnitudes (ψ0 < ψ1 < … < ψK), then ΔIV can be written as a weighted average of ΔLATE (ψk, ψk+1) with k = 1, .., K − 1, where the weights are related to the probability of going into business at the various values of the subsidy (see Yitzhaki, 1989; Imbens and Angrist, 1994). Additionally, if we take the limit as subsidy ψk approaches ψk+1, this delivers the marginal treatment effect (MTE) for those households just indifferent to becoming business (see Heckman and Vytlacil, 2001).
Notice that this parameter is the limit version of the average local treatment effect. More specifically, ΔMTE (p, b) = limp’→p ΔLATE (p, p’, b).
The empirical implementation of the local instrumental variable estimator involves the non-parametric estimation of the derivative of E(Yi|pi, bi) with respect to pi. Although the implementation of non-parametric techniques can be considered standard, in small samples they can be infeasible. See Heckman et al. (2008) and Heckman et al. (2006) for different empirical approaches when implementing the local instrumental variable estimator.
Using data from Thailand, Gine and Townsend (2004) estimate a model with similar characteristics to the one studied here.
Notice that this expression follows directly from the theoretical model (see equation (11)).
| (12) |
Formally, suppose individuals decide among J different options. Each option has associated a utility level Vj for j = 1, … , J. Let Dj = 1 if the individual selects the j-th alternative, and 0 otherwise. Furthermore, as in the model of this section, assume Dj = 1 if Vj = max{V1, … , VJ} for j = 1, … , J. Let Yj denote the potential outcome associated with option j. Valid instruments affect choices but are independent from potential outcomes. Let Zj denote the instrument associated with option j. We present the relationship between instrument and options as Vj(Zj), i.e., instrument Zj determines the utility level Vj. Vj also depends on unobserved components which can be correlated with potential outcome Yj. For notational simplicity we leave this dependence implicit. Observed outcome Y can be written as . Heckman et al. (2006) shows that where z = (z1, … , zj, … , zj) and , so that only the variation from Zj is utilized to compute ΔIV (Zj), identifies the effect on outcome of option j versus the next best option.
Notice that when phrasing our model as a model of multiple treatments, intermediation costs Q and subsidy ψ are not valid instruments, in the sense of Zj entering only Vj (see previous footnote), for any of the four alternatives.
| (22) |
The risk sharing role of formal financial institutions is tested in Alem and Townsend (2008).
Notice that if the individual does not know her unobserved preference parameter θi or, alternatively, if she knows θi but for some reason does not act on it, then the selection process would not be based on unobserved gains. Formally, in this case , and the model would produce homogeneous treatment effects.
A literature on sudden devaluations causing wealth losses from dollar denominated loans is not unrelated.
See Karaivanov and Townsend (2009) for further work with the Thai data and the estimation of financial regimes in a dynamic context.
It is worth mentioning that experimenting with different sample sizes suggests that reducing the number of observations produces significant losses in the accuracy of the local IV estimates.
More precisely, we work with (b1, b2, b3, b4, b5, b6, b7, b8, b9, b10) = (10.5, 10.6, 10.7, 10.88, 10.9, 11.1, 11.2, 11.3, 11.4, 11.5), and (Z1, Z2, Z3, Z4, Z5, Z6, Z7, Z8, Z9, Z10) = (0, 0.03, 0.06, 0.1, 0.13, 0.16, 0.2, 0.23, 0.26, 0.3). Given the structure of the model and our ordering, the resulting probabilities associated with the lottery is increasing in Zj and decreasing in bk. We also consider a discrete grid for talent θ. Specifically, we solve the model for (θ1, θ2, θ3, θ4, θ5, θ6, θ7, θ8, θ9) = (0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4). The distribution of talent and wealth generated using the discrete grids respects the joint distribution associated with these random variables presented in table 10.
A more complicated version would include risk sharing. See Alem and Townsend (2008).
The numbers presented in table 11 are obtained using ΔIV (Zl+1, Zl; bk) and the IV weights presented in tables 12 and 13, respectively.
We do not present the model generated LATE in this case. This is because, as in the previous examples of a valid instrument satisfying uniformity, they will be close to the estimated local IV estimates, and so we prefer not to repeat the argument.
Here we work with the following values for wealth and the instrument (b1, b2, b3, b4, b5, b6, b7, b8, b9) = (18.25, 18.33, 18.41, 18.5, 18.58, 18.66, 18.75, 18.83, 18.91), and (Z1, Z2, Z3, Z4, Z5, Z6, Z7, Z8, Z9, Z10) = (0, 0.03, 0.07, 0.1, 0.14, 0.18, 0.21, 0.25, 0.29, 0.32). Given the structure of the model and our ordering, the resulting probabilities associated with the lottery tends to be increasing in Zj and decreasing in bj. For talent θ, we solve the model using (θ1, θ2, θ3, θ4, θ5, θ6, θ7, θ8, θ9) = (0.86, 0.88, 0.9, 0.92, 0.93, 0.95, 0.97, 0.98, 1).
Contributor Information
Robert M. Townsend, Department of Economics, MIT
Sergio S. Urzua, Department of Economics Northwestern University
References
- [1].Alem Mauro, Townsend Robert. An evaluation of safety nets and financial institutions in crisis and growth. University of Chicago, Department of Economics; 2008. unpublished manuscript. [Google Scholar]
- [2].Felkner John, Townsend Robert M. Enterprise and the wealth of villages. University of Chicago; 2007. unpublished manuscript. [Google Scholar]
- [3].Gine Xavier, Townsend Robert. Evaluation of financial liberalization: a general equilibrium model with constrained occupation choice. Journal of Development Economics. 2004;74:269–307. [Google Scholar]
- [4].Greenwood Jeremy, Jovanovic Boyan. Financial development, growth, and the distribution of income. Journal of Political Economy. 1999;98:1076–1107. [Google Scholar]
- [5].Heckman James J., Honoré Bo E. The empirical content of the Roy model. Econometrica. 1990;58:1121–1149. [Google Scholar]
- [6].Heckman James J., Schmierer Daniel, Urzua Sergio. Journal of Econometrics. University of Chicago, Department of Economics under revision; 2008. Testing the correlated random coefficient model. unpublished manuscript. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Heckman James J., Urzua Sergio, Vytlacil Edward J. Understanding instrumental variables in models with essential heterogeneity. Review of Economics and Statistics. 2006;88:389–432. [Google Scholar]
- [8].Heckman James J., Vytlacil Edward J. In: Cheng Hsiao, Kimio Morimune, Powell James L., editors. Local instrumental variables; Nonlinear Statistical Modeling: Proceedings of the Thirteenth International Symposium in Economic Theory and Econometrics: Essays in Honor of Takeshi Amemiya; New York: Cambridge University Press. 2001.pp. 1–46. [Google Scholar]
- [9].Imbens Guido W., Angrist Joshua D. Identification and estimation of local average treatment effects. Econometrica. 1994;62:467–475. [Google Scholar]
- [10].Jeong Hyeok, Townsend Robert. Growth and inequality: Model evaluation based on an estimation-calibration strategy. Macroeconomic Dynamics. 2008;12:231–284. doi: 10.1017/S1365100507070149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Kaboski Joseph P., Townsend Robert M. Policies and impact: An analysis of village-level micro finance institutions. Journal of the European Economic Association. 2005;3:1–50. [Google Scholar]
- [12].Kaboski Joseph P., Townsend Robert M. The impact of credit on village economies. Ohio State University; 2009. unpublished manuscript. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Karaivanov Alexander K., Townsend Robert M. Enterprise dynamics and finance: Distinguishing mechanism design from exogenously incomplete markets models. Department of Economics, Simon Fraser University; 2009. unpublished manuscript. [Google Scholar]
- [14].Lloyd-Ellis Huw, Bernhardt Dan. Enterprise, inequality and economic development. Review of Economic Studies. 2000;67:147–168. [Google Scholar]
- [15].Matzkin Rosa L. Nonparametric and distribution-free estimation of the binary threshold crossing and the binary choice models. Econometrica. 1992;60:239–270. [Google Scholar]
- [16].Olken Benjamin. Monitoring corruption: Evidence from a field experiment in Indonesia. Journal of Political Economy. 2007;115:200–249. [Google Scholar]
- [17].Paulson Anna L., Townsend Robert M., Karaivanov Alex. Distinguishing limited liability from moral hazard in a model of entrepreneurship. Journal of Political Economy. 2006;114:100–144. [Google Scholar]
- [18].Roy AD. Some thoughts on the distribution of earnings. Oxford Economic Papers. 1951;3:135–146. [Google Scholar]
- [19].Rubin Donald B. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66:688–701. [Google Scholar]
- [20].Salop Steven C. Strategic entry deterrence. American Economic Review. 1979;69:335–338. [Google Scholar]
- [21].Townsend Robert M., Ueda Kenichi. Financial deepening, inequality, and growth: A model-based quantitative evaluation. Review of Economic Studies. 2006;73:251–293. [Google Scholar]
- [22].Townsend Robert M., Ueda Kenichi. Welfare gains from financial liberalization. International Economic Review. 2009 doi: 10.1111/j.1468-2354.2010.00593.x. Forthcoming. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Vissing-Jorgensen Annette. Towards an explanation of household portfolio choice heterogeneity: Non-financial income and participation cost structures. National Bureau of Economic Research; 2002. Working Paper 8884. [Google Scholar]
- [24].Yitzhaki Shlomo. On using linear regression in welfare economics. Department of Economics, Hebrew University; 1989. Working Paper 217. [Google Scholar]



