Measuring the Impact of Financial Intermediation: Linking Contract Theory to Econometric Policy Evaluation

Robert M Townsend; Sergio S Urzua

doi:10.1017/S1365100509090178

. Author manuscript; available in PMC: 2010 Jun 1.

Published in final edited form as: Macroecon Dyn. 2009 Sep 1;13(Suppl S2):268–316. doi: 10.1017/S1365100509090178

Measuring the Impact of Financial Intermediation: Linking Contract Theory to Econometric Policy Evaluation ^{^*}

Robert M Townsend ¹, Sergio S Urzua ²

PMCID: PMC2860337 NIHMSID: NIHMS163624 PMID: 20436953

Abstract

We study the impact that financial intermediation can have on productivity through the alleviation of credit constraints in occupation choice and/or an improved allocation of risk, using both static and dynamic structural models as well as reduced form OLS and IV regressions. Our goal in this paper is to bring these two strands of the literature together. Even though, under certain assumptions, IV regressions can recover accurately the true model-generated local average treatment effect, these are quantitatively different, in order of magnitude and even sign, from other policy impact parameters (e.g., ATE and TT). We also show that laying out clearly alternative models can guide the search for instruments. On the other hand adding more margins of decision, i.e., occupation choice and intermediation jointly, or adding more periods with promised utilities as key state variables, as in optimal multi-period contracts, can cause the misinterpretation of IV as the causal effect of interest.

Keywords: Contract Theory, Financial Intermediation and Econometric Policy Evaluation

1 Introduction

This paper links contract theory models of financial intermediation to econometric policy evaluation. We study a variety of static and dynamic models in which financial intermediation has an impact on productivity through the alleviation of credit constraints in occupation choice and/or an improved allocation of risk. These models of intermediation are structural choice models which are known in the literature, and, more recently, estimated with cross sectional or panel data from developing countries (e.g., Thailand). On the other hand there is a large empirical literature which takes advantage of natural experiments, or instruments, to assess the impact that policy variation and financial institutions are having on incomes, occupations, risk sharing and a variety of other variables (some also in Thailand). Our goal in this paper is to bring these two strands of the literature together. Even though, under certain assumptions, an instrumental variable (IV) strategy can recover accurately the true model-generated local average treatment effects (LATE), these are quantitatively different, in order of magnitude and even sign, from other policy impact parameters (e.g., treatment on the treated TT, the average treatment effect ATE, etc). We also show that laying out clearly alternative models can guide the search for instruments. Mechanism design can deliver natural lotteries or randomization that can be used as sources of identification in empirical analyses. On the other hand, adding more margins of decision, i.e., occupation choice and intermediation jointly, or adding more periods with promised utilities as key state variables, as in optimal multi-period contracts, can cause the researcher to lose key identifying assumptions associated with the IV strategy (e.g., uniformity), so that IV and LATE might no longer coincide. Our objective is to help researchers and policy makers assess accurately the impact of financial intermediation.

The models we use are simple models of discrete choice when there are credit constraints. Typically some households are in financial autarky and others in a fully intermediated sector. There is a cost to entering the financial system, and this is imagined to pick up both the actual cost of traveling to a financial institution (bank) as well as policy distortions which limit access for some agents. We imagine there is variation in the cost/policy in the data, so that some households are financial sector participants and others are not. Indeed, we can generate cross sectional or panel data from a given model (sometimes using parameters which have been estimated from emerging market economies) and ask whether that data would allow an accurate quantification of the gain in the population produced by different policy variations (that emerging market countries had actually experienced). Of course in the model itself we implement the envisioned policy and compare various techniques that assess impact.

A key ingredient in this exercise is heterogeneity in the population, both observed and potentially unobserved. This means that there can be a nontrivial distribution of gains and/or losses in the population, depending on the policy. This is what can make the LATE (identified using the subsidy as instrument) different from TT and ATE, and at realistic parameter values, these can be quite distinct. The logic of the models also makes clear why this is likely to happen. For example, a subsidy can induce relatively inefficient households to enter business, whereas the larger population of businesses consists of talented households who were not on the margin of decision. This makes LATE negative and ATE positive. In other instances heterogeneity in one dimension destroys monotonicity in another. A new, nearby branch of a bank can facilitate intermediation by lowering costs, for those on the margin, and though some talented households will borrow to go into business, other richer, inefficient households will withdraw from low return business and put their money in savings in the bank. Talent is not observed. This makes it difficult without the economic model to assess the impact of intermediation on profits of entrepreneurs. This also means that widely used econometric techniques can potentially give misleading estimates, depending on what one is willing to assume and what one is trying to measure.

In section 2, we focus first on observable and unobservable characteristics such as wealth and talent in a simple model of occupation choice, to clarify some key issues. The credit constraint is extreme: self finance only. The utility functions are linear, but the financing constraint (sale of wealth), makes the problem non-linear. Indeed to guage the impact of this, and for expositional clarity, we begin with exogenous variation in business subsidies for those in financial autarky, computing various measures of welfare gains and comparing the numbers to IV estimates. Section 3 then introduces the full model with intermediation costs and policy variation, distinguishing which instruments are valid for intermediation, and which are valid for occupation choice only.

In section 4 we adopt a long horizon dynamic programming formulation to study endogenous financial deepening in a model with unobserved preferences and financial participation costs. We show that unobserved preference heterogeneity can create the need for instruments, as the decision to go to a bank and the outcome of being banked, and unbanked, can depend on what for an econometrician would be a common error. Importantly, participation costs can be used as instruments. Here IV and LATE coincide if policy variation on the participation costs comes as a surprise or if the participation decisions are made initially and the unobserved shocks in the model are independent and serially uncorrelated. But even in those cases, the identification of other treatment effects, such as TT or ATE, require much more work. Of course anticipated policy changes lowering costs cause the researcher to lose the validity of the instrument, as is well known, and we provide a clear example of this.

Section 5 introduces a model of financial intermediation with moral hazard and unobserved talent. In the model, unobserved talent is an input in the production technology, determining (counterfactual) consumption levels and individual’s preferences for financial intermediation. The key role played by unobserved heterogeneity, a feature shared by all the models considered in this paper, generates heterogeneity in impact parameters. We discuss under what assumptions the economic model generates instrumental variables that can be used to identify a causal effect of financial intermediation on consumption. In other words, we use the model to discuss its consequences for policy evaluations. We study its static and dynamic versions. We show how, in the static case, random assignment of wealth through a lottery can help us to recover instrumental variables at least over specified ranges of ex-ante wealth. Intuitively, we show how individual specific variables affecting the probability of winning the lottery, but independent from potential outcomes associated with intermediation (e.g., costs of entering the randomization), can be used to identify a causal effect of financial intermediation on consumption. Section 5.2 shows however, that in dynamic mechanism design problems the levels of promised utilities in the future matter for choices today, and one so typically looses the availability of instrumental variables even with random assignment of wealth. Essentially promised utilities for tomorrow depend on outcomes today, to induce proper incentives, along with contemporary rewards today. But those promises for tomorrow vary with the costs of intermediation, so we loose the independence of the outcomes from the instrument.

Section 6 presents our conclusions.

2 A Standard Model of Occupation Choice

We start the analysis with a static model of occupational choice without intermediation. We use this simplified financial autarky framework to illustrate some of the general issues which arise later in the paper. This occupation choice model originated with Lloyd-Ellis and Bernhardt (2000) and has been used by Gine and Townsend (2004) and Jeong and Townsend (2008) to understand how occupation choice and the spread of financial infrastructure can create growth in per capita income, movements in inequality, and more generally, to quantify the welfare gains in the population from the spread of financial intermediation.

Let us assume that the individual has linear preferences over current period consumption, of the form u(c) = c, that is u’(c) > 0 and u”(c) = 0. The individual faces the budget constraint c ≤ W where end-of-period wealth W depends on the within-period occupational choice of the agent.¹ The individual has beginning-of-period wealth b_i, assumed to be observed perfectly by the econometrician, so the initial distribution of wealth is known. This is the source of observable heterogeneity. The individual has an unobserved (from the point of the analyst) business entry cost $θ_{i}^{E}$ . Such entry costs are standard in the industrial organization literature. See Salop (1979) for an early example. The individual also has an unobserved talent as wage earner $θ_{i}^{W}$ . These two unobserved talents are as if randomly assigned in the population, again a source of unobserved heterogeneity. For simplicity, we assume that θ^W and θ^E are independent. We denote by f_θ^j (·) the density function of θ^j with j = {E, W}, and we assume E (θ^W) = E (θ^E) = 0. We put additional structure on these densities in future sections. The literature cited earlier did not include unobserved talent in wage work.

The occupational choice of the individual is between enterprise and wage work. These two alternatives can be described by their associated potential outcomes. Specifically, for individual i we have that end-of-period wealth is the sum of initial wealth plus within-period earnings,

W_{i} = {\begin{matrix} ω + θ_{i}^{W} + b_{i} & if wage earner \\ π (θ_{i}^{E}, b_{i}, ω) + b_{i} & if entrepreneur . \end{matrix}

(1)

Here w is the market wage for (unskilled) labor² and $π (θ_{i}^{E}, b_{i}, w)$ represents the profit function obtained after solving the production/profit maximization problem

π (θ_{i}^{E}, b_{i}, w) = \max_{{k, l}} f (k, l) - w l - k - θ_{i}^{E}

(2)

subject to 0 \leq k \leq b_{i} - θ_{i}^{E}

(3)

The production function technology f(k, l) is common to all potential firms. Here labor hired l is measured in efficiency units, not number of people per se. k is the level of capitalization measured in units of wealth. In financial autarky, the unobserved entry cost and capital k must be self-financed from wealth b_i. A household is said to be constrained when capital is equal to total wealth minus setup costs, i.e., $k = b_{i} - θ_{i}^{E}$ , and this is binding. Indeed in the original model, if $θ_{i}^{E} > b_{i}$ it is simply not possible to establish a business. In this case we can not ask what would be the earnings of someone who has not entered business for that reason. We modify the model below to take this into account. On the other hand, this constraint has been used in structural estimation via likelihood methods as it provides a source of identification. We discuss this point in section 2.2 below.

The decision rule associated with this occupation choice model can be presented as:

If π (θ_{i}^{E}, b_{i}, w) > w + θ_{i}^{W}, then the individual becomes an entrepreneur

If π (θ_{i}^{E}, b_{i}, w) \leq w + θ_{i}^{W}, then the individual becomes a wage earner .

Therefore, if we denote by D a binary variable such that D = 1 if the agent becomes an entrepreneur, and 0 otherwise, we can write

D (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w) = {\begin{matrix} 1 & if π (θ_{i}^{E}, b_{i}, w) > w + θ_{i}^{W} \\ 0 & if π (θ_{i}^{E}, b_{i}, w) \leq w + θ_{i}^{W} . \end{matrix}

This model is standard in the development literature. The model can be interpreted more generally as a Roy model (Roy, 1951) in which the occupational selection is based, given the individual’s talents $θ_{i}^{E}$ and $θ_{i}^{W}$ , and wealth b_i, on the comparison of the potential gains.³

2.1 Standard Econometric Approaches for The Analysis of the Impact of Occupational Decisions

We focus on a simple issue: whether we can identify the effect of occupation choice on earnings using a reduced form approach instead of the full structural model.

In this static model the econometrician observes either $π (θ_{i}^{E}, b_{i}, w) + b_{i}$ or $w + θ_{i}^{W} + b_{i}$ , depending on whether the choice D_i = 1 or D_i = 0 is taken by the individual i. Thus, if we denote by Y_i the end-of-period observed outcome we have:

Y_{i} \equiv D_{i} (π (θ_{i}^{E}, b_{i}, w) + b_{i}) + (1 - D_{i}) (w + θ_{i}^{W} + b_{i}) .

where without additional structure profits are non-linear in entrepreneur talent ( $θ_{i}^{E}$ ), wealth (b_i), and market wage (w) . However, the empirical literature primarily uses linear and separable models. That is,

π (θ_{i}^{E}, b_{i}, w) ≃ ϕ_{w} w + ϕ_{θ} θ_{i}^{E} + ϕ_{b} b_{i} .

(4)

This set-up is particularly attractive if one notes that

Y_{i} = D_{i} [ϕ_{w} w + ϕ_{θ} θ_{i}^{E} + ϕ_{b} b_{i} + b_{i}] + (1 - D_{i}) [w + θ_{i}^{W} + b_{i}] = w + b_{i} + (ϕ_{b} b_{i} + (ϕ_{w} - 1) w) D_{i} + θ_{i}^{W} + (ϕ_{θ} θ_{i}^{E} - θ_{i}^{W}) D_{i}

which can be expressed as a linear regression model

Y_{i} = w + b_{i} + (ϕ_{b} b_{i} + (ϕ_{w} - 1) w) D_{i} + ε_{i}

(5)

where $ε_{i} = θ_{i}^{W} + (ϕ_{θ} θ_{i}^{E} - θ_{i}^{W}) D_{i}$ , and the term in parenthesis (ϕ_bb_i + (ϕ_w − 1) w) represents the gain in gross income that does not depend on unobserved talent. Notice that the random variable D_i is by construction correlated with ε)i, so the OLS regression of observed earnings onto an occupational dummy (conditioning on wealth)

{\hat{ϕ}}_{1}^{OLS} = \frac{Cov (Y, D ∣ b_{i} = b)}{Var (D ∣ b_{i} = b)}

would provide a biased estimator of this gain, ϕ_bb_i + (ϕ_w − 1) w. We illustrate the consequences of this selection problem below. Importantly, the interaction between unobserved talents, potential outcomes and occupational choice that generates the selection problem is not a result of a linear and separable profit function but a general consequence of the theoretical framework with unobserved talent and endogenous selection.

A widely used alternative is the instrumental variable method. In order to consider this approach, we introduce a policy distortion (instrument) into the model. This distortion affects occupation choices in a simple way. Specifically, we assume the existence of an exogenous subsidy that increases ex-post profits at the end-of-period by ψ. This subsidy is randomly assigned in the population, so that ψ is a random variable with ψ > 0 and known to the econometrician even if the choices of the household is to be a wage earner. Intuitively, it can be interpreted as an experiment or exogenous policy treatment affecting the occupation choices of the individuals but received only if the choice is to setup a firm. However, this subsidy cannot be used to finance k and so the constraint 0 ≤ k ≤ b − θ^E is unaltered.

The policy distortion impacts the decision rule:

D (θ_{i}^{E}, θ_{i}^{W}, b_{i}, ψ_{i}, w) = {\begin{matrix} 1 & if π (θ_{i}^{E}, b_{i}, w) + ψ_{i} > w + θ_{i}^{W} \\ 0 & if π (θ_{i}^{E}, b_{i}, w) + ψ_{i} \leq w + θ_{i}^{W} \end{matrix}

where ψ_i represents the subsidy to agent i in the event of becoming an entrepreneur. More simply, and to emphasize the role of ψ_i, we use the notation D(ψ_i) below but clearly this binary variable is a function of other observable and unobservable variables. We assume that talents (θ^E,θ^W) and subsidy ψ are independent. Indeed, the government cannot see θ^E (or θ^W) but has total control over the random subsidy. The subsidy ψ affects the decision rule, but not the potential outcomes net of the subsidy, as it enters additively. Therefore, the maximization problem of the household as a firm, if it becomes a firm, and its choice of k and l are unaltered. It gets the subsidy independent of the behavior as a firm.

The subsidy ψ_i appears to be a valid instrument. It influences choices but not the potential outcomes.⁴ Additionally, in this setup the subsidy satisfies the uniformity/monotonicity condition (Imbens and Angrist, 1994; Heckman et al., 2006). That is, for each individual an increase (decrease) in the subsidy unambiguously increases (reduces) the chances of becoming an entrepreneur. Indeed, suppose that the subsidy can take on two values $\overset{‒}{ψ}$ and $\bar{\overset{‒}{ψ}}$ . In this case, and without imposing a linear separable model for profits, we can use the instrument ψ to estimate

Δ^{IV} (\bar{\overset{‒}{ψ}}, \overset{‒}{ψ}; b) = \frac{E (Y_{i} ∣ ψ_{i} = \bar{\overset{‒}{ψ}}, b_{i} = b) - E (Y_{i} ∣ ψ_{i} = \overset{‒}{ψ}, b_{i} = b)}{E (D_{i} ∣ ψ_{i} = \bar{\overset{‒}{ψ}}, b_{i} = b) - E (D_{i} ∣ ψ_{i} = \overset{‒}{ψ}, b_{i} = b)},

which, under the assumption of uniformity, identifies the local average treatment effect (LATE) in income for those in the population induced to enter entrepreneurship due to the change of ψ from $\overset{‒}{ψ}$ to $\bar{\overset{‒}{ψ}}$ (the treatment here is to become an entrepreneur), or more formally

Δ^{LATE} (\bar{\overset{‒}{ψ}}, \overset{‒}{ψ}; b) = E [π (θ_{i}^{E}, b_{i}, w) - w - θ_{i}^{W} ∣ D_{i} (\bar{\overset{‒}{ψ}}) = 1, D_{i} (\overset{‒}{ψ}) = 0, b_{i} = b]

This parameter does not pick up the earnings difference for those who would be entrepreneurs, versus wage earners, regardless of the value of the instrument. Instead, the local average treatment effect Δ^LATE naturally provides the answer to a policy experiment.⁵

Given that the model features heterogeneous treatment effects, we can complete the analysis by computing two alternative treatment effects: the treatment on the treated Δ^TT (average benefits of becoming an entrepreneur for individuals that actually decide to become entrepreneur) and the average treatment effect Δ^ATE (the earnings gain or loss of becoming an entrepreneur versus a wage earner in the entire population). Specifically, and presenting the treatment parameters for a particular wealth level b, we have:

Δ^{T T} (b) = E (π (θ_{i}^{E}, b_{i}, w) - (w + θ_{i}^{W}) ∣ D_{i} = 1, b_{i} = b)

(6)

Δ^{A T E} (b) = E (π (θ_{i}^{E}, b_{i}, w) - (w + θ_{i}^{W}) ∣ b_{i} = b) .

(7)

If there were no heterogeneity or all heterogeneity were observed, then all these effects (including LATE) would be equivalent (see Heckman and Vytlacil, 2001). Otherwise, Δ^TT (b) and Δ^ATE (b) depend on counterfactual wages and profits for a given wealth level b, and the estimation of these parameters is not straightforward.

2.2 Parametric and Semi-Parametric Identification of Treatment Effect Parameters

Following Gine and Townsend (2004), we assume

f (k, l) = α k - \frac{1}{2} β k^{2} + σ k l + ξ l - \frac{1}{2} ρ l^{2},

and the profit function can be written

π (θ_{i}^{E}, w, k) = C_{0} (w) + C_{1} (w) k + C_{2} k^{2} - θ_{i}^{E}

(8)

where $C_{0} (w) = \frac{{(ξ - w)}^{2}}{2 ρ}$ , $C_{1} (w) = α - 1 + σ (\frac{ξ - w}{ρ})$ , $C_{2} = \frac{1}{2} (\frac{σ^{2}}{ρ} - β)$ . The model delivers a quadratic occupation partition as depicted in figure 1 (Panel A) and a nonlinear profit function.

Occupational Choice Maps and The Effect of the Subsidy

For expositional simplicity, we set θ^W = 0 and assume $π (θ_{i}^{E}, b_{i}, w) = b_{i} - θ^{E} > 0$ in figure 1A. The points θ^E*, b* and $({\hat{θ}}^{E}, \hat{b})$ determine entirely the can be shape of curves. the These points expressed of functions C₀(w), C₁(w) and C₂.

This framework also allows us to illustrate the effect of the subsidy. Panel B in figure 1 shows the effect of moving ψ from $\overset{‒}{ψ}$ to $\bar{\overset{‒}{ψ}}$ . This change essentially shifts the line of indifference vertically upward as the subsidy simply adds to the net profits of entrepreneurs. (This upward shift is not present when the household is constrained by beginning-of-period wealth). Now for every value of wealth b there exists a group of θ^E households who weakly shift into business. The impact of the subsidy is “uniform” (or monotone in the language of Imbens and Angrist, 1994), that is, the movement is (at most) in one direction only. This is the group of individuals that provides the source of variation used when estimating Δ^LATE.

Finally, under the assumption σ²/ρ = β and optimal capital (k* = b − θ), we can obtain linear profit functions. We want to emphasize that this approximation is not designed to exactly characterize the economic model but to show how to link the theory with common econometric practice. Therefore, from this point forward, we follow the traditional econometric approach and assume a linear and additively separable approximation for the profit function.

By itself the assumption of linear and additive separable profit functions is not sufficient for the computation of treatment effects. We need additional structure to deal with the selection problems. Consider first the case of independent and normally distributed unobserved talents, i.e., $θ^{E} ~ N (0, σ_{E}^{2})$ , $θ^{W} ~ N (0, σ_{W}^{2})$ . In this context, we can define the probability of being an entrepreneur in our model as

\Pr (π (θ_{i}^{E}, b_{i}, w) + ψ_{i} > (w + θ_{i}^{W})) = \Pr (ϕ_{w} w + ϕ_{θ} θ_{i}^{E} + ϕ_{b} b_{i} + ψ_{i} > (w + θ_{i}^{W})) \equiv Φ (\frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}})

where Φ (·) represents the cumulative distribution function of a standard normal distribution. Therefore, given the normality assumption, the structure of this last expression and with information on occupational choice (D), subsidy (ψ), wealth (b), the observed average wage in the economy (w) and profits (π) for those households with D = 1, we can use a probit model to identify the parameters (ϕ_w − 1), ϕ_b, and $\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}$ .

The mean observed profit (conditional on b_i and ψ_i) can be written as:

E (π (θ_{i}^{E}, b_{i}, w) ∣ D_{i} = 1, b_{i}, ψ_{i}) = E (ϕ_{w} w + ϕ_{θ} θ_{i}^{E} + ϕ_{b} b_{i} ∣ ϕ_{w} w + ϕ_{θ} θ_{i}^{E} + ϕ_{b} b_{i} + ψ_{i} > (w + θ_{i}^{W})) = ϕ_{w} w + ϕ_{b} b_{i} + ϕ_{θ} σ_{E} E (\frac{θ_{i}^{E}}{σ_{E}} ∣ \frac{θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}} < \frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}}) .

Given that $\frac{θ_{i}^{E}}{σ_{E}}$ and $\frac{θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}}$ are standard jointly normally distributed random variables, we have that

E (π (θ_{i}^{E}, b_{i}, w) ∣ D_{i} = 1, b_{i}, ψ_{i}) = ϕ_{w} w + ϕ_{b} b_{i} - \frac{ϕ_{θ}^{2} σ_{E}^{2}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}} λ (\frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}})

(9)

where λ (·) represents the Mills’ ratio.⁶ Expression (9) justifies the estimation of a linear regression model of observed profits/earnings onto the wage w (intercept), wealth b_i and the Mills’ ratio λ (.), to obtain $ϕ_{θ}^{2} σ_{E}^{2}$ (since $\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}$ is known from the probit), and also $σ_{W}^{2}$ . The parameters ϕ_θ and $σ_{E}^{2}$ cannot be identified separately from this regression.

On the other hand, although unobserved, average wages among those choosing to be entrepreneurs can be written as

E (w + θ_{i}^{W} ∣ D_{i} = 1, b_{i}, ψ_{i}) = E (w + θ_{i}^{W} ∣ ϕ_{w} w + ϕ_{θ} θ_{i}^{E} + ϕ_{b} b_{i} + ψ_{i} > (w + θ_{i}^{W})) = w + \frac{σ_{W}^{2}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}} λ (\frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}})

(10)

which depends only on identified parameters, so it can be constructed for any value of wealth and subsidy. Thus, we can compute the treatment on the treated (Δ^TT (b, ψ)) as

Δ^{T T} (b, ψ) = \underset{Identified from expression (9)}{\underset{︸}{E (π (θ_{i}^{E}, b_{i}, w) ∣ D_{i} = 1, b_{i} = b, ψ_{i} = ψ)}} - \underset{Identified from expression (10)}{\underset{︸}{E (w + θ_{i}^{W} ∣ D_{i} = 1, b = b_{i}, ψ_{i} = ψ)}}

and, likewise, the average treatment effect Δ^ATE(b_i)

Δ^{A T E} (b) = E (π (θ_{i}^{E}, b_{i}, w) - (w + θ_{i}^{W}) ∣ b_{i} = b) = (ϕ_{w} - 1) w + ϕ_{b} b .

The unconditional version of Δ^TT (b, ψ_i), i.e. Δ^TT (b), can be obtained by simply integrating out ψ over the relevant region.

The normality assumption for the identification of treatment parameters can be relaxed at the price of additional conditions. In particular, let θ^E and θ^W be two independent random variables distributed according to a (general) joint distribution function f_θ^E,θ^W (·, ·). As shown in the context of the economic model, these variables, which are unobserved by the analyst, determine profits, wages and occupational choices.

On the other hand, and provided enough data variation, we can non-parametrically estimate the probability of D_i = 1 using information on b_i, w, ψ_i and actual choices D_i (Matzkin, 1992). Let p (w, b_i, ψ_i) denote this probability, also known in the literature as the propensity score. We can then write the conditional expectation of observed outcome Y_i as a function of the probability of selection and wealth:

E (Y_{i} ∣ p (w, b_{i}, ψ_{i}), b_{i}) = w + b_{i} + (ϕ_{b} b_{i} + (ϕ_{w} - 1) w) E (D_{i} ∣ p (w, b_{i}, ψ_{i})) + E (θ_{i}^{W} + (ϕ_{θ} θ_{i}^{E} - θ_{i}^{W}) D_{i} ∣ p (w, b_{i}, ψ_{i})) = w + b_{i} + (ϕ_{b} b_{i} + (ϕ_{w} - 1) w) p_{i} + Λ (p_{i}, b_{i})

(11)

where $Λ (p_{i}, b_{i}) \equiv E (θ_{i}^{W} + (ϕ_{θ} θ_{i}^{E} - θ_{i}^{W}) D_{i} ∣ p (w, b_{i}, ψ_{i}), b_{i})$ and for notational convenience, we use p_i instead of p (w, b_i, ψ_i). As shown by Heckman and Vytlacil (2001) we can use this conditional expectation to form Δ^TT (b) and Δ^ATE(b), expressions (6) and (7), respectively, without imposing normality. In particular, these authors show how by computing

Δ^{L I V} (p, b) = {\frac{\partial E (Y_{i} ∣ p_{i}, b_{i} = b)}{\partial p_{i}} ∣}_{p_{i} = p},

usually called the local instrumental variable estimator, the analyst can identify the treatment parameter

Δ^{M T E} (p, b) \equiv E (π (θ_{i}^{E}, b_{i}, w) - (w + θ_{i}^{W}) ∣ b_{i} = b, θ_{i}^{W} - ϕ_{θ} θ_{i}^{E} = p)

where Δ^MTE (p, b) represents the treatment effect for those individuals indifferent between occupations given a particular value (p) for the random variable $θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}$ (conditional on wealth level b).⁷ Finally, Heckman and Vytlacil (2001) show that Δ^ATE(b) and Δ^TT (b) can be obtained as weighted averages of Δ^MTE (p, b) according to the following expressions:

Δ^{T T} (b) = \int Δ^{M T E} (u, b) ω^{T T} (u, b) d u

Δ^{A T E} (b) = \int Δ^{M T E} (u, b) ω^{A T E} (u) d u

where ω^ATE (u) = 1, ω^TT (u, b) = Pr (p(w, b, ψ) > u) / ∫ Pr(p(w, b, ψ) > u)du. The argument of integration u is associated with the random variable $U = F_{θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}} (θ_{i}^{W} - ϕ_{θ} θ_{i}^{E})$ which is uniformly distributed.⁸

The question then becomes how to compute Δ^LIV (p, b). We can use formal semi-parametric techniques to estimate E(Y_i|p_i, b_i) (expression (11)), and its derivative with respect to p. An alternative and simpler way to estimate this function is by approximating it using a polynomial on p_i (see Heckman et al., 2006).⁹

2.3 Measuring the Impact of Occupations on Income

We illustrate the importance of the previous discussion by computing and comparing different estimates of the effect of occupational decisions on income. In order to do this we simulate data from our model. We utilize the quadratic production function described above, and consider the parameterization in table 1. These parameter values are taken directly from Gine and Townsend (2004).¹⁰ We assume a discrete subsidy. Specifically, we assume that the subsidy ψ can take two values: 0 or 1. The value of the subsidy is randomly assigned in the population.

Table 1.

Parameter Values

Parameter	Value
α	0.54561
β	−0.39064
σ	0.1021
ξ	0.2582
ρ	−0.03384
w	0.048

Open in a new tab

Note: The numbers in this table are obtained from Gine and Townsend (2004). Using data from Thailand, Gine and Townsend (2004) estimate a model of occupational choice with similar characteristics to the one studied in this paper.

Wealth (b), talents (θ^W and θ^E) and the subsidy (ψ) are assumed to be distributed as follows:

b \sim Log N (0, 1),

θ^{W} \sim N (0, 1), θ^{E} \sim N (0, 1),

ψ \sim Binomial (0, 1) .

We first reproduce the analysis that a researcher could carry out using a cross-sectional data set with information on wealth, occupation and observed (factual) incomes.

2.3.1 Using Cross-Sectional Information to Estimate the Effect of Occupational Choice

Table 2 presents the sorting into occupations obtained from model generated cross-sections of 25,000 individuals.

Table 2.

Sorting by Occupational Choices Model of Occupational Choice - Simulated Cross-sectional Data

Occupational Choice	Sample Size
Wage Earners	6,109
Entrepreneur	18,891
Constrained	14,519
Unconstrained	4,372
Total	25,000

Open in a new tab

Note: The number of observations in each occupation is the result of the endogenous decision process faced by each of the 25,000 simulated individuals.

Consider an “agnostic” empirical approach in which the researcher tries to estimate the “effect” of the occupational choice on outcomes using a simple regression model. In particular, suppose that observed income (profits/wages) Y is written as:

Y_{i} = κ_{0} + κ_{1} b_{i} + κ_{2} b_{i} D_{i} + κ_{3} D_{i} + ε_{i}

where D_i takes a value of 1 if the individual i is an entrepreneur, and 0 otherwise.¹¹ Notice that in this equation we do not incorporate the talent explicitly. This because in practice the analyst does not observe this variable so it must be excluded from the list of controls (and contained in the error term).

Table 3 presents the estimated effect of D_i on Y_i obtained using OLS and IV.¹² The results show large differences between the two approaches. The results from IV deliver negative impacts whereas OLS would suggest a positive impact. The large discrepancies are a clear manifestation of the biases caused by the selection process. A researcher would draw dramatically different conclusions depending on how she interpreted the policy impact coefficients in the IV regression. In practice, likely instruments can show up as correlated with unobserved error producing the misinterpretation of the results. The analyst must understand the economics behind the selection mechanism before drawing conclusions.

Table 3.

OLS and IV Estimates Model of Occupational Choice - Estimates from Cross-sectional Data

Parameter	Estimates
	Δ ^OLS	Δ ^IV

κ ₀	0.606^**	1.189^**
κ ₁	1.155^**	1.142^**
κ ₂	−0.136^**	−0.082
κ ₃	0.457^**	−0.356^*

Average Effect (κ₂b‾ + κ₃)	0.303^**	−0.450

Open in a new tab

Note: This table presents the parameters obtained from a linear regression of observed income (profits or wages depending on individual’s occupation) on wealth, the occupational dummy, and the interaction between wealth and occupation (dummy). In addition, the column Δ^IV presents the estimates when ψ is used as instrument. Overall these results illustrate what the analyst can obtain using information produced from the model (observed outcome, wealth, and occupation) using a reduced-form strategy.

denotes statistical significance at 5%

^**

denotes statistical significance at 1%.

Interestingly, the negative effect estimated by IV is intuitively correct since the individuals switching occupations as a result of the variation in the instruments are those with lower profits (and higher wages). That is, since the subsidy is not included in the income gains, it induces inefficient choices. On the other hand, there are others who benefits from the subsidy but would have chosen to be entrepreneurs in any event, and they had efficient rents which dominate wage earnings.

2.3.2 Using the Structure of the Model to Generate Counterfactual Outcomes and The Causal Effects of Occupational Choices

Given our knowledge of the model, we can study the consequences of exogenous policy changes. Specifically, we provide individuals that did not receive a subsidy when it was originally assigned with the subsidy. We then use the sample of individuals switching occupation due to the change in subsidy status (from ψ = 0 to ψ = 1) to compute the model generated local average treatment effect (Δ^LATE (1, 0)) (i.e., the average effect of the treatment for those individuals switching occupations as a result of a change in the instrument). However, since occupation status also depends on wealth, we first compute Δ^LATE (1, 0; b_k) where b_k represent the k-th percentile of the wealth distribution, and then we compute Δ^LATE (1, 0) as the (weighted) averages of Δ^LATE (1, 0; b_k).¹³

As a result of our experiment 1,861 of our original wage earners become entrepreneurs. This is precisely the group from which we can compute the model generated local average treatment effect. Additionally, from our knowledge of the model we can directly compute the average treatment effect (ATE) and the treatment effect of those treated (TT).

Table 4 presents the model generated treatment effects. Notice the similarities between the model generated LATEs (Δ^LATE in table 4) and the IV effect estimated using the cross-sectional data sets (Δ^IV in table 3). The discrepancies can be attributed to the linear approximation used in the regression model. We relax this assumption in the next sections.¹⁴ In our model, Δ^LATE is negative as the subsidy induces low productivity individuals to enter business and the subsidy is not counted as part of the gain. This is exactly the same conclusion draw in the context of table 3.

Table 4.

Model Generated Treatment Paramaters Model of Occupational Choice - The Causal Effects of Occupation on Income

Parameter	Value
Δ ^ATE	0.619
Δ ^TT	1.270
Δ^LATE(1, 0)	−0.459

Open in a new tab

Note: The numbers in this table represents the model’s underlying treatment parameters associated with the effect of occupation on income (or model-generated treatment parameters). In order to obtain them, we use the structure of the model to simulate data on wages, profits and choices for 25,000 individuals. The analyst would need to characterize the structure of the model (counter-factual outcomes and decision rule) before producing these treatment parameters (as opposed to a reduced form strategy).

TT and ATE on the other hand are positive numbers indicating the positive benefits associated with entrepreneurship.

Finally, figure 2 presents the local average treatment effects by percentile of the wealth distribution. The figure presents the model generated LATE (Δ^LATE(1, 0; b) and the estimated IV (Δ^IV(ψ)(1, 0; b)) by wealth level. As expected, although the model generated LATE fluctuates across levels of wealth (a result of our sample size), on average it is close to what the standard econometric technique delivers.

Model Generated and Estimated Local Average Treatment Effect by Percentile of the Wealth Distribution

This example illustrates how the economic model delivers a valid instrument, how this instrument allows the identification of a causal effect of interest, and how this causal effect can differ from other relevant treatment parameters.

3 Occupational Choice Under Financial Intermediation

The simple model presented in section 2, with the subsidy to firms, can be easily extended to incorporate an intermediated sector. The analysis in Gine and Townsend (2004) does exactly that. We follow their approach. The underlying model in this case is similar to the model in section 2, but now there is borrowing and lending of capital and wealth. We denote by Q_i the individual-specific cost of using the intermediated sector. Examples of Q_i include travel time to district center or branch office, whether or not a particular intermediary has been active in a city or village according to history, particular policies of financial institutions which vary in effectiveness, new credit in a city or village divided by the number of households, etc. See Kaboski and Townsend (2005, 2009) and Alem and Townsend (2008) for examples.

We take the initial distribution of Q as given and, for simplicity, focus on a binary Q. The analysis can be extended directly to a continuous-valued Q. We assume Q independent from ψ, and denote by r the (equilibrium) interest rate.

An entrepreneur using the intermediated sector solves the following problem

\max_{k, l} f (k, l, θ_{i}^{E}) - w l - (1 + r) (k + θ_{i}^{E})

(13)

There is a neoclassical separation between production and household wealth. In effect, the agent can put all his wealth b_i in financial markets and earn interest r. Meanwhile the firm (individual) can borrow what it needs to finance k and set up cost $θ_{i}^{E}$ . There is lot of indeterminacy in between, in financing, i.e., self invest and borrow/lend the difference with wealth, but real quantities and net income are all pinned down.

The wage is common to both sectors, as households are allowed to work wherever they prefer. They can join an intermediary and put their money in a saving account if they do not become firms.

As before, denote by D_i a binary variable such that D_i = 1 if agent i decides to become an entrepreneur, and 0 otherwise. Thus, the occupation choice when the agent is participating in the intermediated sector can be described by:

D (θ_{i}^{E}, θ_{i}^{W}, w, r) = {\begin{matrix} 1 & if π (θ_{i}^{E}, w, r) + b_{i} (1 + r) - Q_{i} + ψ_{i} > w + θ_{i}^{W} + b_{i} (1 + r) - Q_{i} \\ 0 & otherwise \end{matrix},

where $π (θ_{i}^{E}, w, r)$ denotes the resulting profits after solving (13).

In this context, the researcher would observe $π (θ_{i}^{E}, b_{i}, w, r) + b_{i} (1 + r)$ or $w + b_{i} (1 + r) + θ_{i}^{W}$ depending on the value of $D (θ_{i}^{E}, θ_{i}^{W}, w, r)$ . Thus, if we denote by $Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r)$ the outcome observed under intermediation, we have

Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) = D (θ_{i}^{E}, θ_{i}^{W}, w, r) (π (θ_{i}^{E}, w, r) + b_{i} (1 + r)) + (1 - D (θ_{i}^{E}, θ_{i}^{W}, w, r)) (w + b_{i} (1 + r) + θ_{i}^{W})

(14)

and the cost Q_i and the subsidy ψ_i are not subtracted or added, respectively, from Y_I, that is, we have gross gains.

On the other hand, recall that without financial intermediation the occupation choice model is

D (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w) = {\begin{matrix} 1 & if π (θ_{i}^{E}, b_{i}, w) + ψ_{i} > w + θ_{i}^{W} \\ 0 & otherwise, \end{matrix}

so that the observed outcome under financial autarky $Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)$ (and not counting the subsidy) can be presented as:

Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w) = D (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w) (π (θ_{i}^{E}, b_{i}, w) + b_{i}) + (1 - D (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)) (w + θ_{i}^{W} + b_{i}) .

(15)

In sum, the sub-index k in Y_k indicated the sector (financial autarky A or intermediation I). We use this notation in what follows.

The choice of sector, autarky versus intermediation, is made by a simple comparison of the potential associated outcomes $Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)$ and $Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r)$ but adjusting in the choice for the subsidy ψ_i and entry cost Q_i. Note that, in general, the heterogeneity $(θ_{i}^{E}, θ_{i}^{W})$ does not enter additively into $Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)$ or $Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r)$ . Thus, let Υ_i be a binary variable that takes a value of 1 if the individual decides to use the financial intermediary, and 0 otherwise. Then,

Υ_{i} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r, ψ_{i}, Q_{i}) = {\begin{matrix} 1 & if (\begin{matrix} [Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) - Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)] \\ + [D (θ_{i}^{E}, θ_{i}^{W}, w, r) - D (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)] ψ_{i} \\ - Q_{i} \end{matrix}) \geq 0 \\ 0 & otherwise \end{matrix} .

This simple framework allows us to analyze policies regarding the access to financial intermediation.

3.1 Identifying the Effects of Financial Intermediation

In the context of our model, the effect of having access to financial intermediation at the individual level (agent i) is defined as

Δ_{i}^{Υ} = Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) - Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w),

the average treatment effect (ATE) associated with financial intermediation is

E (Δ_{i}^{Υ}) = E (Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) - Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)),

and the average effect of the treatment on those treated (TT) equals

E (Δ_{i}^{Υ} ∣ Υ_{i} = 1) = E (Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) - Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w) ∣ Υ_{i} = 1)

where again for simplicity we use Υ_i instead of $Υ (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r, Q_{i})$ . Additionally, in what follows we use D_i and D_i (r) to denote the occupation choices $D (θ_{i}^{E}, θ_{i}^{W}, w, b_{i})$ and $D (θ_{i}^{E}, θ_{i}^{W}, w, r)$ under financial autarky and the intermediated sector, respectively.

In order to analyze whether conventional econometric methods (OLS and IV) allow the identification of any of these effects, we first denote by ξ_i the observed outcome, i.e.,

ξ_{i} = Υ_{i} \times Y_{I} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) + (1 - Υ_{i}) \times Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w)

which after substituting expressions (14) and (15) can be written as:

ξ_{i} = Υ_{i} \times [\begin{matrix} D_{i} (r) (π (θ_{i}^{E}, w, r) + b_{i} (1 + r)) \\ + \\ (1 - D_{i} (r)) (w + θ_{i}^{W} + b_{i} (1 + r)) \end{matrix}] + (1 - Υ_{i}) \times [\begin{matrix} D_{i} (π (θ_{i}^{E}, b_{i}, w) + b_{i}) \\ + \\ (1 - D_{i}) (w + θ_{i}^{W} + b_{i}) \end{matrix}] .

(16)

This expression illustrates the fact that all potential choices and outcomes play a role even when the researcher is only interested in the impact of having access to financial intermediation.

Following the conventional empirical strategy, we assume profit functions of the form:

π (θ_{i}^{E}, b_{i}, w) = γ_{w} w + γ_{b} b_{i} + γ_{θ} θ_{i}^{E} (Financial Autarky)

π (θ_{i}^{E}, w, r) = δ_{w} w + δ_{r} r + δ_{θ} θ_{i}^{E} (Intermediation)

Substituting these expressions into equation (16), and after some algebra, we obtain

ξ_{i} = w + b_{i} + r Υ_{i} b_{i} + ((γ_{w} - 1) w) D_{i} (1 - Υ_{i}) + γ_{b} b_{i} D_{i} (1 - Υ_{i}) + ((δ_{w} - 1) w + δ_{r} r) Υ_{i} D_{i} (r) + δ_{b} b_{i} Υ D_{i} (r) + η_{i} (θ_{i}^{E}, θ_{i}^{W}, r, Q_{i}),

(17)

where $η_{i} (θ_{i}^{E}, θ_{i}^{W}, r, Q_{i}) = (δ_{θ} θ_{i}^{E} - θ_{i}^{W}) Υ_{i} D_{i} (r) - (γ_{θ} θ_{i}^{E} - θ_{i}^{W}) Υ_{i} D_{i} + (γ_{θ} θ_{i}^{E} - θ_{i}^{W}) D_{i} + θ_{i}^{W}$ so $η_{i} (θ_{i}^{E}, θ_{i}^{W}, r, Q_{i})$ contains all the terms involving unobserved talents $θ_{i}^{E}$ and $θ_{i}^{W}$ . Using expression (17), we can define the individual effect of having access to financial intermediation, $Δ_{i}^{Υ}$ , as

Δ_{i}^{Υ} = \frac{Δ ξ_{i}}{Δ Υ_{i}} = r b_{i} + ((δ_{w} - 1) w + δ_{r} r) D_{i} (r) - ((γ_{w} - 1) w - γ_{b} b_{i}) D_{i} + \frac{Δ η_{i} (θ_{i}^{E}, θ_{i}^{W}, r, Q_{i})}{Δ Υ_{i}} .

(18)

Notice that $Δ_{i}^{Υ}$ (conditional on wealth b) depends on the occupation of the individual under each regime and the unobserved talents.

On empirical grounds, expression (17) suggests the estimation of the parameters defining $Δ_{i}^{Υ}$ through a regression of ξ_i on b_i, Υ_ib_i, D_i(1 − Υ_i), D_i(r) Υ_i, b_iD_i(1 − Υ_i) and b_iΥ_iD_i(r). However, since unobserved talents (contained in the error term) affect both choices and potential outcomes, without further assumptions, conventional OLS estimates will not provide unbiased estimates of the parameters in the model.

An alternative approach is the instrumental variable method. The economic model provides one natural instrument for Υ_i, namely Q_i. The cost Q_i affects the choice of sector but does not affect the potential outcomes. In addition, notice that changes in Q_i produce uniform (monotonic) responses in choice Υ_i. Consequently, given two values for the instrument Q_i, $\overset{‒}{Q}$ and $\bar{\overset{‒}{Q}}$ (lowering the cost so that $\bar{\overset{‒}{Q}} < \overset{‒}{Q}$ ) and conditioning on wealth b, we can estimate

Δ^{IV (Q)} (\bar{\overset{‒}{Q}}, \overset{‒}{Q}; b) = \frac{E (ξ_{i} ∣ Q_{i} = \bar{\overset{‒}{Q}}, b_{i} = b) - E (ξ_{i} ∣ Q_{i} = \overset{‒}{Q}, b_{i} = b)}{E (Υ_{i} ∣ Q_{i} = \bar{\overset{‒}{Q}}, b_{i} = b) - E (Υ_{i} ∣ Q_{i} = \overset{‒}{Q}, b_{i} = b)}

(19)

to identify the local treatment effect of financial intermediation on income

Δ^{LATE (Q)} (\bar{\overset{‒}{Q}}, \overset{‒}{Q}; b) = E (Y_{i} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r) - Y_{A} (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w) ∣ b_{i} = b, Υ_{i} (\bar{\overset{‒}{Q}}) = 1, Υ_{i} (\overset{‒}{Q}) = 0)

(20)

where $Υ (\overset{‒}{Q}) = Υ (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r, ψ_{i}, \overset{‒}{Q})$ and $Υ (\bar{\overset{‒}{Q}}) = Υ (θ_{i}^{E}, θ_{i}^{W}, b_{i}, w, r, ψ_{i}, \bar{\overset{‒}{Q}})$ . Intuitively, in this case the local IV (expression (19)) identifies the gains in outcomes (including profits and wages but not the subsidy nor the intermediary cost) for those individuals induced to join the financial system as a consequence of the reduction in intermediation cost.

Importantly, one cannot interpret this parameter as the effect of financial intermediation on profits for entrepreneurs or on income for wage earners. This is because the change in Q also induces changes in occupational decisions in a non-uniform way. That is, changes in Q may endogenously induce individuals to switch from the wage sector to entrepreneurship and vice-versa.

Additionally, although in principle the analyst could use the information on occupations to compute versions of $Δ^{IV (Q)} (\bar{\overset{‒}{Q}}, \overset{‒}{Q}; b)$ among wage earners and/or entrepreneurs, in general, these estimates would not identify the local causal effects of financial intermediation (as defined in (20)) in those populations. This is again a consequence of the non-uniform responses in occupational decisions induced by the changes in Q. Intuitively, by restricting the estimation of Δ^IV(Q) to entrepreneurs (wage earners) the analyst would be erroneously excluding the gains on outcomes from those initial entrepreneurs (wage earners) who would become wage earners (entrepreneurs) as a result of the change in Q. In other words, the analyst would conceptually identify the effect of financial intermediation for those entrepreneurs (wage earners) who would not have switched occupations as a result of the change in the instrument. Given the economic incentives operating in the model, the Δ^IV(Q) estimated in this way would only provide a partial response to the question of the effect of financial intermediation among entrepreneurs (wage earners).¹⁵ We illustrate this point below.

We can use the same logic to identify the local average treatment effect of occupation (entrepreneurship) on income through the following local IV estimator:

Δ^{IV (ψ)} (\bar{\overset{‒}{ψ}}, \overset{‒}{ψ}; b) = \frac{E (ξ_{i} ∣ ψ_{i} = \bar{\overset{‒}{ψ}}, b_{i} = b) - E (ξ_{i} ∣ ψ_{i} = \overset{‒}{ψ}, b_{i} = b)}{E ({\tilde{D}}_{i} ∣ ψ_{i} = \overset{‒}{ψ}, b_{i} = b)}

where ${\tilde{D}}_{i}$ is D_i (r) Υ_i + D_i (1 − Υ_i). Under uniformity of ψ on $\tilde{D}$ , this parameter identifies the local treatment effect of occupation on income.

Analogously to the case of financial intermediation, one cannot use $Δ^{IV (ψ)} (\bar{\overset{‒}{ψ}}, \overset{‒}{ψ}; b)$ to determine the gains in income for those induced to enter to the financial system as a result of the subsidy. This is because the change in the subsidy does not produce necessarily uniform (or monotonic) movements with respect to intermediation choice.¹⁶

The complications of identifying $Δ^{LATE (Q)} (\bar{\overset{‒}{Q}}, \overset{‒}{Q}; b)$ by occupation or $Δ^{LATE (ψ)} (\bar{\overset{‒}{ψ}}, \overset{‒}{ψ}; b)$ by status of financial intermediation are due to the presence of two margins of choice in the model. Strictly speaking, the model includes four categories or possible treatments: wage sector and financial autarky, wage sector and financial intermediation, entrepreneurship and financial autarky, and entrepreneurship and financial intermediation. Indeed, we could phrase our discussion in the context of a model with multiple treatments and multiple instruments. In this framework, the definition of treatment effects is not as straightforward as in the binary case. Specifically, the pair-wise comparison of the outcomes associated with different alternatives needs to be supplemented by considerations of the alternatives left out from the comparison. This adds a new level of complexities to the definition of treatment effects. As an example, notice that we can define the effect of financial intermediation on profits (i.e., the effect of intermediation among businesses) but for individuals effectively participating in the wage sector. This is not an intuitive treatment effect, but it is well defined in the context of a model with multiple treatments.

Heckman et al. (2006) analyze the identification power of instrumental variables in the context of models with multiple treatments and unobserved heterogeneity. They show that provided a variable (instrument) determining the preferences for a particular alternative but excluded from its potential outcome (e.g., instrument Z^j determining utility associated with alternative/option j, V_j), in models such as the one considered in this section, a local IV strategy (using Z_j as the instrument and based on a regression of observed income on dummy variables describing individual’s observed decisions) would identify the effect of option j versus the next best alternative.¹⁷ This result complements our discussion about the difficulties of further interpreting Δ^IV(Q) or Δ^IV(ψ).¹⁸ See Heckman et al. (2006) for additional discussion.

3.2 Example

Following the same logic utilized in our previous example (section 2.3), we investigate the consequences of using different econometric techniques when estimating the effects of financial intermediation and occupational choices. We use the parameterization presented in table 1. Our main results are robust to different parameterizations. In addition to the structure presented in table 1, we assume

Q_{i} \sim Binomial (0.25, 1)

with Q_i independent of ψ_i, $θ_{i}^{E}$ , $θ_{i}^{W}$ and b_i. Table 5 presents the sorting simulated from the model for a sample size of 25,000 individuals. Given our parameterization, approximately one fourth of the individuals become wage earners (most of them working under financial autarky), more than half of the individuals are entrepreneur under autarky (most of whom are unconstrained), and the rest are entrepreneurs with access to financial intermediation.

Table 5.

Sorting by Occupational Choices and Access to Financial Intermediation Model of Occupational Choice and Financial Intermediation Simulated Cross-sectional Data

Occupational Choice	Financial Intermediation	Autarky
Wage Earners	940	5,072
Entrepreneurs	2,678	16,310
Constrained	-	14,015
Unconstrained	-	2,295

Open in a new tab

Note: The number of observations in each cell is the result of the endogenous decision process faced by each of the 25,000 simulated individuals.

3.2.1 Using Cross-Sectional Information to Estimate the Effect of Financial Intermediation and Occupational Choices

Suppose the econometrician focuses first on the impact of financial intermediation proposing the following linear model:

Y_{i} = κ_{0} + κ_{1} b_{i} + κ_{2} b_{i} Υ_{i} + κ_{3} Υ_{i} + ε_{i}

(21)

where Υ_i takes a value of 1 if the individual i has access to financial intermediation, and 0 otherwise.

Table 6 presents the results from OLS and IV on equation (21). The results suggest positive average effects of financial intermediation. However, because of the selection bias, the effect suggested by OLS is almost double the effect estimated by IV.

Table 6.

OLS and IV Estimates of the Effect of Financial Intermediation on Income Model of Occupational Choice and Financial Intermediation

	κ ₀	κ ₁	κ ₂	κ ₃	Average Effect (κ₂b‾ + κ₃)
Δ ^OLS	1.015^**	0.954^**	0.313^**	0.227^**	0.585^**
Δ ^IV(Q)	0.933^**	1.071^**	0.076^*	0.236^**	0.323^**

Open in a new tab

Note: This table presents the parameters obtained from a linear regression of observed income (profits or wages depending on individual’s occupation) on wealth, the financial intermediation dummy, and the interaction between wealth and financial intermediation (dummy). In addition, the row Δ^IV(Q) presents the estimates when Q is used as instrument for endogenous financial intermediation. Overall these results illustrate what the analyst can obtain using information produced from the model (observed outcome, wealth and financial intermediation) using a reduced-form strategy.

denotes statistical significance at 5%

^**

denotes statistical significance at 1%.

On the other hand, suppose that the analyst proposes the following linear model to investigate the effects of occupation on income.

Y_{i} = τ_{0} + τ_{1} b_{i} + τ_{2} b_{i} D_{i} + τ_{3} D_{i} + ε_{i}

where D_i takes a value of 1 if the individual i is an entrepreneur, and 0 otherwise. These model follows closely the one presented in section 2.3. Table 7 presents the IV and OLS of the “effect” of D_i on Y_i from our model generated data. As in the previous section, the OLS estimate delivers a positive effect whereas the IV suggests a negative effect of occupation on income/profit.¹⁹

Table 7.

OLS and IV Estimates of the Effect of Occupation on Income Model of Occupational Choice and Financial Intermediation

	τ ₀	τ ₁	τ ₂	τ ₃	Average Effect (τ₂b‾ + τ₃)
Δ ^OLS	0.578^*	1.259^*	−0.146^*	0.433^*	0.266^*
Δ ^IV(ψ)	1.212^*	1.177^*	−0.027	−0.426^*	−0.458^**

Open in a new tab

Note: This table presents the parameters obtained from a linear regression of observed income (profits or wages depending on individual’s occupation) on wealth, the occupational dummy, and the interaction between wealth and occupation (dummy). In addition, the row Δ^IV(ψ) presents the estimates when ψ is used as instrument for the endogenous occupational decision. Overall these results illustrate what the analyst can obtain using information produced from the model (observed outcome, wealth and occupation) using a reduced-form strategy.

denotes statistical significance at 5%

^**

denotes statistical significance at 1%.

3.2.2 Using the Structure of the Model to Generate Counterfactual Outcomes and the Causal Effect of Financial Intermediation and Choices

Table 8 presents the model generated local average treatment effects (LATE). These LATEs are not obtained using econometric techniques, but generated using the structure of the model. The table displays both Δ^LATE(ψ)(1, 0) (LATE associated with the effect of occupation) and Δ^LATE(Q)(0.25, 1) (LATE associated with the effect of financial intermediation).

Table 8.

Model Generated Local Average Treatment Effects Model of Occupational Choice and Financial Intermediation

Parameter	Value	Number of Movers	Direction

Δ^LATE(ψ)(1, 0)	−0.466	2,219	From Wage Earner to Entrepreneur

	−0.444	1,548	From Wage Worker under Autarky to Entrepreneur under Autarky
	−0.278	278	From Wage Worker under Autarky to Entrepreneur under Financial Intermediation
	−0.724	322	From Wage Worker under Financial Intermediation Entrepreneur under Autarky
	−0.519	71	From Wage Worker under Financial Intermediation to Entrepreneur under Financial Intermediation

Δ^LATE(Q)(0.25, 1)	0.388	3,757	From Autarky to Financial Intermediation

	0.355	911	From Wage Worker under Autarky to Wage Worker under Financial Intermediation
	−0.203	176	From Wage Worker under Autarky to Entrepreneur under Financial Intermediation
	0.752	75	From Entrepreneur under Autarky to Wage Worker under Financial Intermediation
	0.430	2,595	From Entrepreneur under Autarky to Entrepreneur under Financial Intermediation

Open in a new tab

Note: The numbers in the table are obtained using the factual and counterfactual information on income generated by the economic model. Specifically, for each individual in the sample we analyze the consequences of modifying the values of the instruments initially assigned. We study the individual’s changes in occupational choices as well as the changes in decisions involving the financial system. Then, for each individual modifying her decisions as a result of the changes in Q or ψ, we compute the associated effects on income. This table presents the average effects on income generated using this logic. It also displays the number of individuals switching decisions as a result of the changes in the instrument (column Number of Movers).

Importantly, our knowledge of the model allows us to generate not only an overall local average treatment effect (bold numbers in table 8) but also the local effects of the treatment for specific groups of individuals. For example, in the case of financial intermediation, we obtain the local treatment effects for individuals switching from “wage-earner under autarky” to “wage-earner with access to financial system” (as a result of the exogenous change in the instrument) as well as the local effect for those “wage-earners under autarky” becoming “entrepreneurs under intermediation” (also as a result of a change in the instrument). This analysis cannot be done without a structural analysis.

Notice, as expected, the model generated overall local treatment effects are very close to the effects estimated using the IV strategy (tables 6 and 7).

Table 8 also displays how the individuals in our model react to changes in the instrument. Interestingly, we observe how changes in Q induces people to move away from entrepreneurship and into the wage sector. In particular, and given our parameterization, 75 entrepreneurs would have become wage earners as a result of a change in Q. This illustrates our previous comment about the difficulty of interpreting Δ^IV(Q) as the effect of financial intermediation on profits for entrepreneurs and income for wage earners. The change in Q induces (non-uniform) changes in occupation. A similar logic prevents interpreting Δ^IV(Q) as the effect of occupation on income for individual using the financial system or for individuals under financial autarky. As table 8 shows, a change in ψ induces (non-uniform) changes in the financial participation decisions of the individuals in the model.

Finally, table 9 presents the model generated ATE and TT for the effect of financial intermediation and occupational choice. These causal parameters are presented for all the different groups of interest. It is worth noting the significant differences between these treatment effects and the local effects reported in table 8. This illustrates the potential discrepancies between the different treatment parameters. All these parameters represent causal effects, but in our model with selection based on unobserved talents and gains, they all answer different economic questions.

Table 9.

Model Generated Treatment Parameters associated with Occupational Choices and Financial Intermediation

Treatment Parameter	Alternatives Considered in the Comparison	Value
Δ ^ATE	Entrepreneurship vs. Wage Sector under Financial Autarky	0.619
Δ ^ATE	Entrepreneurship vs. Wage Sector under Financial Intermediation	0.607
Δ ^ATE	Financial Intermediation vs. Autarky for Wage Earners	0.227
Δ ^ATE	Financial Intermediation vs. Autarky for Entrepreneurs	0.215

Δ ^TT	Entrepreneurship vs. Wage Sector under Financial Autarky	1.205
Δ ^TT	Entrepreneurship vs. Wage Sector under Financial Intermediation	1.734
Δ ^TT	Financial Intermediation vs. Autarky for Wage Earners	0.364
Δ ^TT	Financial Intermediation vs. Autarky for Entrepreneurs	0.433

Open in a new tab

Note: The table presents the treatment parameters associated with the pairwise comparison of different alternatives in the model conditional on a specific alternative for the margin not considered in the comparison. For example, the first row presents the mean difference between profits and wages for individuals not participating in the financial system. The other rows can be interpreted using the same logic.

4 Dynamics, Risk Sharing, Unobserved Heterogeneity and Occupational Choice

In this section we follow the analysis of Greenwood and Jovanovic (1999) (from hereafter GJ), Townsend and Ueda (2006, 2009), Jeong and Townsend (2008), and Felkner and Townsend (2007) with additional modifications. This literature discusses endogenous financial deepening and how well it fits both mmicroeconomic and macroeconomic data, examining for targeting of government development banks and interest rate distortions that created a crisis and increased government involvement in the banking sector.

Consider a dynamic problem with an infinite horizon. Household i maximizes discounted expected utility

E_{0} \sum_{t = 0}^{\infty} β_{i}^{t} u (c_{i t})

where u(·) is strictly concave and initial wealth is k_i,0 = b_i,0. E₀(·) denotes the expectation given the information available at t = 0. We incorporate unobserved heterogeneity by allowing the individuals to differ in their discount factors. Specifically, we assume $β_{i} = \overset{‒}{β} + θ_{i}$ , where θ_i is an individual specific component known to the agent only, and $\overset{‒}{β}$ is common knowledge.

In autarky there is a law of motion for wealth as a function of savings, investment in specific occupations, and an exogenous random endowment. Let s_it denote the savings rate of household i at date t expressed as a fraction of wealth k_it at date t. Let $Ψ_{t}^{E}$ be the proportion of the savings invested in a risky enterprise sector and $Ψ_{t}^{W}$ be the proportion invested in wage sector activities. Additionally, one unit of wealth invested in enterprise E yields $δ_{t}^{E} + ε_{i t}^{E}$ units of capital (wealth), whereas one unit invested in wage activity W yields an ex-post rate of return of $δ_{t}^{W} + ε_{i t}^{W}$ . The returns $δ_{t}^{E}$ and $δ_{t}^{W}$ are realized at the end of date t and are unknown when within-period decisions are made.

The law of motion for wealth in autarky is thus

k_{i t + 1} = s_{i t} \times [Ψ_{t}^{E} \times (δ_{t}^{E} + ε_{i t}^{E}) + Ψ_{t}^{W} \times (δ_{t}^{W} + ε_{i t}^{W})] \times k_{i t} .

(23)

Consumption in autarky at t $c_{i t}^{A}$ is the residual, i.e., $c_{i t}^{A} = (1 - s_{i t}) k_{i t}$ .

The value function W₀ associated with financial autarky, A, exists under standard regularity conditions. It satisfies the Bellman equation:

W_{0} (k_{i t}, θ_{i}) = \max_{Ψ_{i}^{E}, Ψ_{i}^{W}, c_{i t}, s_{i t}} u (c_{i t}) + β_{i} E (W_{0} (k_{i t + 1}, θ_{i}))

subject to (23). The function W₀ (k_it, θ_i) is strictly concave in k_it. Under general preferences, the saving and investment policies are functions of wealth k_it. However, for CRRA preferences $(u (c_{i t}) = c_{i t}^{γ})$ they are constant. More precisely, under these preferences

c_{i t}^{A} = {\tilde{α}}_{i}^{A} k_{i t} = {\tilde{α}}_{i}^{A} (y_{i t}^{E} + y_{i t}^{W})

where ${\tilde{α}}_{i}^{A} = (1 - β_{i})$ , $y_{i t}^{E}$ is the income from enterprise, $y_{i t}^{W}$ is the labor income, i.e.,

y_{i t}^{E} = Ψ_{t - 1}^{E} (δ_{t - 1}^{E} + ε_{i t - 1}^{E}) k_{i t - 1} s_{i t - 1}

y_{i t}^{W} = Ψ_{t - 1}^{W} (δ_{t - 1}^{W} + ε_{i t - 1}^{W}) k_{i t - 1} s_{i t - 1} .

Therefore, and since by definition β_i = β‾ + θ_i, we can write the equation describing optimal consumption in autarky A as:

c_{i t}^{A} = (1 - \overset{‒}{β} - θ_{i}) y_{i t} = α^{A} y_{i t} + ε_{i t}^{A}

where y_it is the sum of all sources of income $(y_{i t} = y_{i t}^{E} + y_{i t}^{W})$ , $α^{A} = 1 - \overset{‒}{β}$ and where $ε_{i t}^{A} = - θ_{i} y_{i t}$ is the unobserved component.

Participation in the intermediated sector on the other hand, allows household to share any idiosyncratic shock and, as in GJ, get perfect advanced information on aggregate shocks $δ_{t}^{E}$ , $δ_{t}^{W}$ .²⁰ The bank directs all investment as if each household were exchanging shares in its own return stream for shares in a common mutual fund. The law of motion for wealth is then

k_{i t + 1} = s_{i t} k_{i t} \max {δ_{t}^{W}, δ_{t}^{E}} (1 - τ)

(24)

where τ is the marginal intermediation transaction cost. The value function V_I for those in the intermediated sector, I, satisfies the Bellman equation

V_{I} (k_{i t}, θ_{i}) = \max_{c_{i t}, s_{i t}} [u (c_{i t}) + β_{i} E (V_{I} (k_{i t + 1}, θ_{i}))]

subject to (24). Again V_I(k_it, θ_i) is strictly concave in k_it. Policy s_it might be a nonlinear function of k_it, but again under CRRA preferences, s_it is linear in k_it. Thus,

c_{i t}^{I} = {\tilde{α}}_{i}^{I} A_{t}

where the aggregate shock A_t is equal to max $\max {δ_{t - 1}^{W}, δ_{t - 1}^{E}} (1 - τ)$ , and ${\tilde{α}}_{i}^{I}$ is equal to $(1 - \overset{‒}{β} - θ_{i})$ . Following our previous analysis, we can write:

c_{i t}^{I} = α^{I} A_{t} + ε_{i t}^{I}

where $α^{I} = 1 - \overset{‒}{β}$ and $ε_{i t}^{I} = - θ_{i} A_{t}$ is the unobserved component.

4.1 Once-And-For-All Participation Decisions and Participation Costs as Instruments

In this section we extend the analysis of GJ. In particular, while GJ has endogenous entry determined by the solution to a dynamic programming problem with a period-by-period decision, we consider the special case of a once-and-for-all entry decision at an initial date. For an empirical application of this idea see Alem and Townsend (2008).

Initially at t = 0, given k_i0, the household decides whether to participate in the financial sector or not. Once decided there is no going back. Let Z_i denote an individual specific participation costs. This subtracts from wealth k_i0. Again this cost is meant to capture exogenous variation in the ability to access intermediation, through either policy variation of physical infrastructure. These can be thought of as household specific transaction costs (with any correlation across individuals taken into account by other control variables, which is the way we treat wealth below). In the original GJ model, these costs are subtracted upon entry to the financial system. These are also transaction costs models in the finance literature, e.g. Vissing-Jorgensen (2002).

Then, with V_I and W₀ strictly concave in k_it, the decision to participate depends on participation cost Z_i and wealth k_i0. More precisely, if we denoted by I_i0 the participation decision, we can write

I_{i 0} = 1 \Leftrightarrow V_{i} (k_{i 0} - Z_{i}, θ_{i}) \geq W_{0} (k_{i 0}, θ_{i}) .

Additionally, we can write observed consumption at t as a function of potential consumption levels $(c_{i t}^{I}, c_{i t}^{A})$ and the participation decision I_i0:

\begin{matrix} c_{i t} = c_{i t}^{A} (1 - I_{i 0}) + c_{i t}^{I} I_{i 0} \\ c_{i t} = α^{A} y_{i t} + (α^{I} A_{t} - α^{A} y_{i t}) I_{i 0} + v_{i t} \end{matrix}

(25)

where $v_{i t} = ε_{i t}^{A} + I_{i 0} (ε_{i t}^{I} - ε_{i t}^{A})$ . Equation (25) shows how, if intermediation is effective for those who choose it, idiosyncratic income y_it should not determine consumption.

Notice that the error term in (25), v_it, depends on the decision made at t = 0, I_i0, so there is a selection bias argument that prevents the researcher of using OLS in the estimation of (25). In this context, an IV strategy becomes an appealing alternative.

The obvious issue is then how to come up with a valid instrument. Interestingly, the economic model delivers a natural instrument, namely Z_i. In order to see this, notice that under autarky and the assumption of CRRA preferences, optimal saving rates and proportions of savings invested in each sectors do not depend on k_it. As a result of this, potential consumption in the intermediated and autarky sectors do not depend on the choice of intermediation other than at t = 0 (when the costs are paid). Consequently, although Z_i affects the initial choice of intermediation sector versus financial autarky, for all time periods t > 0 the individual participation cost does not affect the potential levels of consumption $c_{i t}^{A}$ and $c_{i t}^{I}$ . These two conditions make Z_i a valid instrument for the effect intermediation on consumption.

Using the instrument Z_i the researcher can identify LATE, a causal relationship between financial intermediation and consumption.

Estimating the average treatment effect (ATE) or the treatment effect on those treated (TT) is more delicate. Notice that due to the role of θ_i in the model, I_i0 is correlated with each of the components of v_it, namely $ε_{i t}^{A}$ and $I_{i 0} (ε_{i t}^{I} - ε_{i t}^{A})$ . This structure is similar to the one discussed in the context of the models introduced in sections 2 and 3. As in those cases, the presence of unobserved components and the endogenous selection of the individuals into sectors (based on the comparison of counterfactual outcomes affected by unobserved variables) produces heterogeneity in treatment effects. In this context, we can show that under the assumption of a uniform response of I_i0 to changes in Z_i (for all i), the instrumental variable estimator will indeed identify a causal effect of I_i0 on c_it (see Heckman et al., 2006; Imbens and Angrist, 1994). But the causal effect identified by IV might be different from, for example, ATE or TT. Only under the special case of no selection on unobserved gains IV, ATE and TT would be identical. However, the presence of unobserved components and the endogenous selection process make this case unlikely.²¹

4.2 Sequential Participation Decisions

Now suppose the choice of sector takes place each period t, not just initially. Then for those not yet in the intermediated sector at t ≥ 0, but may choose so at t + 1, the value function satisfies the Bellman equation

W_{0} (k_{i t}, θ_{i}) = \max_{Ψ_{t}^{E}, Ψ_{t}^{W}, c_{i t}, s_{i t}} {U (c_{i t}) + β_{i} E \max {W_{0} (k_{i t + 1}, θ_{i}), V_{1} (k_{i t + 1} - Z_{i}, θ_{i})}}

subject to $k_{i t + 1} = s_{i t} \times [Ψ_{t}^{E} \times (δ_{t}^{E} + ε_{i t}^{E}) + Ψ_{t}^{W} \times (δ_{t}^{W} + ε_{i t}^{W})] \times k_{i t}$ .

There is a critical family of values k* (Z_i, θ_i) which define thresholds for participation. Under some regularity conditions entry is permanent. However, saving s_t(k_it) and investments $Ψ_{t}^{E} (k_{i t})$ , $Ψ_{t}^{W} (k_{i t})$ are generally functions of wealth k_it even with CRRA utility. It can be established, in fact, that savings and investment in risky assets will rise with k_it as that wealth approaches critical entry k* (Z_i, θ_i). See Townsend and Ueda (2006).

Thus variation in Z_i determines both k* and pre participation outcomes. Therefore, Z_i cannot be considered as a potential instrument. Careful researchers do take into account the impacts of anticipated policy when designing experiments. Subjects are not given full information of what is to happen step by step. See Olken (2007).

4.3 The Identification Power of Policies

Interesting, the existence of unanticipated policies can allow us to identify the effect of financial intermediation on consumption. For example, assume a once-and-never-more policy shifting at some date t* the cost of participation Z_i. Then period t* can be interpreted as period zero and the earlier analysis applies (except we have pre-intervention data, and savings and investment are non linear in wealth k_it). At period t* we have pre-established positions for those not yet in, and the participation decision for them is:

I_{i t^{*}} > 0 \Leftrightarrow V_{1} (k_{i t^{*}} - Z_{i}, θ_{i}) \geq W_{1} (k_{i t^{*}}, θ_{i})

In effect, the policy change can be interpreted as a once-and-for-all wealth shock in the event of joining the financial sector. Consumption equations are as before. For t > t*, we have

c_{i t} = c_{i t}^{A} \times (1 - I_{i t}) + c_{i t}^{I} \times I_{i t}

Then, if the agent enters at t*, induced by the sudden and temporary policy change, we can analyze this decision as if it would have been a “once for all” decision. In this case, the policy changes the entry decision, but it does not affect the potential outcomes at t > t*.²²

However, if the policy is permanent, then the policy is subject to the same qualifications as the case when choice of sector takes place each period. Subsequently, pre entry behavior for those not yet entering at period t* will be altered.

5 A Model of Financial Intermediation with Moral Hazard and Collateral Constraints

5.1 Statics

In this section, we study the consequences for impact evaluation of a model with financial intermediation with moral hazard. Our model is similar to the one discussed in Paulson et al. (2006) estimated using data from Thailand. This follows the tradition of the earlier literature on occupation choice but attempts to estimate the financial regime in place, i.e., moral hazard versus limited commitment. Here we focus on moral hazard and the endogeneity of the intermediation decision.

We first introduce the static version of the model though for simplicity we suppress the occupation choice and focus on firms. We also focus our interest on the empirical consequences of randomized contracts. We then go to the dynamics.²³

We denote by u(c_i, e_i) the utility function associated with individual i. This function is increasing in consumption c_i and decreasing in effort e_i. The technology in the model is described by a stochastic production function Pr(q_i|e_i, θ_i) where θ_i denotes outcome and θ_i represents individual’s talent or type.

The individual as firm must decide whether or not to participate in a lottery determining who gets intermediated. If participating in the lottery, she must pay an amount b_i to the bank and, as a results of this, she gets a randomized contract determining if she will have to run her business in autarky or if her output will depend on a transfer agreement associated with credit and insurance. The entry into randomization has a fixed and individual-specific costs Z_i. However, Z_i produces a natural source of variation that can be used to identify and estimate the effect of financial intermediation on consumption. We illustrate this point in our example.

Overall, the timing of the model is as follow. First, wealth b_i is transferred to the bank. Then, the outcome of the lottery is revealed. If the result is autarky, some wealth may be transferred from bank to the individual before she “opens” her business. This amount is such that the on average the individual ends up with the same wealth level as autarky. Let w_Ai denote the optimal transfer and Π_A (w_Ai, q_i, e_i) be the joint distribution of the transfer, production and effort.

Here Π_A (w_Ai, q_i, e_i) allows non trivial probabilities but much of the outcomes can be deterministic. The Π_A (w_Ai, q_i, e_i) makes the problem linear. The following expressions characterize the problem of determining the optimal transferred level w_Ai, for each θ_i type;

\max_{Π_{A}} \sum_{w_{A i}, q_{i}, e_{i}} Π_{A} (w_{A i}, q_{i}, e_{i}) u (q_{i} + w_{A i}, e_{i})

s.t.

\sum_{w_{A i}} Π_{A} (w_{A i}, {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}) = \Pr ({\overset{‒}{q}}_{i} ∣ {\overset{‒}{e}}_{i}, θ_{i}) \sum_{w_{A i}, q_{i}} Π_{A} (w_{A i}, q_{i}, {\overset{‒}{e}}_{i}) \forall {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}

(26)

\sum_{w_{A i}, q_{i}, e_{i}} Π_{A} (w_{A i}, q_{i}, e_{i}) w_{A i} = b_{i}

(27)

Π_{A} \geq 0 and \sum_{w_{A i}, q_{i}, e_{i}} Π_{A} (w_{A i}, q_{i}, e_{i}) = 1

(28)

The first constraint in this program implies that, regardless of the initial transfer, the distribution of output given the effort level is consistent with the production function associated with the individual’s type θ_i, Pr(q_i|e_i, θ_i). The second constraint gives back to the agent his wealth in expectation.

On the other hand, if the outcome of the lottery is “intermediation”, the contract defines first a recommended level of effort, and then a distribution of consumption conditional on output. These choices are described by the joint distribution of consumption, output and effort under intermediation (Π_I(c_i, q_i, e_i)). Again, e_i may be deterministic and c_i a non trivial function of q_i. Additionally, we assume the existence of an individual-specific utility cost k_Ii in case of being intermediated, as otherwise intermediation would always dominate autarky.

Since effort is only known by the individuals, we need to add the following constraint that makes recommended effort e_i weakly dominate any e‾_i:

\sum_{c_{i}, q_{i}} Π_{I} (c_{i}, q_{i}, e_{i}) u (c_{i}, e_{i}) \geq \sum_{c_{i}, q_{i}} \frac{\Pr (q_{i} ∣ {\overset{‒}{e}}_{i}, θ_{i})}{\Pr (q_{i} ∣ e_{i}, θ_{i})} Π_{I} (c_{i}, q_{i}, e_{i}) u (c_{i}, {\overset{‒}{e}}_{i}) \forall {\overset{‒}{e}}_{i}, e_{i}

(29)

Additionally, the joint distribution of consumption, output and effort under intermediation must be consistent with the production technology Pr(q_i|e_i, θ_i). Thus, we require

\sum_{c_{i}} Π_{I} (c_{i}, {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}) = \Pr ({\overset{‒}{q}}_{i} ∣ {\overset{‒}{e}}_{i}, θ_{i}) \sum_{c_{i}, q_{i}} Π_{I} (c_{i}, q_{i}, {\overset{‒}{e}}_{i}) \forall {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}

(30)

In sum, the contract can be characterized by the joint distribution of transfer, output and effort under financial autarky Π_A (w_Ai, q_i, e_i) and by the joint distribution of consumption, output and effort under intermediation Π_I(c_i, q_i, e_i).

We must impose:

Π_{I}, Π_{A} \geq 0

(31)

\sum_{c_{i}, q_{i}, e_{i}} Π_{I} (c_{i}, q_{i}, e_{i}) + \sum_{w_{A i}, q_{i}, e_{i}} Π_{A} (w_{A i}, q_{i}, e_{i}) = 1

(32)

Finally, we impose a zero expected profit condition to our bank. Therefore, the following constraint must hold for each θ_i type:

\sum_{c_{i}, q_{i}, e_{i}} Π_{I} (c_{i}, q_{i}, e_{i}) (c_{i} - q_{i}) + \sum_{w_{A i}, q_{i}, e_{i}} Π_{A} (w_{A i}, q_{i}, e_{i}) w_{A i} = b_{i} - Z_{i}

(33)

This constraint implies that the net expected amount of transfers to the individuals (under both regimes) equals the initial transfer received by the bank.

Program 1 describes the efficient arrangements given b_i.

Program 1

U (b_{i}; θ_{i}, Z_{i}) = \max_{Π_{I}, Π_{A}} Σ_{c_{i}, q_{i}, e_{i}} Π_{I} (c_{i}, q_{i}, e_{i}) [u (c_{i}, e_{i}) - κ_{I i}] + Σ_{w_{A i}, q_{i}, e_{i}} Π_{A} (w_{A i}, q_{i}, e_{i}) u (q_{i} + w_{A i}, e_{i})

s.t. (26),(27),(28),(29),(30),(31),(32),(33). The outcome of this program U(b_i; θ_i, Z_i) is the indirect utility from the contract given a wealth level b_i, the individual’s type θ_i and cost Z_i. Notice the important role of k_Ii. If k_Ii < 0 we will have intermediation with probability one. On the contrary, a non-negative k_Ii will make intermediation more attractive for low values of wealth b_i. Therefore, the possibility of randomization will occur only for non-negative values of k_Ii.

5.1.1 The Role of Lotteries and Z as a Valid instrument

Random assignments of wealth can help us to recover instruments at least over specified ranges of ex-ante wealth. Figure 3 illustrates this point.

Random Assignments of Wealth as a Source of Instruments

For values wealth between b_L < b_i − Z_i < b_U a lottery puts mass on participation and autarky points in proportion to the utility distance. That is, suppose that an individual with initial wealth b_i in this range forfeits Z_i in wealth and enters the lottery with b_i − Z_i. Then, the effect of cost Z_i is to shift ex-ante wealth to the left and increased the probability of loosing the lottery, that is becoming poor and needing the financial system.

Figure 3 shows that when b_i < b_L, intermediation is chosen with probability one, and those agents do not play the lottery (and do not pay costs Z_i).

The point is that in the relevant range of wealth (and only in that range) costs Z_i affect the probability of participation without changing outcomes associated with the participation decision. This logic produces the instrument. Additionally, changes in the instrument produce uniform or monotonic responses in the chances of getting access to the financial system. Therefore, even under the presence of unobserved talent driving consumption levels and probabilities of intermediation, the IV strategy will identify a causal effect associated with financial intermediation.

We note that ex-ante expected utility is a function of the instrument and we come back to this in a consideration of dynamics.

5.1.2 Example

In order to understand the consequences of our analysis for the impact evaluation of financial intermediation, we generate data from our theoretical model and estimate what the effect of financial intermediation would be using different econometric techniques. Specifically, we use our model to generate data on consumption, wealth and financial intermediation for a sample of approximately 1,800,000 individuals.²⁴ As previously explained, talent plays a critical role in our theoretical model, but since talent is observed only by the individual, we do not condition on it.

Table 10 presents our parameterization of the theoretical model. In our data, we observe 67.90% of the individuals (endogenously) reporting financial intermediation. Notice that wealth (b) and the instrument (Z) are defined as continuous random variables. However, once we identify the region in which randomization is non trivial, we solve the model for a set of discrete values of b and Z. Specifically, we work with ten values for both wealth (b₁, … , b₁₀) and the instrument (Z₁, … , Z₁₀).²⁵ This not only allows us to make the numerical solution of the theoretical problem feasible but also to mimic what an analyst would face in reality.

Table 10.

Model of Financial Intermediation with Moral Hazard and Collateral Constraints Parameterization

Utility Function	u(c, e) = −100c^−1.5 − v(e)
Dis-utility of Effort	v(0.06) = 2.9, v(16) = 3
Probability of High Output	$\Pr (q H ∣ e, θ) = \frac{θ^{0.5} e^{0.5}}{1 + θ^{0.5} e^{0.5}}$
Effort Grid	e ∈ {0.06, 16}
Output Grid	q ∈ {0.5, 15}
Cost of Intermediation	κ_I = 0.1
Talent (θ) and Wealth (b)	$(θ, b) ~ [(0, 0), (\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix})]$
Instrument (Z)	Z ~ U(0, 1) with Z independent from θ and b

Open in a new tab

First, we estimate the effect of financial intermediation using OLS and IV techniques. We denote by c_i and D_i observed consumption and financial intermediation (dummy variable), respectively. b_i denotes individual’s wealth level. We carry out the estimation considering two different models of consumption, as illustrative:²⁶

c_{i} = α + β D_{i} + δ b_{i} + ∊_{i}

(34)

c_{i} = α (b_{k}) + β (b_{k}) D_{i} + ∊_{i} for b_{k} = b_{1}, \dots, b_{10}

(35)

where, in (35), the dependency of the coefficients α and β on values of wealth (b_k) indicates that the equation is estimated for each b_k. In this manner, the second equation represents a more flexible functional form than the first (and standard) model. The needs for instruments comes from the fact that intermediation D_i is an endogenous variable. Additionally, given the presence of unobserved talent and the endogenous selection mechanism driving the intermediation decisions, behind expressions (34) and (35) we have a model with heterogeneous treatment effects.

We denote by Δ^OLS(b_k) and Δ^IV (b_k) the effect of financial intermediation on consumption (conditional on wealth) obtained from model (35). Table 11 presents our results. The results are ordered increasingly on wealth (i.e., b₁ < b₂ < … < b₁₀). The last two rows of table 11 present the overall effects obtained from (34) and (35) (across wealth levels). The comparison of these columns illustrates the empirical consequences of imposing a priori the restricted functional form (34), i.e., $α (b_{k}) = α (b_{k}^{'})$ and $β (b_{k}) = β (b_{k}^{'})$ for all (k, k’), and the biased results delivered by OLS.

Table 11.

IV and OLS Estimates computed by Wealth Level - Static Model

Wealth Level	Δ^IV (b_k) (1)	Δ^OLS(b_k) (2)	% Difference $(\frac{(1) - (2)}{∣ (2) ∣})$

b ₁	−0.91	−0.87	−4.8%
b ₂	−2.02	−1.85	−8.9%
b ₃	−2.92	−2.83	−3.1%
b ₄	−3.97	−3.92	−1.4%
b ₅	−4.80	−4.91	2.3%
b ₆	−5.76	−5.97	3.5%
b ₇	−6.91	−7.00	1.3%
b ₈	−7.91	−8.04	1.7%
b ₉	−9.09	−9.04	−0.5%
b ₁₀	−9.75	−10.05	3.0%

Overall Effect-Restricted Model (Equation (34))	−5.40	−5.72	5.6%
Overall Effect-Full Interactions (Equation (35))	−5.38	−5.45	1.3%

Open in a new tab

Note: All estimates are statistically signi cant at 5% level.

Additionally, and following Imbens and Angrist (1994), we can decompose the IV estimates (Δ^IV (b_k)) presented in table 11 into its local components. Specifically, we can write:

Δ^{IV} (b_{k}) = \sum_{l = 1}^{9} Δ^{IV} (Z_{l + 1}, Z_{l}; b_{k}) λ_{l} (b_{k}) \forall b_{k} with k = 1, \dots, 10,

where $Δ^{IV} (Z_{l + 1}, Z_{l}; b_{k}) = \frac{E (c_{i} ∣ Z = Z_{l + 1}, b = b_{k}) - E (c_{i} ∣ Z = Z_{l}, b = b_{k})}{\Pr (D_{i} = 1 ∣ Z = Z_{l + 1}, b = b_{k}) - \Pr (D_{i} = 1 ∣ Z = Z_{l}, b = b_{k})}$ , and the weights are such that λ_l(b_k) ≥ 0 and Σ_lλ_l(b_k) = 1 for all b_k with k = 1, … , 10. Table 12 presents the estimated Δ^IV (Z_l+1, Z_l; b_k) obtained using data generated from the model, whereas table 13 presents the associated weights. The variability of Δ^IV (Z_l+1, Z_l; b_k) demonstrates the presence of unobserved heterogeneity in our model across levels of wealth. The IVs presented in table 11 gives a partial picture of the local effects contained in the data.²⁷

Table 12.

Local IV Estimates by Wealth Level Δ^IV (Z_l+1, Z_l; b_k) Static Model

Wealth Level	(Z₂, Z₁)	(Z₃, Z₂)	(Z₄, Z₃)	(Z₅, Z₄)	(Z₆, Z₅)	(Z₇, Z₆)	(Z₈, Z₇)	(Z₉, Z₈)	(Z₁₀, Z₉)	Δ^IV (b_k)
b ₁	11.89	−8.95	−0.29	−5.53	7.73	−10.81	5.60	−6.13	9.03	−0.92
b ₂	−7.49	−1.31	1.29	−1.32	−2.97	−6.23	−2.69	1.74	1.05	−2.02
b ₃	−3.36	−2.79	−3.17	−1.63	−0.93	−6.39	−4.06	0.10	−4.17	−2.92
b ₄	−3.76	−3.50	−1.95	−7.04	−5.36	−2.91	−2.22	−3.25	−6.09	−3.97
b ₅	−7.05	−3.88	−2.43	−4.84	−6.72	−7.51	−3.65	−1.40	−5.79	−4.80
b ₆	−2.84	−5.24	−3.58	−8.85	−5.27	−7.12	−6.86	−3.94	−5.26	−5.76
b ₇	−5.58	−7.88	−4.96	−6.42	−8.63	−6.88	−8.22	−6.18	−6.00	−6.91
b ₈	−9.72	−6.58	−8.43	−10.29	−10.13	−4.44	−10.11	−3.57	−6.01	−7.91
b ₉	−4.17	−8.68	−12.31	−15.37	−1.80	−12.74	1.77	−19.21	−7.99	−9.10
b ₁₀	−16.29	−11.53	0.92	−24.90	−12.82	7.29	−22.50	5.74	−17.65	−9.75

Open in a new tab

Table 13.

IV Weights by Wealth Level (λ_l(b_k)) Static Model

Wealth Level	(Z₂, Z₁) (1)	(Z₃, Z₂) (2)	(Z₄, Z₃) (3)	(Z₅, Z₄) (4)	(Z₆, Z₅) (5)	(Z₇, Z₆) (6)	(Z₈, Z₇) (7)	(Z₉, Z₈) (8)	(Z₁₀, Z₉) (9)	Σ_lλ_l(b_k) (1)+(2)+…+(9)
b ₁	0.05	0.10	0.13	0.15	0.15	0.14	0.13	0.10	0.05	1.00
b ₂	0.05	0.10	0.13	0.15	0.14	0.15	0.13	0.10	0.05	1.00
b ₃	0.05	0.10	0.13	0.15	0.15	0.15	0.13	0.10	0.05	1.00
b ₄	0.05	0.10	0.13	0.15	0.15	0.15	0.13	0.10	0.05	1.00
b ₅	0.05	0.10	0.12	0.15	0.15	0.15	0.13	0.10	0.05	1.00
b ₆	0.06	0.10	0.13	0.14	0.15	0.15	0.13	0.10	0.05	1.00
b ₇	0.05	0.10	0.13	0.14	0.15	0.15	0.13	0.09	0.05	1.00
b ₈	0.05	0.10	0.12	0.16	0.15	0.14	0.13	0.10	0.06	1.00
b ₉	0.06	0.10	0.13	0.15	0.14	0.15	0.13	0.09	0.05	1.00
b ₁₀	0.06	0.11	0.12	0.13	0.15	0.14	0.14	0.09	0.06	1.00

Open in a new tab

Importantly, the local IV estimates presented in table 12 have a causal interpretation. Specifically, they identify the effects of the treatment for those individuals induced to switch regime as a result of a change in the instrument. In other words, Δ^IV (Z_l+1, Z_l; b_k) identifies $Δ^{LATE} (Z_{l + 1}, Z_{l}; b^{k}) = E (c_{i}^{I} - c_{i}^{A} ∣ D_{i} (Z_{l} + 1) - D_{i} (Z_{l}) = 1, b = b_{k})$ where $c_{i}^{I}$ and $c_{i}^{A}$ denote the consumption levels under intermediation and autarky, respectively, and D_i(Z_l) denotes the value for the dummy variable associated with intermediation when individual i faces Z = Z_l. This causal interpretation of IV comes from the fact that Z is a valid instrument and from the assumption of a uniform (or monotonic) effect of Z on D (from the lottery). In the next section, we show how this causal interpretation of local IV breaks down in the context of a dynamic model.²⁸

As previously discussed, in the context of models with unobserved heterogeneity, reduced form approaches (including IVs) might not give estimates of the average effect of the treatment (ATE) or the treatment effect on those treated (TT). This is because each of these parameters depend in one way or another on counterfactual outcomes, and therefore, their estimation requires additional structure. Fortunately for us, full control of our model allows us to generate these counterfactual states, and consequently, all the treatment parameters. Table 14 presents these treatment parameters. It also presents the average treatment effects for those untreated or TUT. We immediately observe that there are differences among the treatment parameters, which is again a manifestation of the presence of unobserved heterogeneity.

Table 14.

Model Generated Treatment Parameters: ATE, TT and TUT, by Wealth Level Static Model

Wealth Level	Δ^ATE(b_k)	Δ^TT (b_k)	Δ^TUT (b_k)	Consumption Under Financial Autarky
b ₁	−0.81	−0.81	−0.83	22.92
b ₂	−1.83	−1.84	−1.82	23.95
b ₃	−2.85	−2.87	−2.79	24.97
b ₄	−3.87	−3.88	−3.86	25.99
b ₅	−4.89	−4.90	−4.87	27.01
b ₆	−5.91	−5.92	−5.91	28.03
b ₇	−6.93	−6.93	−6.94	29.05
b ₈	−7.95	−7.93	−7.99	30.07
b ₉	−8.97	−8.97	−8.98	31.09
b ₀	−9.99	−10.00	−9.99	32.11

Overall	−5.40	−5.17	−5.90	27.52

Open in a new tab

5.2 Dynamic Mechanism Design

Suppose now there are two time periods in our contract model. We continue defining Z_i as a cost of entering the lottery which is subtracted from wealth b_i.

We denote by $b_{i}^{'}$ the wealth level in the second period which is a “decision” variable in the context of the first period. Individuals in our model are allowed to switch from intermediation today to autarky tomorrow, and also the opposite.

The program introduced in section 5.1 already determined the optimal arrangement in the second period. Importantly, in the first period, not only consumption but also the characteristics of the future arrangement are used to reward individuals. But indirect utility $U (b_{i}^{'}; θ_{i}, Z_{i})$ carries all the information from the second period arrangement that is relevant for the characterization of the optimal contract in the first period as only utility matters for incentives. We use this fact in what follows.

When the result of the lottery is autarky in the first period, a particular distribution of transfer to the individual, w_Ai, is determined. Then, the individual decides the amount of effort, the output level is obtained, and finally, the individual decides how to split his resources q_i + w_Ai between consumption today and wealth level for the second period. Thus, the program determining the optimal policy given the available resources b_i can be written as:

U_{A i} (b_{i}; θ_{i}, Z_{i}) = \max_{Π_{A}} \sum_{b_{i}^{'} w_{A i}, q_{i}, e_{i}} Π_{A} (b_{i}^{'}, w_{A i}, q_{i}, e_{i}) [u (q_{i} + w_{A i} - b_{i}^{'}, e_{i}) + U (b_{i}^{'}; θ_{i}, Z_{i})]

s.t.

\sum_{b_{i}^{'}, w_{A i}} Π_{A} (b_{i}^{'}, w_{A i}, {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}) = \Pr ({\overset{‒}{q}}_{i} ∣ {\overset{‒}{e}}_{i}, θ_{i}) \sum_{b_{i}^{'}, w_{A i}, q_{i}} Π_{A} (b_{i}^{'}, w_{A i}, {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}) \forall {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}

(36)

\sum_{b_{i}^{'}, w_{A i}, q_{i}, e_{i}} Π_{A} (b_{i}^{'}, w_{A i}, q_{i}, e_{i}) w_{A i} = b_{i}

(37)

Π_{A} \geq 0 and \sum_{b_{i}^{'}, w_{A i}, q_{i}, e_{i}} Π_{A} (b_{i}^{'}, w_{A i}, q_{i}, e_{i}) = 1

(38)

In this dynamic version of the model, the contract under intermediation can be characterized by the joint distribution of next period’s wealth, and the first period levels of consumption, production and effort, $Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i})$ .

The incentive constraint under intermediation in the first period is,

\sum_{b_{i}^{'}, c_{i}, q_{i}} Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i}) [u (c_{i}, e_{i}) + U (b_{i}^{'}; θ_{i}, Z_{i})] \geq \sum_{b_{i}^{'}, c_{i}, q_{i}} Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i}) \frac{\Pr (q_{i} ∣ {\overset{‒}{e}}_{i}, θ_{i})}{\Pr (q_{i} ∣ e_{i}, θ_{i})} [u (c_{i}, {\overset{‒}{e}}_{i}) + U (b_{i}^{'}; θ_{i}, Z_{i})] \forall {\overset{‒}{e}}_{i}, e_{i}

(39)

and the constraint securing that $Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i})$ be consistent with the stochastic technology is

\sum_{b_{i}^{'}, c_{i}} Π_{I} (b_{i}^{'}, c_{i}, {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}) = \Pr ({\overset{‒}{q}}_{i} ∣ {\overset{‒}{e}}_{i}, θ_{i}) \sum_{b_{i}^{'}, c_{i}, q_{i}} Π_{I} (b_{i}^{'}, c_{i}, q_{i}, {\overset{‒}{e}}_{i}) \forall {\overset{‒}{q}}_{i}, {\overset{‒}{e}}_{i}

(40)

Finally, the probability constraints are:

Π_{I}, Π_{A} \geq 0

(41)

\sum_{b_{i}^{'}, c_{i}, q_{i}, e_{i}} Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i}) + \sum_{b_{i}^{'}, w_{A i}, q_{i}, e_{i}} Π_{A} (b_{i}^{'}, w_{A i}, q_{i}, e_{i}) = 1

(42)

The bank faces the following zero expected profit condition:

\sum_{b_{i}^{'}, c_{i}, q_{i}, e_{i}} Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i}) (c_{i} + b_{i}^{'} - q_{i}) + \sum_{b_{i}^{'}, w_{A i}, q_{i}, e_{i}} Π_{A} (b_{i}^{'}, w_{A i}, q_{i}, e_{i}) w_{A i} = b_{i} - Z_{i}

(43)

This constraint assures that the amount initially given to the bank equals the expected amount transferred for the individual either under intermediation or autarky, plus the cost of intermediation Z_i. (A similar constraint is already imposed in the second period).

Therefore, given the initial promise $U (b_{i}^{'}; θ_{i}, Z_{i})$ , the first period program describing the efficient allocation of resources becomes:

Program 2

\max_{Π_{I}, Π_{A}} Σ_{b_{i}^{'}, c_{i}, q_{i}, e_{i}} Π_{I} (b_{i}^{'}, c_{i}, q_{i}, e_{i}) [U (b_{i}^{'}; θ_{i}, Z_{i}) + u (c_{i}, e_{i}) - κ_{I i}] + Σ_{b_{i}^{'}, w_{A i}, q_{i}, e_{i}} Π_{A} (b_{i}^{'}, w_{A i}, q_{i}, e_{i}) [U (b_{i}^{'}; θ_{i}, Z_{i}) + u (q_{i} + w_{A i} - b_{i}^{'}, e_{i})]

Our main interest in this model is the critical role of ex-ante promise utility in the second period $U (b_{i}^{'}; θ_{i}, Z_{i})$ . This variable determines the incentives in the first period, and this fact has consequence for the interpretation of Z_i as a valid instrument. Note from our earlier discussion that expected utility in the static problem depends on Z_i. Now the static problem is the second period problem, and so one sees the intuition that varying levels of utility depend on Z through these second period promised utilities. These promises thus impact current period incentives and so vary with Z. In order to see this, notice the higher is Z_i the less surplus there will be during the second period to maintain a given level of promise utility $U (b_{i}^{'}; θ_{i}, Z_{i})$ . As a result of this, we might expect U(·) to be a monotone decreasing function of Z_i. On the other hand, assignment to intermediation in the second period when Z_i is high (low) allows the assignment of a low (high) promise as a threat for bad outcomes in the first period. Therefore, either way the level promise U(·) depends on the instrument Z_i. Thus, we lose the desirable properties of the Z_i as a potential instrument. Promised utility in the second period depends on Z_i and promised utility determines current period incentives

5.2.1 An Example

As in the static case, we study the differences between the model generated treatment parameters and the estimates obtained by a researcher using observational data on consumption, wealth, and intermediation. We use the same parameterization as in the previous case (see table 10, and equations (34) and (35)). In this case, we observe 33.10% of the individuals (endogenously) reporting financial intermediation.²⁹

Table 15 presents the IV and OLS estimates (overall and by level of wealth). In general, the differences between IV and OLS are larger than what we computed in table 11. Table 16 on the other hand, presents the local IV estimates Δ^IV (Z_l+1, Z_l; b_k) for k = (1, … , 9), which - jointly with the weights presented in table 17 - produce the IV estimates reported in table 15. We again observe a larger variability in IVs (across wealth levels) compared to the results in table 12. This fact reflects the strong selection process driving the decision into intermediation.

Table 15.

IV and OLS Estimates computed by Wealth Level - Dynamic Model

Wealth Level	Δ^IV (b_k) (1)	Δ^OLS(b_k) (2)	% Difference $(\frac{(1) - (2)}{∣ (2) ∣})$

b ₁	−1.85	−1.48	−25.0%
b ₂	−2.59	−2.16	−19.7%
b ₃	−3.08	−2.86	−7.8%
b ₄	−3.77	−3.55	−6.1%
b ₅	−4.46	−4.24	−5.2%
b ₆	−5.14	−4.92	−4.4%
b ₇	−5.70	−5.61	−1.6%
b ₈	−6.42	−6.30	−1.9%
b ₉	−7.16	−6.97	−2.8%

Overall Effect-Restricted Model (Equation (34))	−4.74	−4.33	−9.5%
Overall Effect-Full Interactions (Equation (35))	−4.70	−4.50	−4.5%

Open in a new tab

Note: All estimates are statistically significant at 5% level.

Table 16.

Estimated Local IV by Wealth Level Δ^IV (Z_l+1, Z_l; b_k)Dynamic Model

Wealth Level	(Z₂, Z₁)	(Z₃, Z₂)	(Z₄, Z₃)	(Z₅, Z₄)	(Z₆, Z₅)	(Z₇, Z₆)	(Z₈, Z₇)	(Z₉, Z₈)	(Z₁₀, Z₉)	Δ^IV (b_k)
b ₁	3.56	3.93	−1.94	−8.45	1.40	−6.88	−5.11	−1.83	−0.80	−1.85
b ₂	−3.80	−3.23	−2.08	−1.64	−1.70	−4.54	−3.19	−4.19	−1.72	−2.59
b ₃	−5.14	−0.55	−4.26	−3.37	−2.13	−4.58	−3.60	−4.67	−2.25	−3.08
b ₄	−7.10	0.74	−4.96	−3.69	−2.86	−6.26	−5.72	−3.45	−3.45	−3.77
b ₅	−5.58	−2.92	−3.32	−6.42	−2.64	−7.46	−6.08	−4.38	−3.83	−4.46
b ₆	−2.27	−4.45	−5.32	−4.00	−4.76	−7.03	−5.48	−6.76	−5.06	−5.14
b ₇	−9.50	−2.87	−5.88	−6.01	−5.30	−7.78	−5.62	−5.60	−5.81	−5.70
b ₈	−7.44	−6.39	−4.18	−7.43	−5.67	−6.46	−7.87	−6.27	−6.54	−6.42
b ₉	−8.66	−0.31	−8.88	−8.79	−6.32	−8.90	−6.74	−7.94	−5.80	−7.16

Open in a new tab

Table 17.

IV Weights by Wealth Level (λ_l(b_k)) Dynamic Model

Wealth Level	(Z₂, Z₁) (1)	(Z₃, Z₂) (2)	(Z₄, Z₃) (3)	(Z₅, Z₄) (4)	(Z₆, Z₅) (5)	(Z₇, Z₆) (6)	(Z₈, Z₇) (7)	(Z₉, Z₈) (8)	(Z₁₀, Z₉) (9)	Σ_lλ_l(b_k) (1)+(2)+…+(9)
b ₁	0.03	0.09	0.09	0.12	0.25	0.08	0.13	0.11	0.10	1.00
b ₂	0.03	0.09	0.09	0.12	0.26	0.07	0.14	0.11	0.10	1.00
b ₃	0.03	0.08	0.09	0.13	0.26	0.06	0.14	0.11	0.10	1.00
b ₄	0.02	0.07	0.09	0.13	0.25	0.06	0.15	0.11	0.10	1.00
b ₅	0.02	0.07	0.10	0.14	0.25	0.06	0.15	0.11	0.09	1.00
b ₆	0.02	0.06	0.10	0.14	0.25	0.06	0.16	0.12	0.09	1.00
b ₇	0.02	0.06	0.10	0.15	0.24	0.06	0.16	0.12	0.09	1.00
b ₈	0.02	0.05	0.10	0.15	0.23	0.06	0.16	0.12	0.09	1.00
b ₉	0.02	0.05	0.10	0.16	0.23	0.07	0.16	0.13	0.09	1.00

Open in a new tab

Tables 18 and 19 present the model generated treatment parameters. The numbers in these tables are obtained using the counterfactual consumption levels delivered by the model, which would not be available in observational data (as the one used to generate the numbers in tables 15, 16, and 17).

Table 18.

Model Generated Treatment Parameters: ATE, TT and TUT, by Wealth Level Dynamic Model

Wealth Level	Δ^ATE(b_k)	Δ^TT (b_k)	Δ^TUT (b_k)	Consumption Under Financial Autarky
b ₁	−1.49	−1.47	−1.51	22.00
b ₂	−2.18	−2.17	−2.19	22.69
b ₃	−2.87	−2.84	−2.89	23.38
b ₄	−3.56	−3.52	−3.58	24.07
b ₅	−4.25	−4.22	−4.26	24.76
b ₆	−4.94	−4.92	−4.95	25.46
b ₇	−5.63	−5.59	−5.64	26.17
b ₈	−6.32	−6.30	−6.32	26.87
b ₉	−7.01	−7.05	−7.00	27.57

Overall	−4.51	−4.17	−4.69	25.04

Open in a new tab

Table 19.

Model Generated Local Average Treatment Effects by Wealth Level Δ^LATE(Z_l+1, Z_l; b_k) Dynamic Model

Wealth Level	(Z₂, Z₁)	(Z₃, Z₂)	(Z₄, Z₃)	(Z₅, Z₄)	(Z₆, Z₅)	(Z₇, Z₆)	(Z₈, Z₇)	(Z₉, Z₈)	(Z₁₀, Z₉)	Δ^LATE(b_k)
b ₁	−1.37	−1.51	−1.77	−1.55	−1.28	−1.76	−1.64	−1.60	−1.45	−1.55
b ₂	−2.25	−2.22	−2.01	−2.12	−2.10	−1.92	−2.21	−2.49	−2.26	−2.17
b ₃	−2.79	−2.78	−2.78	−2.80	−2.80	−2.67	−2.89	−3.03	−3.06	−2.85
b ₄	−3.48	−3.41	−3.45	−3.40	−3.51	−3.49	−3.58	−3.62	−3.75	−3.53
b ₅	−4.15	−4.25	−4.20	−4.23	−4.06	−4.26	−4.27	−4.45	−4.45	−4.27
b ₆	−4.92	−4.88	−4.88	−4.75	−4.83	−4.87	−4.88	−5.15	−5.04	−4.92
b ₇	−5.81	−5.51	−5.58	−5.69	−5.43	−5.39	−5.61	−5.67	−5.89	−5.62
b ₈	−6.47	−6.37	−6.16	−6.26	−6.04	−6.08	−6.32	−6.29	−6.47	−6.27
b ₉	−6.51	−6.81	−6.96	−6.73	−7.22	−7.00	−6.66	−7.22	−7.15	−7.00

Open in a new tab

The results in table 18 show important differences between the average treatment effect (ATE), the treatment effect on the treated (TT), and the treatment effect on the untreated (TUT). These differences illustrate how the presence of unobserved talent and the sorting mechanism into financial intermediation generate heterogenous treatment parameters. In this context, the analyst must first state the question she wants to answer, and then use the appropriate empirical approach to identify the treatment parameters of interest.

Table 19 on the other hand, presents the model generated local average treatment effects $Δ^{LATE} (Z_{l + 1}, Z_{l}; b^{k}) = E (c_{i}^{I} - c_{i}^{A} ∣ D_{i} (Z_{l} + 1) - D_{i} (Z_{l}) = 1, b_{i} = b_{k})$ . Given the problematic definition of Z_i as a proper instrument, the model generated LATE (table 19) and the estimated local IVs (table 16) are now different. Table 20 summarizes these large differences. In our dynamic model with dynamic incentives, local IVs would not identify the well-defined causal parameter LATE.

Table 20.

Model Generated Local Average Treatment Effect versus Estimated Local IVs, by Wealth Level Dynamic Model

Wealth Level	Δ^LATE(b_k) (1)	Δ^IV (b_k) (2)	% Difference $(\frac{(1) - (2)}{∣ (2) ∣})$

b ₁	−1.55	−1.85	16.2%
b ₂	−2.17	−2.59	16.0%
b ₃	−2.85	−3.08	7.6%
b ₄	−3.53	−3.77	6.4%
b ₅	−4.27	−4.46	4.2%
b ₆	−4.92	−5.14	4.2%
b ₇	−5.62	−5.70	1.4%
b ₈	−6.27	−6.42	2.4%
b ₉	−7.00	−7.16	2.3%

Overall	−4.50	−4.70	4.3%

Open in a new tab

6 Conclusions

This paper links contract theory models of financial intermediation to econometric policy evaluation. We have discussed a variety of economic models with unobserved heterogeneity and endogenous decisions involving financial intermediation. We also analyzed econometric techniques and policy evaluation which are appropriate or inappropriate, depending on the vision of the underlying model, the assumptions one is willing to make, and the data at hand.

Even though, under certain assumptions, an IV strategy can recover accurately a true model-generated causal effect (LATE), these are quantitatively different, in order of magnitude and even sign, from other policy impact parameters (e.g., treatment on the treated and the average treatment effect). We also show that laying out clearly alternative models can guide the search for instruments. Mechanism design can deliver natural lotteries of randomization that can be used as sources of identification in empirical analyses. On the other hand adding more margins of decision, i.e., occupation choice and intermediation jointly, or adding more periods with promised utilities as key state variables, as in optimal multi-period contracts, can cause the misinterpretation of the IV estimates as the causal parameter of interest (e.g., uniformity), so that IV and LATE might no longer coincide.

Our objective is to help researchers and policy makers assess accurately the impact of financial intermediation. In order to identify the impact of financial intermediation, researchers and policy makers need a clear understanding of the role of unobserved heterogeneity (coming from preferences, costs or talents) and the economic mechanisms driving individual’s endogenous decisions. A limited understanding of the economic fundamentals could result in a misinterpretation of policy parameters estimated from observational data. The good news is that there is a wide array of options, so it is a matter of choosing carefully.

Acknowledgments

Research funding from NICHD, NFS, Templeton Foundation, and Bill and Melinda Gates Foundations to the University of Chicago is gratefully acknowledge. The views expressed in this paper are those of the authors and not necessarily those of the funders listed here. We have received helpful comments from Cynthia Kinnan, Benjamin Olken, Marti Mestieri and Gabriel Madeira. Gabriel Madeira provided excellent research assistance.

Footnotes

This is easily modified to allow a choice between savings s and consumption c where c + s ≤ W and preferences are determined by a Cobb-Douglas utility function, giving a (myopic) savings rate.

Although this wage w is taken as given for each individual choice problem, it is consistent with a market clearing equilibrium wage.

See Rubin (1974) and Heckman and Honoré (1990) for a formal exposition of the Roy model.

⁴

This since we assume that the subsidy ψ is not correlated with unobserved talents θ^W and θ^E.

⁵

If the subsidy takes on a finite number of discrete values, and we order them according to their magnitudes (ψ₀ < ψ₁ < … < ψ_K), then Δ^IV can be written as a weighted average of Δ^LATE (ψ_k, ψ_k+1) with k = 1, .., K − 1, where the weights are related to the probability of going into business at the various values of the subsidy (see Yitzhaki, 1989; Imbens and Angrist, 1994). Additionally, if we take the limit as subsidy ψ_k approaches ψ_k+1, this delivers the marginal treatment effect (MTE) for those households just indifferent to becoming business (see Heckman and Vytlacil, 2001).

⁶

Formally,

λ (\frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}}) = E (\frac{θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}} ∣ \frac{θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}} > \frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}}) = \frac{ϕ (\frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}})}{Φ (\frac{(ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i}}{\sqrt{σ_{W}^{2} + ϕ_{θ}^{2} σ_{E}^{2}}})}

where ϕ and Φ represents the probability density and cumulative distribution functions associated with a standard normal distribution, respectively.

⁷

Notice that this parameter is the limit version of the average local treatment effect. More specifically, Δ^MTE (p, b) = lim_p’→p Δ^LATE (p, p’, b).

⁸

This follows from the fact that

θ_{i}^{W} - ϕ_{θ} θ_{i}^{E} < (ϕ_{w} - 1) w + ϕ_{b} b_{i} + ψ_{i} \Leftrightarrow U < p (w, b_{i}, ψ_{i})

where

U = F_{θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}} (\cdot)

represents the cumulative distribution function associated with the random variable

θ_{i}^{W} - ϕ_{θ} θ_{i}^{E}

⁹

The empirical implementation of the local instrumental variable estimator involves the non-parametric estimation of the derivative of E(Y_i|p_i, b_i) with respect to p_i. Although the implementation of non-parametric techniques can be considered standard, in small samples they can be infeasible. See Heckman et al. (2008) and Heckman et al. (2006) for different empirical approaches when implementing the local instrumental variable estimator.

¹⁰

Using data from Thailand, Gine and Townsend (2004) estimate a model with similar characteristics to the one studied here.

¹¹

Notice that this expression follows directly from the theoretical model (see equation (11)).

¹²

The average effects defined in table 3 comes from the following expression

{\frac{Δ E (Y ∣ D, b)}{Δ D} ∣}_{b = \overset{‒}{b}} = κ_{2} \overset{‒}{b} + κ_{3}

where

\overset{‒}{b}

represents the average wealth in the population. We use

\frac{Δ Y}{Δ D}

to denote a change in Y due to a change in the discrete variable D.

¹³

Formally, from the model we can generate

Δ^{LATE} (1, 0; b) = \frac{E (Y_{i} ∣ ψ_{i} = 1, b_{i} = b) - E (Y_{i} ∣ ψ = 0, b_{i} = b)}{E (D_{i} ∣ ψ_{i} = 1, b_{i} = b) - E (D_{i} ∣ ψ_{i} = 0, b_{i} = b)}

and then, we compute

Δ^{LATE} (1, 0) = \int Δ^{LATE} (1, 0; t) d F_{b} (t)

(12)

where F_b(t) represents the cumulative distribution of wealth for those individuals switching occupations as a result of the change in the value of the instrument.

¹⁴

Our linear regression model implies the following approximation for LATE (as a function of wealth):

Δ^{IV} (1, 0; b) = κ_{2} b + κ_{3},

and consequently,

Δ^{IV} (1, 0) = κ_{2} \overset{‒}{b} + κ_{3}

where b‾ denotes the average wealth level. The comparison of Δ^IV (1, 0) and Δ^LATE (1, 0) in expression (12) illustrates the source of discrepancies between our estimates.

¹⁵

Under particular populations in which the occupational decision becomes irrelevant, we can use this method to determine the gains in profits for entrepreneurs induced to use the financial system. Suppose that the random assignment of ψ is such that there exists a population for which the subsidy is so high, ψ*, so that there are only firms regardless of the assigned values of the Q. In this case, we can estimate the local average treatment effect as:

\frac{E (ξ_{i} ∣ Q_{i} = \bar{\overset{‒}{Q}}, b_{i} = b, ψ_{i} = ψ^{*}) - E (ξ_{i} ∣ Q_{i} = \overset{‒}{Q}, b_{i} = b, ψ_{i} = ψ^{*})}{E (Υ_{i} ∣ Q_{i} = \bar{\overset{‒}{Q}}, b_{i} = b, ψ_{i} = ψ^{*}) - E (Υ_{i} ∣ Q_{i} = \overset{‒}{Q}, b_{i} = b, ψ_{i} = ψ^{*})},

which (under uniformity) identifies the income gains associated with intermediation for those who are isolated from the wage sector, or

E (π (θ_{i}^{E}, w, r) - π (θ_{i}^{E}, b_{i}, w) ∣ b_{i} = b, ψ_{i} = ψ^{*}, Υ_{i} (\bar{\overset{‒}{Q}}) = 1, Υ_{i} (\overset{‒}{Q}) = 0) .

¹⁶

However, as in the case of financial intermediation, under particular populations we can use the local treatment effect to identify the effect of entrepreneurship for individuals under financial autarky. Specifically, suppose that the random assignment of Q is such that there exists a population for which the costs of using the financial intermediary are too high, Q*, so that regardless of the assigned values of the subsidy they choose to be in financial autarky. In this case, we can use the instrument ψ to compute:

\frac{E (ξ_{i} ∣ ψ_{i} = \bar{\overset{‒}{ψ}}, b_{i} = b, Q_{i} = Q^{*}) - E (ξ_{i} ∣ ψ_{i} = \overset{‒}{ψ}, b_{i} = b, Q_{i} = Q^{*})}{E (D_{i} ∣ ψ_{i} = \bar{\overset{‒}{ψ}}, b_{i} = b, Q_{i} = Q^{*}) - E (D_{i} ∣ ψ_{i} = \overset{‒}{ψ}, b_{i} = b, Q_{i} = Q^{*})}

which (under uniformity) identifies

E (π (θ_{i}^{E}, b_{i}, w) - (w + θ_{i}^{W}) ∣ b_{i} = b, Q_{i} = Q^{*}, D_{i} (\bar{\overset{‒}{ψ}}) = 1, D_{i} (\overset{‒}{ψ}) = 0)

which is the income gains associated with entrepreneurship for those individuals isolated from financial intermediation.

¹⁷

Formally, suppose individuals decide among J different options. Each option has associated a utility level V_j for j = 1, … , J. Let D_j = 1 if the individual selects the j-th alternative, and 0 otherwise. Furthermore, as in the model of this section, assume D_j = 1 if V_j = max{V₁, … , V_J} for j = 1, … , J. Let Y_j denote the potential outcome associated with option j. Valid instruments affect choices but are independent from potential outcomes. Let Z_j denote the instrument associated with option j. We present the relationship between instrument and options as V_j(Z_j), i.e., instrument Z_j determines the utility level V_j. V_j also depends on unobserved components which can be correlated with potential outcome Y_j. For notational simplicity we leave this dependence implicit. Observed outcome Y can be written as $Y = Σ_{j = 1}^{J} D_{j} Y_{j}$ . Heckman et al. (2006) shows that $Δ^{IV (Z_{j})} = \frac{E (Y ∣ Z = z) - E (Y ∣ Z = z^{'})}{\Pr (D_{j} = 1 ∣ Z = z) - \Pr (D_{j} = 1 ∣ Z = z^{'})}$ where z = (z₁, … , z_j, … , z_j) and $z = (z_{1}, . . ., z_{j}^{'}, . . ., z_{J})$ , so that only the variation from Z_j is utilized to compute Δ^{IV (Z_j)}, identifies the effect on outcome of option j versus the next best option.

¹⁸

Notice that when phrasing our model as a model of multiple treatments, intermediation costs Q and subsidy ψ are not valid instruments, in the sense of Z_j entering only V_j (see previous footnote), for any of the four alternatives.

¹⁹

One could present the following regression model for the simultaneous analysis of the effects of occupation and financial intermediation:

Ψ_{i} = κ_{0} + κ_{1} b_{i} + κ_{2} Υ_{i} b_{i} + κ_{3} D_{i} (1 - Υ_{i}) + κ_{4} b_{i} D_{i} (1 - Υ_{i}) + κ_{5} Υ_{i} D_{i} (r) + κ_{6} b_{i} Υ_{i} D_{i} (r) + ε_{i}

(22)

In this case, the information from both instruments (Ψ_i and Q_i) should be used to control for the endogeneity provoked by the selection processes. As previously explained, this model, in which the two margins are simultaneously modeled, has additional complications that go beyond the scope of our analysis in this paper. See Heckman et al. (2006) for an analysis of this case.

²⁰

The risk sharing role of formal financial institutions is tested in Alem and Townsend (2008).

²¹

Notice that if the individual does not know her unobserved preference parameter θ_i or, alternatively, if she knows θ_i but for some reason does not act on it, then the selection process would not be based on unobserved gains. Formally, in this case $E (ε_{i t}^{I} - ε_{i t}^{A} ∣ I_{i 0}) = 0$ , and the model would produce homogeneous treatment effects.

²²

A literature on sudden devaluations causing wealth losses from dollar denominated loans is not unrelated.

²³

See Karaivanov and Townsend (2009) for further work with the Thai data and the estimation of financial regimes in a dynamic context.

²⁴

It is worth mentioning that experimenting with different sample sizes suggests that reducing the number of observations produces significant losses in the accuracy of the local IV estimates.

²⁵

More precisely, we work with (b₁, b₂, b₃, b₄, b₅, b₆, b₇, b₈, b₉, b₁₀) = (10.5, 10.6, 10.7, 10.88, 10.9, 11.1, 11.2, 11.3, 11.4, 11.5), and (Z₁, Z₂, Z₃, Z₄, Z₅, Z₆, Z₇, Z₈, Z₉, Z₁₀) = (0, 0.03, 0.06, 0.1, 0.13, 0.16, 0.2, 0.23, 0.26, 0.3). Given the structure of the model and our ordering, the resulting probabilities associated with the lottery is increasing in Z_j and decreasing in b_k. We also consider a discrete grid for talent θ. Specifically, we solve the model for (θ₁, θ₂, θ₃, θ₄, θ₅, θ₆, θ₇, θ₈, θ₉) = (0.6, 0.7, 0.8, 0.9, 1, 1.1, 1.2, 1.3, 1.4). The distribution of talent and wealth generated using the discrete grids respects the joint distribution associated with these random variables presented in table 10.

²⁶

A more complicated version would include risk sharing. See Alem and Townsend (2008).

²⁷

The numbers presented in table 11 are obtained using Δ^IV (Z_l+1, Z_l; b_k) and the IV weights presented in tables 12 and 13, respectively.

²⁸

We do not present the model generated LATE in this case. This is because, as in the previous examples of a valid instrument satisfying uniformity, they will be close to the estimated local IV estimates, and so we prefer not to repeat the argument.

²⁹

Here we work with the following values for wealth and the instrument (b₁, b₂, b₃, b₄, b₅, b₆, b₇, b₈, b₉) = (18.25, 18.33, 18.41, 18.5, 18.58, 18.66, 18.75, 18.83, 18.91), and (Z₁, Z₂, Z₃, Z₄, Z₅, Z₆, Z₇, Z₈, Z₉, Z₁₀) = (0, 0.03, 0.07, 0.1, 0.14, 0.18, 0.21, 0.25, 0.29, 0.32). Given the structure of the model and our ordering, the resulting probabilities associated with the lottery tends to be increasing in Z_j and decreasing in b_j. For talent θ, we solve the model using (θ₁, θ₂, θ₃, θ₄, θ₅, θ₆, θ₇, θ₈, θ₉) = (0.86, 0.88, 0.9, 0.92, 0.93, 0.95, 0.97, 0.98, 1).

Contributor Information

Robert M. Townsend, Department of Economics, MIT

Sergio S. Urzua, Department of Economics Northwestern University

References

[1].Alem Mauro, Townsend Robert. An evaluation of safety nets and financial institutions in crisis and growth. University of Chicago, Department of Economics; 2008. unpublished manuscript. [Google Scholar]
[2].Felkner John, Townsend Robert M. Enterprise and the wealth of villages. University of Chicago; 2007. unpublished manuscript. [Google Scholar]
[3].Gine Xavier, Townsend Robert. Evaluation of financial liberalization: a general equilibrium model with constrained occupation choice. Journal of Development Economics. 2004;74:269–307. [Google Scholar]
[4].Greenwood Jeremy, Jovanovic Boyan. Financial development, growth, and the distribution of income. Journal of Political Economy. 1999;98:1076–1107. [Google Scholar]
[5].Heckman James J., Honoré Bo E. The empirical content of the Roy model. Econometrica. 1990;58:1121–1149. [Google Scholar]
[6].Heckman James J., Schmierer Daniel, Urzua Sergio. Journal of Econometrics. University of Chicago, Department of Economics under revision; 2008. Testing the correlated random coefficient model. unpublished manuscript. [DOI] [PMC free article] [PubMed] [Google Scholar]
[7].Heckman James J., Urzua Sergio, Vytlacil Edward J. Understanding instrumental variables in models with essential heterogeneity. Review of Economics and Statistics. 2006;88:389–432. [Google Scholar]
[8].Heckman James J., Vytlacil Edward J. In: Cheng Hsiao, Kimio Morimune, Powell James L., editors. Local instrumental variables; Nonlinear Statistical Modeling: Proceedings of the Thirteenth International Symposium in Economic Theory and Econometrics: Essays in Honor of Takeshi Amemiya; New York: Cambridge University Press. 2001.pp. 1–46. [Google Scholar]
[9].Imbens Guido W., Angrist Joshua D. Identification and estimation of local average treatment effects. Econometrica. 1994;62:467–475. [Google Scholar]
[10].Jeong Hyeok, Townsend Robert. Growth and inequality: Model evaluation based on an estimation-calibration strategy. Macroeconomic Dynamics. 2008;12:231–284. doi: 10.1017/S1365100507070149. [DOI] [PMC free article] [PubMed] [Google Scholar]
[11].Kaboski Joseph P., Townsend Robert M. Policies and impact: An analysis of village-level micro finance institutions. Journal of the European Economic Association. 2005;3:1–50. [Google Scholar]
[12].Kaboski Joseph P., Townsend Robert M. The impact of credit on village economies. Ohio State University; 2009. unpublished manuscript. [DOI] [PMC free article] [PubMed] [Google Scholar]
[13].Karaivanov Alexander K., Townsend Robert M. Enterprise dynamics and finance: Distinguishing mechanism design from exogenously incomplete markets models. Department of Economics, Simon Fraser University; 2009. unpublished manuscript. [Google Scholar]
[14].Lloyd-Ellis Huw, Bernhardt Dan. Enterprise, inequality and economic development. Review of Economic Studies. 2000;67:147–168. [Google Scholar]
[15].Matzkin Rosa L. Nonparametric and distribution-free estimation of the binary threshold crossing and the binary choice models. Econometrica. 1992;60:239–270. [Google Scholar]
[16].Olken Benjamin. Monitoring corruption: Evidence from a field experiment in Indonesia. Journal of Political Economy. 2007;115:200–249. [Google Scholar]
[17].Paulson Anna L., Townsend Robert M., Karaivanov Alex. Distinguishing limited liability from moral hazard in a model of entrepreneurship. Journal of Political Economy. 2006;114:100–144. [Google Scholar]
[18].Roy AD. Some thoughts on the distribution of earnings. Oxford Economic Papers. 1951;3:135–146. [Google Scholar]
[19].Rubin Donald B. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66:688–701. [Google Scholar]
[20].Salop Steven C. Strategic entry deterrence. American Economic Review. 1979;69:335–338. [Google Scholar]
[21].Townsend Robert M., Ueda Kenichi. Financial deepening, inequality, and growth: A model-based quantitative evaluation. Review of Economic Studies. 2006;73:251–293. [Google Scholar]
[22].Townsend Robert M., Ueda Kenichi. Welfare gains from financial liberalization. International Economic Review. 2009 doi: 10.1111/j.1468-2354.2010.00593.x. Forthcoming. [DOI] [PMC free article] [PubMed] [Google Scholar]
[23].Vissing-Jorgensen Annette. Towards an explanation of household portfolio choice heterogeneity: Non-financial income and participation cost structures. National Bureau of Economic Research; 2002. Working Paper 8884. [Google Scholar]
[24].Yitzhaki Shlomo. On using linear regression in welfare economics. Department of Economics, Hebrew University; 1989. Working Paper 217. [Google Scholar]

[R1] [1].Alem Mauro, Townsend Robert. An evaluation of safety nets and financial institutions in crisis and growth. University of Chicago, Department of Economics; 2008. unpublished manuscript. [Google Scholar]

[R2] [2].Felkner John, Townsend Robert M. Enterprise and the wealth of villages. University of Chicago; 2007. unpublished manuscript. [Google Scholar]

[R3] [3].Gine Xavier, Townsend Robert. Evaluation of financial liberalization: a general equilibrium model with constrained occupation choice. Journal of Development Economics. 2004;74:269–307. [Google Scholar]

[R4] [4].Greenwood Jeremy, Jovanovic Boyan. Financial development, growth, and the distribution of income. Journal of Political Economy. 1999;98:1076–1107. [Google Scholar]

[R5] [5].Heckman James J., Honoré Bo E. The empirical content of the Roy model. Econometrica. 1990;58:1121–1149. [Google Scholar]

[R6] [6].Heckman James J., Schmierer Daniel, Urzua Sergio. Journal of Econometrics. University of Chicago, Department of Economics under revision; 2008. Testing the correlated random coefficient model. unpublished manuscript. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] [7].Heckman James J., Urzua Sergio, Vytlacil Edward J. Understanding instrumental variables in models with essential heterogeneity. Review of Economics and Statistics. 2006;88:389–432. [Google Scholar]

[R8] [8].Heckman James J., Vytlacil Edward J. In: Cheng Hsiao, Kimio Morimune, Powell James L., editors. Local instrumental variables; Nonlinear Statistical Modeling: Proceedings of the Thirteenth International Symposium in Economic Theory and Econometrics: Essays in Honor of Takeshi Amemiya; New York: Cambridge University Press. 2001.pp. 1–46. [Google Scholar]

[R9] [9].Imbens Guido W., Angrist Joshua D. Identification and estimation of local average treatment effects. Econometrica. 1994;62:467–475. [Google Scholar]

[R10] [10].Jeong Hyeok, Townsend Robert. Growth and inequality: Model evaluation based on an estimation-calibration strategy. Macroeconomic Dynamics. 2008;12:231–284. doi: 10.1017/S1365100507070149. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] [11].Kaboski Joseph P., Townsend Robert M. Policies and impact: An analysis of village-level micro finance institutions. Journal of the European Economic Association. 2005;3:1–50. [Google Scholar]

[R12] [12].Kaboski Joseph P., Townsend Robert M. The impact of credit on village economies. Ohio State University; 2009. unpublished manuscript. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] [13].Karaivanov Alexander K., Townsend Robert M. Enterprise dynamics and finance: Distinguishing mechanism design from exogenously incomplete markets models. Department of Economics, Simon Fraser University; 2009. unpublished manuscript. [Google Scholar]

[R14] [14].Lloyd-Ellis Huw, Bernhardt Dan. Enterprise, inequality and economic development. Review of Economic Studies. 2000;67:147–168. [Google Scholar]

[R15] [15].Matzkin Rosa L. Nonparametric and distribution-free estimation of the binary threshold crossing and the binary choice models. Econometrica. 1992;60:239–270. [Google Scholar]

[R16] [16].Olken Benjamin. Monitoring corruption: Evidence from a field experiment in Indonesia. Journal of Political Economy. 2007;115:200–249. [Google Scholar]

[R17] [17].Paulson Anna L., Townsend Robert M., Karaivanov Alex. Distinguishing limited liability from moral hazard in a model of entrepreneurship. Journal of Political Economy. 2006;114:100–144. [Google Scholar]

[R18] [18].Roy AD. Some thoughts on the distribution of earnings. Oxford Economic Papers. 1951;3:135–146. [Google Scholar]

[R19] [19].Rubin Donald B. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66:688–701. [Google Scholar]

[R20] [20].Salop Steven C. Strategic entry deterrence. American Economic Review. 1979;69:335–338. [Google Scholar]

[R21] [21].Townsend Robert M., Ueda Kenichi. Financial deepening, inequality, and growth: A model-based quantitative evaluation. Review of Economic Studies. 2006;73:251–293. [Google Scholar]

[R22] [22].Townsend Robert M., Ueda Kenichi. Welfare gains from financial liberalization. International Economic Review. 2009 doi: 10.1111/j.1468-2354.2010.00593.x. Forthcoming. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] [23].Vissing-Jorgensen Annette. Towards an explanation of household portfolio choice heterogeneity: Non-financial income and participation cost structures. National Bureau of Economic Research; 2002. Working Paper 8884. [Google Scholar]

[R24] [24].Yitzhaki Shlomo. On using linear regression in welfare economics. Department of Economics, Hebrew University; 1989. Working Paper 217. [Google Scholar]

PERMALINK

Measuring the Impact of Financial Intermediation: Linking Contract Theory to Econometric Policy Evaluation *

Robert M Townsend

Sergio S Urzua

Abstract

1 Introduction

2 A Standard Model of Occupation Choice

2.1 Standard Econometric Approaches for The Analysis of the Impact of Occupational Decisions

2.2 Parametric and Semi-Parametric Identification of Treatment Effect Parameters

Figure 1.

2.3 Measuring the Impact of Occupations on Income

Table 1.

2.3.1 Using Cross-Sectional Information to Estimate the Effect of Occupational Choice

Table 2.

Table 3.

2.3.2 Using the Structure of the Model to Generate Counterfactual Outcomes and The Causal Effects of Occupational Choices

Table 4.

Figure 2.

3 Occupational Choice Under Financial Intermediation

3.1 Identifying the Effects of Financial Intermediation

3.2 Example

Table 5.

3.2.1 Using Cross-Sectional Information to Estimate the Effect of Financial Intermediation and Occupational Choices

Table 6.

Table 7.

3.2.2 Using the Structure of the Model to Generate Counterfactual Outcomes and the Causal Effect of Financial Intermediation and Choices

Table 8.

Table 9.

4 Dynamics, Risk Sharing, Unobserved Heterogeneity and Occupational Choice

4.1 Once-And-For-All Participation Decisions and Participation Costs as Instruments

4.2 Sequential Participation Decisions

4.3 The Identification Power of Policies

5 A Model of Financial Intermediation with Moral Hazard and Collateral Constraints

5.1 Statics

Program 1

5.1.1 The Role of Lotteries and Z as a Valid instrument

Figure 3.

5.1.2 Example

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

5.2 Dynamic Mechanism Design

Program 2

5.2.1 An Example

Table 15.

Table 16.

Table 17.

Table 18.

Table 19.

Table 20.

6 Conclusions

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Measuring the Impact of Financial Intermediation: Linking Contract Theory to Econometric Policy Evaluation ^{^*}