The Government Finance Database: A Common Resource for Quantitative Research in Public Financial Analysis

Kawika Pierson; Michael L Hand; Fred Thompson

doi:10.1371/journal.pone.0130119

. 2015 Jun 24;10(6):e0130119. doi: 10.1371/journal.pone.0130119

The Government Finance Database: A Common Resource for Quantitative Research in Public Financial Analysis

Kawika Pierson ^1,^*, Michael L Hand ¹, Fred Thompson ¹

Editor: Frank Emmert-Streib²

PMCID: PMC4479543 PMID: 26107821

Abstract

Quantitative public financial management research focused on local governments is limited by the absence of a common database for empirical analysis. While the U.S. Census Bureau distributes government finance data that some scholars have utilized, the arduous process of collecting, interpreting, and organizing the data has led its adoption to be prohibitive and inconsistent. In this article we offer a single, coherent resource that contains all of the government financial data from 1967-2012, uses easy to understand natural-language variable names, and will be extended when new data is available.

Introduction

Widely shared and easy to use databases facilitate quantitative research and render the replication of findings practical and convenient [1]. Indeed, much of what we know about public finance has been tested against large microdata sets–in the United States, primarily merged information files based on household-level data from the IRS Individual Public-Use Tax Files, the Current Population Survey, the Consumer Expenditure Survey, and the triennial Survey of Consumer Finances. Unfortunately, students of public financial management at the local government level must often rely on one-off, custom-built datasets to pursue their inquiries, which is costly, inimical to replication, and leaves practitioners uncertain about the utility of academic insights.

For someone from outside the field of public financial management the lack of widely used and consistently applied data might seem an unlikely obstacle. After all, scholars of public financial management have access to a database that is in many respects ideally suited to their needs. The U.S. Census Bureau has surveyed state and local governments annually since 1967, and, as the Director of the U.S. Census Bureau stated in a letter accompanying the 2013 request for financial information: “This survey is the only comprehensive source of information on the finances of local governments in the United States.”

Many examples of research using these data exist, including recent papers by Gore [2], Baber and Gore [3], Kido et al. [4], Murray et al. [5], Carroll [6], Mullins [7], and Fisher and Papke [8], among others. However because the government financial data retrieved from the census require substantial effort to obtain, interpret, translate, consolidate, and use, every example of its scholarly application is unique in the years included for analysis, variables consolidated or ignored, and types of governments considered.

The diversity of treatments and time horizons in work using the Census of Governments data isn’t surprising given the investment of time and resources necessary to work with the data, but it is potentially damaging to the interpretation and application of research in our field. By consolidating the Census Bureau’s government financial data into a single, coherent database we hope to alleviate these concerns and move quantitative research in public finance progressively forward.

Arguably, the situation is similar to the situation in corporate finance prior to the availability of the CRSP-COMPUSTAT database of stock prices and accounting data. Accounting data were available from the Securities and Exchange Commission and data on share prices could be obtained from various vendors, but merging and matching observations from these files was prohibitively costly. Consequently, the data were rarely used and, when they were, it was nearly impossible to explain, let alone resolve, the numerous discrepancies in the findings that resulted, which held back sustained intellectual progress in the field. Corporate financial research no longer suffers from this problem. The CRSP-COMPUSTAT database has secured the field’s sustained progress.

This article describes the steps we have taken to make the Census Bureau’s annual surveys of state and local government finances equally easy to interpret and use. It offers a single, comprehensive database of government finance statistics, which includes detailed financial data from states, municipalities, townships, special districts, and school districts for the years 1967 through 2012, processed to make it user friendly–uncomplicated to use and convenient for replication. The database is freely available and can be downloaded from: http://www.willamette.edu/mba/research_impact/public_datasets/.

We will demonstrate some applications of the database here, but its potential for scholarly inquiry is staggering. The data include extensive information on government revenue from both tax and non-tax sources, facilitating a more general understanding of strategies to increase revenue streams [9], the interdependencies of local government and school district revenue [10], or the budgetary impacts of revenue diversity [11], just to name a few possibilities.

The data include detailed breakdowns of expenditures by both type and function, which can propel answers to questions about spending on education and transportation [12], the importance of the business cycle for budgets [13], geographic impacts on categories of municipal spending [14], or the applicability of aggregate budget functions [15]. The database also contains information about the cash positions of governments, the issuance and retirement of debt, and the investments of social insurance trusts.

Some caveats are appropriate however. The government finance database is not a perfect resource. In particular the data do not include measures of accomplishment or effort, except where money spent is a reasonable proxy, and so the database must be supplemented if such measures are important to the question being studied, e.g. by merging it with performance data, such as the Texas school-district performance data [16]. However, given the push towards both methodological [17] and theoretical [18] innovation in public administration research, and given the existing diversity the field displays in those areas (see for instance [19] [20]) this breadth of financial information provided from a single, standardized source has the potential to facilitate a diverse body of inquiry.

After explaining the overall structure of the data, what variables are included, and how the data is transformed from its raw state, we will transition to discussing several insights that arose from our initial analysis of the data. These include examples of using the data to better understand patterns in government finance as well as important advice for other researchers working with the database.

Materials and Methods

Issues with Data Availability and Coding

The basic unit of reporting in government accounting is the fund, essentially a separate bucket of financial resources tasked with accomplishing some objective [21]. While more recent accounting guidance mandates that some government-wide information be reported in addition to fund level reports [22], the census extends this reference frame by consolidating information across funds and presenting all of its data on a government-wide basis. This approach is broadly beneficial for studies that seek to understand something about governments as separate financial entities, and better conforms to the way that citizens and financial intermediaries (as opposed to governmental managers charged with oversight) use government accounting data [23].

While many government funds report information on a modified-accrual basis, some are required to report using a cash basis, a modified cash basis, or (rarely) a full accrual basis [24]. This diversity of reporting practices within and across governments presents some difficulty to anyone attempting to present or utilize government financial data in a consistent manner. Given that a transformation of data between the different accounting treatments is not possible, the census adopts the accounting basis declared by each government fund “so long as that basis (1) conforms to generally accepted accounting procedures and (2) is applied consistently from year-to-year.” In practice this means that the data are best conceptualized as roughly equivalent to cash flows, even though they will not always represent actual cash flows during the periods reported.

The data are reported in thousands of nominal dollars, unadjusted for changes in prices or wages over time, allowing researchers to choose whether and how best to convert the information into real dollars. The time period represented in the data is 1967–2012, however the number of governments included varies significantly from year to year. The primary source of this variation is the fact that the Census Bureau collects financial data from governments in two separate, but related efforts. During years ending in a 2 or 7 the government collects a census (essentially a population) of government financial statistics in the “Census of Government Finance and Employment Data”. Every year when a census is not being conducted a sample of governments report data through the “Annual Survey of State & Local Government Finances”.

The data include federal (type 6), state (type 0), county (type 1), municipal (type 2), township (type 3), special district (type 4), and school district data (type 5), each of which can be isolated by censoring the data on the “Type_Code” variable. While every state is included in the sample every year the coverage for other government types is less complete. Fig 1 shows the number of governments of each type that are included in the data each year.

Fig 1 highlights several important insights into the coverage of the data over time. Reporting rates are uniformly high during years when a full census was conducted. In addition, school districts report at much higher rates than other governments, but show a large reduction in reporting during the years between 1993 and 1996. Closer examination of the data for other government types shows a similar (but less visually pronounced) reduction in coverage during those years.

In 1993 the census began sampling a smaller, but still significant, portion of all government types. Because of their work with the National Center for Education Statistics, the census was able to resume nearly complete coverage of school districts following the 1997 census, but the other government types were never again sampled at the levels seen in the late 1980’s.

One of the largest hurdles in the process of organizing the government financial data as a single, coherent database is learning to interpret the codes used by the census to identify what each data point represents. The database we present replaces these codes with natural language variable names borrowed from the census’ classification manual, however understanding the codes that the census uses internally will help readers to validate, interpret, and apply our work.

Each census code combines an “object code” with a “function code”. Object codes are one character long and represent large categories or types of data. For instance, the object code T is used for all tax revenues. Function codes are double digit numbers that indicate what the funds in question were used for. Combining an object code, such as A, for current charges, with a function code, such as 12, for elementary and secondary education, results in a pointer to a particular variable, in this case A12: current charges from elementary and secondary education. Function codes are not applied consistently across the entire data set, but are still useful to understanding data within large sections of it. For instance, the function code 01 represents property taxes whenever it is used with object code T, but represents air transportation with every expenditure function code.

Creating a Single, Coherent Database

The government financial data comes in two forms. Data from 1967 through 2007 is more or less organized in the manner that researchers expect from panel data. The files are divided by year. Each row of each file corresponds to one government. There are several columns for identifying information and a column for each financial variable. These columns are all labeled with natural language names that make it easy to understand what they represent. One wrinkle arises from the fact that data for this period is always provided in three separate text files each year. Each file contains a row for every government and some identifying information, but the three files contain different subsets of the financial information available.

Overcoming this challenge is straightforward. Given the consistent naming scheme used by the census for these years we simply merge the three data files each year so that all of the columns are available in one large matrix. We then loop through the years available and continue aggregating the data into what we call the “early database”.

The newer data presents a much more substantial challenge, and the process of consolidating it with the early database to create one source of data is our main contribution. Data after 2007 is organized into two files per year. The first file is a fixed width text file called the “Individual Unit File”. On each row of this file there is one government ID number, one census data code, one number representing data, the year of the data, and a character that encodes something about how the data was gathered.

This organization presents the first major hurdle to merging the recent data with the early database, since each row of the individual unit file holds data that must comprise one cell in the final matrix. For this reason the individual unit file is transposed so that there is one row per government and one column for each census data code.

The second file the Census provides contains identifying information for every government in that year’s data, and is organized by government ID code. This “Government ID” file has the name of each government, population figures, and several other pieces of identifying information. Once the individual unit files are transposed they are merged with this identifying information to create the “recent database”.

The second major hurdle presented by the more recent government financial data is the fact that the data are not encoded with natural language variable names the way that the early data is. This needs to be fixed, and so the final step in our data consolidation process is a mapping of each of the census codes onto the variable names used in the early database. The Census provides some resources to facilitate the process, including a user’s guide to the early data and classification manuals describing the recent data, but the process is still time consuming and meticulous in a way that likely deters other researchers from incorporating the recent data into their studies.

In the end we take the recoded recent data and merge it with the early data to present a single coherent database of government financial data between 1967 and 2012. Specific instructions for replicating our consolidation, the SAS code we employed, and a mapping of data codes to variable names is available in S1 File and are also included with the database when you download it as one of our supporting information files (S2, S3, S4, S5, S6 and S7 Files) which each contain data for one government type. A high-level view of the process of organizing and consolidating all of the government financial data is shown in Fig 2.

Fig 2 — A conceptual overview of the process of consolidating the various data files to form a single coherent database.

Categories of Financial Data

At a high level the data for each government are grouped into four large categories: revenue flows, expenditure flows, cash and investment positions, and debt positions.

Revenue

The revenue data are organized by sector into general revenue, utility revenue, liquor store revenue, and social insurance trust revenue. Each of these sectors is comprised of a number of smaller subcategories, as shown in Table 1.

Table 1. Revenue Categories.

Revenue Categories	Census Object	Description
General Revenue	T B C D A U	All revenue not arising from utilities, liquor stores, or social insurance
Taxes	T	All taxes other than those assessed for social insurance
Intergovernmental Revenue	B C D	Transfers to the government from others, including grants and shared taxes
From Federal	B	Intergovernmental revenue from federal sources
From State	C	Intergovernmental revenue from state sources
From Local	D	Intergovernmental revenue from local sources
Current Charges	A	Fees collected for providing services, other than utility service charges or liquor store charges
Miscellaneous General Rev	U	Other general revenue from a government’s own sources
Utility Revenue	A	Revenue from providing water, electric, gas, or transportation services
Liquor Store Revenue	A	Sales revenue from government run liquor stores
Social Insurance Trust Rev	X Y	Contributions and investment earnings (or losses) for all social insurance programs.
Retirement Plans	X	Contributions and investment earnings (or losses) for public employee retirement programs
Unemployment Revenue	Y	Contributions and investment earnings (or losses) for the unemployment compensation insurance system

Open in a new tab

This table shows the high-level organization of the different revenue variables in the database. It references the census object codes used to create these categories, and provides a short description of each. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each category can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

The revenue data within each subcategory are further broken down in order to identify more specific sources of funds. Tax revenues have the largest number of subcategories in the data. Table 2 summarizes the organization of tax revenue subcategories.

Table 2. Tax Revenue Categories.

Tax Revenue Categories	Census Code	Description
Total Taxes		The sum of all of the tax categories
Property Tax	T01	All taxes on property that use its value as a basis
Total Sales Taxes		The sum of general and selective sales taxes
General Sales Tax	T09	Taxes on the sale of all types of goods and services
Total Selective Sales Taxes		The sum of the eight selective sales tax categories
Alcoholic Beverage	T10	Sales taxes on government and private sales of alcohol
Amusement	T11	Sales taxes on all types of amusement businesses
Insurance Premium	T12	Sales taxes on insurance
Motor Fuel	T13	Sales taxes on fuels for vehicles and aircraft
Pari-mutuels	T14	Sales taxes on wagers and betting
Public Utilities	T15	Sales taxes on government owned utilities
Tobacco	T16	Sales taxes on tobacco products
Other Selective Sales Tax	T19	All other selective sales taxes
Total License Taxes		The sum of the nine licensing tax subcategories
Alcoholic Beverage	T20	Licenses pertaining to alcohol
Amusement	T21	Licenses pertaining to amusement businesses
Corporate	T22	Licenses pertaining to all corporations
Hunting and Fishing	T23	Licenses pertaining to hunting and fishing
Total Motor Vehicle		The sum of motor vehicle and operator licenses
Motor Vehicle	T24	Licenses pertaining to the right to operate a vehicle (Registration, plates, inspection ect.)
Operator Licenses	T25	Licenses pertaining to the right to drive
Public Utility	T27	Licenses imposed on public utilities
Occupation and business	T28	Licenses for certain professions and businesses
Other Licenses	T29	All other licenses.
Total income Taxes		The sum of individual and corporate income taxes.
Individual	T40	Taxes on the income of individuals
Corporate	T41	Taxes on the income of corporations
Death and Gift Tax	T50	Taxes on the transfer of property after death
Documentary Tax	T51	Taxes on the transfer of documents
Severance Tax	T53	Taxes on the removal of natural resources
Taxes NEC	T99	All other taxes not listed above

Open in a new tab

This table gives a detailed breakdown of the different tax revenue data reported. NEC stands for not elsewhere classified. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each category can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

Intergovernmental revenue data is first separated based on its source (from the federal, state, or local government) as shown in Table 1. Within each of these sources intergovernmental revenue is categorized by its intended use. Table 3 displays this structure.

Table 3. Intergovernmental Revenue Functions.

Intergovernmental Revenue Categories	Census Number	Description
Air Transportation	01	Aid in support of public airports
Interschool revenue	11	Aid from one school district to another (schools only)
Education	21	Aid for public schools
Employment Security	22	Transfers to the states from the federal government for unemployment insurance
General Support	30	Aid that can be applied for any purpose
Health and Hospitals	42	Aid intended for public health or hospitals
Highways	46	Aid to be used for roads, streets, and highways
Transit Subsidies	94	Aid for mass transit systems
Housing and Community Dev	50	Aid for public housing and other community development
Natural Resources	59	Federal aid for conservation resource protection
Public Welfare	79	Aid for social welfare programs
Sewerage	80	Aid for sewage systems, disposal and treatment
Other Uses	89	All other aid not classified above

Open in a new tab

This table describes a detailed breakdown of the different intergovernmental revenue function codes reported by governments in the data. More detailed descriptions of each function code can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

The precise application of each of these categories changes somewhat based on the source of the intergovernmental revenue. For instance federally sourced intergovernmental revenue for public welfare includes programs such as TANF (Temporary Assistance for Needy Families) and Medicaid, whereas state sourced intergovernmental revenue for public welfare includes pass-through of these programs, as well as revenue arising from state specific programs. An exhaustive documentation of what each variable contains and excludes is available in the census classification manual included in the supporting information files (S2, S3, S4, S5, S6, or S7 Filess).

Current charges are amounts that the government collects from individuals and corporations in exchange for providing services. They are reported in gross amounts, ignoring any cost of service. Liquor stores and utilities are excluded from current charges and given their own category of revenue in order to distinguish them from general revenue. Charges are separated based on the type of service provided as shown in Table 4.

Table 4. Current Charge Functions.

Charge Functions	Census Number	Description
Total General Charges		The sum of all charges
Airport Charges	01	Charges relating to air transportation
Misc. Commercial Charges	03	Charges from all publicly owned enterprises NEC
Total Education Charges		The sum of the three education subcategories
Total Elem-Secondary		The sum of the next three variables
School Lunch	09	Revenue from the sale of milk and school lunches
Tuition	10	Charges for tuition and transportation
Other	12	Other charges (athletics, textbooks ect.)
Higher Education	16 18	All charges from public higher education
All Other Education	21	Charges from all other state or federally run schools
Hospital Charges	36	Charges for care in publicly run hospitals
Total Highway Charges		The sum of the next two variables
Regular Highways	44	Assessments and fees for the maintenance of non-toll roads
Toll Highways	45	Fees from toll roads
Housing and Com Dev	50	Revenue from the rental of public housing
Natural Resources	56 59	Charges from forestry and other natural resources
Parking Charges	60	Charges from on and off-street parking, and lots
Parks and Recreation	61	Revenue from facilities, parks, stadiums ect.
Sewerage	80	Charges for sewage connection, collection and disposal
Solid Waste Management	81	Fees from garbage collection and the operation of landfills
Water Transport	87	Charges relating to port terminals and canal operation
All Other General Charges	89	All charges NEC

Open in a new tab

This table describes a detailed breakdown of the different current charge function codes reported in the data. NEC stands for not elsewhere classified. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each function code can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

Liquor store and utility revenue are not disaggregated to the extent that the other revenue data is. Total liquor store revenue is reported, and utility revenue is broken into revenue from each of the four types of utilities: water, electricity, gas, and mass transit.

Several categories of general revenue are listed under miscellaneous general revenue. Their organization is shown in Table 5.

Table 5. Miscellaneous General Revenue Variables.

Variable Name	Census Code	Description
Total Charges and Misc. Revenue		The sum of total charges and total misc. revenue
Total Misc. General Revenue		The sum of the seven variables below
Special Assessments	U01	Charges to individuals benefiting from improvements
Property Sale Other	U11	Gross receipts from all property sales
Interest Revenue	U20	Interest earnings from all sources
Fines and Forfeits	U30	Revenue from legal penalties
Rents and Royalties	U40 U41	The sum of rent and royalty income
Net Lottery Revenue	U95	Lottery proceeds net of the cost of prizes
Misc. General Revenue NEC	U99	All general revenue NEC

Open in a new tab

This table describes the coding of the miscellaneous revenue variables and provides a short description of each. NEC stands for not elsewhere classified. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each variable can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

The last category of revenue is revenue from social insurance trusts. Insurance trust revenue is separated into retirement plan revenue and unemployment revenue, and several smaller partitions of both are reported as shown in Table 6.

Table 6. Insurance Trust Revenue Variables.

Variable Name	Census Code	Description
Total Insurance Trust Revenue		The sum of all insurance trust revenue
Total Insurance Trust Contributions		The sum of the contribution variables below
Total Trust Investment Revenue		The sum of the investment variables below
Total Retirement Plan Revenue		The sum of all retirement plan revenue
Total Retirement Contributions		The sum of the following four contribution variables
Local Government Employees	X01	Contributions from employees of local governments
State Government Employees	X04	Contributions from employees of state governments
From Other Governments	X05	Contributions coming from other governments
Contribution to Own System	X06	Contributions to the government’s own system
Investment Earnings	X08	All earnings on the investments of the retirement plan
Total Unemployment Revenue		The sum of the following three variables
Unemployment Payroll Tax	Y01	Included in total insurance trust contributions
Unemployment Interest Revenue	Y02	Included in total investment revenue
Unemployment Federal Advances	Y04	Funds received when taxes and investments cannot cover the benefits due to unemployed workers

Open in a new tab

This table describes the coding of the social insurance trust revenue variables and provides a short description of each. NEC stands for not elsewhere classified. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each variable can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

Expenditure

Expenditures are organized according to their category and function. The category of each expenditure refers to how the cash was used, while the function of the expenditure refers to the type of service it was used to accomplish. In general every expenditure variable is a combination of one category and one function, following the logic of the census codes. For instance, “Air Transportation Capital Outlay” is in the capital outlay category and was used for the air transportation function.

Table 7 shows the different categories of expenditures that are recorded in the data. Total expenditures are the sum of direct expenditures and intergovernmental expenditures. Direct expenditures can further be broken down into current expenditures used to pay employees, purchase supplies and hire contractors; construction expenditures used to build long term assets; and expenditures used to purchase (rather than build) long term assets. Capital outlay expenditures are the sum of construction and purchase expenditures.

Table 7. Expenditure Categories.

Expenditure Categories	Census Object	Description
Total	E F G L M	The sum of all expenditures
Direct	E F G	Current expenditures (such as salaries and supplies), plus any expenditures for capital improvements
Capital Outlay	F G	Purchase or construction of capital improvements
Construction	F	Construction expenditures only
Intergovernmental to State	L	Paid to state governments for performance of functions or aid related to those functions
Intergovernmental to Local	M	Paid to local governments for performance of functions or aid related to those functions

Open in a new tab

Every set of expenditure data follows a similar organization. This table shows how to interpret the names given to the variables in the database, references the census object codes used to create them, and provides a short description of each. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each category can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

Intergovernmental expenditures are defined by the census as “amounts paid to other governments for performance of specific functions or for general financial support.” They are included in total expenditure, and are separated based on whether the funds went to state governments or local ones.

In a very small number of instances assistance, subsidies, and interest on debt are added to direct expenditures and total expenditures. Assistance and subsidies are coded by the census as object J, and occur four times in the data: state government scholarships (J19), federal categorical assistance programs (J67), other cash assistance programs (J68), and federal and state veterans’ services (J85). Interest on debt is coded by the census as object I and occurs five times: interest on general debt (I89), and interest on debt for the four classes of utilities (I91, I92, I93, and I94). When this occurs the data always include a separate line item reporting the amount of assistance, subsidies, or interest, allowing researchers to correct for their inclusion in direct expenditure if necessary.

Expenditures are also separated by function within the database. Table 8 shows the various expenditure functions considered in the data and the census function codes that correspond to them. Some of the expenditure functions recorded by the census only exist at the federal level and have been excluded from the database otherwise. Other codes exist in the newest census data but do not exist for years prior to 2007 and have been consolidated into their earlier versions to create a more coherent database.

Table 8. Expenditure Function Codes.

Expenditure Functions	Census Number	Expenditure Functions Continued	Census Number
Air Transport	01	Parking Facilities	60
Miscellaneous Commercial Activities, NEC	03	Parks and Recreation	61
Correctional Institutions	04	Police Protection	62
Elementary and Secondary Education	12	Protective Inspection & Reg., NEC	66
Higher Education	16 18	Public Welfare–sum of several smaller functions	67 68 74 75 77 79
State Government Scholarships	19	Federal Categorical Assistance	67
Education NEC	21	Other Cash Assistance Programs	68
Employment Security Administration	22	Vendor Payments Medical Care	74
Financial Administration	23	Vendor Payments Other Purposes	75
Fire Protection	24	Institutions	77
Judicial and Legal	25	Public Welfare—Other	79
Central Staff Services	29	Sewerage	80
General Public Buildings	31	Solid Waste Management	81
Health	32	Sea and Inland Port Facilities	87
Hospitals	36	General Expenditure NEC	89
Federal Owned Hospitals—Veterans	37	Liquor Stores	90
Federal Other Hospitals—Veterans	39	Utilities Total–sum of several smaller functions	91 92 93 94
Regular (non-toll) Highways	44	Water Supply	91
Toll Highways	45	Electric Power	92
Housing and Community Development	50	Gas Supply	93
Libraries	52	Public Mass Transit Systems	94
Natural Resources	55 56 59

Open in a new tab

The data include a number of different functional separations for expenditures and the table above shows the name of each along with the corresponding census function number or numbers included in that expenditure function. NEC stands for not elsewhere classified. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each expenditure function can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

Cash and Investment Positions

Several of the cash and investment positions of each government are recorded in the data. Other current and long term assets, such as those recorded on a typical statement of net position are not included by the census. A summary of these variables is shown in table 9.

Table 9. Cash and Investment Security Variables.

Variable Name	Census Code	Description
Total Cash and Securities		The sum of all cash and securities held
Insurance Trust Cash and Securities		The sum of retirement and unemployment investments
Employee Retirement Cash and Sec.		The sum of all employee retirement cash and security amounts
Employee Retirement Cash	X21	Cash held by the employee retirement system
Employee Retirement Securities		The sum of the following two subcategories
Federal Securities	X30	Amount invested in federal government securities
Non-Governmental Securities		The sum of the following five variables
Corporate Bonds	Z77	All forms of corporate debt
Corporate Stock	Z78	All forms of corporate equity investments
Mortgages	X42	Mortgages owed to the retirement system
Other Investments	X44	Mutual funds, international investments, loans to members and several other investments
Miscellaneous Investments	X47	All investments of the retirement system NEC
State and Local Government Sec.	X35	Included in X44 but also reported separately
Unemployment Cash and Securities		The sum of the following two variables
Unemployment in US Treasuries	Y07	The balance held in federal securities
Other Unemployment Balances	Y08	Negative when states borrow from the federal gov.
Non-Insurance Trust Cash and Sec.		The sum of the following three variables
Sinking Fund Cash and Securities	W01	Funds held in order to service debt
Bond Fund Cash and Securities	W31	Proceeds of bond issues awaiting disbursement
Other Non-Insurance Trust C&S	W61	All other non-insurance trust cash and investments

Open in a new tab

This table describes the coding of the cash and investment security variables and provides a short description of each. NEC stands for not elsewhere classified. The indentation of the variables in the first column indicates how subcategories of data collapse into larger categories. More detailed descriptions of each variable can be found in the Census’ 2006 classification manual (http://www.census.gov/govs/classification/) included with the database download.

It may seem like an odd choice to report state and local government securities twice given the explicit division between governmental and non-governmental securities in the data, but given that defaults by state and local governments are more likely than federal defaults ([25] and many others) including state and local government bonds with non-federal securities is reasonable. For situations where this combination is unwanted state and local government securities can be subtracted out of the non-governmental securities variable.

Debt Positions

Debt statistics were significantly simplified following the 2005 redesign of the Census’ government finance statistics program. Prior to this simplification data on debt were separated based on whether the debt was issued with the backing of the full faith and credit of the government in question, whether it was not guaranteed, or whether the guarantee was unspecified. Within each of those categories the debt was broken out by function: debt to be used for each of the four utilities (water, electric, gas, and transit), general use debt, elementary and secondary education debt, or higher education debt. Measures of debt outstanding, debt issued, and debt retired were recorded for each of these guarantees and functions.

Debt outstanding, issued, and retired are still reported variables, but the distinctions between the guarantee levels and functions of debt have been removed. Instead, debt variables are disaggregated into public debt for private purposes, and debt for all other general purposes.

Because the data prior to 2005 have substantial additional detail, the government finance database keeps all of the potential categories of debt, even though many of these values are missing for the most recent years. Any research using the finer-grained debt data should exclude years prior to 2005, but the larger debt categories are comparable over the entire timespan of the data.

Discussion

Implications of Unbalanced Panel Data

The government finance database is an unbalanced panel dataset because the annual samples vary in size, and so any analysis of the data should be informed by traditional approaches to working with unbalanced panels, such as fixed effects models [26]. However, a deeper understanding of how the sample varies over time can provide us with advantages over the simple application of statistical tools, by guiding future research designs and by helping to interpret results.

One particularly striking finding from a high level analysis of the data is that smaller (larger) sample sizes indicate that the sample is skewed towards larger (smaller) governments. The graphs in Figs 3 and 4 show a clear, inverse relationship for counties (r = -0.93) and municipalities (r = -0.78) between the number of governments sampled each year and the median population of the governments in the sample, providing strong evidence that larger municipalities are more likely to be sampled during non-census years. The two government types that are exceptions to this pattern are states and school districts, because of the uniformly high reporting rates for both.

Fig 3 — This figure shows how the median population reported in the data for counties correlates with the number of total records found in the data for counties. Median population is graphed against the left hand axis and the number of records is graphed against the right hand axis.

Fig 4 — This figure shows how the median population reported in the data for municipalities correlates with the number of total records found in the data for municipalities. Median population is graphed against the left hand axis and the number of records is graphed against the right hand axis.

This relationship indicates several actionable steps, beyond the straightforward advice to apply year fixed effects, for quantitative research using this data. First, considering government size in your research design will be essential. Directly controlling for size, or being able to make a plausible argument for why size is not important for the question being investigated is an important bar for studies using this database to clear. If such controls or arguments are missing, academics and policy makers should be very wary of generalizing their results.

Second, your research focus may inform the data cleaning and selection process in novel ways. For instance, studies that aim to identify long term financial trends across all governments may want to only use the data from years ending in a 2 or a 7, because that will ensure that every measure they calculate is representative of a population of governments. Some of the time series we graph later in this paper will clearly show the impact that ignoring this advice can have.

On the other hand, studies that include or truncate data based on the population of each observation may claim to be including all of the data, but are actually removing much of what is available and are prejudicing their sample towards including more observations during the most recent years. While this type of data cleaning is often implemented without much thought in other fields, reviewers of work using the government finance database should ask authors to justify (or test to ensure) that the choice to include only governments with a certain population does not bias the results of the study.

While the number of governments sampled in any given year varies considerably, impacting the median population of the sample, it has long been understood that city populations follow a power law, or Pareto distribution ([27] or [28]), and thus it is reasonable to ask whether years with a small number of governments might nonetheless cover a large fraction of the population. Figs 5 and 6 take advantage of the fact that all states report data for every year to calculate the percentage of the total population covered by various government types each year.

Fig 5 — This figure stacks together the population covered by municipalities and townships, and compares it to the population indicated at the state level for every year of the data.

Fig 6 — This figure graphs the population covered by the counties in the database, and compares it to the population indicated at the state level for every year of the data.

What these figures show is that even though the samples are skewed towards governments with the largest populations, and so are not representative of all cities or all counties, they do capture a sizeable portion of the overall population in both cases.

An important implication of this is that studies which use the government finance database to measure the overall economic force of a particular category of government cash flow are likely to come very close to an accurate estimate, even during small sample years. Per capita numbers, which are easy to compute in the database, will often be a reasonable tool to use given that the data cover so much of the population. These measures will still be weighted towards representing people living in the largest governments during years when the sample is the smallest, but most people live in places with large populations and so per capita measures will be broadly representative.

There are likely several reasons why none of the years reach 100% coverage for population. One is that Connecticut and Rhode Island do not report county data, even though they both have counties. In addition, the District of Columbia is coded as a state and not as a local government. In practice however these reasons do not account for much of the gap. Other sources potentially include systematic non-reporting from less obvious sources, or the possibility that state population estimates are updated more often than other governments and so display growth sooner.

Another important consideration in working with unbalanced panel data is that requiring a long, uninterrupted time series of observations will limit the generality of your results. More specifically in this case, depending on the type of government being researched, requiring consecutive observations is likely to bias your sample towards including larger governments and data measured during the years in the late 1980’s when the samples were larger. The size of this effect is controlled by the number of concurrent observations your research design requires however, so even small differences in such requirements have the potential to sizably impact your findings. Table 10 shows the number of observations that have consecutive data of a given length, and Table 11 shows how average population changes in those samples.

Table 10. The Impact of Requiring Consecutive Data on Sample Size.

Consecutive Years Required	All Reports	2-Year	3-Year	4-Year	5-Year
Federal	25	24	23	22	21
State	1,800	1,700	1,650	1,600	1,550
County	93,365	76,582	70,505	65,096	60,109
Municipality	361,300	208,715	170,286	144,915	121,995
Township	273,063	133,936	100,474	80,014	60,311
Special Districts	393,918	159,349	137,548	116,732	98,437
School Districts	488,319	425,588	395,090	367,150	339,564
All Types	1,611,790	1,005,894	875,576	775,529	681,987

Open in a new tab

This table displays how the sample size will change when researchers require consecutive years of data. The calculations are shown by government type, and each column increases the number of required years by one.

Table 11. The Impact of Requiring Consecutive Data on Average Population.

Consecutive Years Required	All Reports	2-Year	3-Year	4-Year	5-Year
Federal	229,703,936	229,703,936	229,703,936	229,703,936	229,703,936
State	5,148,833	5,148,833	5,148,833	5,148,833	5,148,833
County	95,796	108,182	113,368	118,457	123,753
Municipality	15,568	23,986	28,017	31,423	35,677
Township	4,963	7,521	8,923	10,087	11,967

Open in a new tab

This table displays how the average population of included governments will change when researchers require consecutive years of data. The calculations are shown by government type, and each column increases the number of required years by one.

Fig 7 graphs these sample sizes as a proportion of all of the available data for each government type.

These points also suggest some practical considerations that arise when using unbalanced panel data in less academic settings. For instance, if you are interested in discovering something about the revenues or expenses of a particular local government you are not likely to find a complete time pattern of behavior in this database. In fact, unless the government you are interested in serves a particularly large population you may find that data only exist once every five years. Individuals who want to dig deeply into the finances of a particular local government are likely to have much better luck asking for financial records directly from the local government they are interested in.

Results

In the process of organizing and cleaning the data we were struck by its wide applicability to many different areas of public administration. In this section we present a number of simple analyses that illustrate both the flexibility and usability of the government finance database.

Fig 8 is a good example. It shows a time series of the average number of tax revenue sources computed for both municipalities and school districts. These data were constructed by adding an indicator variable to the database for each type of tax revenue. The indicator was coded as a 0 whenever the total amount of that tax was either missing or equal to zero, and was coded as a 1 otherwise. The indicators for property taxes, sales taxes, income taxes, license taxes, and other taxes were then summed and the average was calculated, by year, for each government type.

The results show two interesting features. The first is a quantitative confirmation of the often-noted trend towards increasing revenue diversification by municipal governments [29]. This trend is mirrored by school districts, a fact which is far less well known. The second notable feature is that the number of municipal tax revenue sources looks much more variable than the number of school district tax revenue sources. In fact, much of that variability is induced by the different sample sizes (and therefore the different average populations) each year.

Fig 9 shows a similar analysis that also highlights several additional considerations for using the data. It graphs average, real, per capita government debt at both the state and municipal levels.

Scaling by population is easy, since population figures are included in the data, but because the database is recorded in nominal thousands of dollars any analysis that wants to control for inflation needs to merge an appropriate scaling factor into the database. In this case we used the annual average CPI levels from the Bureau of Labor Statistics (with 1983 ≅ 1), scaled total debt outstanding by both CPI and population, and multiplied each resulting figure by 1,000 (to correct for the fact that all data in the government finance database is recorded in thousands of dollars). The government level figures were then averaged, by year, for each government type.

On the surface the graph in Fig 9 shows many of the features described by Hildreth and Zorn [30], including a substantial increase in debt levels following the Tax Reform Act of 1986, decreasing new issues in the early 1990’s, and a general upward trend in debt outstanding since. Beyond those well-known trends however the real, per capita levels tell an interesting story about how large and small municipalities have used debt markets differently.

Prior to 1986 census years show relatively little difference from annual samples in terms of the average level of real debt per person. Following the 1987 census however those differences dominate the figure, indicating that large municipalities have taken advantage of the Tax Reform Act of 1986 far more than small municipalities, even in inflation adjusted per capita terms. While city size has been studied in relation to its impact on interest rates [31] [32], this previously unnoticed pattern between city size and the level of outstanding municipal debt is a potential area for future research.

There is no reason why the data need to be analyzed from the aggregate perspective our previous two figures used. Breaking the data out and studying one particular government is also an interesting exercise. For instance, Fig 10 shows the state of Oregon’s total revenue and total expenditure, in billions of nominal dollars through time.

Fig 10 — This figure shows the total revenue and total expenditure of Oregon State government over time.

The most striking feature of this graph is the sizeable impact of the great recession on total revenue in 2009. Contrary to what you might think, this change is not the result of a large decrease in taxes collected or any other traditional revenue source, instead virtually all of the difference between the 2008 and 2009 numbers comes from the approximately $12 billion dollar loss from public employee retirement system investment revenue.

The visual impact of this loss on the graph is small compared to the actual impact losses like this had on public retirement systems across the nation and the world [32], but it drives home an important point about the flexibility of the government finance database. Isolating more stable government revenues through the use of general revenue, rather than total revenue, is likely to be advisable in many situations, and further isolating your data from the impact of intergovernmental revenue by using the “own source” versions of either revenue number is also possible.

Another option for segmenting the data is to look more closely at patterns within a particular government type. Table 12 shows one such analysis for special districts. While the general pattern of growth in special districts is well known [34] [35], and there are a few isolated studies that attempt to understand what is driving that growth (cf. [36], a study using one year of data, to our table), there are no studies describing which types of special districts have contributed most to that growth.

Table 12. Special District Growth.

Special District Category	Code	1977 Ct.	2007 Ct.	Δ%	1977%	2007%
Total	-	25,987	35,574	37%	100.00%	100.00%
Local Fire Protection	24	4,186	5,814	39%	16.11%	16.34%
Water Supply Utility	91	2,481	3,424	38%	9.55%	9.63%
Housing and Community Development	50	2,412	3,391	41%	9.28%	9.53%
Other Multi-function Districts	99	517	2,545	392%	1.99%	7.15%
Soil and Water Conservation	88	2,431	2,531	4%	9.35%	7.11%
Drainage	51	2,254	2,021	-10%	8.67%	5.68%
Sewerage	80	1,608	1,867	16%	6.19%	5.25%
Libraries	52	588	1,663	183%	2.26%	4.67%
Cemeteries	2	1,615	1,588	-2%	6.21%	4.46%
Sewerage and Water Supply	98	1,064	1,359	28%	4.09%	3.82%
Parks and Recreation	61	830	1,320	59%	3.19%	3.71%
Other Single Function Districts	89	312	926	197%	1.20%	2.60%
Irrigation	64	933	827	-11%	3.59%	2.32%
Regular Highways	44	652	813	25%	2.51%	2.29%
Health	32	356	768	116%	1.37%	2.16%
Hospitals	40	717	671	-6%	2.76%	1.89%
Flood Control	63	681	588	-14%	2.62%	1.65%
School Building Authorities	9	1,019	522	-49%	3.92%	1.47%
Air Transportation	1	299	490	64%	1.15%	1.38%
Solid Waste Management	81	71	425	499%	0.27%	1.19%
Public Mass Transit Utility	94	96	356	271%	0.37%	1.00%
Other Natural Resources	59	179	336	88%	0.69%	0.94%
Miscellaneous Commercial Activities	3	0	297	-	0.00%	0.83%
Industrial Development	41	0	168	-	0.00%	0.47%
Sea and Inland Port Facilities	87	166	162	-2%	0.64%	0.46%
Electric Power Utility	92	82	154	88%	0.32%	0.43%
Reclamation	86	114	149	31%	0.44%	0.42%
Natural Resources and Water Supply	97	71	87	23%	0.27%	0.24%
Fire Protection and Water Supply	96	66	59	-11%	0.25%	0.17%
Gas Supply Utility	93	46	57	24%	0.18%	0.16%
Public Welfare Institutions	77	0	51	-	0.00%	0.14%
Mortgage Credit	42	0	39	-	0.00%	0.11%
Parking Facilities	60	122	32	-74%	0.47%	0.09%
Correctional Institutions	4	0	23	-	0.00%	0.06%
Police Protection	62	0	23	-	0.00%	0.06%
Toll Highways	45	0	15	-	0.00%	0.04%
Public Welfare	79	0	12	-	0.00%	0.03%
Other Corrections	5	0	1	-	0.00%	0.00%
Unknown	0	19	0	-100%	0.07%	0.00%

Open in a new tab

This table displays the growth of special districts by comparing the 1977 data with the 2007 data, and is ordered by the number of districts existing in the 2007 data. The absolute number of special districts of each type is displayed, along with the percentage change between the two years, and the proportion of all special districts that each type comprises. The code column is the special district function code used by the census.

Our findings demonstrate a number of interesting patterns. First, much of the growth seems to be an organic expansion of the most common special districts without much change in their proportion. For instance, even though local fire protection districts added 1,628 to their total and grew almost 40% over the 30 years, they represented a very stable 16% of all special districts at both points in time. Second, some of the most dramatic growth came from the other multi-function district category, which grew from around 2% to over 7% of all districts, indicating that citizens who form special districts are increasingly deciding that the efficiency of combining multiple functions (perhaps from economies of scale, or reductions in administrative costs), outweighs the burden arising from additional complexity.

There are many other interesting storylines that we might draw from this Table 12, including the reduction in school building authorities and cemeteries potentially representing shifts in population demographics, or the strong growth of library, health, and solid waste management districts potentially representing increased demand for those services in areas without the population to support them previously. The diversity of potential insights from this relatively simple analysis highlights the fact that we can only begin to characterize the full extent of the flexibility and utility of the government finance database here.

Conclusion

A trade-off between ease of use and purity exists with any data cleaning effort. On the one hand we would like to present researchers with a database that is free from abnormalities and can be easily used in the widest range of circumstances. On the other hand we also want a database that is as close to the raw data as possible in order to limit statistician induced measurement error.

In this effort we have made several decisions that ensure the purity of the data even when there is some reduction in its usability. We plan to implement fixes for these issues, but have reserved these changes for a later work, since this will give us an opportunity to describe our approach to the data cleaning in a complete way, and because our approaches are potentially controversial. Academics who appreciate the changes we plan to make are free to apply them or use our revised database, and those who disagree with us or would prefer to use an alternate method can still have access to the government financial data in this form. A brief description of the issues we would like to fix is warranted however, since our choosing to not amend the database now means that the data may have less utility for some studies.

The two primary issues surround the population figures and the fiscal year end dates. The issue with the population numbers is that they do not update annually. Given the fact that per capita levels are a common, useful transformation to apply to government finance statistics, the use of old population figures means that per capita variables are likely to be measured with error in many cases. Short of conducting a retrospective count of populations for every government in the dataset the best solution is to model what the population must have been in every year when the population estimate is not current. This model could take many different forms, so we will reserve a discussion of how to create it for the future.

The issue with the fiscal year end dates is threefold: inconsistent coding of dates, a large number of error codes that we can interpret, and a surprising collection of other strange entries that are harder to interpret. The bluntest illustration of this problem is that fact that there are 520 unique values of fiscal year end dates, and only 365 days in a year. There are several avenues for correcting this problem, none of which is perfect. In the meantime however research that relies on fiscal year end dates should be careful of dropping observations that don’t conform to the expected format of this data field.

While the data we provide is far from perfect it still represents a substantial step forward for quantitative research in public financial analysis, and helps to solve a long-standing problem created by the lack of standardized, cross-sectional databases in local government finance.

All of the data was collected by the U.S. Census Bureau’s annual surveys and five-year censuses of state and local government finance, but prior to our work the use of government financial data in public accounting and finance research always involved a substantial investment of time into data cleaning and organization. As a result there was very little standardization in the time periods and government types covered, and the interpretability, accessibility, and replicability of research suffered.

We offer this database in the hopes that it can bring more consistency and transparency to quantitative research in public financial management. In the process it should also make conducting this type of research less costly, and may provide a template for others with access to unique data sources who want to provide them to our field.

Supporting Information

S1 File. Appendix for the Government Finance Database.

This file contains detailed information showing how to replicate our work creating the database described in the paper. The three appendices include step by step instructions, a mapping of our variable names to the census data codes, and the SAS code we used to consolidate the data files we received from the Census into the government finance database.

(DOC)

Click here for additional data file.^{(445KB, doc)}

S2 File. The Government Finance Database State Data.

This file includes the state data from the Government Finance Database at the time of publication, the appendix for the Government Finance Database, and the U.S. Census 2006 Classification manual. For more up to date versions of the data please visit http://www.willamette.edu/mba/research_impact/public_datasets/.

(ZIP)

Click here for additional data file.^{(5.4MB, zip)}

S3 File. The Government Finance Database County Data.

This file includes the county data from the Government Finance Database at the time of publication, the appendix for the Government Finance Database, and the U.S. Census 2006 Classification manual. For more up to date versions of the data please visit http://www.willamette.edu/mba/research_impact/public_datasets/.

(ZIP)

Click here for additional data file.^{(33.9MB, zip)}

S4 File. The Government Finance Database Municipal Data.

This file includes the municipal data from the Government Finance Database at the time of publication, the appendix for the Government Finance Database, and the U.S. Census 2006 Classification manual. For more up to date versions of the data please visit http://www.willamette.edu/mba/research_impact/public_datasets/.

(ZIP)

Click here for additional data file.^{(91.3MB, zip)}

S5 File. The Government Finance Database Township Data.

This file includes the township data from the Government Finance Database at the time of publication, the appendix for the Government Finance Database, and the U.S. Census 2006 Classification manual. For more up to date versions of the data please visit http://www.willamette.edu/mba/research_impact/public_datasets/.

(ZIP)

Click here for additional data file.^{(39.9MB, zip)}

S6 File. The Government Finance Database Special District Data.

This file includes the special district data from the Government Finance Database at the time of publication, the appendix for the Government Finance Database, and the U.S. Census 2006 Classification manual. For more up to date versions of the data please visit http://www.willamette.edu/mba/research_impact/public_datasets/.

(ZIP)

Click here for additional data file.^{(42.3MB, zip)}

S7 File. The Government Finance Database School District Data.

This file includes the school district data from the Government Finance Database at the time of publication, the appendix for the Government Finance Database, and the U.S. Census 2006 Classification manual. For more up to date versions of the data please visit http://www.willamette.edu/mba/research_impact/public_datasets/.

(ZIP)

Click here for additional data file.^{(81.3MB, zip)}

Data Availability

The Government Finance Database is available at: http://www.willamette.edu/mba/research_impact/public_datasets/. The data are also included as Supporting Information files.

Funding Statement

The authors received no specific funding for this work.

References

1. Gill J, Meier KJ (2000) Public administration research and practice: A methodological manifesto. Journal of Public Administration Research and Theory, 10(1), 157–199. [Google Scholar]
2. Gore AK (2004) The effects of GAAP regulation and bond market interaction on local government disclosure. Journal of Accounting and Public Policy, 23(1), 23–52. [Google Scholar]
3. Baber WR, Gore AK (2008) Consequences of GAAP disclosure regulation: Evidence from municipal debt issues. The Accounting Review, 83(3), 565–592. [Google Scholar]
4. Kido N, Petacchi R, Weber J (2012) The influence of elections on the accounting choices of governmental entities. Journal of Accounting Research, 50(2), 443–476. [Google Scholar]
5.Murray SE, Evans WN, Schwab RM (1998) Education-finance reform and the distribution of education resources. American Economic Review, 789–812.
6. Carroll DA (2009) Diversifying municipal government revenue structures: Fiscal illusion or instability?. Public Budgeting & Finance, 29(1), 27–48. [Google Scholar]
7. Mullins DR (2004) Tax and Expenditure Limitations and the Fiscal Response of Local Government: Asymmetric Intra‐Local Fiscal Effects. Public Budgeting & Finance, 24(4), 111–147. [Google Scholar]
8.Fisher RC, Papke LE (2000) Local government responses to education grants. National Tax Journal, 153–168.
9. Mikesell JL, Ross JM (2012) Fast money? The contribution of state tax amnesties to public revenue systems. National Tax Journal, 65(3), 529–562. [Google Scholar]
10. Kurban H, Gallagher RM, Persky JJ (2012) Estimating local redistribution through property-tax-funded public school systems. National Tax Journal, 65(3), 629–65 [Google Scholar]
11. Chapman J, Gorina E (2012) Effects of the Form of Government and Property Tax Limits on Local Finance in the Context of Revenue and Expenditure Simultaneity. Public Budgeting & Finance, 32(4), 19–45. [Google Scholar]
12. Witko C, Newmark AJ (2010) The Strange Disappearance of Investment in Human and Physical Capital in the United States. Journal of Public Administration Research and Theory, 20(1), 215–232. [Google Scholar]
13. Hou Y (2006) Budgeting for fiscal stability over the business cycle: A countercyclical fiscal policy and the multiyear perspective on budgeting. Public Administration Review, 66(5), 730–741. [Google Scholar]
14. Vallés‐Giménez J, Zárate‐Marco A (2011) Disparities in the Spending Needs of Highland Municipalities. Public Budgeting & Finance, 31(3), 26–48. [Google Scholar]
15. Breunig C, Koski C, Mortensen PB (2010) Stability and punctuations in public spending: A comparative study of budget functions. Journal of Public Administration Research and Theory, 20(3), 703–722. [Google Scholar]
16. Meier KJ, Morton T, Molina A (2010) Texas School District Performance Data 1993–2009 College Station, TX: Project For Equity, Representation and Governance. [Google Scholar]
17. Krause GA, Meier KJ (2003) Politics, policy, and organizations: frontiers in the scientific study of bureaucracy University of Michigan Press. [Google Scholar]
18. Kelman S (2005) Public management needs help!. Academy of Management Journal, 48(6), 967–969. [Google Scholar]
19. Bartle JR (2001) Evolving Theories of Public Budgeting Amsterdam, New York: Emerald Group Publishing Limited. [Google Scholar]
20. Raadschelders JC (2013) Public administration: The interdisciplinary study of government Oxford University Press. [Google Scholar]
21. GASB. Concepts Statement No.1 of the Governmental Accounting Standards Board: Objectives of Financial Reporting (Norwalk, CT: GASB, 1987). [Google Scholar]
22.GASB. Statement No. 34 (Norwalk, CT: GASB, 1999).
23. Voorhees WR, Kravchuk RS (2001) The new governmental financial reporting model under GASB statement no. 34: an emphasis on accountability. Public Budgeting & Finance, 21(3), 1–30. [Google Scholar]
24.Ingram RW (1984) Economic incentives and the choice of state government accounting practices. Journal of Accounting Research, 126–144.
25. Chalmers JM (1998) Default risk cannot explain the muni puzzle: Evidence from municipal bonds that are secured by US Treasury obligations. Review of Financial Studies, 11(2), 281–308. [Google Scholar]
26. Baltagi B (2008) Econometric analysis of panel data (Vol. 1). John Wiley & Sons. [Google Scholar]
27. Beckmann MJ (1958) City hierarchies and the distribution of city size. Economic Development and Cultural Change, 6(3), 243–248. [Google Scholar]
28. Blank A, Solomon S (2000) Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components). Physica A: Statistical Mechanics and its Applications, 287(1), 279–288. [Google Scholar]
29. Hendrick R (2002) Revenue diversification: Fiscal illusion or flexible financial management. Public budgeting & finance, 22(4), 52–72. [Google Scholar]
30. Hildreth WB, Zorn C (2005) The evolution of the state and local government municipal debt market over the past quarter century. Public Budgeting & Finance, 25(4s), 127–153. [Google Scholar]
31. Rivers MJ, Yates BM (1997) City size and geographic segmentation in the municipal bond market. The Quarterly Review of Economics and Finance, 37(3), 633–645. [Google Scholar]
32. Simonsen B, Robbins MD, Helgerson L (2001) The influence of jurisdiction size and sale type on municipal bond interest rates: An empirical analysis. Public Administration Review, 61(6), 709–717. [Google Scholar]
33. Pino A. Yermo J (2010) The impact of the 2007–2009 crisis on social security and private pension funds: A threat to their financial soundness? International Social Security Review, 63(2), 5–30. [Google Scholar]
34. Carr JB (2006) Local government autonomy and state reliance on special district governments: A reassessment. Political Research Quarterly, 59(3), 481–492. [Google Scholar]
35.McCabe BC (2000) Special-district formation among the states. State & Local Government Review, 121–131.
36. Nunn S, Schoedel C (1997) Special districts, city governments, and infrastructure spending in 105 US metropolitan areas. Journal of Urban Affairs,19(1), 59–72. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Appendix for the Government Finance Database.

(DOC)

Click here for additional data file.^{(445KB, doc)}

S2 File. The Government Finance Database State Data.

(ZIP)

Click here for additional data file.^{(5.4MB, zip)}

S3 File. The Government Finance Database County Data.

(ZIP)

Click here for additional data file.^{(33.9MB, zip)}

S4 File. The Government Finance Database Municipal Data.

(ZIP)

Click here for additional data file.^{(91.3MB, zip)}

S5 File. The Government Finance Database Township Data.

(ZIP)

Click here for additional data file.^{(39.9MB, zip)}

S6 File. The Government Finance Database Special District Data.

(ZIP)

Click here for additional data file.^{(42.3MB, zip)}

S7 File. The Government Finance Database School District Data.

(ZIP)

Click here for additional data file.^{(81.3MB, zip)}

Data Availability Statement

The Government Finance Database is available at: http://www.willamette.edu/mba/research_impact/public_datasets/. The data are also included as Supporting Information files.

[pone.0130119.ref001] 1. Gill J, Meier KJ (2000) Public administration research and practice: A methodological manifesto. Journal of Public Administration Research and Theory, 10(1), 157–199. [Google Scholar]

[pone.0130119.ref002] 2. Gore AK (2004) The effects of GAAP regulation and bond market interaction on local government disclosure. Journal of Accounting and Public Policy, 23(1), 23–52. [Google Scholar]

[pone.0130119.ref003] 3. Baber WR, Gore AK (2008) Consequences of GAAP disclosure regulation: Evidence from municipal debt issues. The Accounting Review, 83(3), 565–592. [Google Scholar]

[pone.0130119.ref004] 4. Kido N, Petacchi R, Weber J (2012) The influence of elections on the accounting choices of governmental entities. Journal of Accounting Research, 50(2), 443–476. [Google Scholar]

[pone.0130119.ref005] 5.Murray SE, Evans WN, Schwab RM (1998) Education-finance reform and the distribution of education resources. American Economic Review, 789–812.

[pone.0130119.ref006] 6. Carroll DA (2009) Diversifying municipal government revenue structures: Fiscal illusion or instability?. Public Budgeting & Finance, 29(1), 27–48. [Google Scholar]

[pone.0130119.ref007] 7. Mullins DR (2004) Tax and Expenditure Limitations and the Fiscal Response of Local Government: Asymmetric Intra‐Local Fiscal Effects. Public Budgeting & Finance, 24(4), 111–147. [Google Scholar]

[pone.0130119.ref008] 8.Fisher RC, Papke LE (2000) Local government responses to education grants. National Tax Journal, 153–168.

[pone.0130119.ref009] 9. Mikesell JL, Ross JM (2012) Fast money? The contribution of state tax amnesties to public revenue systems. National Tax Journal, 65(3), 529–562. [Google Scholar]

[pone.0130119.ref010] 10. Kurban H, Gallagher RM, Persky JJ (2012) Estimating local redistribution through property-tax-funded public school systems. National Tax Journal, 65(3), 629–65 [Google Scholar]

[pone.0130119.ref011] 11. Chapman J, Gorina E (2012) Effects of the Form of Government and Property Tax Limits on Local Finance in the Context of Revenue and Expenditure Simultaneity. Public Budgeting & Finance, 32(4), 19–45. [Google Scholar]

[pone.0130119.ref012] 12. Witko C, Newmark AJ (2010) The Strange Disappearance of Investment in Human and Physical Capital in the United States. Journal of Public Administration Research and Theory, 20(1), 215–232. [Google Scholar]

[pone.0130119.ref013] 13. Hou Y (2006) Budgeting for fiscal stability over the business cycle: A countercyclical fiscal policy and the multiyear perspective on budgeting. Public Administration Review, 66(5), 730–741. [Google Scholar]

[pone.0130119.ref014] 14. Vallés‐Giménez J, Zárate‐Marco A (2011) Disparities in the Spending Needs of Highland Municipalities. Public Budgeting & Finance, 31(3), 26–48. [Google Scholar]

[pone.0130119.ref015] 15. Breunig C, Koski C, Mortensen PB (2010) Stability and punctuations in public spending: A comparative study of budget functions. Journal of Public Administration Research and Theory, 20(3), 703–722. [Google Scholar]

[pone.0130119.ref016] 16. Meier KJ, Morton T, Molina A (2010) Texas School District Performance Data 1993–2009 College Station, TX: Project For Equity, Representation and Governance. [Google Scholar]

[pone.0130119.ref017] 17. Krause GA, Meier KJ (2003) Politics, policy, and organizations: frontiers in the scientific study of bureaucracy University of Michigan Press. [Google Scholar]

[pone.0130119.ref018] 18. Kelman S (2005) Public management needs help!. Academy of Management Journal, 48(6), 967–969. [Google Scholar]

[pone.0130119.ref019] 19. Bartle JR (2001) Evolving Theories of Public Budgeting Amsterdam, New York: Emerald Group Publishing Limited. [Google Scholar]

[pone.0130119.ref020] 20. Raadschelders JC (2013) Public administration: The interdisciplinary study of government Oxford University Press. [Google Scholar]

[pone.0130119.ref021] 21. GASB. Concepts Statement No.1 of the Governmental Accounting Standards Board: Objectives of Financial Reporting (Norwalk, CT: GASB, 1987). [Google Scholar]

[pone.0130119.ref022] 22.GASB. Statement No. 34 (Norwalk, CT: GASB, 1999).

[pone.0130119.ref023] 23. Voorhees WR, Kravchuk RS (2001) The new governmental financial reporting model under GASB statement no. 34: an emphasis on accountability. Public Budgeting & Finance, 21(3), 1–30. [Google Scholar]

[pone.0130119.ref024] 24.Ingram RW (1984) Economic incentives and the choice of state government accounting practices. Journal of Accounting Research, 126–144.

[pone.0130119.ref025] 25. Chalmers JM (1998) Default risk cannot explain the muni puzzle: Evidence from municipal bonds that are secured by US Treasury obligations. Review of Financial Studies, 11(2), 281–308. [Google Scholar]

[pone.0130119.ref026] 26. Baltagi B (2008) Econometric analysis of panel data (Vol. 1). John Wiley & Sons. [Google Scholar]

[pone.0130119.ref027] 27. Beckmann MJ (1958) City hierarchies and the distribution of city size. Economic Development and Cultural Change, 6(3), 243–248. [Google Scholar]

[pone.0130119.ref028] 28. Blank A, Solomon S (2000) Power laws in cities population, financial markets and internet sites (scaling in systems with a variable number of components). Physica A: Statistical Mechanics and its Applications, 287(1), 279–288. [Google Scholar]

[pone.0130119.ref029] 29. Hendrick R (2002) Revenue diversification: Fiscal illusion or flexible financial management. Public budgeting & finance, 22(4), 52–72. [Google Scholar]

[pone.0130119.ref030] 30. Hildreth WB, Zorn C (2005) The evolution of the state and local government municipal debt market over the past quarter century. Public Budgeting & Finance, 25(4s), 127–153. [Google Scholar]

[pone.0130119.ref031] 31. Rivers MJ, Yates BM (1997) City size and geographic segmentation in the municipal bond market. The Quarterly Review of Economics and Finance, 37(3), 633–645. [Google Scholar]

[pone.0130119.ref032] 32. Simonsen B, Robbins MD, Helgerson L (2001) The influence of jurisdiction size and sale type on municipal bond interest rates: An empirical analysis. Public Administration Review, 61(6), 709–717. [Google Scholar]

[pone.0130119.ref033] 33. Pino A. Yermo J (2010) The impact of the 2007–2009 crisis on social security and private pension funds: A threat to their financial soundness? International Social Security Review, 63(2), 5–30. [Google Scholar]

[pone.0130119.ref034] 34. Carr JB (2006) Local government autonomy and state reliance on special district governments: A reassessment. Political Research Quarterly, 59(3), 481–492. [Google Scholar]

[pone.0130119.ref035] 35.McCabe BC (2000) Special-district formation among the states. State & Local Government Review, 121–131.

[pone.0130119.ref036] 36. Nunn S, Schoedel C (1997) Special districts, city governments, and infrastructure spending in 105 US metropolitan areas. Journal of Urban Affairs,19(1), 59–72. [Google Scholar]

PERMALINK

The Government Finance Database: A Common Resource for Quantitative Research in Public Financial Analysis

Kawika Pierson

Michael L Hand

Fred Thompson

Roles

Abstract

Introduction

Materials and Methods

Issues with Data Availability and Coding

Fig 1. Report Counts by Government Type.

Creating a Single, Coherent Database

Fig 2. Consolidation of the Census of Governments Data.

Categories of Financial Data

Revenue

Table 1. Revenue Categories.

Table 2. Tax Revenue Categories.

Table 3. Intergovernmental Revenue Functions.

Table 4. Current Charge Functions.

Table 5. Miscellaneous General Revenue Variables.

Table 6. Insurance Trust Revenue Variables.

Expenditure

Table 7. Expenditure Categories.

Table 8. Expenditure Function Codes.

Cash and Investment Positions

Table 9. Cash and Investment Security Variables.

Debt Positions

Discussion

Implications of Unbalanced Panel Data

Fig 3. The Relationship between Sample Size and Population for Counties.

Fig 4. The Relationship between Sample Size and Population for Municipalities.

Fig 5. Population Coverage for Sampled Local Governments.

Fig 6. Population Coverage for Sampled County Governments.

Table 10. The Impact of Requiring Consecutive Data on Sample Size.

Table 11. The Impact of Requiring Consecutive Data on Average Population.

Fig 7. Proportion of Governments with Consecutive Years of Data.

Results

Fig 8. Number of Tax Revenue Sources by Government Type.

Fig 9. Real per Capita Debt by Government Type.

Fig 10. Oregon Total Revenue and Total Expenditure.

Table 12. Special District Growth.

Conclusion

Supporting Information

Data Availability

Funding Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases