Internet TAXSIM

Version 3.(DEPRECATED)

TAXSIM is the NBER's FORTRAN program for calculating liabilities under US Federal and State income tax laws from individual data. Scholars interested in using TAXSIM to calculate tax liabilities and marginal tax rates should follow the these instructions for submitting data. The program itself is not released.

Version 3.0Beta introduces:

File to upload:
Column Labels: Yes No
Summary: Yes No
Marginal Rates with respect to: Other Income Wage Income Not Calculated

Instructions for preparing data for TAXSIM.

Users are asked to prepare a file of cases, each line representing a potential tax return. The file may have any number of cases, and we return a file with tax liability, etc for each case. In order to upload a file, you must use Netscape 3.0+, MSIE 4.0+, or another browser that supports file uploading. When the data file is ready enter the filename in the box and select "calculate".

Here are the required fields for each taxpayer:
(You may use the fields below to enter information about a single taxpayer).

1. Case ID (must be numeric)
2. Tax year (4 digits between 1960 and 1999, but state must be zero if year is before 1977 or after 1998. Those are the only state tax laws we have coded. 1999 is coming soon.
3. State (SOI codes. See list below,and also item two above.).
4. Marital Status (see list below)
5. Dependent Exemptions (number of children)
6. Age exemptions (number of taxpayers over 65 years of age)
7. Wage and salary income of Taxpayer
8. Wage and salary income of Spouse
9. Dividend income
10. Other property income, including interest, Self-employment, may be negative. Also alimony, fellowships and other taxable income.
11. Pensions
12. Gross Social Security Income
13. Other non-taxable transfer Income such as welfare, municipal bond interest, child support that would affect eligibility for state property tax rebates.
14. Rent Paid (used for state property tax rebates)
15. Property taxes paid
16. Itemized deductions except state income tax and local property tax. If you have medical expenses remember to only include the excess over 3%,5% or 7,5% of AGI (for years 60-82, 92-86 and 87+)
17. Child care expenses
18. Unemployment compensation (New Variable in Version 2.)

Marital Status:
  1. single
  2. joint
  3. head of household (single parent)
State: 0.state unknown 1.alabama 2.alaska 3.arizona 4.arkansas 5.california 6.colorado 7.conn 8.delaware 9.dc 10.florida 11.georgia 12.hawaii 13.idaho 14.illinois 15.indiana 16.iowa 17.kansas 18.kentucky 19.louisiana 20.maine 21.maryland 22.mass 23.michigan 24.minnesota 25.mississippi 26.missouri 27.montana 28.nebraska 29.nevada 30.nh 31.nj 32.nm 33.ny 34.nc 35.nd 36.ohio 37.oklahoma 38.oregon 39.penn 40.ri 41.sc 42.sd 43.tenn. 44.texas 45.utah 46.vermont 47.virginia 48.washington 49.wv 50.wisconsin 51.wyoming 52.other

A crosswalk of Census state codes and SOI codes is available, but you have to do the conversion yourself. If you don't want state tax calculations, just set the state id to zero.

This is a simplified list of input variables selected to encompass what might be available in a non-tax oriented survey such as the CPS, CEX, PSID, etc. The items are rich enough to cover the ordinary and supertax brackets, the earned income credit, the child care credit, the partial taxation of Social Security, and UI, the secondary earner deduction, the maximum tax on earned income and other important features of the tax code. At the state level, low income allowances, property and rent credits are calculated if data is available. If you don't know the wage split between husbands and wives, you can put it all in one category. The computer won't get the perfect answer, but only you can judge the suitability of the simplification. Obviously ignored are the capital gains deduction, limits, floors and ceilings on deductions, adjustments, tax preferences, minimum taxes, separate filing etc. These are available in the full TAXSIM model used internally at the NBER, but have been left out of Internet TAXSIM for the sake of simplicity.

Here is a sample entry for a single parent of one child in Idaho with only labor income, and $2000 in child care expenses. Notice that no missing value indicator is used - zero amounts are coded as zero.:

12345 1989 13 3 1 30000 0 0 0 0 0 0 0 0 0 2000 0 0

Sample input and output files are are available to allow users to make a quick test of the functioning of the server. To run the test right click on the sample file link to download the 14 case test file to your PC. Then submit it back to our server using the file selection box above. Compare the values returned with the cannonical output file stored on the server. If they are the same, the server has passed a simple test.

The file may include any number of such lines. While the format is flexible, and values may be of variable size, each value must be separated from the next by one or more spaces or a single comma. If one line is short of values, the next will be read to make up the set of 18, but a new case always begins on a new line. We cannot read .prn files, or any file that uses commas to mark thousands. Spreadsheet files of any kind can not be read. Before uploading your file please check for the following common errors:

The tax calculator stops on any error and returns a range error with the case ID. Some users are concerned about imputations. Internet TAXSIM only does one. Social security taxes on wages and business income are imputed for the states that have a deduction for that tax. Anything not listed above becomes a zero on the Form 1040. Zero is always a valid entry, although it may be inaccurate. Formerly we did an imputation for rent when property taxes were less than one percent of income. This has been dropped.

TAXSIM will calculate the federal and state tax liability, and also the marginal tax rate calculated on an additional $1000 in wage or non-wage income. Since the federal and state tax liabilities may depend upon each other, the calculations are iterated, with the calculated state tax entered on the federal form as a (potential) deduction and the federal tax similarly placed in the state calculation. (Yes, we know that in law it is only lagged taxes that matter, but that is less interesting for our purposes). Refundable credits appear as negative liabilities.

Notches

In many cases the marginal rate of tax taken from schedule X, Y or Z of the form 1040 correctly characterizes the tax wedge. But for many taxpayers, their effective marginal rate is affected by a phase-out or other special provision of the tax code.

These effective marginal rates are important to us, because one of the most important uses TAXSIM can be put to is calculating correct after-tax prices in models of economic behavior. A problem has been the presence of decreasing marginal tax rates (notches) and otherwise implausible marginal rates in these calculations. We maintain a list of significant notches that we have noticed. Although notches generally involved very small sums of money, they could have disproportionate effects on the results of a linear regression, which is sensitive to the squared difference of any observation from the mean.

Some of the anomalies arose from defects in our FORTRAN code, these have been corrected where found. Some arose from actual notches in the tax laws. For example, in the federal law the child care credit is an integer percent of child care expenses, depending on AGI. In this case we have used a smooth interpolation between adjacent percentages even though the tax code prescribes a step function. Especially at the state level some credits are just a fixed amount for taxpayers with income less than X. In that case the range for an smoothing isn't obvious and we have not done any. So some notches will remain. We have reduced the maximum magnitude of the notches by calculating the marginal rate over a $1000 range instead of the $100 range used in Version 2 of Internet TAXSIM.

We have also set limits on the range of marginal rates the state tax calculator will report. If any calculated rate exceeds 25 percent (or 40 percent for taxpayers with negative liability generated by refundable credits) we recalculate the marginal rate using a $1000 decrease in business income. Since there probably is no notch on the decrease side (and business income is less subject to notches than wage income) this alternative calculation generally returns a more routine estimate of the marginal rate. Because of the biased selection of taxpayers to receive this treatment, the procedure adds a tiny bias to an estimate of the average marginal tax rate, but it may make for more accurate estimates in regressions using the marginal tax rates as regressors. In a run with the 1995 Current Population Survey as the data source, 21 taxpayers out of 51,000 were truncated in this fashion.

We are very interested in other solutions to this problem, and suggestions are welcome.

The returned eight columns of results include:

  1. Case ID
  2. Federal tax liability
  3. State tax liability
  4. Effective federal marginal rate
  5. Effective state marginal rate
  6. Federal bracket rate (including phase-out of 15% bracket)
  7. State bracket rate
  8. Itemization status (1 for itemizer).

Bug Reporting

If you suspect any case has been calculated incorrectly, please extract that case (or a exemplar, if there are many cases) and email it to me with a statement of what you think is wrong. I will get back to you within a couple of days with an explanation or a fix. It is important that you not send large files or attachments. Just send an email with a few cases, the TAXSIM response and the reason you believe the response to be in error.

We do answer all mail, and we do follow up on queries, but it is important to abide by the guidelines in the previous paragraph. An email message with the statement that "state x is broken for feature y" will be returned with a request for an example in Internet TAXSIM format as described above. Once we get that, we will compare our results with hand generated results done from a tax form and any problem will soon be resolved. Comments from users have led to significant improvements in the program.

Some users have expressed concern about our ability to process large files, but files up to 100,000 cases are not a problem. Larger files should probably be broken up and run in the evening. Anytime you are getting timed out, please wait several hours and try again, as the time-outs are a sign that something is overloaded.

Confidentiality

Users of the AHEAD and HRS are reminded that they are not permitted to send state identifiers to us. If such users need state tax calculations they should send each record 51 times, once for each possible state, and select the correct record from among the returned records. Some problem records are retained on our server.

For more information about TAXSIM see:

Feenberg, Daniel Richard, and Elizabeth Coutts, An Introduction to the TAXSIM Model, Journal of Policy Analysis and Management vol 12 no 1, Winter 1993, pages 189-194.

This paper and this URL (http://www.nber.org/taxsim) should be cited if any results from TAXSIM are circulated or published. If you intend to make any serious scholarly (or other) use of Internet TAXSIM you should speak with me about your project on the telephone. I am at 617-588-0343 and can offer advice etc.

Daniel Feenberg
NBER
1050 Mass Ave.
Cambridge MA 02138
617-588-0343
feenberg@nber.org


Date last modified: June 2nd, 2000