Data Sets

For Assignments and Examples

You can find copies of data used for examples or assignments. To retrieve them go to the "page" containing the data and then from the file menu select "Save as..." Or, place the pointer over the name and click the left mouse button and select "Save link as."

Data for Fall 2000

Comparative Education Example
• A small set of socio-economic indicators for counties in Delaware, Maryland, and New Jersey.
• There are six (6) varaibles:
• 1. County idenfication (FIPS) number. The first two digits are state codes. (c1)
• 10 = Delaware
• 24 = Maryland
• 34 = New Jersey
• 2. Total population (c2)
• 3. Percent of county residents who graduated from high school. (c3)
• 4. Median family income (c4)
• 5. Number of residents living below poverty level (c5)
• 6 Number of femaled headed households (c6)
• Pennslyvania County Data
• A set of three variables that provides an opportunity to practice calculating simple statistics and drawing graphs.
• Variables
1. County identification (FIPS) code. Of no substantive value.
2. Percent of all persons in the county who are poor
3. Percent of children 5 to 17 who are poor
4. Median family income.
• For Class notes 6 and Assignment 3
• D.C. data not included.
• Right click the file name and select Save Link As...
• A single column of data for assignment 4.
• N = 3141
• Five year periods
• c1 Federal employees (1,000)
• c2 Population (1,000)
• c3 Year
• c4 Time period indicator
• 0 for years before 1945
• 1 for years 1945 and after
• c5 Per capita federal employees

Right to work data from Carneige Mellon University Stat Lab used in Class 15 Notes.

The variables are:
1. State abbreviation (c1)
2. Percent of state employees in unions (c2)
3. Presence or absence of state employee right to work law 0 = No 1 = Yes (c3)
4. Presence or absence of general right to work law 0 = No 1 = Yes (c4)
5. Percent of workforce in unions (c5)
• Note: you may have to delete the state abbreviations
• N = 506
• This is the full version from the "Stat Library." It is too large for the Student Version of MINITAB.
• You will have to cut out or edit the heading information at the top of the file. It takes time but can be done.
• This is an abbreivated version for use in Assignment 9.
• It has only four variables: per capita crime rate, percent of "older" housing, percent of population classified as "lower class," and median value of owner-occupied housing.
• Shows the importance of recognizing outliers and using plots.
• Data for 50 States plus District of Columbia (N = 51)
• Variables
1. c1 Number of surgical procedures
2. c2 Number of surgical specialists/100,000 population
3. c3 Number of general surgeons/100,000
4. c4 State Population in 1990

Various Cross-national Social and Economic Indicators

• From Statistica Methods, Agresti and Finlay.
• For Assignment 10 the 18th case is Nigeria.
• Sample of 13 California counties; 14 Alabama counties.
• Variables are:
• c1: California: infant mortality, 1990.
• c2: California: Crimes known to police,per 100,000 population, 1994.
• c3: California: Active physicians per 100,100 population
• c4: Alabama: infant mortality, 1990.
• c5: Alabama: Crimes known to police,per 100,000 population
• c6: Alabama: Active physicians per 100,100 population.

Posc/Uapp 816 "Religon and Politics Data"

• Three variables from the 1996 American National Election Study
• Full sample N = 1713
• Sample of sample N = 750
• Use this when using the Student version of MINITAB
• Variables
• column 1 (c1) "Respondent agrees/Disagrees: mothers with children should work."
• column 2 (c2) "How frequently does respondent pray?"
• column 3 (c3) "Respondent's view on whether the Bible is the word of God."
Age and Height Datafor Posc/Uapp 816 - Class 9
• (c1) Age in months
• (c2) Height in centimeters

Smoking and Cancer Data for Posc/Uapp 816Class 11 - A and Assignment 6.

• These data come from the Data Story Library
• The raw data are here
• Data are for the counties in Oregon, New Jersey, Maryland, and Colorado.
• c1: Percent of county voters for Perot in 1992.
• c2: Percent of county voters for Reagan in 1984.
• c3 State indicator:
• 1 New Jersey
• 3 Maryland
• 4 Oregon
• Data from Agresti and Finlay, Statistical Methods for Social Sciences
• Variables:
• c1 Crimes per 1,000 population.
• c2 Income in thousands of dollars.
• Percent of residants with at least a high school education.
• Percent living in urban areas.
• Column 1: Percent of eligible electorate voting.
• Column 2: Year
• Very small, incompete data set. used in Class 22
• Variables for approximately 47 representatives.
• c1: Percent of district classified "rural."
• c2: Vote on measure to repeal ban on assault weapons:
• 1 = "no": vote against repealing the ban.
• 0 = "yes": vote to repeal
• c3: Vote on measure to increase minimum wage.
• 1 = "yes": vote to increase
• 0 = "no": vote against increase
• Party affliation
• 1 = Democrat
• 0 = Republican
• c5: ADA score: rating by Americans for Democratic Action, a "liberal" organization.
• Scores range from 0 (most conservative) to 100 (most liberal)

Applied statistics main page