Table of Contents

This study is provided by ICPSR.
ICPSR provides leadership and training in data access, curation, and methods of analysis
for a diverse and expanding social science research community.

Puerto Rico Census Project, 1920 (ICPSR 4344)

Principal Investigator(s):
Palloni, Alberto, University of Wisconsin-Madison;
Winsborough, Halliman H., University of Wisconsin-Madison;
Scarano, Francisco, University of Wisconsin-Madison

Summary:

The data comprising the Puerto Rico Census Project, 1920
contain individual and household records drawn from the 1920 Puerto
Rican Population Census. The data include variables containing basic
demographic information such as age, sex, race, marital status, number
of children born and surviving, family size, place of birth,
immigration status, county and neighborhood of residence, urban/rural
status, and citizenship. The data also describe language proficiency,
literacy, school attendance, and disabilities (blind or deaf) of the
individuals. Other variables provide data on occupation, industry,
ownership of residence, status of mortgage, and farm ownership. There
are four classifications of variables belonging to this dataset:
original input variables, coded variables, constructed variables, and
quality flag variables. The original input variables contain the raw
data collected by the enumerators. The coded variables are variables
that were recoded by the University of Wisconsin Survey Center (UWSC)
as part of the Puerto Rico Census Project. Constructed variables were
produced by UWSC to capture additional relevant information. For
example, one constructed variable measures literacy by combining
separate variables containing data on whether the individual could
read and if they could write. Finally, quality flag variables were
created by UWSC to indicate whether it could be logically deduced that
individual records had been hand edited by the Census Office.

The data comprising the Puerto Rico Census Project, 1920
contain individual and household records drawn from the 1920 Puerto
Rican Population Census. The data include variables containing basic
demographic information such as age, sex, race, marital status, number
of children born and surviving, family size, place of birth,
immigration status, county and neighborhood of residence, urban/rural
status, and citizenship. The data also describe language proficiency,
literacy, school attendance, and disabilities (blind or deaf) of the
individuals. Other variables provide data on occupation, industry,
ownership of residence, status of mortgage, and farm ownership. There
are four classifications of variables belonging to this dataset:
original input variables, coded variables, constructed variables, and
quality flag variables. The original input variables contain the raw
data collected by the enumerators. The coded variables are variables
that were recoded by the University of Wisconsin Survey Center (UWSC)
as part of the Puerto Rico Census Project. Constructed variables were
produced by UWSC to capture additional relevant information. For
example, one constructed variable measures literacy by combining
separate variables containing data on whether the individual could
read and if they could write. Finally, quality flag variables were
created by UWSC to indicate whether it could be logically deduced that
individual records had been hand edited by the Census Office.

Access Notes

Data in this collection are available only to users at ICPSR member institutions.
Please log in so we can determine if you are with a member institution and have
access to these data files.

Dataset(s)

WARNING: This study is over 150MB in size and may take several minutes to download on a typical internet connection.

Universe:
Population (individuals and households) living in Puerto
Rico in 1920.

Data Type(s):
census/enumeration data

Data Collection Notes:

The following variables have been masked to
prevent any breach of ICPSR confidentiality standards: NAMELAST,
NAMEFRST, ENUMNAME, and STREET.

A number of the original input
variables were initially captured as character variables. In the
processing of this study, many of these were converted to numeric
variables. Such variables contain the word, "original," in the
variable label in order to distinguish it from the coded numeric
variables produced by the University of Wisconsin Survey Center.

Methodology

Extent of Processing: ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of
disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major
statistical software formats as well as standard codebooks to accompany the data. In addition to
these procedures, ICPSR performed the following processing steps for this data collection: