The Collaborative Psychiatric Epidemiology Surveys (CPES) were initiated in recognition of the need for contemporary, comprehensive epidemiological data regarding the distributions, correlates and risk factors of mental disorders among the general population with special emphasis on minority groups. The primary objective of the CPES was to collect data about the prevalence of mental disorders, impairments associated with these disorders, and their treatment patterns from representative samples of majority and minority adult populations in the United States. Secondary goals were to obtain information about language use and ethnic disparities, support systems, discrimination and assimilation, in order to examine whether and how closely various mental health disorders are linked to social and cultural issues. To this end, CPES joins together three nationally representative surveys: the NATIONAL COMORBIDITY SURVEY REPLICATION (NCS-R), the NATIONAL SURVEY OF AMERICAN LIFE (NSAL), and the NATIONAL LATINO AND ASIAN AMERICAN STUDY (NLAAS). These surveys collectively provide the first national data with sufficient power to investigate cultural and ethnic influences on mental disorders. In this manner, CPES permits analysts to approach analysis of the combined dataset as though it were a single, nationally representative survey. Each of the CPES surveys has been documented in a comprehensive and flexible manner that promotes cross-survey linking of key data and scientific constructs.

The Collaborative Psychiatric Epidemiology Surveys (CPES) were initiated in recognition of the need for contemporary, comprehensive epidemiological data regarding the distributions, correlates and risk factors of mental disorders among the general population with special emphasis on minority groups. The primary objective of the CPES was to collect data about the prevalence of mental disorders, impairments associated with these disorders, and their treatment patterns from representative samples of majority and minority adult populations in the United States. Secondary goals were to obtain information about language use and ethnic disparities, support systems, discrimination and assimilation, in order to examine whether and how closely various mental health disorders are linked to social and cultural issues. To this end, CPES joins together three nationally representative surveys: the NATIONAL COMORBIDITY SURVEY REPLICATION (NCS-R), the NATIONAL SURVEY OF AMERICAN LIFE (NSAL), and the NATIONAL LATINO AND ASIAN AMERICAN STUDY (NLAAS). These surveys collectively provide the first national data with sufficient power to investigate cultural and ethnic influences on mental disorders. In this manner, CPES permits analysts to approach analysis of the combined dataset as though it were a single, nationally representative survey. Each of the CPES surveys has been documented in a comprehensive and flexible manner that promotes cross-survey linking of key data and scientific constructs.

Guidelines for Applying for Restricted Data

Before you begin an application you will need the following information to complete the form

General Requirements:

appointment at research institution; appointment must be under the jurisdiction of the receiving institution

degree requirements (possibly doctorate)

Must be submitted:

project description

IRB approval

approved security plan

roster of research and IT staff who can access or view the data or computer where data are hosted.

confidentiality pledges for all people on roster

Some require:

CV's

The restricted versions of this data collection may not be used for any purpose other than statistical reporting and analysis. Use of the restricted versions to learn the identity of any person or establishment is prohibited. To protect respondent privacy, the names and IDs are restricted from general dissemination. To obtain this file, researchers must agree to the terms and conditions of a Restricted Data Use Agreement in accordance with existing ICPSR servicing policies.

Any public-use data files in this collection are available for access by the general public.
Access does not require affiliation with an ICPSR member institution.

Dataset(s)

WARNING: This study is over 150MB in size and may take several minutes to download on a typical internet connection.

Universe:
The CPES universe was defined by the union of the following survey populations for the three component surveys: The NCS-R universe included English-speaking adults aged 18 years and older residing in households located in the coterminous United States. The NSAL universe included adults in the three target groups: Black Americans of African descent, Black Americans of Caribbean descent, and White Americans, who were aged 18 years and older residing in households located in the coterminous United States. The NLAAS universe included Latino American, Asian American, and non-Latino, non-Asian White American adults aged 18 and older residing in households located in the coterminous United States and the state of Hawaii.

Data Type(s):
survey data

Data Collection Notes:

The master CPES datasets are updated periodically as various, typically small, errors are detected. Diagnostic algorithms are periodically updated as well. This updating will continue in the future as needed and public users will be informed of these updates. It is important for public users to recognize that, because of these changes, it will not be possible to reproduce results reported in earlier publications.

The NATIONAL SURVEY OF AMERICAN LIFE (NSAL) is supported by the National Institute of Mental Health (NIMH U01-MH57716), with supplemental support from the Office of Behavioral and Social Sciences Research (OBSSR) at the National Institute of Health (NIH), and the University of Michigan. Collaborating investigators include James S. Jackson (Principal Investigator, University of Michigan, Survey Research Center), Harold W. Neighbors (Co-Principal Investigator, University of Michigan, Research Center for Group Dynamics), David R. Williams (Co-Principal Investigator, University of Michigan, Survey Research Center), Robert J. Taylor (Co-Principal Investigator, University of Michigan, Research Center for Group Dynamics), Cleopatra H. Caldwell (Co-Investigator, University of Michigan, Research Center for Group Dynamics), Steven J. Trierweiler (Co-Investigator, University of Michigan, Research Center for Group Dynamics), and Randolph M. Nesse (Co-Investigator, University of Michigan, Research Center for Group Dynamics.

The NATIONAL LATINO AND ASIAN AMERICAN STUDY (NLAAS) is supported by the National Institute of Mental Health (NIMH, U01-MH062209, and U01-MH62207), with supplemental support from the Office of Behavioral and Social Sciences Research (OBSSR) at the National Institute of Health (NIH), the Substance Abuse and Mental Health Services Agency (SAMHSA), and the Latino Research Program Project (P01-MH059876). Collaborating investigators include Margarita Alegria (Principal Investigator, Center for Multicultural Mental Health Research at Cambridge Health Alliance), David Takeuchi (Principal Investigator, University of Washington), Sergio Aguilar-Gaxiola (Co-Investigator, California State University-Fresno), Glorisa Canino (Co-Investigator, University of Puerto Rico), Naihua Duan (Co-Investigator, University of California-Los Angeles), Javier Escobar (Co-Investigator, Robert Wood Johnson Medical School), Peter Guarnaccia (Co-Investigator, Rutgers University), Alexander Ortega (Co-Investigator, University of California-Los Angeles), Patrick Shrout (Co-Investigator, New York University), William Vega (Co-Investigator, Robert Wood Johnson Medical School), and Mildred Vera (Co-Investigator, University of Puerto Rico).

Methodology

Sample:
The sample for all three surveys consisted of primary sampling units selected with probabilities proportional to size. Data collection for the three surveys was conducted in a total of 252 geographic areas or primary sampling units across the United States. Only 50 of these areas were shared by all three surveys, those representing the most densely populated areas of the country. These latter areas were selected into all three surveys with certainty. In addition, there were 52 areas unique to NSAL and 18 areas unique to NLAAS. These unique areas were introduced to reflect the particular racial and ethnic focus of those surveys. A more detailed discussion of the CPES sample design and that for the individual surveys can be found in the User Guide.

Weight:
See the User Guide for the technical report outlining the method used for integrating the design-based analysis weights and variance estimation codes for the three individual surveys to permit analysts to approach analysis of the combined dataset as though it were a single, nationally-representative survey.

Response Rates:
For NCS-R, a sample of 13,054 addresses was fielded. In 98.1 percent (11,222) of occupied housing units (11,443), interviewers were able to determine whether or not the household was eligible for inclusion in the study. A total of 10,622 addresses yielded an eligible household and 9,282 adult interviews were completed: 7,693 interviews with the main respondent and 1,589 interviews with a second adult in the household. An additional 554 interviews were collected using a shortened form of the instrument with a subsample of nonrespondents in an effort to assess nonresponse bias. The final weighted response rate for NCS-R, excluding the short form interviews, was 70.9 percent for primary respondents. The response rate for the second respondents was 80.4 percent. For NSAL, 11,634 eligible households were identified from 26,495 randomly sampled addresses. A total of 6,199 adult respondents were interviewed as part of NSAL (1,006 White respondents, 1,623 respondents of Caribbean descent, and 3,570 African American respondents) although the actual number of cases in the CPES data file is 6,082 (3,570 African American, 1,621 Afro-Caribbean, and 891 non-Hispanic White). An extremely small sample (n = 115) of White adults who were interviewed in households where the White subsample was less than 10 percent of the African American density stratum were excluded from the final dataset as well as two of the Afro-Caribbean interviews when it was later discovered that they were duplicate cases. The overall response rate for the core NSAL national sample was 71.5 percent. The Caribbean Supplement sample, which was designed to target areas with high concentrations of persons of Caribbean origin, yielded a weighted response rate of 76.4 percent. For NLAAS, from a total sample of 27,026 addresses, 4,345 eligible main respondents and 1,234 eligible second adult respondents were identified, and 3,620 main respondent and 1,029 second adult interviews were completed. The weighted response rate for NLAAS was 75.7 percent among main respondents (77.6 percent for Latinos, 69.3 percent for Asians). For second respondents, the final response rate was 80.3 percent (82.4 percent for Latinos, 73.7 percent for Asians).

Extent of Processing: ICPSR data undergo a confidentiality review and are altered when necessary to limit the risk of
disclosure. ICPSR also routinely creates ready-to-go data files along with setups in the major
statistical software formats as well as standard codebooks to accompany the data. In addition to
these procedures, ICPSR performed the following processing steps for this data collection:

Performed consistency checks.

Standardized missing values.

Checked for undocumented or out-of-range codes.

Version(s)

Original ICPSR Release: 2007-06-14

Version History:

2016-03-23 The public-use crosswalk and internal records were updated.

2015-12-09 Excel files listing all variables included in the public-use and restricted-use data sets have been added to the collection.

2015-05-28 Additional documentation files have been added to the collection. These files include material from the CPES Web site that were previously unavailable.

2014-04-23 2014-04-17 Internal records were updated.

2014-04-08 Internal records were updated.

2014-03-31 Internal records were updated.

2014-03-21 Internal records were updated.

2009-01-07 This second version of the restricted CPES files includes additional variables (that were not included in the original restricted files). An Excel file that lists the variables included in each file is also provided with this version.

2008-06-19 A supplementary file for NSAL public-use file has been added to the study. This file contains 10 variables along with 3 ID variables for merging with the full NSAL dataset. An Errata file has been also added to the PDF codebook.

2008-05-19 Restricted Versions of the National Comorbidity Survey Replication (NCS-R), National Survey of American Life (NSAL), and National Latino and Asian American Study (NLAAS) have been added to the collection.

2008-04-07 The data producers have provided updated datasets and CPES Processor Notes. They have resolved discrepancies, updated algorithms, and identified additional variables that can be harmonized. They have created new versions of the CPES files that contain thousands of these types of changes, many at the cell level. Many of these changes represent errors in the original data, therefore previous versions of the CPES file are obsolete and should not be used. The data producers were careful not to change any variable names so that you can use previously-developed code to run against these corrected and updated files to determine if any of these changes impact your analysis. As part of additional harmonization efforts across NCS-R, NLAAS, and NSAL, more variables have been recoded and dropped. The Recoded Variables and Dropped Variables sections of the Processor Notes provide detail on dropped and recoded variables. PS_FLAG (V09435) was added for NCS-R and linked to PS_FLAG for NLAAS and NSAL. CPESCASE (CPES Case ID) was regenerated, and VERSION (Version release number) was added. In the dataset released in July 2007, the Pharmacoepidemiology variables were not completely linked across NCS-R, NLAAS, and NSAL. This has been corrected. These variables include: Med1-reason stop taking: 1st to 6th mention, Med2-reason stop taking: 1st to 4th mention, Med3-reason stop taking: 1st and 2nd mention, Med1-problems took medicine for: 1st to 10th mention, Med2-problems took medicine for: 1st to 9th mention, Med3-problems took medicine for: 1st to 4th mention. Note that for "Med1-reason stop taking: 1st mention" the NCS-R variable (V03031) has not been linked to NLAAS and NSAL (V03011), this was overlooked in the harmonization, but users may link the NCS-R variable to the NLAAS and NSAL variables.

2007-12-18 The data producer provided an updated version of the Processor Notes including a new table called: "Constructed Variable Names: Differences between CPES Interactive Documentation and Datasets."