DESCRIPTION (provided by applicant): The Health and Retirement Study is one of the world's most important data resources for the study of aging. The basic longitudinal survey instrument has been supplemented with data from a variety of other sources includ
ing Social Security Administration records containing the detailed earnings history of the respondent. Under current HRS protocols, the use of the SSA data is restricted. Investigators must make special security arrangements to obtain these ?les, which the
y can not redistribute once they complete their analyses. These protocols are necessary to preserve the confidentiality of the underlying HRS and SSA micro data, which is essential to the continued willingness of respondents to participate in the study, bu
t they severely limit the usefulness of the SSA data. New statistical disclosure limitation methods have been developed that promise to provide much of the information in confidential micro data in a manner that permits much wider dissemination and use of
the protected data. This project is a Phase I feasibility study of applying these new methods, called synthetic data, to a subset of the variables in the SSA records that link to the general-use RAND-HRS data. The project has three main components: (1) por
t a general data synthesizer that was developed at the U.S. Census Bureau for use with SSA data linked to the Survey of Income and Program Participation for adaptation to the HRS/SSA link; (2) synthesize a few variables from the HRS/SSA link and test their
usefulness in statistical modeling; (3) perform studies of the statistical disclosure risk associated with linking synthetic SSA data to the RAND-HRS general-release ?le. If the confidentiality-protected data prove scientifically useful and if the statist
ical disclosure risk can be controlled, then Phase II of the research would synthesize the entire HRS/SSA data link. The project scientists will work with the HRS Data Release Protocol Committee and SSA to develop appropriate certifications of the statisti
cal disclosure avoidance provided by the methods. 1 7 Project Narrative Many users of the Health and Retirement Study general-release data ?les would benefit from some access to the restricted-release ?les without the requirement for special security arran
gements. Critical variables on the restricted- release data from the Social Security Administration earnings histories will be confidentiality-protected using powerful new methods that preserve privacy while allowing many important statistical analyses to
be performed combining the protected SSA and general-release HRS data. If the demonstration is successful, the methods will be extended to the entire HRS/SSA linked data in a future project.