Research

Archive Toolbar

Project 90 (Partial Data)

Project 90 was a prospective study of the influence of network structure
on the dynamics of HIV transmission in a community of high-risk heterosexuals.
The data was collected between 1988 and 1992 in Colorado Springs, CO, and
the project was funded by the Centers for Disease Control and Prevention (CDC).
For more details of the Project 90 study, please see the references below.

Stephen Muth and John Potterat kindly provided the data to Sharad Goel
and Matthew Salganik in 2007, and it was later used in their paper,
S. Goel and M. J. Salganik (2010) "Assessing respondent-driven sampling"
Proceedings of the National Academy of Sciences (PNAS).
The release of these data allows others to replicate the analyses of
Goel and Salganik.

Data Release

Included in this release are two tab-separated files, edges.tsv
and nodes.tsv, that describe the structure of the Project 90 network
and the individual-level attributes of study participants.

edges.tsv:
Each row indicates an edge in the network, specified by a pair
of node ids. Edges represent social, sexual, and/or drug affiliation.
Each edge is recorded twice. In other words, if there is an
edge between 12 and 15, there is also an edge between 15 and 12.
There are 43,288 edges in the file.

nodes.tsv:
Each row corresponds to a study participant. In addition to node id,
the following attributes are listed for each individual: