UCI Source Code Data Sets

Welcome to the UCI Source Code Data Sets

This page is a repository of various data sets
we have curated in our research in large scale analysis of source code.
These data sets are available for other researchers and individuals to
use. Please refer to the terms of usage that come with each data set
for any restrictions in usage.

Questions, Issues and More Information

If you publish material based on data sets obtained from this repository, then,
in your acknowledgments, please note the assistance you received by using this
repository. This will help others to obtain the same data sets and replicate
your experiments.
We suggest the following pseudo-APA reference format for referring to this repository: