Dana-Farber Repository for Machine Learning in Immunology

This data repository bridges the gap between immunological and computer science/machine learning communities by providing preprocessed and scaled immunological data sets suitable for use in machine learning applications. These datasets include major publically available data sets (from IEDB, CBS, and our group) as well as carefully selected independent validation data sets. The recommendations for scaling and comparison of performance of prediction systems are included in the system too. Some of the data sets in the DFRMLI were used for an earlier machine learning competition.