2012-13 Program on Statistical and Computational Methodology for Massive Datasets

This year-long SAMSI program focused on fundamental methodological questions of statistics, mathematics and computer science posed by massive datasets, with applications to astronomy, high energy physics, and the environment.

Serious challenges posed by massive datasets have to do with "scalability" and "data streaming". Techniques developed for small or moderate-sized datasets simply do not translate to modern massive data sets. Data acquisition rates on the order of gigabytes per second necessitate innovative approaches towards computing environments, analysis, and algorithms.

Research Working Groups

Working groups are at the very heart of the scientific activities at SAMSI. They consist of SAMSI visitors, postdoctoral fellows, graduate students, local faculty, and other scientists. The working groups met every week throughout the program year, to pursue the following research topics that were identified at the Planning Workshop and at the Opening Workshop, or subsequently chosen by the working group participants: