The ENVIREM dataset v1.0 is a set of 16 climatic and 2 topographic variables that can be used in modeling species' distributions. The strengths of this dataset include their close ties to ecological processes, and their availability at a global scale, at several spatial resolutions, and for several time periods. The underlying temperature and precipitation data that went into their construction comes from the WorldClim dataset ( www.worldclim.org), and the solar radiation data comes from the Consortium for Spatial Information ( www.cgiar-csi.org). The data are compatible with and expand the set of variables from WorldClim v1.4 ( www.worldclim.org).
For more information, please visit the project website: envirem.github.io

We provide the parameters used in Umbrella Sampling simulations reported in our study "Efficient Estimation of Binding Free Energies between Peptides and an MHC Class II Molecule Using Coarse-Grained Molecular Dynamics Simulations with a Weighted Histogram Analysis Method", namely the set positions and spring constants for each window in simulations. Two tables are provided. Table 1 lists the names of the peptides and their corresponding sequences. Table 2 lists the parameters. The abstract of our work is the following:
We estimate the binding free energy between peptides and an MHC class II molecule using molecular dynamics (MD) simulations with Weighted Histogram Analysis Method (WHAM). We show that, owing to its more thorough sampling in the available computational time, the binding free energy obtained by pulling the whole peptide using a coarse-grained (CG) force field (MARTINI) is less prone to significant error induced by biased-sampling than using an atomistic force field (AMBER). We further demonstrate that using CG MD to pull 3-4 residue peptide segments while leaving the remain-ing peptide segments in the binding groove and adding up the binding free energies of all peptide segments gives robust binding free energy estimations, which are in good agreement with the experimentally measured binding affinities for the peptide sequences studied. Our approach thus provides a promising and computationally efficient way to rapidly and relia-bly estimate the binding free energy between an arbitrary peptide and an MHC class II molecule.

The information and education environment refers to: 1) the presence of information infrastructures such as broadband Internet access and public libraries in a location; 2) a person’s proximity to information infrastructures and sources; 3) the distribution of information infrastructures, sources and in a specific location; and 4) exposure to specific messages (information content) within a specific location.
Coverage for all data: 10-county Detroit-Warren-Ann Arbor Combined Statistical Area.

Details of the microphone used for data collection, acoustic environment in which data was collected, and naming convention used are provided here.
1 - Microphones Used:
The microphones used to collect this dataset belong to 7 different trademarks. Table (1) illustrates the number of used Mics of different trademarks and models.
Table 1: Trademarks and models of Mics
Mic Trademark Mic Model # of Mics
Shure SM-58 3
Electro-Voice RE-20 2
Sennheiser MD-421 3
AKG C 451 2
AKG C 3000 B 2
Neumann KM184 2
Coles 4038 2
The t.bone MB88U 6
Total 22
2- Environment Description:
A brief description of the 6 environments in which the dataset was collected is presented here:
(i) Soundproof room: a small room (nearly 1.5m × 1.5m × 2m), which is closed and completely isolated. With an exception of a small window in the front side of the room which is made of glass, all the walls of the room are made of wood and covered by a layer of sponge from the inner side, and the floor is covered by carpet.
(ii) Class room: standard class room (6m × 5m × 3m).
(iii) Lab: small lab (4m × 4m × 3m). All the walls are made of glasses and the floor is covered by carpet. The lab contains 9 computers.
(iv) Stairs: is in the second floor. The place of recording is 3m × 5m
(v) Parking: is the college parking.
(vi) Garden: is an open space outside the buildings.
3- Naming Convention:
This set of rules were followed as a naming convention to give each file in the dataset a unique name:
(i) The file name is 19 characters long, and consists of 5 sections separated by underscores.
(ii) The first section is of 3 characters indicates the Microphone trademark.
(iii) The second section of 4 characters indicates the microphone model as in table (2).
(iv) The third section of 2 characters indicates a specific microphone within a set of microphones of the same trademark and model, since we have more than one microphone of the same trademark and model.
(v) The fourth section of 2 characters indicates the environment, where
Soundproof room --> 01
Class room --> 02
Lab --> 03
Stairs --> 04
Parking --> 05
Garden --> 06
(vi) The fifth section of 2 characters indicates the language, where
Arabic --> 01
English --> 02
Chinese --> 03
Indonesian --> 04
(vii) The sixth section of 2 characters indicates the speaker.
Table 2: Microphones Naming Criteria
Original Mic Trademark and model --> Naming Convenient
Shure SM-58 --> SHU_0058
Electro-Voice RE-20 --> ELE_0020
Sennheiser MD-421 --> SEN_0421
AKG C 451 --> AKG_0451
AKG C 3000 B --> AKG_3000
Neumann KM184 --> NEU_0184
Coles 4038 --> COL_4038
The t.bone MB88U --> TBO_0088
For example: SEN_0421_02_01_02_03 is an English file recorded by speaker number 3 in the soundproof room using microphone number 2 of Sennheiser MD-421

This dataset contains three data files used in: Blesh, J. 2017. Functional traits in cover crop mixtures: biological nitrogen fixation and multifunctionality. Journal of Applied Ecology. There are also three corresponding metadata files.
The file “Ecosystem_functions_soil_species.csv” contains data organized by farm, treatment, replicate block, and species combining the fall and spring sampling time points. These data include aboveground biomass, nitrogen and carbon content, and biological nitrogen fixation for the plant species. The dataset also includes measured soil characteristics for each farm site.
The file “Ecosystem_functions_soil_treatment.csv” contains data organized by farm, treatment, and replicate block for the fall and spring sampling time points combined. These data include aboveground biomass, nitrogen and carbon content, and biological nitrogen fixation aggregated by treatment. The dataset also includes measured soil characteristics for each farm site.
The file “Traits_unstandardized.csv” contains individual plant trait data, a subset of which were used to calculate an index of functional diversity after they were standardized to have zero mean and unit variance. These data are organized by farm, treatment, replicate block, and species.
The corresponding metadata files: “Ecosystem_functions_soil_species_metadata.csv”, “Ecosystem_functions_soil_treatment_metadata.csv”, and “Traits_unstandardized_metadata.csv” provide a detailed description of all variables in each dataset and any abbreviations used.
Note: On Dec 19th 2017, the format of the files was changed to csv to aid preservation. The following information was added to the three metadata files: the name of the data file the metadata refers to, an explanation as to the meaning of blank cells in the data file, a full citation to the paper where the author describes her findings and contact information for the author.

This dataset was compiled as an attempt to understand how natural resource managers and research ecologists in the Great Lakes region integrate the ecosystem services (ES) paradigm into their work. The following text is the adapted abstract from a thesis associated with this data.
Ecosystem services, or the benefits people obtain from ecosystems, have gained much momentum in natural resource management in recent decades as a relatively comprehensive approach to provide quantitative tools for improving decision-making and policy design. However, to date we know little about whether and how natural resource practitioners, from natural resource managers to research ecologists (hereafter managers and ecologists respectively), have adopted the ES paradigm into their respective work. Here, we addressed this knowledge gap by asking managers and ecologists about whether and how they have adopted the ES paradigm into their respective work.
First, we surveyed federal, state, provincial and tribal managers in the Great Lakes region about their perception and use of ES as well as the relevance of specific services to their work. Although results indicate that fewer than 31% of the managers said they currently consider economic values of ES, 79% of managers said they would use economic information on ES if they had access to it. Additionally, managers reported that ES-related information was generally inadequate for their resource management needs. We also assessed managers by dividing them into identifiable groups (e.g. managers working in different types of government agencies or administrative levels) to evaluate differential ES integration. Overall, results suggest a desire among managers to transition from considering ES concepts in their management practices to quantifying economic metrics, indicating a need for practical and accessible valuation techniques.
Due to a sample of opportunity at the USGS Great Lakes Science Center (GLSC), we also evaluated GLSC research ecologists’ integration of the ES paradigm because they play an important role by contributing requisite ecological knowledge for ES models. Managers and ecologists almost unanimously agreed that it was appropriate to consider ES in resource management and also showed convergence on the high priority ES. However, ecologists appeared to overestimate the adequacy of ES-related information they provide as managers reported the information was inadequate for their needs. This divergence may reflect an underrepresentation of ecological economists in this system who can aid in translating ecological models into estimates of human well-being.
As a note, the dataset for the research ecologists has had some data removed as it could be considered personally identifiable information due to the small sample size in that population. The surveys associated with both datasets have also been included in PDF format.
Curation Notes: Three files were added to the data set on Dec 21, 2017. Two csv files: "Ecosystem services and Research Ecologists - Data Index.csv" and "Ecosystem services and Research Managers - Data Index.csv" and one text file: "Ecosystem Services Adoption Readme.txt". The file names of the original four files were altered to replace an ampersand with the word "and".

This dataset contains three data files used in: King, A.E. and J. Blesh, 2017. Crop rotations for increased soil carbon: perenniality as a guiding principle. Ecological Applications. There are also three corresponding metadata files.
The file “CRMA 2017 Main.csv” contains data for the control and treatment rotations used to construct pairwise comparisons for meta-analysis, response ratios calculated for soil organic carbon concentration, and change in carbon input. The dataset also includes management, soil, and other environmental characteristics for each site.
The file “CRMA 2017 Diversity x Nitrogen.csv” contains data used to test whether N fertilizer inputs mediated the effect of functional diversity on SOC concentrations.
The file “CRMA Annual grain.csv” contains data used to test for effects of crop rotation species diversity (one vs. two species, or two vs. three species) on SOC concentrations and C input (i.e., for the “grain-only” rotations). The dataset also includes management, soil, and other environmental characteristics for each site.
The corresponding metadata files: “CRMA 2017 Main_metadata.csv”, “CRMA 2017 Diversity x Nitrogen_metadata.csv”, and “CRMA Annual grain _metadata.csv” provide a detailed description of all variables in each dataset.
Note: On Jan 12, 2018 the following information was added to the three metadata files: the name of the data file the metadata refers to, an explanation as to the meaning of blank cells in the data file, a full citation to the paper where the author describes her findings and contact information for the author.

Greenhouse gas (GHG) additions to Earth’s atmosphere initially reduce global outgoing longwave radiation (OLR), thereby warming the planet. In select environments with temperature inversions, however, increased GHG concentrations can actually increase local OLR. Negative top-of-atmosphere and effective radiative forcing (ERF) from this situation give the impression that local surface temperatures could cool in response to GHG increases. Here we consider an extreme scenario in which GHG concentrations are increased only within the warmest layers of winter near-surface inversions of the Arctic and Antarctic. We find, using a fully coupled Earth system model, that the underlying surface warms despite the GHG addition exerting negative ERF and cooling the troposphere in the vicinity of the GHG increase. This unique radiative forcing and thermal response is facilitated by the high stability of the polar winter atmosphere, which inhibits thermal mixing and amplifies the impact of surface radiative forcing on surface temperature. These findings also suggest that strategies to exploit negative ERF via injections of short-lived GHGs into inversion layers would likely be unsuccessful in cooling the planetary surface. and Note: A revised data description file was added to this work on April 11, 2018 containing additional information about the data set than was provided in the original description. Additional keywords and a full citation to the related article were added as well.

Note: The "Readme_Metadata" file was updated on March 15, 2018 to include a citation to the related article making use of this data and was reformatted to be presented as a pdf file rather than as a docx file. and This data set is comprised of synchrotron-based X-ray transmission and absorption spectroscopy data as well as X-ray diffraction patterns that were performed to characterize the best-preserved examples of nanoscale iron silicate mineral inclusions from 2.5 billion-year-old Banded Iron Formations (BIFs) and ferruginous cherts.

The global magnetosphere-ionosphere-thermosphere (M-I-T) system is intrinsically coupled and susceptible to external drivers such as solar wind dynamic pressure enhancements. In order to understand the large-scale dynamic processes in the M-I-T system due to the compression from the solar wind, the 17 March 2015 sudden commencement was studied in detail using global numerical models. This data set is comprised of the simulation data
generated from these models. and NOTE: The following changes were made to this dataset on March 28, 2018. First, two mp4 files were added. Second, the symbol representing "degree" was not rendering properly in the README file. The symbols were removed and replaced with the word "degree". Third, the metadata in the "methodology" and "description" fields were revised for content and clarity. On April 16, 2018 a citation to the corresponding article was added to the metadata record.

Detrital zircon age distributions provide robust insights into past sedimentary systems,
but these age distributions are often complex and multi-peaked, with sample sizes too
small to confidently resolve population distributions. This limited sampling hinders
existing quantitative methods for comparing detrital zircon age distributions, which
show systematic dependence on the sizes of compared samples. The proliferation of
detrital zircon studies motivates the development of more robust quantitative methods.
We present the first attempt, to our knowledge, to infer probability model ensembles
(PMEs) for samples of detrital zircon ages using a Bayesian method. Our method infers
the parent population age distribution from which a sample is drawn, using a Monte
Carlo approach to aggregate a representative set of probability models that is consistent
with the constraints that the sample data provide.
Using the PMEs inferred from sample data, we develop a new estimate of correspondence
between detrital zircon populations called Bayesian Population Correlation
(BPC). Tests of BPC on synthetic and real detrital zircon age data show that it is nearly
independent from sample size bias, unlike existing correspondence metrics. Robust
BPC uncertainties can be readily estimated, enhancing interpretive value.
When comparing two partially overlapping zircon age populations where the shared
proportion of each population is independently varied, BPC results conform almost perfectly
to expected values derived analytically from probability theory. This conformity
of experimental and analytical results permits direct inference of the shared proportions
of two detrital zircon age populations from BPC. We provide MATLAB scripts to facilitate the procedures we describe.

ABSTRACT: We present a statistical study of interchange injections in Saturn's inner and middle magnetosphere focusing on the dependence of occurrence rate and properties on radial distance, partial pressure, and local time distribution. Events are evaluated from over the entirety of the Cassini mission’s equatorial orbits between 2005 and 2016. We identified interchange events from CHarge Energy Mass Spectrometer (CHEMS) H+ data using a trained and tested automated algorithm, which has been compared with manual event identification for optimization. We provide estimates of interchange based on intensity, which we use to investigate current inconsistencies in local time occurrence rates. This represents the first automated detection method of interchange, estimation of injection event intensity, and comparison between interchange injection survey results. We find the peak rates of interchange occur between 7 - 9 Saturn radii and that this range coincides with the most intense events as defined by H+ partial particle pressure. We determine that nightside occurrence dominates as compared to the dayside injection rate, supporting the hypothesis of an inversely dependent instability growth rate on local Pedersen ionospheric conductivity. Additionally, we observe a slight preference for intense events on the dawn side, supporting a triggering mechanism related to large-scale injections from downtail reconnection. Our observed local time dependence paints a dynamic picture of interchange triggering due to both the large-scale injection driven process and ionospheric conductivity. Within this repository we provide a readme file (description of data file and usage) and the event list provided as a .txt file. The event list includes start and stop times, comparison to previous surveys, and the average location of events identified. Additional formats are available on request. , Further details on this method can be found in "Interchange Injections at Saturn: Statistical Survey of Energetic H+ Sudden Flux Intensifications" by Azari et al., 2018 in the Journal of Geophysical Research: Space Physics. https://doi.org/10.1029/2018JA025391. , and Curation notes: The files 'events_CORRECTED.txt' and 'readme_UPDATED.rtf' were added to this record July 31 and August 7, 2018 and supersede the previous data file 'events.txt' and readme file 'readme.rtf'. The file 'events.txt' uploaded May 9, 2018 should not be used, as this file contains a duplication error where the last column, InAllSurveys, is incorrectly presented as a duplicate of the column InAnySurvey. 'readme_UPDATED.rtf' contains additional context and updated references to the new data file.

Wind exposure is a key physical driver of coastal systems in aquatic environments influencing circulation and wave dynamics. A measure of wind exposure is fetch, the distance over which wind can travel across open water. In large lake systems, such as the Laurentian Great Lakes, estimating fetch has proved to be difficult due to their vast size and complex topobathymetry. Here we describe the development of two spatially discrete indicators of exposure to provide a more accurate indicator of influence of wind exposure in the nearshore of the Laurentian Great Lakes. We summarized wind data from offshore buoys and leveraged existing tools to calculate effective fetch and relative exposure index (effective fetch scaled by mean wind speed) at a 30 m grid cell resolution. We validated these models by comparing our exposure maps to the U.S. Army Corps of Engineers Wave Information Studies models and found general agreement. These exposure maps are available for public download for the years 2004-2014.

The nucleus accumbens (NAc) contains multiple subpopulations of medium spiny neurons (MSNs): one subpopulation expresses D1-type dopamine receptors, another expresses D2-type receptors, and a third expresses both. The relative roles in NAc of D1 neurons versus D2 neurons in appetitive motivation were assessed here. Specifically, we asked whether D1-Cre mice or D2-Cre mice would instrumentally seek optogenetic self-stimulation of those respective subpopulations in NAc, or instead avoid NAc laser stimulation. and Some statistical forms have been exported from SPSS for the purposes of accessibility to viewers. Please see the "readme" text for descriptions of each individual excel file.

This data is part of a large program to translate detection and interpretation of HFOs into clinical use. A zip file is included which contains hfo detections, metadata, and Matlab scripts. The matlab scripts analyze this input data and produce figures as in the referenced paper (note: the blind source separation method is stochastic, and so the figures may not be exactly the same). A file "README.txt" provides more detail about each individual file within the zip file.

Biomass burning produces smoke aerosols that are emitted into the atmosphere. Some smoke constituents, notably black carbon (BC), are highly effective light-absorbing aerosols (LAA). Emitted LAA can be transported to high albedo regions like the Greenland Ice Sheet (GrIS) and affect local snowmelt. In the summer, the effects of LAA in Greenland are uncertain. To explore how LAA affect GrIS snowmelt and surface energy flux in the summer, we conduct idealized global climate model simulations with perturbed aerosol amounts and properties in the GrIS snow and overlying atmosphere. The in-snow and atmospheric aerosol burdens we select range from background values measured on the GrIS to unrealistically high values. This helps us explore the linearity of snowmelt response and to achieve high signal-to-noise ratios. With LAA operating only in the atmosphere, we find no significant change in snowmelt due to the competing effects of surface dimming and tropospheric warming. Regardless of atmospheric LAA presence, in-snow BC-equivalent mixing ratios greater than ~60 ng/g produce statistically significant snowmelt increases over much of the GrIS. We find that net surface energy flux changes correspond well to snowmelt changes for all cases. The dominant component of surface energy flux change is solar energy flux, but sensible and longwave energy fluxes respond to temperature changes. Atmospheric LAA dampen the magnitude of solar radiation absorbed by in-snow LAA when both varieties are simulated. In general, the significant melt and surface energy flux changes we simulate occur with LAA quantities that have never been recorded in Greenland.

Raw SNP genotypes are provided in STRUCTURE format, with a maximum of one SNP reported per ddRAD locus. The files "caryco_SNP.str" and "caryov_SNP.str" are genotypes for Carya cordiformis and Carya ovata, respectively. The first column of each file is the individual name, the second column is the population (see original publication for information on population locations), and the remaining columns are genotypes of individual SNPs. Rows represent individuals, with the diploid genotypes contained on two lines per individual. Missing data are entered as "0" (zero). The first row is a header with a unique identifier for each SNP. and Occurrence records for each species are provided in the file "occs_carya.csv" and contain the latitude and longitude of each record.

Investigating minimum human reaction times is often confounded by the motivation, training, and state of arousal of the subjects. We used the reaction times of athletes competing in the shorter sprint events in the Athletics competitions in recent Olympics (2004-2016) to determine minimum human reaction times because there's little question as to their motivation, training, or state of arousal.
The reaction times of sprinters however are only available on the IAAF web page for each individual heat, in each event, at each Olympic. Therefore we compiled all these data into two separate excel sheets which can be used for further analyses.

This dataset contains all data used to generate the figures in The Cryosphere manuscript “Measuring Snow Specific Surface Area with 1.30 and 1.55 micro-meter Bidirectional Reflectance Factors,” by Adam Schneider, Mark Flanner, and Roger De Roo. These data support the theory, calibration, and application of the Near-Infrared Emitting and Reflectance Monitoring Dome (NERD), an instrument engineered to rapidly retrieve surface snow specific surface area in the field. Note that this deposit includes a microCT scan database for natural snowfall samples collected in New Hampshire during 2015-2017, comprised of raw tiff files as well as reconstructions, binarized reconstructions, and some 3D model reconstructions. and Running python scripts generally require that the following packages are installed: NumPy, SciPy, Matplotlib, Pandas, and ipdb (for debugging).

This work contains the experimental data and associated analysis that are described in the research publication entitled "Ultra-specific and Amplification-free Quantification of Mutant DNA by Single-molecule Kinetic Fingerprinting". This work contains multiple zip files, each of which represents one of the principal experiment groups presented in the publication. Each experiment group contains movie and analysis files corresponding to various experimental conditions related to that experiment group.

Percent Weight Change Data:
The model was run continuously on a daily time step for seasonal intervals (Spring: March thru May; Summer: June thru August; Fall: September thru November) as well as contiguously from Spring to Fall to assess total growth over the likely growing season (March thru November). CSV files represent the simulated weight change (%) of Bighead and Silver Carp for the respective time periods associated with the file name. Initial fish mass for each seasonal interval and growing season was 4350 g for Silver Carp and 5480 g for Bighead Carp. Maximum and mean total weight change (%) was determined for three depth ranges (near surface depths [NS]: 0 – 10 m; deep chlorophyll layer depths [DCL]: 10 - 50 m; and whole water column [WC]). Coordinates are in decimal degrees.
File naming convention: speciesSeasonWtChange (e.g. bigheadFallWtChange = % weight change of Bighead Carp from September through November)
,
Monthly Habitat Quality Data:
Rdata files contain matrices of Bighead or Silver carp growth rate potential as represented as a mass-proportional growth rate (gram of carp/gram of carp/day [g/g/d]) for the 15th day of each month. Habitats with growth rate potential >= 0 g/g/d were deemed suitable.
Matrix attributes:
Rows: Row numbers refer to the spatial node with 20 equally-spaced vertical layers.
Columns: Columns 1-20 refer to the growth rate potential value for each vertical layer of each node. Vertical layers are evenly spaced based on the total depth of the water column for each node. Depth for each node can be found in the grid attributes data file. Columns 21 ("meanG") and 22 ("Gmax") represent the average and maximum growth rate potential, respectively, of the fish across the whole water column for the corresponding node.
File naming convention: species_MonthNumber (e.g. silver_06 = Silver carp growth rate potential in June)
Spatial coordinates for each node can be found in the grid attributes data files.,
Grid attributes data:
This Rdata file provides the spatial reference data and other grid attributes. Coordinates are provided in UTM (x & y) and latitude and longitude (decimal degrees). Depth (meters) for each node is listed in this file.
, GRP Model code:
Details bioenergetics equations, foraging equation, functions for running the model on a monthly time-step and daily time step, and functions for basic analyses. Model is coded in R., and
The simulated input data (prey and temperature) used to run our model is not included in this data set. Instead we provide the model code, grid attributes, and outputs of the model.
The readRDS() function (R Base Package v.3.5.1) is required to read in .Rdata files in R.

The eastern coastal basins of Brazil are a series of small and isolated rivers that drain directly into the Atlantic Ocean. During the Pleistocene, sea-level retreat caused by glaciations exposed the continental shelf, resulting in enlarged paleodrainages that connected rivers that are isolated today. Using Geographic Information System (GIS), we infer the distribution of these paleodrainages, and their properties for the east Brazilian coast. Here, we publicly make available the shapefiles that demonstrate the paleodrainage structure along the Brazilian coast during the largest sea-level retreats in the Pleistocene, the riverine vectors during the same period and the coastal line for a drop of -125m in the sea.

There is a directory tree inside this zipped file. The main directory has the Adobe Illustrator plots of the figures in the paper, Space Weather journal manuscript # 2018SW002067, "Model evaluation guidelines for geomagnetic index predictions" by M. W. Liemohn and coauthors. The three subdirectories have the files for the individual models, the data to which they are compared, and the IDL code used to create the figure plots and metrics calculations. and Date coverage is specific to each model. The RAMSCB model covers January 2005, the WINDMI model all of 2014, and the UPOS model 1.5 solar cycles, from 1 October 2001 through 29 July 2013.

The specific focus of the project was radiative shocks, which develop when shock waves become so fast and hot that the radiation from the shocked matter dominates the energy transport. This in turn leads to changes in the shock structure. Radiative shocks are challenging to simulate, as they include phenomena on a range of spatial and temporal scales and involve two types of nonlinear physics Ð- hydrodynamics and radiation transport. Even so, the range of physics involved is narrow enough that one can hope to model all of it with sufficient fidelity to reproduce the data.
CRASH was focused on developing predictions for a sequence of experiments performed in Project Year 5, in which those experiments represented an extrapolation from all previously available data. The previous data involved driving radiative shocks within cylindrical structures, and mainly straight tubes. The Year 5 experiments drove a radiative shock down an elliptical tube. Our long-stated goal for these predictions was that the distribution of predicted values would overlap significantly with the observed distribution. We achieved this goal.
Achieving our goal required the conversion of an established space-weather code to model radiative shocks at high energy density. To obtain reasonable fidelity with respect to the experimental data required implementing a laser absorption package, in addition to a hydrodynamic solver, electron physics and heat conduction, and multigroup diffusive radiation transport. The dedicated experiments provided evidence of experimental variability, validation of the calculation of initial shock wave behavior, and validation data at many observation times using cylindrical shock tubes. Following this were preparatory experiments for and finally the execution of the Year 5 experiments. The predictive science research included a wide range of sensitivity studies to determine which variables were important and a sequence of predictive studies focused on specific issues and sets of data. This led ultimately to predictions of shock location for the Year 5 experiments.
A conclusion from this project is that the serious quantification of uncertainty in simulations is a dauntingly difficult and expensive prospect. Pre-existing codes are unlikely to have been built with attention to what will be needed to quantify their uncertainty. Pre-existing experimental results are even more unlikely to include a sufficiently detailed analysis of the experimental uncertainties. And this will also be true of most experiments that might be used to validate components of the simulation. The analysis of uncertainty in any one of the physical processes (and related physical constants) is a major effort. And addressing model form uncertainty is an even bigger challenge, that may in principle require development of complete, alternative simulation models. We made a start at all of this, and completed almost none of it. But by the end of a project, we finally had all the pieces in place and working that would have enabled a range of important studies and advances in relatively near-term years. But the sponsor terminated the program after only five years. For most of the participants this was a relatively minor development, although for a few of them it proved to be enormously disruptive. We believe that the cost to the nation, in work that was ready be done but now will not be, was much much larger.
The sketch of the target was produced using a drawing program based on the experimental dimensions.
The annotated photograph of the target was obtained using a visible-light camera.
The colorized radiographs were obtained via backilit-pinhole radiography of a radiative shock propagating down an elliptical tube, at 26 ns after the lasers driving the shock tube fired.
The graph showing lines and circles was produced by running many computer models, analyzing their statistical distribution, and measuring actual shock positions in the experiment.

The goal of the work is to elucidate the stability of a complex experimentally observed structure of proteins. We found that supercharged GFP molecules spontaneously assemble into a complex 16-mer structure that we term a protomer, and that under the right conditions an even larger assembly is observed. The protomer structure is very well defined, and we performed simulations to try and understand the mechanics underlying its behavior. In particular, we focused on understanding the role of electrostatics in this system and how varying salt concentrations would alter the stability of the structure, with the ultimate goal of predicting the effects of various mutations on the stability of the structure.
There are two separate projects included in this repository, but the two are closely linked. One, the candidate_structures folder, contains the atomistic outputs used to generate coarse-grained configurations. The actual coarse-grained simulations are in the rigid_protein folder, which pulls the atomistic coordinates from the other folder. All data is managed by signac and lives in the workspace directories, which contain various folders corresponding to different parameter combinations. The parameters associated with a given folder are stored in the signac_statepoint.json files within each subdirectory.
The atomistic data uses experimentally determined protein structures as a starting point; all of these are stored in the ConfigFiles folder. The primary output is the topology files generated from the PDBs by GROMACS; these topologies are then used to parametrize the Monte Carlo simulations. In some cases, atomistic simulations were actually run as well, and the outputs are stored alongside the topology files.
In the rigid_protein folder, the ConfigFiles folder contains MSMS, the software used to generate polyhedral representations of proteins from the PDBs in the candidate_structures folder. All of the actual polyhedral structures are also stored in the ConfigFiles folder. The actual simulation trajectories are stored as general simulation data (GSD) files within each subdirectory of the workspace, along with a single .pos file that contains the shape definition of the (nonconvex) polyhedron used to represent a protein. The logged quantities, such as energies and MC move sizes, are stored in .log files.
The logic for the simulations in the candidate_structures project is in the Python scripts project.py, operations.py, and scripts/init.py. The rigid_protein folder also includes the notebooks directory, which contains Jupyter notebooks used to perform analyses, as well as the Python scripts used to actually perform the simulations and manage the data space. In particular, the project.py, operations.py and scripts/init.py scripts contain most of the logic associated with the simulations.

Estimated phylogenetic relationships based on more than 18,000 loci in 93 individuals (full data) or 21 individuals (subset data) representing 19 described species and two putative undescribed species. Nine files are part of this dataset, including all input files to infer the phylogenetic reconstructions and the outputs obtained, in addition to a pruned tree used to infer the ancestral state reconstructions.

The NASA MAVEN (Mars Atmosphere and Volatile Evolution) spacecraft, which is currently in orbit around Mars, has been taking monthly measurements of the speed and direction of the winds in the upper atmosphere of Mars between about 140 to 240 km above the surface. The observed wind speeds and directions change with time and location, and sometimes fluctuate quickly. These measurements are compared to simulations from a computer model of the Mars atmosphere called M-GITM (Mars Global Ionosphere-Thermosphere Model), developed at U. of Michigan. This is the first comparison between direct measurements of the winds in the upper atmosphere of Mars and simulated winds and is important because it can help to inform us what physical processes are acting on the observed winds. Some wind measurements have similar wind speeds or directions to those predicted by the M-GITM model, but sometimes, there are large differences between the simulated and measured winds. The disagreements between wind observations and model simulations suggest that processes other than normal solar forcing may become relatively more important during these observations and alter the expected circulation pattern. Since the global circulation plays a role in the structure, variability, and evolution of the atmosphere, understanding the processes that drive the winds in the upper atmosphere of Mars provides key context for understanding how the atmosphere behaves as a whole system.
A basic version of the M-GITM code can be found on Github as follows:
https:/github.com/dpawlows/MGITM
and About 30 Neutral Gas and Ion Mass Spectrometer (NGIMS) wind campaigns (of 5 to 10 orbits each) have been conducted by the MAVEN team (Benna et al., 2019). Five of these campaigns are selected for detailed study (Roeten et al. 2019). The Mars conditions for these five campaigns have been used to launch corresponding M-GITM code simulations, yielding 3-D neutral wind fields for comparison to these NGIMS wind observations. The M-GITM datacubes used to extract the zonal and meridional neutral winds, along the trajectory of each orbit path between 140 and 240 km, are provided in this Deep Blue Data archive. README files are provided for each datacube, detailing the contents of each file. A general README file is also provided that summarizes the inputs and outputs of the M-GITM code simulations for this study.

This collection represents various raw data and analysis of cores extracted during the January 2009 mission of the research vessel Sproul in the Santa Barbara Basin., Cores included: box core SPR0901-04BC, box core SPR0901-unnamed, and Kasten core SPR0901-03KC. Core photos, physical properties and magnetic susceptibility from the multisensor track (MST), and the scanning X-ray fluorescence (XRF) data are included in the collection., and Cruise DOI: 10.7284/901089
This research is funded by NSF-OCE 0752093.

The data and the scripts are to show that seizure onset dynamics and evoked responses change over the progression of epileptogenesis defined in this intrahippocampal tetanus toxin rat model. All tests explored in this study can be repeated with the data and scripts included in this repository. and Dataset citation: Crisp, D.N., Cheung, W., Gliske, S.V., Lai, A., Freestone, D.R., Grayden, D.B., Cook, MJ., Stacey, W.C. (2019). Epileptogenesis modulates spontaneous and responsive brain state dynamics [Data set]. University of Michigan Deep Blue Data Repository. https://doi.org/10.7302/r6vg-9658

This collection represents various raw data and analysis of cores extracted during the November 2008 mission of R/V Melville in the Santa Barbara Basin., The core included is the jumbo piston core MV0811-14JC. Core photos, physical properties and magnetic susceptibility from the multisensor track (MST), and the scanning X-ray fluorescence (XRF) data are included in the collection., and Cruise DOI: 10.7284/903459
The research is funded by NSF OCE-1304327.

The work on accelerating authenticated boot for embedded system resulted in designing an algorithm in python to perform the random address generation and cryptographic MAC calculation.
The Sampled Boot schemes implemented in this package allow a significant reduction of the time
needed to authenticate firmware images during startup, while still retaining a high degree of trust.
This is particularly useful for automotive applications in which startup time constraints make secure boot a time prohibitive process. and Citation for this dataset: Nasser, A., Gumise, W. (2019). Authenticated Boot Acceleration Algorithm [Code and data]. University of Michigan Deep Blue Data Repository. https://doi.org/10.7302/yeh1-1x17

The relationship between words in a sentence often tell us more about the underlying semantic content of a document than its actual words, individually. Recent publications in the natural language processing arena, more specifically using word embeddings, try to incorporate semantic aspects into their word vector representation by considering the context of words and how they are distributed in a document collection. In this work, we propose two novel algorithms, called Flexible Lexical Chain II and Fixed Lexical Chain II that combine the semantic relations derived from lexical chains, prior knowledge from lexical databases, and the robustness of the distributional hypothesis in word embeddings into a single decoupled system. In short, our approach has three main contributions: (i) unsupervised techniques that fully integrate word embeddings and lexical chains; (ii) a more solid semantic representation that considers the latent relation between words in a document; and (iii) lightweight word embeddings models that can be extended to any natural language task. Knowledge-based systems that use natural language text can benefit from our approach to mitigate ambiguous semantic representations provided by traditional statistical approaches. The proposed techniques are tested against seven word embeddings algorithms using five different machine learning classifiers over six scenarios in the document classification task. Our results show that the integration between lexical chains and word embeddings representations sustain state-of-the-art results, even against more complex systems.
Github: https://github.com/truas/LexicalChain_Builder

Manganese in the sedimentary record has been interpreted by many as a powerful redox proxy for paleoenvironments, and yet very little work has been done to ensure that the manganese-rich minerals in the rock record are actually recording primary signals. In the accompanying manuscript, we present an in-depth characterization of the manganese mineralogy from two correlated regions recording the Transvaal Supergroup in South Africa with markedly different alteration histories to investigate if there can be post-depositional emplacement of manganese-rich minerals. The data uploaded here are X-ray absorption spectra of (1) manganese standard minerals that were useful in our analyses and (2) minerals from an important well-characterized sample that may be useful as comparative standards in future studies.

Raw Rheology data in supplement to the 2019 Macromolecules publication: "Assessing the Range of Validity of Current Tube Models Through Analysis of a Comprehensive Set of Star-Linear 1,4-Polybutadiene Polymer Blends"

The Liquid Metal Battery (LMB), a promising energy-storage device that contains liquid-metal interior, is studied numerically in the paper. The metal pad roll instability was modeled based on the open-source CFD software, OpenFOAM. It's based on the solver for simulations of incompressible multiphase flows multiphaseInterFoam modified to include the electromagnetic fields and account for the sharp variations of the electrical conductivity.

The main goal of this research was to identify potential molecular pathways that contribute to memory dysregulation and decline that persists long after illness or inflammation. We have previously established a subchronic immune challenge model that results in memory impairments months after the inflammatory challenge. This project aimed to determine whether memory impairments were accompanied by transcriptional dysregulation in memory related brain region (the hippocampus).
These data show the differential gene expression as log2fold change (and p-value) in males and females 3 months after immune challenge (Supp Tables 1 and 2); after a subsequent immune challenge (Supp Tables 3 and 4); the differential regulation of genes in males and females (Supp Table 5); genes differentially expressed in the hippocampus of males and females at baseline (Supp Table 6) and the differential regulation of those genes in males and females after immune challenge (Supp Tables 7,8).