Estimated phylogenetic relationships based on more than 18,000 loci in 93 individuals (full data) or 21 individuals (subset data) representing 19 described species and two putative undescribed species. Nine files are part of this dataset, including all input files to infer the phylogenetic reconstructions and the outputs obtained, in addition to a pruned tree used to infer the ancestral state reconstructions.

The modeling research conducted to produce this dataset focuses on the solar wind dynamic pressure drop events and how they affect the Earth's intrinsically coupled Magnetosphere, Ionosphere and Thermosphere systems. This study specifically focuses on the 11 June 2017 event, where the solar wind dynamic pressure dropped significantly following a period of higher pressure. We model the response to this pressure drop using University of Michigan Space Weather Modeling Framework ( http://csem.engin.umich.edu/tools/swmf/). The simulation results were created using BATS-R-US and GITM models. The observational data required for model comparisons were taken from OMNI ( https://omniweb.gsfc.nasa.gov) and CDAWeb ( https://cdaweb.gsfc.nasa.gov/sp_phys/) Databases.

The aim of the research to study the temporal evolution of the Sub-Auroral Polarizations Streams (SAPS) with multiple instruments. Aurora images are important to identify the electrodynamics in the high latitude ionosphere. Data used in this research were collected and provided by Dr. Jun Liang at University of Calgary, and analyzed by Zihan Wang for the article accompanying this deposit. The data was collected from all sky imagers (ASI) located in Saskatoon, Canada. and saskrainbow01.sav contains the rgb channels from ASI. It can be opened using IDL.
saskasiskymap.gm5 contains the sky map of the ASI. It shows the geographic location of the figure pixels.
aurora_diff.pro is the procedure to recover the data from saskrainbow01.sav and make plots.
draw_colorbar.pro is the procedure to add a colorbar to the plot.
mlon_smp_towang.pro and rebinsmpkeo_towang.pro can together make keogram plots.
Run mlon_smp_towang.pro first and then rebinsmpkeo_towang.pro.

The goal of the work is to elucidate the stability of a complex experimentally observed structure of proteins. We found that supercharged GFP molecules spontaneously assemble into a complex 16-mer structure that we term a protomer, and that under the right conditions an even larger assembly is observed. The protomer structure is very well defined, and we performed simulations to try and understand the mechanics underlying its behavior. In particular, we focused on understanding the role of electrostatics in this system and how varying salt concentrations would alter the stability of the structure, with the ultimate goal of predicting the effects of various mutations on the stability of the structure.
There are two separate projects included in this repository, but the two are closely linked. One, the candidate_structures folder, contains the atomistic outputs used to generate coarse-grained configurations. The actual coarse-grained simulations are in the rigid_protein folder, which pulls the atomistic coordinates from the other folder. All data is managed by signac and lives in the workspace directories, which contain various folders corresponding to different parameter combinations. The parameters associated with a given folder are stored in the signac_statepoint.json files within each subdirectory.
The atomistic data uses experimentally determined protein structures as a starting point; all of these are stored in the ConfigFiles folder. The primary output is the topology files generated from the PDBs by GROMACS; these topologies are then used to parametrize the Monte Carlo simulations. In some cases, atomistic simulations were actually run as well, and the outputs are stored alongside the topology files.
In the rigid_protein folder, the ConfigFiles folder contains MSMS, the software used to generate polyhedral representations of proteins from the PDBs in the candidate_structures folder. All of the actual polyhedral structures are also stored in the ConfigFiles folder. The actual simulation trajectories are stored as general simulation data (GSD) files within each subdirectory of the workspace, along with a single .pos file that contains the shape definition of the (nonconvex) polyhedron used to represent a protein. The logged quantities, such as energies and MC move sizes, are stored in .log files.
The logic for the simulations in the candidate_structures project is in the Python scripts project.py, operations.py, and scripts/init.py. The rigid_protein folder also includes the notebooks directory, which contains Jupyter notebooks used to perform analyses, as well as the Python scripts used to actually perform the simulations and manage the data space. In particular, the project.py, operations.py and scripts/init.py scripts contain most of the logic associated with the simulations.

The specific focus of the project was radiative shocks, which develop when shock waves become so fast and hot that the radiation from the shocked matter dominates the energy transport. This in turn leads to changes in the shock structure. Radiative shocks are challenging to simulate, as they include phenomena on a range of spatial and temporal scales and involve two types of nonlinear physics Ð- hydrodynamics and radiation transport. Even so, the range of physics involved is narrow enough that one can hope to model all of it with sufficient fidelity to reproduce the data.
CRASH was focused on developing predictions for a sequence of experiments performed in Project Year 5, in which those experiments represented an extrapolation from all previously available data. The previous data involved driving radiative shocks within cylindrical structures, and mainly straight tubes. The Year 5 experiments drove a radiative shock down an elliptical tube. Our long-stated goal for these predictions was that the distribution of predicted values would overlap significantly with the observed distribution. We achieved this goal.
Achieving our goal required the conversion of an established space-weather code to model radiative shocks at high energy density. To obtain reasonable fidelity with respect to the experimental data required implementing a laser absorption package, in addition to a hydrodynamic solver, electron physics and heat conduction, and multigroup diffusive radiation transport. The dedicated experiments provided evidence of experimental variability, validation of the calculation of initial shock wave behavior, and validation data at many observation times using cylindrical shock tubes. Following this were preparatory experiments for and finally the execution of the Year 5 experiments. The predictive science research included a wide range of sensitivity studies to determine which variables were important and a sequence of predictive studies focused on specific issues and sets of data. This led ultimately to predictions of shock location for the Year 5 experiments.
A conclusion from this project is that the serious quantification of uncertainty in simulations is a dauntingly difficult and expensive prospect. Pre-existing codes are unlikely to have been built with attention to what will be needed to quantify their uncertainty. Pre-existing experimental results are even more unlikely to include a sufficiently detailed analysis of the experimental uncertainties. And this will also be true of most experiments that might be used to validate components of the simulation. The analysis of uncertainty in any one of the physical processes (and related physical constants) is a major effort. And addressing model form uncertainty is an even bigger challenge, that may in principle require development of complete, alternative simulation models. We made a start at all of this, and completed almost none of it. But by the end of a project, we finally had all the pieces in place and working that would have enabled a range of important studies and advances in relatively near-term years. But the sponsor terminated the program after only five years. For most of the participants this was a relatively minor development, although for a few of them it proved to be enormously disruptive. We believe that the cost to the nation, in work that was ready be done but now will not be, was much much larger.
The sketch of the target was produced using a drawing program based on the experimental dimensions.
The annotated photograph of the target was obtained using a visible-light camera.
The colorized radiographs were obtained via backilit-pinhole radiography of a radiative shock propagating down an elliptical tube, at 26 ns after the lasers driving the shock tube fired.
The graph showing lines and circles was produced by running many computer models, analyzing their statistical distribution, and measuring actual shock positions in the experiment.

There is a directory tree inside this zipped file. The main directory has the Adobe Illustrator plots of the figures in the paper, Space Weather journal manuscript # 2018SW002067, "Model evaluation guidelines for geomagnetic index predictions" by M. W. Liemohn and coauthors. The three subdirectories have the files for the individual models, the data to which they are compared, and the IDL code used to create the figure plots and metrics calculations. and Date coverage is specific to each model. The RAMSCB model covers January 2005, the WINDMI model all of 2014, and the UPOS model 1.5 solar cycles, from 1 October 2001 through 29 July 2013.

This is the experimental data referenced in our manuscript entitled “SMALL-LABS: An algorithm for measuring single molecule intensity and position in the presence of obscuring backgrounds .” These live-cell single-molecule imaging movies were used as a test of the SMALL-LABS single-molecule image analysis algorithm.
The dataset comprises two movies; each one is provided both as a .tif stack and as an .avi file. The movie called “low_bg” has a standard low background, and the movie called “high_bg” includes a high fluorescent background produced by an external 488-nm laser.

The eastern coastal basins of Brazil are a series of small and isolated rivers that drain directly into the Atlantic Ocean. During the Pleistocene, sea-level retreat caused by glaciations exposed the continental shelf, resulting in enlarged paleodrainages that connected rivers that are isolated today. Using Geographic Information System (GIS), we infer the distribution of these paleodrainages, and their properties for the east Brazilian coast. Here, we publicly make available the shapefiles that demonstrate the paleodrainage structure along the Brazilian coast during the largest sea-level retreats in the Pleistocene, the riverine vectors during the same period and the coastal line for a drop of -125m in the sea.

Percent Weight Change Data:
The model was run continuously on a daily time step for seasonal intervals (Spring: March thru May; Summer: June thru August; Fall: September thru November) as well as contiguously from Spring to Fall to assess total growth over the likely growing season (March thru November). CSV files represent the simulated weight change (%) of Bighead and Silver Carp for the respective time periods associated with the file name. Initial fish mass for each seasonal interval and growing season was 4350 g for Silver Carp and 5480 g for Bighead Carp. Maximum and mean total weight change (%) was determined for three depth ranges (near surface depths [NS]: 0 – 10 m; deep chlorophyll layer depths [DCL]: 10 - 50 m; and whole water column [WC]). Coordinates are in decimal degrees.
File naming convention: speciesSeasonWtChange (e.g. bigheadFallWtChange = % weight change of Bighead Carp from September through November)
,
Monthly Habitat Quality Data:
Rdata files contain matrices of Bighead or Silver carp growth rate potential as represented as a mass-proportional growth rate (gram of carp/gram of carp/day [g/g/d]) for the 15th day of each month. Habitats with growth rate potential >= 0 g/g/d were deemed suitable.
Matrix attributes:
Rows: Row numbers refer to the spatial node with 20 equally-spaced vertical layers.
Columns: Columns 1-20 refer to the growth rate potential value for each vertical layer of each node. Vertical layers are evenly spaced based on the total depth of the water column for each node. Depth for each node can be found in the grid attributes data file. Columns 21 ("meanG") and 22 ("Gmax") represent the average and maximum growth rate potential, respectively, of the fish across the whole water column for the corresponding node.
File naming convention: species_MonthNumber (e.g. silver_06 = Silver carp growth rate potential in June)
Spatial coordinates for each node can be found in the grid attributes data files.,
Grid attributes data:
This Rdata file provides the spatial reference data and other grid attributes. Coordinates are provided in UTM (x & y) and latitude and longitude (decimal degrees). Depth (meters) for each node is listed in this file.
, GRP Model code:
Details bioenergetics equations, foraging equation, functions for running the model on a monthly time-step and daily time step, and functions for basic analyses. Model is coded in R., and
The simulated input data (prey and temperature) used to run our model is not included in this data set. Instead we provide the model code, grid attributes, and outputs of the model.
The readRDS() function (R Base Package v.3.5.1) is required to read in .Rdata files in R.

Nighttime and diurnal surveys in the lowland Peruvian Amazon of Los Amigos Biological Station were conducted in order to describe herpetological diversity at this site. As a result of these surveys, the predation event between a Pamphobeteus sp. and Marmosops sp. and the myiasis of Ranitomeye uakarii were observed. The video footage was recorded in order to document these interesting interactions between arthropod predators and parasites and vertebrate prey and hosts, and are included for publication in the short communication "Ecological interactions between arthropods and small vertebrates in a lowland Amazon rainforest" in the journal Amphibian and Reptile Conservation.

This work contains the experimental data and associated analysis that are described in the research publication entitled "Ultra-specific and Amplification-free Quantification of Mutant DNA by Single-molecule Kinetic Fingerprinting". This work contains multiple zip files, each of which represents one of the principal experiment groups presented in the publication. Each experiment group contains movie and analysis files corresponding to various experimental conditions related to that experiment group.

This dataset contains all data used to generate the figures in The Cryosphere manuscript “Measuring Snow Specific Surface Area with 1.30 and 1.55 micro-meter Bidirectional Reflectance Factors,” by Adam Schneider, Mark Flanner, and Roger De Roo. These data support the theory, calibration, and application of the Near-Infrared Emitting and Reflectance Monitoring Dome (NERD), an instrument engineered to rapidly retrieve surface snow specific surface area in the field. Note that this deposit includes a microCT scan database for natural snowfall samples collected in New Hampshire during 2015-2017, comprised of raw tiff files as well as reconstructions, binarized reconstructions, and some 3D model reconstructions. and Running python scripts generally require that the following packages are installed: NumPy, SciPy, Matplotlib, Pandas, and ipdb (for debugging).

Geological and geophysical observations reveal along-strike fault zone heterogeneity on major strike-slip faults, which can play a significant role in earthquake rupture propagation and termination. I present 2D dynamic rupture simulations to demonstrate rupture characteristics in such heterogeneous fault zone structure. The modeled rupture is nucleated in a damaged fault zone and propagates on a preexisting fault towards the zone of intact rocks. There is an intermediate range of nucleation lengths that only allow rupture to spontaneously propagate in the damaged fault zone but not in a homogeneous medium. I find that rupture with an intermediate nucleation length tends to stop when it reaches the zone of intact rocks, especially when the rupture propagation distance in the damaged fault zone is relatively short and when the damaged fault zone is relatively narrow or smooth in the fault-normal direction. Pronounced small-scale heterogeneity within the damaged fault zone also contributes to such early rupture termination. In asymmetric fault zones bisected by a bimaterial fault, rupture moving in the direction of slip of faster rocks tends to terminate under the same conditions as in symmetric fault zones, whereas rupture moving in the direction of slip of slower rocks can penetrate into the zone of intact rocks. Break-through rupture is allowed when a sufficiently-large asperity is located at the edge of the zone of intact rocks. The results suggest the along-strike fault zone heterogeneity can play a critical role in seismicity distribution.
The data set contains multiple folders of simulation results from the SEM2DPACK that demonstrate the above findings. The folder name includes the model parameters in each simulation as explained in the README file. The Flt01_sem2d.data file in the folder documents the slip, slip rate and stresses from each simulation. The files are also explained in the manual of SEM2DPACK ( http://web.gps.caltech.edu/~ampuero/soft/users_guide_sem2dpack.pdf). Please refer to section 4.6 in the manual and use “sem2d_read_fault.m” in the POST folder of SEM2DPACK to plot the results.

Investigating minimum human reaction times is often confounded by the motivation, training, and state of arousal of the subjects. We used the reaction times of athletes competing in the shorter sprint events in the Athletics competitions in recent Olympics (2004-2016) to determine minimum human reaction times because there's little question as to their motivation, training, or state of arousal.
The reaction times of sprinters however are only available on the IAAF web page for each individual heat, in each event, at each Olympic. Therefore we compiled all these data into two separate excel sheets which can be used for further analyses.

In this work , we study the problem of allocating limited security countermeasures to protect network data from cyber-attacks, for scenarios modeled by Bayesian attack graphs.
We consider multi-stage interactions between a network administrator and cybercriminals, formulated as a security game.
We propose parameterized heuristic strategies for the attacker and defender and provide detailed analysis of their time complexity.
Our heuristics exploit the topological structure of attack graphs and employ sampling methods to overcome the computational complexity in predicting opponent actions.
Due to the complexity of the game, we employ a simulation-based approach and perform empirical game analysis over an enumerated set of heuristic strategies.
Finally, we conduct experiments in various game settings to evaluate the performance of our heuristics in defending networks, in a manner that is robust to uncertainty about the security environment.

Raw SNP genotypes are provided in STRUCTURE format, with a maximum of one SNP reported per ddRAD locus. The files "caryco_SNP.str" and "caryov_SNP.str" are genotypes for Carya cordiformis and Carya ovata, respectively. The first column of each file is the individual name, the second column is the population (see original publication for information on population locations), and the remaining columns are genotypes of individual SNPs. Rows represent individuals, with the diploid genotypes contained on two lines per individual. Missing data are entered as "0" (zero). The first row is a header with a unique identifier for each SNP. and Occurrence records for each species are provided in the file "occs_carya.csv" and contain the latitude and longitude of each record.