Gene duplications can facilitate adaptation and may lead to interpopulation divergence, causing reproductive isolation. We used whole-genome resequencing data from 34 butterflies to detect duplications in two Heliconius species, Heliconius cydno and Heliconius melpomene. Taking advantage of three distinctive signals of duplication in short-read sequencing data, we identified 744 duplicated loci in H. cydno and H. melpomene and evaluated the accuracy of our approach using single-molecule sequencing. We have found that duplications overlap genes significantly less than expected at random in H...

Escherichia coli is both of a widespread harmless gut commensal and a versatile pathogen of humans. Domestic animals are a well-known reservoir for pathogenic E. coli. However, studies of E. coli populations from wild animals that have been separated from human activities had been very limited. Here we obtained 580 isolates from intestinal contents of 116 wild Marmot Marmota himalayana from Qinghai-Tibet plateau, China, with five isolates per animal. We selected 125 (hereinafter referred to as strains) from the 580 isolates for genome sequencing, based on unique pulse field gel electrophoresis patterns and at least one isolate per animal...

The evolution in next-generation sequencing (NGS) technology has led to the development of many different assembly algorithms, but few of them focus on assembling the organelle genomes. These genomes are used in phylogenetic studies, food identification and are the most deposited eukaryotic genomes in GenBank. Producing organelle genome assembly from whole genome sequencing (WGS) data would be the most accurate and least laborious approach, but a tool specifically designed for this task is lacking. We developed a seed-and-extend algorithm that assembles organelle genomes from whole genome sequencing (WGS) data, starting from a related or distant single seed sequence...

A unique resource for systems pharmacology and genomic studies is the NCI-60 cancer cell line panel, which provides data for the largest publicly available library of compounds with cytotoxic activity (~21,000 compounds), including 108 FDA-approved and 70 clinical trial drugs as well as genomic data, including whole-exome sequencing, gene and microRNA transcripts, DNA copy number, and protein levels. Here we provide the first readily usable genome-wide DNA methylation database for the NCI-60, including 485,577 probes from the Infinium HumanMethylation450k BeadChip array, which yielded DNA methylation signatures for 17,559 genes integrated into our open access CellMiner version 2...

Murine syngeneic tumor models are critical to novel immuno-based therapy development but the molecular and immunological features of these models are still not clearly defined. The translational relevance of differences between the models is not fully understood, impeding appropriate preclinical model selection for target validation, and ultimately hindering drug development. Across a panel of commonly-used murine syngeneic tumor models, we showed variable responsiveness to immunotherapies. We employed array comparative genomic hybridization, whole-exome sequencing, exon microarray analysis, and flow cytometry to extensively characterize these models, which revealed striking differences that may underlie these contrasting response profiles...

We developed a multiplex real-time PCR assay for simultaneously detecting M. pneumoniae and typing into historically-defined P1 types. Typing was achieved based on the presence of short type-specific indels identified through whole genome sequencing. This assay was 100% specific compared to existing methods and may be useful during epidemiologic investigations.

OBJECTIVE To describe the investigation and control of a rare cluster of Klebsiella pneumoniae carbapenemase-producing Citrobacter freundii in a hospital in southern Florida. METHODS An epidemiologic investigation, review of infection prevention procedures, and molecular studies including whole genome sequencing were conducted. RESULTS An outbreak of K. pneumoniae carbapenemase-3-producing C. freundii was identified at a tertiary hospital in Florida in 2014. Of the 6 cases identified, 3 occurred in the same intensive care unit and were caused by the same clone...

BACKGROUND: Allele-specific expression (ASE) is differential expression of each of the two chromosomal alleles of an autosomal gene. We assessed ASE patterns in the human left atrium (LA, n = 62) and paired samples from the left ventricle (LV, n = 76) before and after ischemia, and tested the utility of differential ASE to identify genes associated with postoperative atrial fibrillation (poAF) and myocardial ischemia. METHODS: Following genotyping from whole blood and whole-genome sequencing of LA and LV samples, we called ASE using sequences overlapping heterozygous SNPs using rigorous quality control to minimize false ASE calling...

BACKGROUND: Since the end of 2011 an outbreak of pseudorabies affected Chinese pig herds that had been vaccinated with the commercial vaccine made of Bartha K61 strain. It is now clear that the outbreak was caused by an emergent PRV variant. Even though vaccines made of PRV Bartha K61 strain can confer certain cross protection against PRV variants based on experimental data, less than optimal clinical protection and virus shedding reduction were observed, making the control or eradication of this disease difficult...

BACKGROUND: Pistachio (Pistacia vera L.) is one of the most important nut crops in the world. There are about 11 wild species in the genus Pistacia, and they have importance as rootstock seed sources for cultivated P. vera and forest trees. Published information on the pistachio genome is limited. Therefore, a genome survey is necessary to obtain knowledge on the genome structure of pistachio by next generation sequencing. Simple sequence repeat (SSR) markers are useful tools for germplasm characterization, genetic diversity analysis, and genetic linkage mapping, and may help to elucidate genetic relationships among pistachio cultivars and species...

BACKGROUND: Lung adenocarcinoma (LUAD) is the most common histologic subtype of lung cancer and has a high risk of distant metastasis at every disease stage. We aimed to characterize the genomic landscape of LUAD and identify mutation signatures associated with tumor progression. METHODS AND FINDINGS: We performed an integrative genomic analysis, incorporating whole exome sequencing (WES), determination of DNA copy number and DNA methylation, and transcriptome sequencing for 101 LUAD samples from the Environment And Genetics in Lung cancer Etiology (EAGLE) study...

BACKGROUND: Metastasis is the main cause of cancer patient deaths and remains a poorly characterized process. It is still unclear when in tumor progression the ability to metastasize arises and whether this ability is inherent to the primary tumor or is acquired well after primary tumor formation. Next-generation sequencing and analytical methods to define clonal heterogeneity provide a means for identifying genetic events and the temporal relationships between these events in the primary and metastatic tumors within an individual...

BACKGROUND: Inflammatory breast cancer (IBC) is a rare, aggressive form of breast cancer associated with HER2 amplification, with high risk of metastasis and an estimated median survival of 2.9 y. We performed an open-label, single-arm phase II clinical trial (ClinicalTrials.gov NCT01325428) to investigate the efficacy and safety of afatinib, an irreversible ErbB family inhibitor, alone and in combination with vinorelbine in patients with HER2-positive IBC. This trial included prospectively planned exome analysis before and after afatinib monotherapy...

Epigenetic processes have been implicated in addiction; yet, it remains unclear whether these represent a risk factor and/or a consequence of substance use. Here, we believe we conducted the first genome-wide, longitudinal study to investigate whether DNA methylation patterns in early life prospectively associate with substance use in adolescence. The sample comprised of 244 youth (51% female) from the Avon Longitudinal Study of Parents and Children (ALSPAC), with repeated assessments of DNA methylation (Illumina 450k array; cord blood at birth, whole blood at age 7) and substance use (tobacco, alcohol and cannabis use; age 14-18)...

Mutation breeding is based on the induction of genetic variations; hence knowledge of the frequency and type of induced mutations is of paramount importance for the design and implementation of a mutation breeding program. Although γ ray irradiation has been widely used since the 1960s in the breeding of about 200 economically important plant species, molecular elucidation of its genetic effects has so far been achieved largely by analysis of target genes or genomic regions. In the present study, the whole genomes of six γ-irradiated M2 rice plants were sequenced; a total of 144-188 million high-quality (Q>20) reads were generated for each M2 plant, resulting in genome coverage of >45 times for each plant...

Small colony variants (SCVs) of the human pathogen Staphylococcus aureus are associated with persistent infections. Phenotypically, SCVs are characterized by slow growth and they can arise upon interruption of the electron transport chain that consequently reduce membrane potential and thereby limit uptake of aminoglycosides (e.g., gentamicin). In this study, we have examined the pathways by which the fitness cost of SCVs can be ameliorated. Five gentamicin resistant SCVs derived from S. aureus JE2 were independently selected on agar plates supplemented with gentamicin...

Breast cancer heterogeneity is evident at the clinical, histological and molecular level. High throughput technologies allowed the identification of intrinsic subtypes that capture transcriptional differences among tumors. A remaining question is whether said differences are associated to a particular transcriptional program which involves different connections between the same molecules. In other words, whether particular transcriptional network architectures can be linked to specific phenotypes. In this work we infer, construct and analyze transcriptional networks from whole-genome gene expression microarrays, by using an information theory approach...

PURPOSE: Thyroid cancer is the most frequent malignancies of the endocrine system, and it has became the fastest growing type of cancer worldwide. Much still remains unknown about the molecular mechanisms of thyroid cancer. Studies have found that some certain relationship between ARAP3 and human cancer. However, the role of ARAP3 in thyroid cancer has not been well explained. This study aimed to investigate the role of ARAP3 gene in papillary thyroid carcinoma. METHODS: Whole exon sequence and whole genome sequence of primary papillary thyroid carcinoma (PTC) samples and matched adjacent normal thyroid tissue samples were performed and then bioinformatics analysis was carried out...