It is accepted that confusion regarding the description of genetic variants occurs when researchers do not use standard nomenclature. The Human Genome Organization Gene Nomenclature Committee contacted a panel of consultants, all working on the KAL1 gene, to propose an update of the nomenclature of the gene, as there was a convention in the literature of using the 'KAL1' symbol, when referring to the gene, but using the name 'anosmin-1' when referring to the protein. The new name, ANOS1, reflects protein name and is more transferrable across species...

Quality control (QC) is a critical step in RNA sequencing (RNA-seq). Yet, it is often ignored or conducted on a limited basis. Here, we present a multi-perspective strategy for QC of RNA-seq experiments. The QC of RNA-seq can be divided into four related stages: (1) RNA quality, (2) raw read data (FASTQ), (3) alignment and (4) gene expression. We illustrate the importance of conducting QC at each stage of an RNA-seq experiment and demonstrate our recommended RNA-seq QC strategy. Furthermore, we discuss the major and often neglected quality issues associated with the three major types of RNA-seq: mRNA, total RNA and small RNA...

Microbial diversity in unique environmental settings enables abrupt responses catalysed by altering the gene regulation and formation of gene clusters called operons. Operons increases bacterial adaptability, which in turn increases their survival. This review article presents the emergence of computational operon prediction methods for whole microbial genomes and metagenomes, and discusses their strengths and limitations. Most of the whole-genome operon prediction methods struggle to generalize on unrelated genomes...

Transcriptomic methods are now widely used in functional genomic research. The vast amount of information received from these studies comes along with the challenge of developing a precise picture of the functional consequences and the characteristic regulatory mechanisms. Here we assess recent studies in marine species and their adaptation to polar (and seasonal) cold and explore how they have been able to draw reliable conclusions from transcriptomic patterns on functional consequences in the organisms. Our analysis indicates that the interpretation of transcriptomic data suffers from insufficient understanding of the consequences for whole organism performance and fitness and comes with the risk of supporting only preliminary and superficial statements...

Marine species live in a wide diversity of environments and yet, because of their pelagic life stages, are thought to be well-connected: they have high migration rates that inhibit significant population structure. Recent innovations in sequencing technologies now provide information on nucleotide polymorphisms at thousands to tens of thousands of loci based on whole genomes, reduced representative portions of genomes (0.1-1%) or a majority of expressed mRNAs. Data from these genomic approaches are used to define and quantify single-nucleotide polymorphisms (SNPs)...

Fisheries genomics is an emerging field that advocates the application of genomic tools to address questions in fisheries management. Genomic approaches bring a new paradigm for fisheries management by making it possible to integrate adaptive diversity to understand fundamental aspects of fisheries resources. Hence, this review is focused on the relevance of genomic approaches to solve fisheries-specific questions. Particularly the detection of adaptive diversity (outlier loci) provides unprecedented opportunity to understand bio-complexity, increased power to trace processed sample origin to allow enforcement and the potential to understand the genetic basis of micro-evolutionary effects of fisheries-induced evolution and climate change...

Developments in chromosome conformation capture (3C) technologies have revealed that the three-dimensional organization of a genome leads widely separated functional elements to reside in close proximity. However, the mechanisms responsible for mediating long-range interactions are still not completely known. In this review, we firstly evaluate and compare the current seven 3C-based methods, summarize their advantages and discuss their limitations to our current understanding of genome structure. Then, software packages available to perform the analysis of 3C-based data are described...

Marine pollution is ubiquitous, and is one of the key factors influencing contemporary marine biodiversity worldwide. To protect marine biodiversity, how do we surveil, document and predict the short- and long-term impacts of pollutants on at-risk species? Modern genomics tools offer high-throughput, information-rich and increasingly cost-effective approaches for characterizing biological responses to environmental stress, and are important tools within an increasing sophisticated kit for surveiling and assessing impacts of pollutants on marine species...

Next-generation sequencing has catapulted healthcare into a revolutionary genomics era. One such technology, whole-exome sequencing, which targets the protein-coding regions of the genome, has proven success in identifying new causal mutations for diseases of previously unknown etiology. With a successful diagnostic rate approaching 25% for rare disease in recent studies, its clinical utility is becoming increasingly popular. However, the interpretation of whole-exome sequencing data requires expertise in genomic informatics and clinical medicine to ensure the accurate and safe reporting of findings back to the bedside...

A key aim in evolutionary biology is to deduce ancestral states to better understand the evolutionary origins of clades of interest and the diversification process(es) that has/have elaborated them. These ancestral deductions can hit difficulties when undetected loss events are misinterpreted as ancestral absences. With the ever-increasing amounts of animal genomic sequence data, we are gaining a much clearer view of the preponderance of differential gene losses across animal lineages. This has become particularly clear with recent progress in our understanding of the origins of the Hox/ParaHox developmental control genes relative to the earliest branching lineages of the animal kingdom: the sponges (Porifera), comb jellies (Ctenophora) and placozoans (Placozoa)...

The rapid development of programmable nuclease-based genome editing technologies has enabled targeted gene disruption and correction both in vitro and in vivo This revolution opens up the possibility of precise genome editing at target genomic sites to modulate gene function in animals and plants. Among several programmable nucleases, the type II clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated nuclease 9 (Cas9) system has progressed remarkably in recent years, leading to its widespread use in research, medicine and biotechnology...

DNA methylation is an epigenetic modification of cytosines that undergoes dynamic changes in a temporal, spatial and cell-type-specific manner. Recent advances in technology have permitted the profiling of high-throughput methylomes in large numbers of biological samples. Various computational tools have been developed to identify and analyze DNA methylation dynamics in a variety of critical biological processes. As DNA methylation is becoming increasingly viewed as a dynamic process, the mechanisms governing DNA methylation dynamics and its roles in the transcriptional regulatory network are of great interest...

Protein-DNA binding plays a central role in gene regulation and by that in all processes in the living cell. Novel experimental and computational approaches facilitate better understanding of protein-DNA binding preferences via high-throughput measurement of protein binding to a large number of DNA sequences and inference of binding models from them. Here we review the state of the art in measuring protein-DNA binding in vitro, emphasizing the advantages and limitations of different technologies. In addition, we describe models for representing protein-DNA binding preferences and key computational approaches to learn those from high-throughput data...

In the past few years, extensive progress has been made in the development of genome-editing technology. Among several genome-editing tools, the clustered regularly interspaced short palindrome repeat-associated Cas9 nuclease (CRISPR/Cas9) system is particularly widely used owing to the ease of sequence-specific nuclease construction and the highly efficient introduction of mutations. The CRISPR/Cas9 system was originally constructed to induce small insertion and deletion mutations, but various methods have been developed to introduce point mutations, deletions, insertions, chromosomal translocations and so on...

The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies...

Zebrafish is a powerful model for the study of vertebrate development, being amenable to a wide range of genetic and other manipulations to probe the molecular basis of development and its perturbation in disease. Over recent years, genome editing approaches have become increasingly used as an efficient and sophisticated approach to precisely engineer the zebrafish genome, which has further enhanced the utility of this organism. This review provides a practical overview of genome editing and its application in zebrafish research, including alternate strategies for introducing and screening for specific genetic changes...

The decreasing cost of performing genome-wide association studies has made genomics widely accessible. However, there is a paucity of guidance for best practice in conducting such analyses. For the results of a study to be valid and replicable, multiple biases must be addressed in the course of data preparation and analysis. In addition, standardizing methods across small, independent studies would increase comparability and the potential for effective meta-analysis. This article provides a discussion of important aspects of quality control, imputation and analysis of genome-wide data from a low-coverage microarray, as well as a straight-forward guide to performing a genome-wide association study...