Spliceosomal introns are a hallmark of eukaryotic genes that are hypothesized to play important roles in genome evolution but have poorly understood origins. Although most introns lack sequence homology to each other, new families of spliceosomal introns that are repeated hundreds of times in individual genomes have recently been discovered in a few organisms. The prevalence and conservation of these introner elements (IEs) or introner-like elements in other taxa, as well as their evolutionary relationships to regular spliceosomal introns, are still unknown. Here, we systematically investigate introns in the widespread marine green alga Micromonas and report new families of IEs, numerous intron presence-absence polymorphisms, and potential intron insertion hot-spots. The new families enabled identification of conserved IE secondary structure features and establishment of a novel general model for repetitive intron proliferation across genomes. Despite shared secondary structure, the IE families from each Micromonas lineage bear no obvious sequence similarity to those in the other lineages, suggesting that their appearance is intimately linked with the process of speciation. Two of the new IE families come from an Arctic culture (Micromonas Clade E2) isolated from a polar region where abundance of this alga is increasing due to climate induced changes. The same two families were detected in metagenomic data from Antarctica--a system where Micromonas has never before been reported. Strikingly high identity between the Arctic isolate and Antarctic coding sequences that flank the IEs suggests connectivity between populations in the two polar systems that we postulate occurs through deep-sea currents. Recovery of Clade E2 sequences in North Atlantic Deep Waters beneath the Gulf Stream supports this hypothesis. Our research illuminates the dynamic relationships between an unusual class of repetitive introns, genome evolution, speciation, and global distribution of this sentinel marine alga.

In marine ecosystems, viruses are major disrupters of the direct flow of carbon and nutrients to higher trophic levels. Although the genetic diversity of several eukaryotic phytoplankton virus groups has been characterized, their infection dynamics are less understood, such that the physiological and ecological implications of their diversity remain unclear. We compared genomes and infection phenotypes of the two most closely related cultured phycodnaviruses infecting the widespread picoprasinophyte Ostreococcus lucimarinus under standard- (1.3 divisions per day) and limited-light (0.41 divisions per day) nutrient replete conditions. OlV7 infection caused early arrest of the host cell cycle, coinciding with a significantly higher proportion of infected cells than OlV1-amended treatments, regardless of host growth rate. OlV7 treatments showed a near-50-fold increase of progeny virions at the higher host growth rate, contrasting with OlV1's 16-fold increase. However, production of OlV7 virions was more sensitive than OlV1 production to reduced host growth rate, suggesting fitness trade-offs between infection efficiency and resilience to host physiology. Moreover, although organic matter released from OlV1- and OlV7-infected hosts had broadly similar chemical composition, some distinct molecular signatures were observed. Collectively, these results suggest that current views on viral relatedness through marker and core gene analyses underplay operational divergence and consequences for host ecology.

BACKGROUND:Prasinophytes are widespread marine green algae that are related to plants. Cellular abundance of the prasinophyte Micromonas has reportedly increased in the Arctic due to climate-induced changes. Thus, studies of these unicellular eukaryotes are important for marine ecology and for understanding Viridiplantae evolution and diversification. RESULTS:We generated evidence-based Micromonas gene models using proteomics and RNA-Seq to improve prasinophyte genomic resources. First, sequences of four chromosomes in the 22 Mb Micromonas pusilla (CCMP1545) genome were finished. Comparison with the finished 21 Mb genome of Micromonas commoda (RCC299; named herein) shows they share ≤8,141 of ~10,000 protein-encoding genes, depending on the analysis method. Unlike RCC299 and other sequenced eukaryotes, CCMP1545 has two abundant repetitive intron types and a high percent (26 %) GC splice donors. Micromonas has more genus-specific protein families (19 %) than other genome sequenced prasinophytes (11 %). Comparative analyses using predicted proteomes from other prasinophytes reveal proteins likely related to scale formation and ancestral photosynthesis. Our studies also indicate that peptidoglycan (PG) biosynthesis enzymes have been lost in multiple independent events in select prasinophytes and plants. However, CCMP1545, polar Micromonas CCMP2099 and prasinophytes from other classes retain the entire PG pathway, like moss and glaucophyte algae. Surprisingly, multiple vascular plants also have the PG pathway, except the Penicillin-Binding Protein, and share a unique bi-domain protein potentially associated with the pathway. Alongside Micromonas experiments using antibiotics that halt bacterial PG biosynthesis, the findings highlight unrecognized phylogenetic complexity in PG-pathway retention and implicate a role in chloroplast structure or division in several extant Viridiplantae lineages. CONCLUSIONS:Extensive differences in gene loss and architecture between related prasinophytes underscore their divergence. PG biosynthesis genes from the cyanobacterial endosymbiont that became the plastid, have been selectively retained in multiple plants and algae, implying a biological function. Our studies provide robust genomic resources for emerging model algae, advancing knowledge of marine phytoplankton and plant evolution.

Giant viruses are remarkable for their large genomes, often rivaling those of small bacteria, and for having genes thought exclusive to cellular life. Most isolated to date infect nonmarine protists, leaving their strategies and prevalence in marine environments largely unknown. Using eukaryotic single-cell metagenomics in the Pacific, we discovered a Mimiviridae lineage of giant viruses, which infects choanoflagellates, widespread protistan predators related to metazoans. The ChoanoVirus genomes are the largest yet from pelagic ecosystems, with 442 of 862 predicted proteins lacking known homologs. They are enriched in enzymes for modifying organic compounds, including degradation of chitin, an abundant polysaccharide in oceans, and they encode 3 divergent type-1 rhodopsins (VirR) with distinct evolutionary histories from those that capture sunlight in cellular organisms. One (VirRDTS) is similar to the only other putative rhodopsin from a virus (PgV) with a known host (a marine alga). Unlike the algal virus, ChoanoViruses encode the entire pigment biosynthesis pathway and cleavage enzyme for producing the required chromophore, retinal. We demonstrate that the rhodopsin shared by ChoanoViruses and PgV binds retinal and pumps protons. Moreover, our 1.65-Å resolved VirRDTS crystal structure and mutational analyses exposed differences from previously characterized type-1 rhodopsins, all of which come from cellular organisms. Multiple VirR types are present in metagenomes from across surface oceans, where they are correlated with and nearly as abundant as a canonical marker gene from Mimiviridae Our findings indicate that light-dependent energy transfer systems are likely common components of giant viruses of photosynthetic and phagotrophic unicellular marine eukaryotes.