Abstract

The genomes of higher plants and animals are highly differentiated, and are composed of a relatively small number of genes and a large fraction of repetitive DNA. The bulk of this repetitive DNA constitutes transposable, and especially retrotransposable, elements. It has been hypothesized that most of these elements are heavily methylated relative to genes, but the evidence for this is controversial. We show here that repeat sequences in maize are largely excluded from genomic shotgun libraries by the selection of an appropriate host strain because of their sensitivity to bacterial restriction-modification systems. In contrast, unmethylated genic regions are preserved in these genetically filtered libraries if the insert size is less than the average size of genes. The representation of unique maize sequences not found in plant reference genomes is also greatly enriched. This demonstrates that repeats, and not genes, are the primary targets of methylation in maize. The use of restrictive libraries in genome shotgun sequencing in plant genomes should allow significant representation of genes, reducing the number of reactions required.