I’ve been guilty of teaching bean-bag genetics this semester. Bean-bag genetics treats individuals as a bag of irrelevant shape containing a collection of alleles (the “beans”) that are sorted and disseminated by the rules of Mendel, and at its worst, assigns one trait to one allele; it’s highly unrealistic. In my defense, it was necessary — first-year students struggle enough with the basic logic of elementary transmission genetics without adding great complications — and of course, in some contexts, such as population genetics, it is a usefulsimplification. It’s just anathema to anyone more interested in the physiological and developmental side of genetics.

The heart of the problem is that it ignores the issue of translating genotype into phenotype. If you’ve ever had a basic genetics course, it’s quite common to have been taught only one concept about the phenotype problem: that an allele is either dominant, in which case it is expressed as the phenotype, or it’s recessive, in which case it is completely ignored unless it’s the only allele present. This idea is so 19th century — it’s an approximation made in the complete absence of any knowledge of the nature of genes.

And the “one gene, one trait” model violates everything we do know about the phenotype and genotype. Every gene is pleiotropic — it influences multiple traits to varying degrees. Every trait is multigenic — multiple genes contribute to the expression of every phenotypic detail. The bean-bag model is totally inadequate for describing the relationship of genes to physiology and morphology. Instead of a bean-bag, I prefer to think of the genome as comparable to a power spectrum, an expression of the organism in a completely different domain. But I wrote about that previously, and I’ll make this explanation a little simpler.

Here’s the problem: you can’t always reliably predict the phenotype from the genotype. We have a skewed perspective on the problem, because historically, genetics has first searched for strong phenotypes, and then gone looking for the genetic cause. We’ve been effectively blind to many subtle phenotypic effects, simply because we don’t know how to find them. When we go the other way, and start by mutating known genes and then looking for changes in the phenotype, we’re often surprised to discover no detectable change. One of the classic examples is the work of Elkins (1990), who found that mutating a neural cell adhesion gene, Fasciclin I, did not generate any gross defects. Mutating another gene, a signal transduction gene called Abelson tyrosine kinase, similarly had no visible effects. Mutating the two together, though — and this is a major clue to how these strange absences of effect could work — did produce gross and obvious effects on nervous system development.

How can this be? As any geneticist will tell you, the background — the other alleles present in the organism — are important in defining the pattern of expression of a specific gene of interest. One simple possibility is that the genome contains redundancy: that a trait such as adhesion of axons in the nervous system or the amount of hair on the head can be the product of multiple genes, each doing pretty much the same thing, so knocking out one doesn’t have a strong effect, because there is a backup present.

Genetic interactions provide a general model for incomplete penetrance. Representation of a negative (synergistic) genetic interaction between two genes A and B.

So Steve Pinker could have seen that he has a defective Gene A, which is important in regulating hair, but maybe there’s another Gene B lurking in the system that we haven’t characterized yet, and which can compensate for a missing Gene A, and he has a particularly strong form of it. One explanation for a variable association between an allele and the phenotype, then, is that we simply don’t have all the information about the multigenic cause of the phenotype, and there are other genes that can contribute.

This doesn’t explain all of the observed phenomena, however. Identical twins who share the same complement of alleles also exhibit variability in the phenotype; we also have isogenic animal lines, where every individual has the same genetic complement, and they also show variability in phenotype. This is the problem of penetrance; penetrance is a genetics term that refers to the likelihood that an individual carrying an allele will actually express the phenotype associated with that allele…and it’s not always 100%.

Again, the explanation lies in the other genes present in the organism. No gene functions all by itself; its expression is dependent on a cloud of other proteins — transcription factors, enhancers, chaperones — all of which modulate the gene of interest. We also have to deal with statistical variation in the degree of expression of all those modulatory factors, which vary by chance from cell to cell, and so the actual degree of activation of a gene may follow a kind of bell curve distribution. In the cartoon below, the little diamonds represent these partners; sometimes, just by chance, they’ll be present in sufficiently high numbers to boost Gene B’s output enough to fully compensate for a defective Gene A; in other cases, just by chance, they’re too low in concentration to adequately compensate for the absence.

Genetic interactions provide a general model for incomplete penetrance. A model for incomplete penetrance based on variation in the activity of genetic interaction partners.

What the above cartoon illustrates is the concept of developmental noise, the idea that the cumulative total of statistical variation in gene expression during development can produce significant phenotypic variation in the absence of any differences in the genotype. Developmental noise is a phrase bruited about quite a bit, and there’s good reason to think it’s valid: we can see quantitative variation in gene expression with molecular techniques, for instance. But at the same time we have other concepts, like redundancy and canalization, that work to buffer variation and produce reliable outputs from developmental processes, so we don’t have many good examples where we can directly correlate subtle variation at the molecular level with clear morphological differences.

To test that, we have to go to simple animal models (it turns out that Steve Pinker is a rather intractable experimental animal). And here we have a very nice example in the nematode worm, C. elegans. In these experiments, the investigators were dealing with an isogenic strain — the genetic background was identical in all of the animals — raised in a uniform environment. They were looking at a mutant in the gene tbf9, which causes defects in muscle formation, but only 50% penetrance; that is, half the time, the mutants appeared completely normal, and the other half of the time they had grossly abnormal muscle development.

Genetic interactions provide a general model for incomplete penetrance. Inactivation of the gene tbx-9 in C. elegans results in an incompletely penetrant defect, with approximately half of embryos hatching with abnormal morphology (small arrow).

See the big red question mark? That’s the big question: can we trace the abnormal phenotype all the way back to random fluctuations in the expression of other genes in the animal? Yes, they can, otherwise it would never have been published in Nature and I wouldn’t be writing about it now.

In this case, they have a situation analogous to the Gene A/Gene B cartoons above. Gene B is tbx-9; Gene B is a related gene, a duplicate called tbx-8 which acts as a redundant copy. In the experiments below, they knock out tbx-9 with a mutation, and then measure the quantity of other genes in the system using a very precise technique of quantitative fluorescence. Below, I’ve reproduced the entirety of their summary figure, because it is awesome — I just love the idea of being able to count the number of molecules expressed in a developing system. In order to avoid overwhelming everyone, though, I’ll just describe a couple of the panels to give you the gist of the work.

First, just look at the top left panel, a. It’s a plot of the level of expression of the tbx-8 gene over time, where each line in the plot is a different animal. The lines in black are in the wild type animal, with fully functional copies of bothe tbx-8 and tbx-9, and you should be able to see that there’s a fair amount of variation in expression, about two-fold, in different individuals. The lines in green are from animals mutant for tbx-9; it’s messy, but statistically what happens when tbx-9 is knocked out, more tbx-8 gene product is produced.

Panel e, just below it, shows the complementary experiment: the expression of tbx-9 is shown for both wild type (black) and animals with tbx-8 knocked out. Here, the difference is very clear: tbx-9 levels are greatly elevated in the absence of tbx-8. This shows that tbx-8 and tbx-9 are actually tied together in a regulatory relationship where levels of one rise in response to reduced levels of the other, and vice versa.

Now skip over to the right, to panel c. All of the lines in this plot are of tbx-8 expression in tbx-9 mutants, and again you see a wide variation in levels of gene expression. In addition, the lines are color-coded by whether the worm developed normally (blue), or had the mutant phenotype (red). The answer: worms with low tbx-8 levels were more likely to have the abnormal phenotype than those with high levels.

Panel g, just below it, is the complementary analysis of tbx-9 levels in tbx-8 mutants, and it gives the same answer.

Obviously, though, there is still a lot of variability unaccounted for; having relatively high levels of one or the other of the tbx genes didn’t automatically mean the worm developed a wild-type phenotype. There’s got to be something more that is varying. Look way back to the second cartoon I showed, with the little diamonds representing the cloud of transcription factors and chaperone proteins that modulate gene expression. Could there also be correlated variation there? And yes, there is. The authors looked at a chaperone protein called daf-21 that is associated with the tbx system, and found, in mutants for tbx-9, that elevated levels of daf-21 were associated with wildtype morphology (in blue), while lowered levels of daf-21 were associated with the mutant phenotype.

(Click for larger image)Expression of daf-21 reporter in a tbx-9(ok2473) mutant background. Embryos that hatch into phenotypically WT worms (blue) have higher expression than those hatching with a morphological defect (red) at the comma stage (P=1.9×10-3).

I know what you’re thinking: there isn’t a perfect correlation between high daf-21 levels and wildtype morphology either. But when they do double-label experiments, and take into account both daf-21 and tbx-8 levels in tbx-9 mutants, they found that 92% of the animals with greater than median levels of expression of bothdaf-21 and tbx-8 had wildtype morphology. It’s still not perfect, but it’s pretty darned good, and besides, it’s no surprise that there are probably other modulatory factors with statistical variation lurking in the system.

What should you learn from this? Developmental noise is real, and is a product of statistical variation in the degree of expression of multiple genetic components that contribute to a phenotype. We can measure that molecular variation in living, developing systems and correlate it phenotypic outcomes. None of this is surprising; we expect that the process of gene expression is going to be a bit noisy, especially in these transcriptional regulators that are present in low concentration in the cell, anyway. But the other cool thing we can observe here is that having multiple noisy systems that interact with each other can produce a more reliable, robust signal and contribute to the fidelity of developmental outcomes.