If you’ve ever put a pair of headphones in your pocket, you’ll know how difficult it is to keep a long cord in a bundle without getting it hopelessly tangled and knotted. You’ll also start to appreciate the monumental challenge that our cells face when packaging our DNA. At 2 metres in length, the human genome is longer than the average human. But in every one of our cells, the genome needs to fit inside the nucleus, a tiny compartment just 6 millionths of a metre long. How does it do it?

One of the secrets behind this monumental feat of folding has just been revealed by research that shows the human genome’s three-dimensional structure. A team of scientists led by Erez Lieberman-Aiden and Nynke van Berkum found that our genome folds into a shape called a “fractal globule”, where the long strands of DNA are densely packed but without a single knot. It’s an awe-inspiring feat of space-saving and keeps DNA accessible. When a particular gene is needed, the DNA it sits on can be easily unpacked.

Lieberman explains, “The best way to think about it is that it looks like a pack of ramen noodles when you just start cooking them: really dense, but totally unentangled, so you can pull out a noodle or a bunch of noodles without disrupting the rest.” Previously, scientists suggested that the genome folds into a more tangled structure called the “equilibrium globule”, which is more like ramen noodles post-cooking – a massive knotted mess from which single noodles are difficult to extract.

Until now, the fractal globule was a theoretical shape that existed only in the minds of mathematicians. This is the first time that it has been observed in reality. The shape was first described by a mathematician Guiseppe Peano in 1890 and in 1988, Alexander Grosberg proposed that a long molecule might spontaneously fold into such a shape under the right conditions. Still, it took till this week for anyone to observe a fractal globule in reality. “[Peano] had no idea that it described any actual object in the universe,” says Lieberman-Aiden, “but it turns out it describes the genome!”

Some of the other tricks that cells use to fold the genome are well documented. At the most basic level, DNA is wrapped around proteins called histones, like a series of beads on a string. These are then twisted around each other to form a wider filament, like the individual strands of a piece of rope. Beyond that, things become less clear but this new study shows what happens at these higher levels.

Imagine a series of beads on a string. You gather clumps of beads and crumple them together into a globule, carefully avoiding any knots or crossovers. Every row of, say, five beads gets crumpled into a globule, every row of five globules gets crumpled together, and so on and so forth. The final result is a single ball – a “globule-of-globules-of-globules”.

Lieberman-Aiden developed a technique called Hi-C that simultaneously analyses adjacent DNA across the entire genome, in order to reveal its 3-D shape. It relies on formaldehyde to immobilise pieces of DNA that sit next to each other, effectively freezing the genome and forming cross-links between adjacent strands. The DNA is then shredded and the cross-linked fragments are isolated, sequenced and mapped onto the reference copy of the human genome. The result is a library of all the DNA strands that were neighbours in the nucleus, which can be analysed to understand how the genome must be folded.

The technique confirmed that parts of the genome that would sit far apart if it was fully stretched out are actually very close to each other in space. Because of the complicated molecular origami that goes on inside the nucleus, around three quarters of the close-contact sequences identified by the Hi-C method are actually distant ones.

The research also confirmed that the nucleus is divided into two territories – an “ON” compartment where DNA is rich in genes, highly active and loosely packed, and an “OFF” compartment where DNA is gene-poor, largely inactive and densely packed for storage. Individual chromosomes snake in and out of these two compartments and when a given gene is activated, it moves from one to the other. It’s not clear what defines the boundaries between these two compartments, but Lieberman-Aiden suspects that these boundaries are very sharp.

As an example, Lieberman-Aiden use glow-in-the-dark molecules to tag four stretches of DNA called L1, L2, L3 and L4. They lie one after the other on chromosome 14, but in the nucleus, they pair up differently. L1 and L3 are typically found in the “ON” compartment and are always closer to each other than L2. Meanwhile, L2 and L4 are closer to each other than L3, and are usually found in OFF territory.

“A huge question in biology is how all the different cells in the body perform totally different functions when all of them have the same genome,” says Lieberman-Aiden. “This work suggests that the spatial arrangement of the genome in a particular nucleus is a big part of why different cells do different things.”

PPS: You may remember Erez from the irregular verbs paper that I recently reposted. Many thanks to Erez for the heads-up about the paper and the awesome ramen noodle analogy.

Comments (7)

Interesting! Question: The technique confirmed that parts of the genome that would sit far apart if it was fully stretched out are actually very close to each other in space.
Is this determined by the DNA itself somehow, or is it semi-random? In other words, if two sections are adjacent in one cell, are they adjacent in all cells of the same type? And in relatives? I’m just wondering, if this is the case, could that play an as-of-yet unrecognized role in genetics?

Christina — see the work of Wendy Bickmore and related groups, who have suggested that chromosomal translocations between completely different chromosomes happens more often between regions of these chromosomes that co-occur in 3-D space more frequently.

Christina, the compressed but knot-free structure of DNA strand can not be semi-random, as e.g. the Hamilton and Peano fractal curves arise as a result of an intrinsic recursive algorithm (see Wiki for as much detail as you care to immerse yourself). The beauty of this result is, that the long-suspected fractality of DNA (e.g. the Hamilton concept first advocated by Alexander Grosberg two decades ago in Moscow and later in New York University) will leave little doubt, just as you say, that fractal structure results in fractal function – e.g. as FractoGene conceived from fractal model of brain cell, assuming recursive iteration to DNA in 1989, unfolding in The Principle of Recursive Genome Function by 2008.

Hi Ed, the comment about the BBC article, it made me go to their site and have a look and it was as you said 😉
I put a feedback there with a reference to your blog (as I am no expert on this) and now it’s the right picture there…
I donno if it was the feedback or if they found it by some other means.. anyways.. thanks for such an intereseting blog.. I just happened to stumble upon it and LOVED it