Looping the loop to fold — and regulate — genes
Take a blank sheet of paper and fold it. If you are skilled in the ancient Japanese art of origami, that paper can become a crane, an insect or a warrior. The differences lies with the folds.
Each cell in the human body faces a similar problem. The genome inside each cell of the body is identical, but the body needs the cells to fulfill different tasks. An immune cell fights off infection; a cone cell helps the eye detect light; the heart’s myocytes beat endlessly.
Appearing online in the journal Cell, a report by researchers at Baylor College of Medicine, Rice University, the Broad Institute of MIT and Harvard, and Harvard University describe the results of a five-year effort to map, in unprecedented detail, how the 2-meter long human genome folds inside the nucleus of a cell. Their results show that the cell–- like a microscopic origamist – modulates its function by folding the genome into an almost limitless variety of shapes. A centerpiece of the new study is the first reliable catalog of loops spanning the entire human genome. For decades, scientists have examined the regions in the close vicinity of a gene to understand how it is regulated. But as the genome folds, sequences far from a gene loop back and come into contact with those nearby elements.
Looping has been a blind spot for modern biology “For over a century, scientists have known that DNA forms loops inside of cells, and that knowing where the loops are is incredibly important,” said co-first author Suhas Rao, a researcher at the Center for Genome Architecture at Baylor. “But mapping the positions of all those loops was long thought to be an insurmountable challenge.” The researchers showed that the 3 billion DNA letters of the human genome are partitioned into roughly 10,000 loops, a surprisingly small number. (Prior work on loops had suggested that the genome contains over a million.) “In the early days of human genome sequencing, scientists believed that humans had hundreds of thousands of genes. The genome project revealed far fewer genes than everyone was expecting,” said Dr. Erez Lieberman Aiden, senior author of the study, director of the Center for Genome Architecture, and an assistant professor of molecular and human genetics at Baylor College of Medicine and the departments of computer science and computational and applied mathematics at Rice University. “The fact that there are so few loops is a similar surprise.” DNA loops essential The team’s research showed that, although few in number, DNA loops play an essential role in nearly every process inside the cell. That’s because many loops have genes at one end. When the loop forms, the gene turns on. “Folding drives function,”said co-first author Miriam Huntley, a Ph.D. student in the Harvard School of Engineering and Applied Sciences working with Aiden. At the other end of these loops — far away from the genes that they regulate — lay hitherto unknown genetic switches buried deep in so-called junk DNA. “Our maps of looping revealed thousands of hidden switches that scientists didn’t know about before,” said Huntley. “In the case of genes that can cause cancer or other diseases, knowing where these switches are is vital.” Rules for loops The team also discovered a series of rules about how and where loops can form. “If DNA were a shoestring, you could make a loop anywhere. But within the cell, the formation of loops is highly constrained,” said Rao. “The loops we see almost all span fewer than 2 million genetic letters; they rarely overlap; and they are almost always associated with a single protein, called CTCF.” CTCF is known to be involved in the regulation of the 3D structure of chromatin, the building block of chromosomes.