Max Planck Institute of Multidisciplinary Sciences
The production of proteins in the cells of higher organisms is a complex process involving many steps. First, the genetic information for a protein is re written from DNA into a working copy, the precursor messenger RNA (pre mRNA). However, pre-mRNAs contain regions that do not contain information used for the production of proteins (the so-called “introns”). These regions must be precisely cut out and the remaining regions, which contain usable information (the “exons”), linked together. This maturation process is termed "pre-mRNA splicing”. Only mature mRNAs, that are transported from the cell nucleus into the cytoplasm, can be used by the ribosome as a template for the production of proteins.
The presence of exons and introns is a great advantage for an organism, as different combinations of exons from a given pre-mRNA species can be chosen to be included in the mature mRNA product. In this way, mRNAs corresponding to many different proteins can be made from a single gene. This so-called alternative splicing represents an additional level at which gene expression can be regulated, and leads to an enormous increase in the genetic capacity of higher eucaryotes. This explains why humans manage with only just over 20,000 protein-encoding genes in their genomes. Understanding splicing at the molecular level is of great medical relevance, as aberrant pre-mRNA splicing is the basis or a severity modifier of a plethora of human diseases.
The pre-mRNA splicing reaction takes place in two steps. Both involve phosphoester-transfer reactions, and both are carried out by a macromolecular machine, the spliceosome. Spliceosomes consist of well over 100 proteins and five small RNA molecules (the snRNAs U1, U2, U4, U5 and U6) and thus consist largely of protein. Many of the spliceosome's components are organised into smaller, stable sub-complexes. For example, about 50 of the spliceosomal proteins are stably bound to the snRNAs, forming RNA–protein particles (termed small nuclear ribonucleoproteins or snRNPs) which include the U1 and U2 snRNPs and the U4/U6.U5 tri-snRNP.
Spliceosomes do not exist in the cell nucleus as complete, pre-formed complexes. Rather, a new spliceosome is built up from its components around each intron that requires excision (Figure 1). First, the U1 and U2 snRNPs recognize and bind the 5'ss and of the pre-mRNA. The resulting complex is termed the A complex. Subsequent binding of the U4/U6.U5 tri snRNP leads to the formation of the so-called B complex. However, this multi-megadalton complex still has no catalytically active site. The subsequent catalytic activation of the spliceosome involves dramatic structural rearrangements that lead to changes in the conformations of its snRNAs and also its biochemical composition. During this process, a complex network of RNA–RNA interactions is formed between the pre mRNA and the snRNAs U2, U5 and U6. This network forms the heart of the spliceosome's catalytic center (Figure 2). The catalytically activated spliceosome is now ready to perform the first step of the splicing reaction. The product of this first step is the C complex, which then catalyses the second step. After this, the excised intron and remaining snRNPs are separated from the mature (spliced) mRNA, and the snRNPs are actively released to take part in a new round of splicing.
Both the snRNAs and the spliceosomal proteins are essential for the function of the spliceosome. They are involved in the recognition of the pre-mRNA's splice sites and in the formation of the spliceosome's catalytic center. Furthermore, a number of energy-requiring enzymes – the so-called RNA helicases – play decisive roles in the stepwise structural rearrangements of the spliceosome (Figure 1).
The primary goal of our research is to understand the structure and the function of the splicing machinery. One main question that we wish to address is how the structural rearrangements of the spliceosome during its work cycle are directed and regulated. Another is what is the nature of the catalytic center of the spliceosome – for example, does it consist only of RNA components (like a ribozyme), or do RNA and protein both contribute to catalysis (as in an RNP enzyme)? To answer these questions we are using an integrated experimental approach that involves a broad palette of methods. We are using biochemical and molecular-genetic methods to study the functions of the proteins and snRNA molecules in splicing, mainly by focussing on the spliceosomes of human cells and those of baker's yeast. At the same time we are using electron cryomicroscopy, X-ray crystallography, mass spectrometry, and fluorescence spectroscopy to investigate the spatial organization and the structural dynamics of isolated spliceosomes.
Structure and function of spliceosomes Eukaryotic pre-mRNAs contain non-coding regions (introns) which need to be removed before the mRNA can be used for the synthesis of proteins. This so-called splicing process is catalysed in the cell's nucleus by the spliceosome, a highly complex and dynamic molecular machine. It is composed of numerous protein and RNA components and it is assembled anew on each intron to be removed from an RNA transcript. Using approaches from biochemistry, molecular biology, genetics and structural biology, we study the complex catalytic work cycle of the spliceosome to understand its structure and function. (in German) more