Cre recombinase
Cre recombinase is a tyrosine recombinase enzyme derived from the P1 bacteriophage. The enzyme uses a topoisomerase I-like mechanism to carry out site specific recombination events. The enzyme is a member of the integrase family of site specific recombinase and it is known to catalyse the site specific recombination event between two DNA recognition sites. This 34 base pair loxP recognition site consists of two 13 bp palindromic sequences which flank an 8bp spacer region. The products of Cre-mediated recombination at loxP sites are dependent upon the location and relative orientation of the loxP sites. Two separate DNA species both containing loxP sites can undergo fusion as the result of Cre mediated recombination. DNA sequences found between two loxP sites are said to be "floxed". In this case the products of Cre mediated recombination depends upon the orientation of the loxP sites. DNA found between two loxP sites oriented in the same direction will be excised as a circular loop of DNA whilst intervening DNA between two loxP sites that are opposingly orientated will be inverted. The enzyme requires no additional cofactors or accessory proteins for its function.
The enzyme plays important roles in the life cycle of the P1 bacteriophage such as cyclization of the linear genome and resolution of dimeric chromosomes that form after DNA replication.
Cre recombinase is a widely used tool in the field of molecular biology. The enzyme's unique and specific recombination system is exploited to manipulate genes and chromosomes in a huge range of research, such as gene knock out or knock in studies. The enzyme's ability to operate efficiently in a wide range of cellular environments enables the Cre-Lox recombination system to be used in a vast number of organisms, making it a particularly useful tool in scientific research.
Discovery
Studies carried out in 1981 by Sternberg and Hamilton demonstrated that the bacteriophage 'P1' had a unique site specific recombination system. EcoRI fragments of the P1 bacteriophage genome were generated and cloned into lambda vectors. A 6.5kb EcoRI fragment was found to permit efficient recombination events. The mechanism of these recombination events was known to be unique as they occurred in the absence of bacterial RecA and RecBCD proteins. The components of this recombination system were elucidated using deletion mutagenesis studies. These studies showed that a P1 gene product and a recombination site were both required for efficient recombination events to occur. The P1 gene product was named Cre and the recombination site was named loxP. The Cre protein was purified in 1983 and was found to be a 35,000 Da protein. No high energy cofactors such as ATP or accessory proteins are required for the recombinase activity of the purified protein. Early studies also demonstrated that Cre binds to non specific DNA sequences whilst having a 20 fold higher affinity for loxP sequences and results of early DNA footprinting studies also suggested that Cre molecules bind loxP sites as dimers.Tyrosine recombinase family members |
S.cerevisiae Flp recombinase |
Bacterial XerC recombinase |
Bacterial XerD recombinase |
λ integrase protein |
HP1 integrase protein |
Structure
Cre recombinase consists of 343 amino acids that form two distinct domains. The amino terminal domain encompasses residues 20–129 and this domain contains 5 alpha helical segments linked by a series of short loops. Helices A & E are involved in the formation of the recombinase tetramer with the C terminus region of helix E known to form contacts with the C terminal domain of adjacent subunits. Helices B & D form direct contacts with the major groove of the loxP DNA. These two helices are thought to make three direct contacts to DNA bases at the loxP site. The carboxy terminal domain of the enzyme consists of amino acids 132–341 and it harbours the active site of the enzyme. The overall structure of this domain shares a great deal of structural resemblance to the catalytic domain of other enzymes of the same family such as λ Integrase and HP1 Integrase. This domain is predominantly helical in structure with 9 distinct helices. The terminal helix protrudes from the main body of the carboxy domain and this helix is reputed to play a role in mediating interactions with other subunits. Crystal structures demonstrate that this terminal N helix buries its hydrophobic surface into an acceptor pocket of an adjacent Cre subunit.The effect of the two-domain structure is to form a C-shaped clamp that grasps the DNA from opposite sides.
Active site
The active site of the Cre enzyme consists of the conserved catalytic triad residues Arg 173, His 289, Arg 292 as well as the conserved nucleophilic residues Tyr 324 and Trp 315. Unlike some recombinase enzymes such as Flp recombinase, Cre does not form a shared active site between separate subunits and all the residues that contribute to the active site are found on a single subunit. Consequently, when two Cre molecules bind at a single loxP site two active sites are present. Cre mediated recombination requires the formation of a synapse in which two Cre-LoxP complexes associate to form what is known as the synapse tetramer in which 4 distinct active sites are present.Tyr 324 acts as a nucleophile to form a covalent 3’-phosphotyrosine linkage to the DNA substrate. The scissile phosphate is coordinated by the side chains of the 3 amino acid residues of the catalytic triad. The indole nitrogen of tryptophan 315 also forms a hydrogen bond to this scissile phosphate.. This reaction cleaves the DNA and frees a 5’ hydroxyl group. This process occurs in the active site of two out of the four recombinase subunits present at the synapse tetramer. If the 5’ hydroxyl groups attack the 3’-phosphotyrosine linkage one pair of the DNA strands will exchange to form a Holliday junction intermediate.
Applications
Role in bacteriophage P1
Cre recombinase plays important roles in the life cycle of the P1 bacteriophage. Upon infection of a cell the Cre-loxP system is used to cause circularization of the P1 DNA. In addition to this Cre is also used to resolve dimeric lysogenic P1 DNA that forms during the cell division of the phage.Use in research
The simplicity and robustness of the Cre-loxP systems has enabled scientists to exploit the Cre enzyme in order to manipulate DNA both in vivo and in vitro. See Cre-Lox Recombination for more details. The Cre enzyme can be expressed in many different organisms such as plants, bacteria, mammals, yeast. In 1992 Cre was expressed and found to be functional in a mouse host. Promoter regions can be manipulated to allow precise temporal control of Cre enzyme expression. As the enzyme has a specific 34bp DNA substrate the genome of the organism would have to be 1018 bp in length for there to be a likely occurrence of a loxP site. As mammalian genomes are on average in the region of 3 bp there is a very low chance of finding an endogenous loxP site. For Cre to be functional in a foreign host, exogenous loxP sites must be engineered. This allows precise control over the activity of the Cre enzyme in test organisms.Independently, Joe Z. Tsien has pioneered the use of Cre-loxP system for neuroscience research to achieve cell type- and region-specific gene manipulation in the adult brain where hundreds of distinct neuron types may exist and nearly all neurons in the adult brain are in post-mitotic state. Tsien and his colleagues demonstrated Cre-mediated recombination can occur in the post-mitotic pyramidal neurons in the adult mouse forebrain. The clear demonstration of its usefulness in precisely defining the complex relationship between specific cells/circuits and behaviors for brain research, has promoted the NIH to initiate the NIH Blueprint for Neuroscience Research Cre-driver mouse projects in early 2000. To date, NIH Blueprint for Neuroscience Research Cre projects have created several hundreds of Cre driver mouse lines which are currently used by the worldwide neuroscience community.