RNA editing


RNA editing is a molecular process through which some cells can make discrete changes to specific nucleotide sequences within an RNA molecule after it has been generated by RNA polymerase. It occurs in all living organisms, and is one of the most evolutionarily conserved properties of RNAs. RNA editing may include the insertion, deletion, and base substitution of nucleotides within the RNA molecule. RNA editing is relatively rare, with common forms of RNA processing not usually considered as editing. It can affect the activity, localization as well as stability of RNAs, and has been linked with human diseases.
RNA editing has been observed in some tRNA, rRNA, mRNA, or miRNA molecules of eukaryotes and their viruses, archaea, and prokaryotes. RNA editing occurs in the cell nucleus and cytosol, as well as within mitochondria and plastids. In vertebrates, editing is rare and usually consists of a small number of changes to the sequence of the affected molecules. In other organisms, extensive editing can occur; in some cases the majority of nucleotides in an mRNA sequence may result from editing. More than 160 types of RNA modifications have been described so far.
RNA-editing processes show great molecular diversity, and some appear to be evolutionarily recent acquisitions that arose independently. The diversity of RNA editing phenomena includes nucleobase modifications such as cytidine to uridine and adenosine to inosine deaminations, as well as non-template nucleotide additions and insertions. RNA editing in mRNAs effectively alters the amino acid sequence of the encoded protein so that it differs from that predicted by the genomic DNA sequence.

Detection of RNA editing

Next generation sequencing

To identify diverse post-transcriptional modifications of RNA molecules and determine the transcriptome-wide landscape of RNA modifications by means of next generation RNA sequencing, recently many studies have developed conventional or specialised sequencing methods. Examples of specialised methods are MeRIP-seq, m6A-seq, methylation-iCLIP, m6A-CLIP, Pseudo-seq, Ψ-seq, CeU-seq, Aza-IP and RiboMeth-seq). Application of these methods have identified various modifications within coding genes and non-coding genes at single nucleotide or very high resolution.

Mass Spectrometry

is a way to qualitatively and quantify RNA modifications. More often than not, modifications cause an increase in mass for a given nucleoside. This gives a characteristic readout for the nucleoside and the modified counterpart. Moreover, mass spectrometry allows the investigation of modification dynamics by labeling RNA molecules with stable heavy isotopes in vivo. Due to the defined mass increase of heavy isotope labeled nucleosides they can be distinguished from their respective unlabeled isotopomeres by mass spectrometry. This method, called NAIL-MS, enables a variety of approaches to investigate RNA modification dynamics.

Types of RNA

Messenger RNA modification

Recently, functional experiments have revealed many novel functional roles of RNA modifications. For example, m6A has been predicted to affect protein translation and localization, mRNA stability, alternative polyA choice and stem cell pluripotency. Pseudouridylation of nonsense codons suppresses translation termination both in vitro and in vivo, suggesting that RNA modification may provide a new way to expand the genetic code. Importantly, many modification enzymes are dysregulated and genetically mutated in many disease types. For example, genetic mutations in pseudouridine synthases cause mitochondrial myopathy, sideroblastic anemia and dyskeratosis congenital.

Transfer RNA modifications

or tRNA is the most abundantly modified type of RNA. Modifications in tRNA play crucial roles in maintaining translation efficiency through supporting structure, anticodon-codon interactions, and interactions with enzymes.
Anticodon modifications are important for proper decoding of mRNA. Since the genetic code is degenerate, anticodon modifications are necessary to properly decode mRNA. Particularly, the wobble position of the anticodon determines how the codons are read. For example, in eukaryotes an adenosine at position 34 of the anticodon can be converted to inosine. Inosine is a modification that is able to base-pair with cytosine, adenine, and uridine.
Another commonly modified base in tRNA is the position adjacent to the anticodon. Position 37 is often hypermodified with bulky chemical modifications. These modifications prevent frameshifting and increase anticodon-codon binding stability through stacking interactions.

Ribosomal RNA modification

Ribosomal RNA modifications are made throughout the ribosome synthesis. Modifications primarily play a role in the structure of the rRNA in order to protect translational efficiency.

Types of changes

Editing by insertion or deletion

RNA editing through the addition and deletion of uracil has been found in kinetoplasts from the mitochondria of Trypanosoma brucei
Because this may involve a large fraction of the sites in a gene, it is sometimes called "pan-editing" to distinguish it from topical editing of one or a few sites.
Pan-editing starts with the base-pairing of the unedited primary transcript with a guide RNA, which contains complementary sequences to the regions around the insertion/deletion points. The newly formed double-stranded region is then enveloped by an editosome, a large multi-protein complex that catalyzes the editing. The editosome opens the transcript at the first mismatched nucleotide and starts inserting uridines. The inserted uridines will base-pair with the guide RNA, and insertion will continue as long as A or G is present in the guide RNA and will stop when a C or U is encountered. The inserted nucleotides cause a frameshift, and result in a translated protein that differs from its gene.
The mechanism of the editosome involves an endonucleolytic cut at the mismatch point between the guide RNA and the unedited transcript. The next step is catalyzed by one of the enzymes in the complex, a terminal U-transferase, which adds Us from UTP at the 3' end of the mRNA. The opened ends are held in place by other proteins in the complex. Another enzyme, a U-specific exoribonuclease, removes the unpaired Us. After editing has made mRNA complementary to gRNA, an RNA ligase rejoins the ends of the edited mRNA transcript. As a consequence, the editosome can edit only in a 3' to 5' direction along the primary RNA transcript. The complex can act on only a single guide RNA at a time. Therefore, a RNA transcript requiring extensive editing will need more than one guide RNA and editosome complex.

Editing by deamination

C-to-U editing

The editing involves cytidine deaminase that deaminates a cytidine base into a uridine base. An example of C-to-U editing is with the apolipoprotein B gene in humans. Apo B100 is expressed in the liver and apo B48 is expressed in the intestines. In the intestines, the mRNA has a CAA sequence edited to be UAA, a stop codon, thus producing the shorter B48 form.
C-to-U editing often occurs in the mitochondrial RNA of flowering plants. Different plants have different degrees of C-to-U editing; for example, eight editing events occur in mitochondria of the moss Funaria hygrometrica, whereas over 1,700 editing events occur in the lycophytes Isoetes engelmanii. C-to-U editing is performed by members of the pentatricopeptide repeat protein family. Angiosperms have large PPR families, acting as trans -factors for cis -elements lacking a consensus sequence; Arabidopsis has around 450 members in its PPR family. There have been a number of discoveries of PPR proteins in both plastids and mitochondria.

A-to-I editing

Adenosine-to-inosine modifications contribute to nearly 90% of all editing events in RNA. The deamination of adenosine is catalyzed by the double-stranded RNA-specific adenosine deaminase, which typically acts on pre-mRNAs. The deamination of adenosine to inosine disrupts and destabilizes the dsRNA base pairing, therefore rendering that particular dsRNA less able to produce siRNA, which interferes with the RNAi pathway.
The wobble base pairing causes deaminated RNA to have a unique but different structure, which may be related to the inhibition of the initiation step of RNA translation. Studies have shown that I-RNA recruits methylases that are involved in the formation of heterochromatin and that this chemical modification heavily interferes with miRNA target sites. There is active research into the importance of A-to-I modifications and their purpose in the novel concept of epitranscriptomics, in which modifications are made to RNA that alter their function. A long established consequence of A-to-I in mRNA is the interpretation of I as a G, therefore leading to functional A-to-G substitution, e.g. in the interpretation of the genetic code by ribosomes. Newer studies however, have weakened this correlation by showing that I's can also be decoded by the ribosome as A's and U's. Furthermore it was shown that I's lead to the stalling of ribosomes on the I-rich mRNA.
The development of high-throughput sequencing in recent years has allowed for the development of extensive databases for different modifications and edits of RNA. RADAR was developed in 2013 to catalog the vast variety of A-to-I sites and tissue-specific levels present in humans, mice, and flies. The addition of novel sites and overall edits to the database are ongoing. The level of editing for specific editing sites, e.g. in the filamin A transcript, is tissue-specific. The efficiency of mRNA-splicing is a major factor controlling the level of A-to-I RNA editing.

Alternative mRNA editing

Alternative U-to-C mRNA editing was first reported in WT1 transcripts, and non-classic G-A mRNA changes were first observed in HNRNPK transcripts in both malignant and normal colorectal samples. The latter changes were also later seen alongside non-classic U-to-C alterations in brain cell TPH2 transcripts. Although the reverse amination might be the simplest explanation for U-to-C changes, transamination and transglycosylation mechanisms have been proposed for plant U-to-C editing events in mitochondrial transcripts. A recent study reported novel G-to-A mRNA changes in WT1 transcripts at two hotspots, proposing the APOBEC3A as the enzyme implicated in this class of alternative mRNA editing. It was also shown that alternative mRNA changes were associated with canonical WT1 splicing variants, indicating their functional significance.

RNA editing in plant mitochondria and plastids

It has been shown in previous studies that the only types of RNA editing seen in the plants' mitochondria and plastids are conversion of C-to-U and U-to-C. RNA-editing sites are found mainly in the coding regions of mRNA, introns, and other non-translated regions. In fact, RNA editing can restore the functionality of tRNA molecules. The editing sites are found primarily upstream of mitochondrial or plastid RNAs. While the specific positions for C to U RNA editing events have been fairly well studied in both the mitochondrion and plastid, the identity and organization of all proteins comprising the editosome have yet to be established. Members of the expansive PPR protein family have been shown to function as trans-acting factors for RNA sequence recognition. Specific members of the MORF family are also required for proper editing at several sites. As some of these MORF proteins have been shown to interact with members of the PPR family, it is possible MORF proteins are components of the editosome complex. An enzyme responsible for the trans- or deamination of the RNA transcript remains elusive, though it has been proposed that the PPR proteins may serve this function as well.
RNA editing is essential for the normal functioning of the plant's translation and respiration activity. Editing can restore the essential base-pairing sequences of tRNAs, restoring functionality. It has also been linked to the production of RNA-edited proteins that are incorporated into the polypeptide complexes of the respiration pathway. Therefore, it is highly probable that polypeptides synthesized from unedited RNAs would not function properly and hinder the activity of both mitochondria and plastids.
C-to-U RNA editing can create start and stop codons, but it cannot destroy existing start and stop codons. A cryptic start codon is created when the codon ACG is edited to be AUG.

RNA editing in viruses

RNA editing in viruses is used for stability and generation of protein variants. Viral RNAs are transcribed by a virus-encoded RNA-dependent RNA polymerase, which is prone to pausing and "stuttering" at certain nucleotide combinations. In addition, up to several hundred non-templated A's are added by the polymerase at the 3' end of nascent mRNA. These As help stabilize the mRNA. Furthermore, the pausing and stuttering of the RNA polymerase allows the incorporation of one or two Gs or As upstream of the translational codon. The addition of the non-templated nucleotides shifts the reading frame, which generates a different protein.

Origin and evolution of RNA editing

The RNA-editing system seen in the animal may have evolved from mononucleotide deaminases, which have led to larger gene families that include the apobec-1 and adar genes. These genes share close identity with the bacterial deaminases involved in nucleotide metabolism. The adenosine deaminase of E. coli cannot deaminate a nucleoside in the RNA; the enzyme's reaction pocket is too small for the RNA strand to bind to. However, this active site is widened by amino acid changes in the corresponding human analog genes, APOBEC1 and ADAR, allowing deamination.
The gRNA-mediated pan-editing in trypanosome mitochondria, involving templated insertion of U residues, is an entirely different biochemical reaction. The enzymes involved have been shown in other studies to be recruited and adapted from different sources. But the specificity of nucleotide insertion via the interaction between the gRNA and mRNA is similar to the tRNA editing processes in the animal and Acanthamoeba mithochondria. Eukaryotic ribose methylation of rRNAs by guide RNA molecules is a similar form of modification.
Thus, RNA editing evolved more than once. Several adaptive rationales for editing have been suggested. Editing is often described as a mechanism of correction or repair to compensate for defects in gene sequences. However, in the case of gRNA-mediated editing, this explanation does not seem possible because if a defect happens first, there is no way to generate an error-free gRNA-encoding region, which presumably arises by duplication of the original gene region. This thinking leads to an evolutionary proposal called "constructive neutral evolution" in which the order of steps is reversed, with the gratuitous capacity for editing preceding the "defect". 31

RNA editing may be involved in RNA degradation

A study looked at the involvement of RNA editing in RNA degradation. The researchers specifically looked at the interaction between ADAR and UPF1, an enzyme involved in the nonsense-mediated mRNA decay pathway. They found that ADAR and UPF1 are found within the suprasliceosome and they form a complex that leads to the down-regulation of specific genes. The exact mechanism or the exact pathways that these two are involved in are unknown at this time. The only fact that this research has shown is that they form a complex and down-regulate specific genes.