Matches in SemOpenAlex for { <https://semopenalex.org/work/W2040372310> ?p ?o ?g. }
- W2040372310 endingPage "46831" @default.
- W2040372310 startingPage "46826" @default.
- W2040372310 abstract "A bacterial ribonucleotide reductase gene was found to encode four inteins and three group II introns in the oceanic N2-fixing cyanobacterium Trichodesmium erythraeum. The 13,650-bp ribonucleotide reductase gene is divided into eight extein- or exon-coding sequences that together encode a 768-amino acid mature ribonucleotide reductase protein, with 83% of the gene sequence encoding introns and inteins. The four inteins are encoded on the second half of the gene, and each has conserved sequence motifs for a protein-splicing domain and an endonuclease domain. These four inteins, together with known inteins, define five intein insertion sites in ribonucleotide reductase homologues. Two of the insertion sites are 10 amino acids apart and next to key catalytic residues of the enzyme. Protein-splicing activities of all four inteins were demonstrated in Escherichia coli. The four inteins coexist with three group II introns encoded on the first half of the same gene, which suggests a breakdown of the presumed barrier against intron insertion in this bacterial conserved protein-coding gene. A bacterial ribonucleotide reductase gene was found to encode four inteins and three group II introns in the oceanic N2-fixing cyanobacterium Trichodesmium erythraeum. The 13,650-bp ribonucleotide reductase gene is divided into eight extein- or exon-coding sequences that together encode a 768-amino acid mature ribonucleotide reductase protein, with 83% of the gene sequence encoding introns and inteins. The four inteins are encoded on the second half of the gene, and each has conserved sequence motifs for a protein-splicing domain and an endonuclease domain. These four inteins, together with known inteins, define five intein insertion sites in ribonucleotide reductase homologues. Two of the insertion sites are 10 amino acids apart and next to key catalytic residues of the enzyme. Protein-splicing activities of all four inteins were demonstrated in Escherichia coli. The four inteins coexist with three group II introns encoded on the first half of the same gene, which suggests a breakdown of the presumed barrier against intron insertion in this bacterial conserved protein-coding gene. Inteins are intervening protein sequences embedded in precursor proteins and catalyze a protein-splicing reaction that excises the intein and joins the flanking sequences (N- and C-exteins) with a peptide bond (1Perler F.B. Davis E.O. Dean G.E. Gimble F.S. Jack W.E. Neff N. Noren C.J. Thorner J. Belfort M. Nucleic Acids Res. 1994; 22: 1125-1127Crossref PubMed Scopus (315) Google Scholar, 2Xu M.Q. Perler F.B. EMBO J. 1996; 15: 5146-5153Crossref PubMed Scopus (251) Google Scholar, 3Perler F.B. Xu M.Q. Paulus H. Curr. Opin. Chem. Biol. 1997; 1: 292-299Crossref PubMed Scopus (138) Google Scholar, 4Paulus H. Annu. Rev. Biochem. 2000; 69: 447-496Crossref PubMed Scopus (380) Google Scholar). Inteins and intein-like sequences have been found in various host proteins in bacteria, archaea, and eukaryotes (5Perler F.B. Nucleic Acids Res. 2002; 30: 383-384Crossref PubMed Scopus (317) Google Scholar, 6Amitai G. Belenkiy O. Dassa B. Shainskaya A. Pietrokovski S. Mol. Microbiol. 2003; 47: 61-73Crossref PubMed Scopus (46) Google Scholar). A typical intein is ∼400 amino acid long and has two structural domains, with its N- and C-terminal parts forming the protein-splicing domain and its middle part forming the endonuclease domain (7Duan X. Gimble F.S. Quiocho F.A. Cell. 1997; 89: 555-564Abstract Full Text Full Text PDF PubMed Scopus (234) Google Scholar, 8Ichiyanagi K. Ishino Y. Ariyoshi M. Komori K. Morikawa K. J. Mol. Biol. 2000; 300: 889-901Crossref PubMed Scopus (97) Google Scholar). The protein-splicing function ensures self-removal of intein from the host protein, and the endonuclease function initiates an intein homing (gene conversion) process and promotes maintenance and spread of the intein coding sequence (9Cooper A.A. Stevens T.H. Trends Biochem. Sci. 1995; 20: 351-356Abstract Full Text PDF PubMed Scopus (103) Google Scholar, 10Gimble F.S. FEMS Microbiol. Lett. 2000; 185: 99-107Crossref PubMed Google Scholar, 11Belfort M. Roberts R.J. Nucleic Acids Res. 1997; 25: 3379-3388Crossref PubMed Scopus (395) Google Scholar). Inteins appear similar in overall structure and splicing mechanism, although they generally show low levels of sequence conservation (12Perler F.B. Cell. 1998; 92: 1-4Abstract Full Text Full Text PDF PubMed Scopus (169) Google Scholar, 13Gogarten J.P. Senejani A.G. Zhaxybayeva O. Olendzenski L. Hilario E. Annu. Rev. Microbiol. 2002; 56: 263-287Crossref PubMed Scopus (167) Google Scholar, 14Liu X.Q. Annu. Rev. Genet. 2000; 34: 61-76Crossref PubMed Scopus (127) Google Scholar). Group II introns have properties of both catalytic RNAs and retroelements and are the presumed progenitors of eukaryotic nuclear spliceosomal introns (15Lambowitz A.M. Belfort M. Annu. Rev. Biochem. 1993; 62: 587-622Crossref PubMed Scopus (534) Google Scholar, 16Michel F. Ferat J.L. Annu. Rev. Biochem. 1995; 64: 435-461Crossref PubMed Scopus (490) Google Scholar, 17Bonen L. Vogel J. Trends Genet. 2001; 17: 322-331Abstract Full Text Full Text PDF PubMed Scopus (235) Google Scholar, 18Toro N. Environ. Microbiol. 2003; 5: 143-151Crossref PubMed Scopus (74) Google Scholar, 19Cavalier-Smith T. Trends Genet. 1991; 7: 145-148Abstract Full Text PDF PubMed Scopus (331) Google Scholar). But unlike spliceosomal introns that are abundant in protein-coding genes, bacterial group II introns are strongly excluded from protein-coding genes outside mobile DNA (20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 21Dai L. Zimmerly S. Nucleic Acids Res. 2002; 30: 1091-1102Crossref PubMed Scopus (146) Google Scholar). Group II intron RNAs fold into a conserved structure with six domains, and most bacterial group II introns also encode a reverse-transcriptase-like (RTL) 1The abbreviations used are: RTLreverse transcriptase-likeRIRribonucleotide reductaseTerTrichodesmium erythraeum.1The abbreviations used are: RTLreverse transcriptase-likeRIRribonucleotide reductaseTerTrichodesmium erythraeum. protein involved in intron mobility (22Zimmerly S. Hausner G. Wu X. Nucleic Acids Res. 2001; 29: 1238-1250Crossref PubMed Scopus (155) Google Scholar). Group II intron mobility includes efficient retrohoming (site-specific insertion) into homologous intron-less allele and less frequent retrotransposition into non-cognate sites (23Belfort M. Perlman P.S. J. Biol. Chem. 1995; 270: 30237-30240Abstract Full Text Full Text PDF PubMed Scopus (195) Google Scholar, 24Belfort M. Derbyshire V. Cousineau B. Lambowitz A.M. Craig N. Craigie R. Gellert M. Lambowitz A.M. Mobile DNA II. American Society for Microbiology Press, Washington, DC2002: 761-783Google Scholar). Both retrohoming and retrotransposition are initiated and catalyzed by a ribonucleoprotein complex consisting of the excised intron RNA and the intron-encoded RTL protein (25Lambowitz A.M. Caprara M.G. Zimmerly S. Perlman P.S. Gesteland R.F. Cech T.R. Atkins J.F. The RNA World. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY1999: 451-485Google Scholar, 26Ichiyanagi K. Beauregard A. Lawrence S. Smith D. Cousineau B. Belfort M. Mol. Microbiol. 2002; 46: 1259-1272Crossref PubMed Scopus (92) Google Scholar). Group II introns are rare in bacterial (and archaeal) protein genes and were found mostly in mobile DNAs such as transposon-like insertion sequence elements (21Dai L. Zimmerly S. Nucleic Acids Res. 2002; 30: 1091-1102Crossref PubMed Scopus (146) Google Scholar). Nearly half of bacterial group II introns are outside of any gene (27Dai L. Toor N. Olson R. Keeping A. Zimmerly S. Nucleic Acids Res. 2003; 31: 424-426Crossref PubMed Scopus (96) Google Scholar), despite the ∼90% gene density of bacterial genomes, suggesting selection against insertion into genes. Among bacterial group II introns inside genes, few are inside conserved protein-coding genes, indicating extremely strong selection against insertion into “housekeeping” protein genes (20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 21Dai L. Zimmerly S. Nucleic Acids Res. 2002; 30: 1091-1102Crossref PubMed Scopus (146) Google Scholar). It was suggested that bacterial group II introns function primarily as retroelements, not as introns (21Dai L. Zimmerly S. Nucleic Acids Res. 2002; 30: 1091-1102Crossref PubMed Scopus (146) Google Scholar). reverse transcriptase-like ribonucleotide reductase Trichodesmium erythraeum. reverse transcriptase-like ribonucleotide reductase Trichodesmium erythraeum. Only a few completely sequenced bacterial genomes encode both intein and intron. The only known example of coexistence of an intein and a group I intron in the same gene, a bacteriophage ribonucleotide reductase gene (28Lazarevic V. Soldo B. Dusterhoft A. Hilbert H. Mauel C. Karamata D. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 1692-1697Crossref PubMed Scopus (60) Google Scholar), was likened to lightning striking twice in the same place; it was reasoned to be an unlikely event of pure chance (29Derbyshire V. Belfort M. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 1356-1357Crossref PubMed Scopus (37) Google Scholar). Coexistence of intein and group II intron in one gene has not been reported before. Here we report the coexistence of four inteins and three group II introns in a bacterial ribonucleotide reductase gene. To our knowledge, this is the first report of coexistence of multiple inteins and multiple introns (any intron) in the same gene, which has interesting implications regarding the evolution and possible function of inteins and introns. Gene Cloning and Sequence Analysis—GenBank searches, sequence alignments, and intron RNA folding were performed using the BLAST search program (30Altschul S.F. Madden T.L. Schaffer A.A. Zhang J. Zhang Z. Miller W. Lipman D.J. Nucleic Acids Res. 1997; 25: 3389-3402Crossref PubMed Scopus (59441) Google Scholar), the ClustalW program (31Higgins D.G. Thompson J.D. Gibson T.J. Methods Enzymol. 1996; 266: 383-402Crossref PubMed Scopus (1288) Google Scholar), and the MFOLD program (32Zuker M. Nucleic Acids Res. 2003; 31: 1-10Crossref PubMed Scopus (10157) Google Scholar), respectively. Parts of the ribonucleotide reductase gene were amplified by doing PCR from total genomic DNA of Trichodesmium erythraeum (Ter) strain IMS101, using the high fidelity Pfu DNA polymerase. DNA fragments were cloned in a pDrive plasmid vector (Qiagen), and DNA sequences were determined through automated DNA sequencing. Protein-splicing Analysis in Escherichia coli Cells—To construct gene expression plasmids, intein coding sequences were inserted in the previously described pMST plasmid (33Wu H. Xu M.Q. Liu X.Q. Biochim. Biophys. Acta. 1998; 1387: 422-432Crossref PubMed Scopus (153) Google Scholar) between XhoI and AgeI sites, replacing the Ssp DnaB intein coding sequence of pMST. Protein production in E. coli cells, gel electrophoresis, and Western blot analysis were performed as described previously (33Wu H. Xu M.Q. Liu X.Q. Biochim. Biophys. Acta. 1998; 1387: 422-432Crossref PubMed Scopus (153) Google Scholar). Briefly, cells containing the expression plasmid were grown in liquid LB medium at 37 °C to late log phase (A600, 0.5). Isopropyl-1-thio-β-d-galactopyranoside was added to a final concentration of 0.8 mm to induce production of the recombinant protein, and the induction was continued at 37 °C for 3 h or at 25 °C overnight. Cells were then harvested, and cellular proteins were dissolved in SDS- and dithiothreitol-containing gel loading buffer in a boiling water bath before electrophoresis in SDS-polyacrylamide gel. Western blots were carried out using anti-thioredoxin antibody (Invitrogen) and the enhanced chemiluminescence detection kit (ECL). Intensity of protein band was estimated by using a gel documentation system (Gel Doc 1000 coupled with Molecular Analyst software, Bio-Rad). Identification of Ter RIR Gene Encoding Multiple Introns and Inteins—A search for new intein sequences led us to a DNA sequence in the GenBank (accession no. AABK02000003) that had been deposited by the U.S. Department of Energy Joint Genome Institute and was from the genome of the oceanic N2-fixing cyanobacterium T. erythraeum strain IMS101. Our initial analysis indicated that this DNA sequence has a ribonucleotide reductase (RIR) gene containing inteins as well as introns. Subsequent DNA cloning and sequence determination confirmed this analysis and also corrected a small deletion error in the above GenBank sequence. This revealed the complete T. erythraeum RIR gene as illustrated in Fig. 1. The 13,560-bp long Ter RIR gene is divided into eight exon- or extein-coding sequences by the presence of three intron- and four intein-coding sequences. The eight extein/exon-coding sequences are 252, 141, 306, 126, 396, 30, 390, and 663 bp long, respectively, and together they encode a 768-amino acid mature RIR protein that is highly similar to homologous but intein/intron-less RIR of other bacteria (Fig. 2A). Specifically, the mature RIR sequence of T. erythraeum is 69% identical and 80% similar to its homologue in the cyanobacterium Nostoc sp. strain PCC7120 (Nsp), and this level of RIR sequence identity is normal among cyanobacterial species. The Nsp RIR (also known as Asp RNR) has been functionally characterized as a class II B12-dependent ribonucleotide reductase (34Gleason F.K. Olszewski N.E. J. Bacteriol. 2002; 184: 6544-6550Crossref PubMed Scopus (18) Google Scholar). The mature RIR sequence of T. erythraeum is also 24% identical and 40% similar to the RIR sequence of the distantly related bacterium Lactobacillus leichmannii (Lle). The Lle RIR is also a class II (coenzyme B12-dependent) ribonucleoside triphosphate reductase with a known crystal structure (35Sintchak M.D. Arjara G. Kellogg B.A. Stubbe J. Drennan C.L. Nat. Struct. Biol. 2002; 9: 293-300Crossref PubMed Scopus (169) Google Scholar). The 13,560-bp Ter RIR gene is much larger than other bacterial RIR genes because of the presence of intein and intron coding sequences, with just 17% of its coding sequences for exteins and exons and the remaining 83% for inteins and introns.Fig. 2Protein sequence comparisons.A, predicted mature RIR protein sequence of Ter is aligned with RIR sequences of Nostoc sp. strain PCC7120 (Nsp) and Lactobacillus leichmannii (Lle). Only sequences proximal to the insertion sites of the Ter introns and inteins are shown, and amino acid positions in the Ter sequence are numbered. Intron and intein insertion sites (I1 through I7) in the Ter sequence are marked with arrowheads, and the corresponding intron and intein names are shown in parentheses. The Nsp RIR sequence also has an intein at the I1 insertion site. B, four intein sequences predicted from the Ter RIR gene are aligned, and putative intein sequence motifs (blocks A through H) are underlined. Less conserved sequences are omitted, the numbers of omitted residues are shown in parentheses, and the length of each sequence is shown at the end. Symbols represent gaps introduced to optimize the alignment.View Large Image Figure ViewerDownload Hi-res image Download (PPT) Identification and Sequence Analysis of the Inteins—The four intein-coding sequences are located on the second half of the gene (Fig. 1), are named Ter RIR-1, Ter RIR-2, Ter RIR-3, and Ter RIR-4 intein, according to nomenclature of known inteins, and are 394, 373, 323, and 381 amino acids long, respectively. Insertion positions of the four inteins are before amino acid residues Ser-276, Cys-408, Cys-418, and Thr-548 of the mature RIR protein, respectively. Boundaries of the four inteins were readily identified by comparison of extein sequences with intein-less homologues such as Nsp RIR (Fig. 2A) and by location of conserved intein sequence blocks A and G marking the ends of each intein (Fig. 2B). Each of the four inteins begins with a nucleophilic residue (Cys), ends with Asn, and is followed immediately by another nucleophilic residue (Ser, Cys, or Thr), as expected for the protein-splicing function of the intein (2Xu M.Q. Perler F.B. EMBO J. 1996; 15: 5146-5153Crossref PubMed Scopus (251) Google Scholar). Each intein also has sequence motifs for protein-splicing function (sequence blocks A, B, F, and G) as well as putative sequence motifs for intein endonuclease function (sequence blocks C, E, and H) (36Pietrokovski S. Protein Sci. 1994; 3: 2340-2350Crossref PubMed Scopus (147) Google Scholar). The Ter RIR-1 intein and the Ter RIR-2 intein are also homologous to previously identified Nsp RIR intein and Pfu RIR1–2 intein, respectively, based on significant (>30%) sequence similarities and the same corresponding insertion site (data not shown). Protein-splicing function has been demonstrated for both the Nsp RIR intein (34Gleason F.K. Olszewski N.E. J. Bacteriol. 2002; 184: 6544-6550Crossref PubMed Scopus (18) Google Scholar) and the Pfu RIR1–2 intein (37Komori K. Fujita N. Ichiyanagi K. Shinagawa H. Morikawa K. Ishino Y. Nucleic Acids Res. 1999; 27: 4167-4174Crossref PubMed Scopus (29) Google Scholar). The four Ter RIR inteins showed only low levels (less than 15%) of sequence identity among themselves, and their insertion sites have no apparent sequence similarity, indicating that they are not closely related. Insertion sites of Ter RIR-2 and Ter RIR-3 inteins are just 10 amino acids apart and inside the catalytic center of ribonucleotide reductase, according to sequence comparison to the structurally determined Lle ribonucleotide reductase (35Sintchak M.D. Arjara G. Kellogg B.A. Stubbe J. Drennan C.L. Nat. Struct. Biol. 2002; 9: 293-300Crossref PubMed Scopus (169) Google Scholar) (Fig. 2A). The Ter RIR-2 intein is inserted at Cys-408 that acts with metallocofactor to generate the active-site thiyl radical (S*), whereas the Ter RIR-3 intein is inserted at Cys-418 (Ter numbering) that forms a redox-active disulfide with another Cys residue. The four Ter RIR inteins, together with known RIR inteins in other organisms, define five distinct intein insertion sites in the RIR protein. Protein-splicing Activity of the Inteins—To determine whether the four inteins are functional, we tested their protein-splicing activity in E. coli cells. For each intein, a plasmidborne fusion gene was constructed to produce a fusion protein in which the intein (plus 5 amino acids of its native extein sequence on each side) was fused to an N-terminal maltose binding protein and a C-terminal thioredoxin (Fig. 3). Similar fusion protein constructs had been used in previous studies of other inteins (33Wu H. Xu M.Q. Liu X.Q. Biochim. Biophys. Acta. 1998; 1387: 422-432Crossref PubMed Scopus (153) Google Scholar, 38Liu X.Q. Yang J. J. Biol. Chem. 2003; 278: 26315-26318Abstract Full Text Full Text PDF PubMed Scopus (18) Google Scholar), so that the protein-splicing products can readily be identified using SDS-polyacrylamide gel electrophoresis and Western blotting. Precursor protein, spliced protein, and excised intein were identified by their predicted sizes, and the first two were further identified using anti-thioredoxin antibody. To estimate the efficiency of protein splicing, the intensity of individual protein bands on Western blots was measured to estimate the amount of that protein, and protein-splicing efficiency was calculated as the amount of spliced protein divided by the sum of spliced protein and precursor protein. At 37 °C, the Ter RIR-1 intein and the Ter RIR-3 intein showed efficient protein splicing, as indicated by the strong bands corresponding to the spliced protein and the excised intein. In contrast, the Ter RIR-2 intein and Ter RIR-4 intein showed little or no protein splicing at 37 °C. At 25 °C, which is closer to the natural growth condition of T. erythraeum, all four inteins showed efficient protein splicing. The protein-splicing efficiency was estimated to be 100%, 76%, 100%, and 95% for Ter RIR-1, -2, -3, and -4 intein, respectively. Identification and Analysis of the Introns—The three introns are located in the first half of the gene and are named T.er.I2, T.er.I3, and T.er.I4 intron, according to nomenclature of known introns; they are 2,524, 1,757, and 2,554 nucleotides long, respectively (Fig. 1). Boundaries of these introns were defined by comparison of exon sequences with intron-less homologues such as Nsp RIR and by sequence comparison to known introns. Each intron can be folded into a typical secondary structure of group II introns, which includes the six helical domains and the exon-binding sequences. Predicted secondary structure of the T.er.I2 intron is shown in Fig. 4A. The T.er.I3 and T.er.I4 introns are very similar to the previously identified T.er.I1 intron both in sequence (Fig. 4B) and in predicted secondary structure (not shown). The T.er.I1 intron is listed as T.sp.I1 intron in the database for mobile group II introns and located in an intergenic sequence of this organism. These three introns are over 80% identical to each other in the ∼680-nucleotide folded RNA sequence and over 70% identical in overall sequence, although the T.er.I3 intron lacks a 754-nucleotide sequence present in the other two introns. These three introns are ∼50% identical to the T.er.I2 intron. The T.er.I2 intron, T.er.I4 intron, and the intergenic T.er.I1 intron each encode an RTL protein in domain IV, and their RTL protein sequences are over 60% similar to each other (data not shown). But the T.er.I2 RTL lacks a domain Zn, and the T.er.I1 RTL coding sequence has at least two single-base deletions causing frameshifts and is likely not functional. The T. erythraeum RIR gene was found to encode four inteins and three group II introns, which predicts three RNA splicing events followed by four protein-splicing events in producing the mature ribonucleotide reductase (Fig. 1). The Ter RIR gene is believed to be functional in vivo despite its seven intervening sequences, based on the following considerations. The Ter RIR gene, after excluding its intein and intron coding sequences, predicts a complete and normal looking class II ribonucleotide reductase. The non-intron parts of the Ter RIR gene, totaling 6,717 bp, are maintained as open reading frames, whereas its non-coding frames have numerous termination codons, which would be extremely unlikely if the Ter RIR gene were not functional. All of the four inteins are functional in protein splicing when tested in E. coli and are predicted to be functional in vivo. The group II introns are also predicted to be functional in vivo, based on their normal looking structure and high levels of sequence identity to known group II introns. Ribonucleotide reductase is an essential housekeeping enzyme in all organisms, and there are at least three known classes of ribonucleotide reductase with very different amino acid sequences for different classes. We can not exclude the possible existence of an additional copy or class of RIR gene in T. erythraeum, although no other class II RIR gene was found through a Blast search of the nearly complete genome sequence of this organism. Four inteins in the Ter RIR protein represent the largest number of inteins found so far in one host protein, and their coexistence with three group II introns in the same gene adds to the novelty. A gene encoding a single intein and a single intron has been reported before (28Lazarevic V. Soldo B. Dusterhoft A. Hilbert H. Mauel C. Karamata D. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 1692-1697Crossref PubMed Scopus (60) Google Scholar), but that gene is a bacteriophage ribonucleotide reductase gene not similar to the Ter RIR gene, and that single intron is a group I intron frequently found in bacteriophage genes (20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 39Edgell D.R. Fast N.M. Doolittle W.F. Curr. Biol. 1996; 6: 385-388Abstract Full Text Full Text PDF PubMed Scopus (12) Google Scholar). The Ter RIR gene is apparently a chromosomal gene, not a phage or plasmid gene, because it was found on a large sequence contig (∼225 kb) of the incomplete Ter genome sequence that encodes numerous other known chromosomal genes. It is interesting to see the high levels of sequence identity among the T.er.I3, T.er.I4, and the intergenic T.er.I1 introns. If these introns are related through relatively recent retrotranspositions, the T.er.I4 intron may be the closest to the presumed progenitor, because it alone encodes an intact RTL protein for retrotransposition. It may also be possible for RTL protein of T.er.I4 intron to act on the other introns in RNA splicing and in retrotransposition. Furthermore, T.er.I3 and T.er.I4 introns in the same gene may possibly cause exon skipping during splicing, as was found with certain bacterial phage group I introns (40Landthaler M. Shub D.A. Proc. Natl. Acad. Sci. U. S. A. 1999; 96: 7005-7010Crossref PubMed Scopus (63) Google Scholar). It is novel to see three group II introns in one bacterial gene, especially in a chromosomal conserved housekeeping proteincoding gene, although multiple group I introns in one gene have been reported previously for bacterial phage and prophage protein-coding genes (40Landthaler M. Shub D.A. Proc. Natl. Acad. Sci. U. S. A. 1999; 96: 7005-7010Crossref PubMed Scopus (63) Google Scholar, 41Lazarevic V. Nucleic Acids Res. 2001; 29: 3212-3218Crossref PubMed Scopus (30) Google Scholar). Bacterial and archaeal group II introns are strongly excluded from conserved protein-coding genes, even in organisms with relatively large numbers of introns. For example, among the 28 group II introns in the cyanobacterium Thermosynechococcus elongatus (42Nakamura Y. Kaneko T. Sato S. Ikeuchi M. Katoh H. Sasamoto S. Watanabe A. Iriguchi M. Kawashima K. Kimura T. Kishida Y. Kiyokawa C. Kohara M. Matsumoto M. Matsuno A. Nakazaki N. Shimpo S. Sugimoto M. Takeuchi C. Yamada M. Tabata S. DNA Res. 2002; 9: 123-130Crossref PubMed Scopus (259) Google Scholar), 26 are outside of gene or inside mobile elements (insertion sequence, other group II introns), and 1 is inside a tRNA gene. The remaining intron is inside a hypothetical protein gene but in reverse orientation to transcription and therefore not functional. Preliminary searches in the nearly complete genome sequence of T. erythraeum did not reveal exceptional abundance of intron or intron in protein genes. Therefore, the presence of three group II introns in the ribonucleotide reductase gene represents a striking departure from known patterns of bacterial intron distribution. It is extremely unlikely that by pure chance the four inteins and three group II introns inserted in the T. erythraeum ribonucleotide reductase gene, considering the rarity of intein and intron. Ribonucleotide reductase and other DNA metabolism proteins are more likely than other proteins to have intein and phage group I intron, which has generated many discussions regarding possible reasons (12Perler F.B. Cell. 1998; 92: 1-4Abstract Full Text Full Text PDF PubMed Scopus (169) Google Scholar, 13Gogarten J.P. Senejani A.G. Zhaxybayeva O. Olendzenski L. Hilario E. Annu. Rev. Microbiol. 2002; 56: 263-287Crossref PubMed Scopus (167) Google Scholar, 14Liu X.Q. Annu. Rev. Genet. 2000; 34: 61-76Crossref PubMed Scopus (127) Google Scholar, 20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 29Derbyshire V. Belfort M. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 1356-1357Crossref PubMed Scopus (37) Google Scholar, 43Pietrokovski S. Trends Genet. 2001; 17: 465-472Abstract Full Text Full Text PDF PubMed Scopus (99) Google Scholar). In these highly conserved genes, an intein or group I intron may spread more easily in a population, because the recognition sequences of the intein-contained or group I intron-encoded homing endonucleases would be better preserved. It may also be more difficult for these genes to lose an intein or intron, because any imprecise deletion of the intein or intron will likely inactivate the essential gene. Consistent with this argument, the four Ter RIR inteins are inserted in the more conserved parts of the RIR protein, and two of them (Ter RIR-2 and Ter RIR-3) are next to critical catalytic residues of the enzyme. It is more difficult to explain the existence of three group II introns in the Ter RIR gene, considering the apparently strong selection against intron insertions in bacterial conserved protein genes (20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 21Dai L. Zimmerly S. Nucleic Acids Res. 2002; 30: 1091-1102Crossref PubMed Scopus (146) Google Scholar, 29Derbyshire V. Belfort M. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 1356-1357Crossref PubMed Scopus (37) Google Scholar). In bacteria, protein translation starts before completion of transcription and may, therefore, interfere with RNA splicing (20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 44Ohman-Heden M. Ahgren-Stalhandske A. Hahne S. Sjoberg B.M. Mol. Microbiol. 1993; 7: 975-982Crossref PubMed Scopus (17) Google Scholar), and it remains to be determined whether or how the Ter RIR gene avoided this problem. Interestingly, there is a translation termination codon just 3, 15, and 18 nt after the beginning of the T.er.I2, T.er.I3, and T.er.I4 introns, respectively, which may help to pause translation before RNA splicing is complete, as has been suggested for some group I introns (20Edgell D.R. Belfort M. Shub D.A. J. Bacteriol. 2000; 182: 5281-5289Crossref PubMed Scopus (120) Google Scholar, 44Ohman-Heden M. Ahgren-Stalhandske A. Hahne S. Sjoberg B.M. Mol. Microbiol. 1993; 7: 975-982Crossref PubMed Scopus (17) Google Scholar). For whatever reason, barriers against intron insertion were broken in the Ter RIR gene, and multiple insertions through retrotransposition could have happened within a relatively short period of time. This may be reminiscent of what happened in the proliferation of introns in eukaryotes (45Cho G. Doolittle R.F. J. Mol. Evol. 1997; 44: 573-584Crossref PubMed Scopus (69) Google Scholar, 46Stoltzfus A. Logsdon Jr., J.M. Palmer J.D. Doolittle W.F. Proc. Natl. Acad. Sci. U. S. A. 1997; 94: 10739-10744Crossref PubMed Scopus (140) Google Scholar, 47Cho Y. Qiu Y.L. Kuhlman P. Palmer J.D. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 14244-14249Crossref PubMed Scopus (218) Google Scholar). Another interesting possibility is that the multiple inteins and introns may help to regulate the production of ribonucleotide reductase in T. erythraeum. This enzyme catalyzes the conversion of ribonucleotides to deoxyribonucleotides, and its production is intricately regulated in other organisms in response to DNA synthesis, cell cycle, and DNA damages (48Elledge S.J. Zhou Z. Allen J.B. Navas T.A. BioEssays. 1993; 15: 333-339Crossref PubMed Scopus (209) Google Scholar). A regulatory role of the inteins and introns would confer selective advantage on the host organism and explain the existence of the inteins and introns against all odds. We thank Dr. J. B. Waterbury for the gift of T. erythraeum cells used in this study." @default.
- W2040372310 created "2016-06-24" @default.
- W2040372310 creator A5005954873 @default.
- W2040372310 creator A5018544698 @default.
- W2040372310 creator A5088724320 @default.
- W2040372310 date "2003-11-01" @default.
- W2040372310 modified "2023-10-13" @default.
- W2040372310 title "Four Inteins and Three Group II Introns Encoded in a Bacterial Ribonucleotide Reductase Gene" @default.
- W2040372310 cites W1502972645 @default.
- W2040372310 cites W1525759724 @default.
- W2040372310 cites W1964117837 @default.
- W2040372310 cites W1965429549 @default.
- W2040372310 cites W1969542242 @default.
- W2040372310 cites W1979074409 @default.
- W2040372310 cites W1991210664 @default.
- W2040372310 cites W1996662739 @default.
- W2040372310 cites W2000389439 @default.
- W2040372310 cites W2001043647 @default.
- W2040372310 cites W2011588721 @default.
- W2040372310 cites W2012391532 @default.
- W2040372310 cites W2019977292 @default.
- W2040372310 cites W2025583051 @default.
- W2040372310 cites W2038349521 @default.
- W2040372310 cites W2038772648 @default.
- W2040372310 cites W2041508914 @default.
- W2040372310 cites W2044350546 @default.
- W2040372310 cites W2047944454 @default.
- W2040372310 cites W2050188885 @default.
- W2040372310 cites W2073353609 @default.
- W2040372310 cites W2074338586 @default.
- W2040372310 cites W2083501365 @default.
- W2040372310 cites W2087024519 @default.
- W2040372310 cites W2089488870 @default.
- W2040372310 cites W2091232945 @default.
- W2040372310 cites W2095919862 @default.
- W2040372310 cites W2112988974 @default.
- W2040372310 cites W2114271633 @default.
- W2040372310 cites W2114910106 @default.
- W2040372310 cites W2124144156 @default.
- W2040372310 cites W2125268636 @default.
- W2040372310 cites W2125569321 @default.
- W2040372310 cites W2126528399 @default.
- W2040372310 cites W2129426695 @default.
- W2040372310 cites W2132582054 @default.
- W2040372310 cites W2141157874 @default.
- W2040372310 cites W2142075012 @default.
- W2040372310 cites W2145035916 @default.
- W2040372310 cites W2158714788 @default.
- W2040372310 cites W2161374501 @default.
- W2040372310 cites W2164178481 @default.
- W2040372310 cites W2165899507 @default.
- W2040372310 cites W2176375858 @default.
- W2040372310 cites W4232607672 @default.
- W2040372310 cites W4234596903 @default.
- W2040372310 doi "https://doi.org/10.1074/jbc.m309575200" @default.
- W2040372310 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/12975359" @default.
- W2040372310 hasPublicationYear "2003" @default.
- W2040372310 type Work @default.
- W2040372310 sameAs 2040372310 @default.
- W2040372310 citedByCount "24" @default.
- W2040372310 countsByYear W20403723102013 @default.
- W2040372310 countsByYear W20403723102014 @default.
- W2040372310 countsByYear W20403723102015 @default.
- W2040372310 crossrefType "journal-article" @default.
- W2040372310 hasAuthorship W2040372310A5005954873 @default.
- W2040372310 hasAuthorship W2040372310A5018544698 @default.
- W2040372310 hasAuthorship W2040372310A5088724320 @default.
- W2040372310 hasBestOaLocation W20403723101 @default.
- W2040372310 hasConcept C104292427 @default.
- W2040372310 hasConcept C104317684 @default.
- W2040372310 hasConcept C4710235 @default.
- W2040372310 hasConcept C512185932 @default.
- W2040372310 hasConcept C54355233 @default.
- W2040372310 hasConcept C54458228 @default.
- W2040372310 hasConcept C64962909 @default.
- W2040372310 hasConcept C67705224 @default.
- W2040372310 hasConcept C86803240 @default.
- W2040372310 hasConcept C94671646 @default.
- W2040372310 hasConcept C98892162 @default.
- W2040372310 hasConceptScore W2040372310C104292427 @default.
- W2040372310 hasConceptScore W2040372310C104317684 @default.
- W2040372310 hasConceptScore W2040372310C4710235 @default.
- W2040372310 hasConceptScore W2040372310C512185932 @default.
- W2040372310 hasConceptScore W2040372310C54355233 @default.
- W2040372310 hasConceptScore W2040372310C54458228 @default.
- W2040372310 hasConceptScore W2040372310C64962909 @default.
- W2040372310 hasConceptScore W2040372310C67705224 @default.
- W2040372310 hasConceptScore W2040372310C86803240 @default.
- W2040372310 hasConceptScore W2040372310C94671646 @default.
- W2040372310 hasConceptScore W2040372310C98892162 @default.
- W2040372310 hasIssue "47" @default.
- W2040372310 hasLocation W20403723101 @default.
- W2040372310 hasOpenAccess W2040372310 @default.
- W2040372310 hasPrimaryLocation W20403723101 @default.
- W2040372310 hasRelatedWork W146779788 @default.
- W2040372310 hasRelatedWork W148194743 @default.
- W2040372310 hasRelatedWork W2006882919 @default.
- W2040372310 hasRelatedWork W2041926743 @default.