Matches in SemOpenAlex for { <https://semopenalex.org/work/W2043317687> ?p ?o ?g. }
- W2043317687 endingPage "30014" @default.
- W2043317687 startingPage "30010" @default.
- W2043317687 abstract "A wide variety of peptidases associate with vital biological pathways, but the origin and evolution of their tremendous diversity are poorly defined. Application of the MEROPS classification to a comprehensive set of genomes yields a simple pattern of peptidase distribution and provides insight into the organization of proteolysis in all forms of life. Unexpectedly, a near ubiquitous core set of peptidases is shown to contain more types than those unique to higher multicellular organisms. From this core group, an array of eukaryote-specific peptidases evolved to yield well known intracellular and extracellular processes. The paucity of peptidase families unique to higher metazoa suggests gains in proteolytic network complexity required a limited number of biochemical inventions. These findings provide a framework for deeper investigation into the evolutionary forces that shaped each peptidase family and a roadmap to develop a timeline for their expansion as an interconnected system. A wide variety of peptidases associate with vital biological pathways, but the origin and evolution of their tremendous diversity are poorly defined. Application of the MEROPS classification to a comprehensive set of genomes yields a simple pattern of peptidase distribution and provides insight into the organization of proteolysis in all forms of life. Unexpectedly, a near ubiquitous core set of peptidases is shown to contain more types than those unique to higher multicellular organisms. From this core group, an array of eukaryote-specific peptidases evolved to yield well known intracellular and extracellular processes. The paucity of peptidase families unique to higher metazoa suggests gains in proteolytic network complexity required a limited number of biochemical inventions. These findings provide a framework for deeper investigation into the evolutionary forces that shaped each peptidase family and a roadmap to develop a timeline for their expansion as an interconnected system. Proteolysis is a requirement for life. Some peptidases recycle polypeptides into their constitutive amino acids, whereas others catalyze selective polypeptide cleavage for post-translational modification. Approximately 700 peptidases are present in the human genome, collectively termed the degradome (1Lopez-Otin C. Overall C.M. Nat. Rev. 2002; 3: 509-519Crossref Scopus (627) Google Scholar), mediating diverse processes such as extracellular matrix remodeling, immunity, development, protein processing, cell signaling, and apoptosis (2Puente X.S. Sanchez L.M. Overall C.M. Lopez-Otin C. Nat. Rev. Genet. 2003; 4: 544-558Crossref PubMed Scopus (742) Google Scholar). The MEROPS data base classifies peptidases into clans based on structural similarity or sequence features suggesting such homology (3Rawlings N.D. Morton F.R. Kok C.Y. Kong J. Barrett A.J. Nucleic Acids Res. 2008; 36: D320-D325Crossref PubMed Scopus (515) Google Scholar, 4Barrett A.J. Rawlings N.D. Biol. Chem. 2007; 388: 1151-1157Crossref PubMed Scopus (31) Google Scholar). Families divide clans by common ancestry. Subfamilies have common structure yet unclear ancestry. Individual peptidases are termed species. For example, clan PA peptidases contain eight families each bearing a classical two β-barrel architecture observed in chymotrypsin with either cysteine or serine acting as catalytic nucleophile (5Page M.J. Di Cera E. Cell Mol. Life Sci. 2008; 65: 1220-1236Crossref PubMed Scopus (289) Google Scholar). The blood clotting peptidase thrombin (6Di Cera E. Mol. Aspects Med. 2008; 29: 203-254Crossref PubMed Scopus (247) Google Scholar) is one species in the large S1A family of clan PA (7Krem M.M. Di Cera E. EMBO J. 2001; 20: 3036-3045Crossref PubMed Scopus (160) Google Scholar). Classification provides context to decipher peptidase function in the natural world. Different classes of peptidases associate with vital biological pathways. Metallopeptidases dictate structural composition surrounding cells in multicellular organisms (8Mott J.D. Werb Z. Curr. Opin. Cell Biol. 2004; 16: 558-564Crossref PubMed Scopus (871) Google Scholar). Cysteine peptidases mediate regulated cell death (9Cohen G.M. Biochem. J. 1997; 326: 1-16Crossref PubMed Scopus (4124) Google Scholar). Serine peptidases enable our immune response (10Beutler B. Mol. Immunol. 2004; 40: 845-859Crossref PubMed Scopus (864) Google Scholar). Threonine peptidases function as the central conduit for protein recycling in many organisms (11Coux O. Tanaka K. Goldberg A.L. Annu. Rev. Biochem. 1996; 65: 801-847Crossref PubMed Scopus (2232) Google Scholar). Aspartic peptidases broker viral infection (12Kohl N.E. Emini E.A. Schleif W.A. Davis L.J. Heimbach J.C. Dixon R.A. Scolnick E.M. Sigal I.S. Proc. Natl. Acad. Sci. U. S. A. 1988; 85: 4686-4690Crossref PubMed Scopus (1277) Google Scholar). However, each of these examples is a small subset within each catalytic type. To clarify peptidase diversity, we apply the MEROPS classification on a comprehensive set of genomes to visualize, compare, and contrast the proteolytic underpinnings of the biosphere. From this analysis, a map of peptidase distribution is drawn to provide a framework for decoding function, architecture, and evolution of proteolysis in vivo. Remarkably, a simple pattern is shown to underlie limited proteolysis in all forms of life. Annotated protein sequences from whole genome sequence data were obtained from public ftp servers of the NCBI, DOE JGI, and select genome project homepages on the worldwide web. ClustalW was used to prepare sequence alignments from peptidase domains obtained from the MEROPS data base (3Rawlings N.D. Morton F.R. Kok C.Y. Kong J. Barrett A.J. Nucleic Acids Res. 2008; 36: D320-D325Crossref PubMed Scopus (515) Google Scholar, 13Thompson J.D. Higgins D.G. Gibson T.J. Nucleic Acids Res. 1994; 22: 4673-4680Crossref PubMed Scopus (55635) Google Scholar). The HMMER package by Eddy (14Eddy S.R. Bioinformatics (Oxf.). 1998; 14: 755-763Crossref PubMed Scopus (4043) Google Scholar) was used to build hidden Markov models (HMM) 2The abbreviation used is: HMM, hidden Markov models. from the ClustalW prepared alignments and used to search protein sequence data using default cutoff values. HMMs were constructed for 154 peptidase families. Families were excluded from the census if they contained only a few sequences, typically less than 20, or if from viral origin. All of the data in the MEROPS data base were queried with each peptidase HMM and returned greater than 80% data base coverage indicating utility of the models (supplemental Fig. S1). An iterated search was applied on 128 genomes from all three kingdoms of life. The complete list of genomes examined, their taxonomy, and peptidase estimates are provided in supplemental Table S1. The HMMs applied capture peptidase diversity in the biosphere, yet the numbers of peptidases noted in each organism are only an estimate. Results were parsed with BioPerl (15Stajich J.E. Block D. Boulez K. Brenner S.E. Chervitz S.A. Dagdigian C. Fuellen G. Gilbert J.G. Korf I. Lapp H. Lehvaslaiho H. Matsalla C. Mungall C.J. Osborne B.I. Pocock M.R. Schattner P. Senger M. Stein L.D. Stupka E. Wilkinson M.D. Birney E. Genome Res. 2002; 12: 1611-1618Crossref PubMed Scopus (1245) Google Scholar). Cluster 3.0 and Treeview software was used to cluster census data after conversion into a binary matrix using the complete linkage algorithm and visualize the data as shown in Fig. 2 (16Eisen M.B. Spellman P.T. Brown P.O. Botstein D. Proc. Natl. Acad. Sci. U. S. A. 1998; 95: 14863-14868Crossref PubMed Scopus (13204) Google Scholar). HMMs are useful indicators of peptidase content in a genome sequence (17Eddy S.R. Curr. Opin. Struct. Biol. 1996; 6: 361-365Crossref PubMed Scopus (848) Google Scholar, 18Karplus K. Barrett C. Hughey R. Bioinformatics (Oxf.). 1998; 14: 846-856Crossref PubMed Scopus (907) Google Scholar). As many peptidase families lack known fold, we rely upon MEROPS annotation for each domain. HMMs were constructed for 154 peptidase families using the HMMER software package to probe 128 eukaryotic, eubacterial, and archaeal genomes. Results are provided in supplementary Table S1. Genomes examined represent every order of organism for which complete genome sequence data are available. Due to the tremendous diversity of peptidase families the resulting census captures trends in degradome composition and is not an exact measure. Variation in polypeptide sequence, distant phylogenetic relationships, gene loss, horizontal gene transfer, inactive peptidase sequences, and quality of the genome annotation makes precise assignment difficult (19Doolittle R.F. Curr. Opin. Struct. Biol. 2005; 15: 248-253Crossref PubMed Scopus (50) Google Scholar). Hence, the tabulated numbers are only approximates of the true diversity. The HMMs distill over two million protein sequences into more than 38,000 peptidases. Plotting number of each peptidase family estimated in each genome reveals the landscape of degradome composition in known forms of cellular life (Fig. 1). Select peptidase families underwent significant expansion in copy number per genome and of these the S1A family of trypsins is most prevalent. A landscape of genome-wide peptidase diversity is visualized from the census that enables a framework for understanding limited proteolysis in vivo. Serine peptidases are the most abundant class in the dataset followed by metalloproteases. Typically 100 peptidases are found in genomes of bacteria. In contrast, archaeal genomes present half that many. Explosive growth in multiple peptidase families is evident in eukaryotic organisms and the difference in degradome composition between eubacteria and eukarya is striking. Considerable work remains to detail how changes in polypeptide sequence associate with the functional gains in each of these families and further refine accuracy of the census. To derive a concise picture of peptidase diversity, we reduced dimensionality of the data. Peptidase census results were converted into a binary matrix obviating the need for weighting functions and other advanced statistical measures. Clustering peptidase families from the binary dataset by the organisms that contain them greatly clarifies peptidase diversity (Fig. 2). The clustering algorithm, commonly applied to microarray data, segregates organisms into the three kingdoms of life and trends with accepted classification yet is entirely based upon degradome content (20Woese C.R. Kandler O. Wheelis M.L. Proc. Natl. Acad. Sci. U. S. A. 1990; 87: 4576-4579Crossref PubMed Scopus (4492) Google Scholar, 21Cavalier-Smith T. Intl. J. Syst. Evolut. Microbiol. 2002; 52: 297-354Crossref PubMed Scopus (759) Google Scholar, 22Doolittle W.F. Science. 1999; 284: 2124-2129Crossref PubMed Scopus (1305) Google Scholar). Despite presenting few peptidases in their genome, the archaea group is distinctively apart from the eubacteria. Inclusion of multiple genomes from parasitic organisms within the dataset complicates phylogenetic inference from the dataset due to significant gene loss (23Yang S. Doolittle R.F. Bourne P.E. Proc. Natl. Acad. Sci. U. S. A. 2005; 102: 373-378Crossref PubMed Scopus (184) Google Scholar, 24Ochman H. Moran N.A. Science. 2001; 292: 1096-1099Crossref PubMed Scopus (405) Google Scholar). Nevertheless, a clear pattern of peptidase distribution emerges accounting for all of the HMMs examined and nearly 90% of the families in the MEROPS data base of which 10% are viral peptidases not examined in this study. A nearly ubiquitous core of 16 peptidase families encompasses proteolytic requirements of cellular life in a modern environment (Table 1). Given extensive distribution it is unlikely gene transfer between organisms explains this observation. Highly conserved peptidase families play central metabolic and protein processing roles such folate and glutathione metabolism, de novo purine biosynthesis, and signal peptide processing (25Yao R. Schneider E. Ryan T.J. Galivan J. Proc. Natl. Acad. Sci. U. S. A. 1996; 93: 10134-10138Crossref PubMed Scopus (79) Google Scholar, 26Suzuki H. Kumagai H. J. Biol. Chem. 2002; 277: 43536-43543Abstract Full Text Full Text PDF PubMed Scopus (116) Google Scholar, 27Brannigan J.A. Dodson G. Duggleby H.J. Moody P.C. Smith J.L. Tomchick D.R. Murzin A.G. Nature. 1995; 378: 416-419Crossref PubMed Scopus (544) Google Scholar, 28Becker A.B. Roth R.A. Proc. Natl. Acad. Sci. U. S. A. 1992; 89: 3835-3839Crossref PubMed Scopus (141) Google Scholar). Importance of intracellular protein homeostasis is evident by the presence of nine families with known digestive function (29Varshavsky A. Proc. Natl. Acad. Sci. U. S. A. 1996; 93: 12142-12149Crossref PubMed Scopus (720) Google Scholar). Rhomboid and FtsH peptidases cleave substrates on or in a phospholipid bilayer and are highly conserved (30Urban S. Lee J.R. Freeman M. Cell. 2001; 107: 173-182Abstract Full Text Full Text PDF PubMed Scopus (489) Google Scholar, 31Karata K. Inagawa T. Wilkinson A.J. Tatsuta T. Ogura T. J. Biol. Chem. 1999; 274: 26225-26232Abstract Full Text Full Text PDF PubMed Scopus (175) Google Scholar). The prolyl aminopeptidase family is associated with a variety of other chemistries outside of proteolysis including epoxide hydrolase and monoglyceride lipase activities (32Holmquist M. Curr. Protein Peptide Sci. 2000; 1: 209-235Crossref PubMed Scopus (482) Google Scholar). The M22 family bear homology with the DNAK/Hsp70 family of chaperones (33Aravind L. Koonin E.V. J. Mol. Biol. 1999; 287: 1023-1040Crossref PubMed Scopus (370) Google Scholar). Hence, the 16 near ubiquitous families need not have been active peptidases at their origin nor throughout the dataset. The basal degradome was expanded upon through gains in intra- and extracellular peptidase activity.TABLE 1The nearly ubiquitous core of peptidase families Sixteen peptidase families can be identified in the genomes of all forms of cellular life. The known functions of these peptidases appear to encompass all the requirements for complex proteolysis capable of digestion and protein processing. Several organisms have lost one or more of these peptidase families.FamilyClanCommon nameBiological functionC26PCγ-Glutamyl hydrolaseFolate metabolismC44PBAmidophosphoribosyltransferaseAdenosine biosynthesisM1MAAminopeptidase NProtein turnoverM16BMEPitrilysinSignal peptide processingM20AMHGlutamate carboxypeptidaseProtein turnoverM22MKHsp70/DnaK homologChaperoneM24AMGMethionyl aminopeptidaseProtein turnoverM24BMGAminopeptidase PProtein turnoverM41MAFtsH peptidaseMembrane protein processingS14SKClp peptidaseProtein turnoverS16SJLonA peptidaseProtein turnoverS1BPAHtrA peptidaseProtein turnoverS33SCProlyl aminopeptidaseTurnover and epoxide hydrolaseS54STRhomboidMembrane protein processingS8ASCSubtilisinProtein turnoverT3PBγ-GlutamyltranferaseGlutathione metabolism Open table in a new tab Intracellular peptidase inventions in eukaryotic organisms bring about four major cellular advances in eukaryotes (Table 2). First, ubiquitin- and SUMO-mediated protein turnover emerged through invention of six peptidase families (34Hochstrasser M. Nat. Cell Biol. 2000; 2: E153-E157Crossref PubMed Scopus (367) Google Scholar). The autophagins are also critical for autophagy, a cellular process requiring a complex cytoskeleton. Second, calpains are both unique to eukarya and to known peptidases for their dependence upon Ca2+ for activity (35Goll D.E. Thompson V.F. Li H. Wei W. Cong J. Physiol. Rev. 2003; 83: 731-801Crossref PubMed Scopus (2356) Google Scholar). Calpains are largely cytosolic and critical for cytoskeletal reorganization, endosome recycling, and membrane fusion (36Evans J.S. Turner M.D. J. Neurochem. 2007; 103: 849-859Crossref PubMed Scopus (29) Google Scholar). Third, two peptidase families are involved in structure and cell division in the nucleus. Separases divides sister chromatids during mitosis or meiosis through cleavage of cohesions and nucleoporins form an integral component of the pore separating the nucleus from the cytoplasm (37Uhlmann F. Lottspeich F. Nasmyth K. Nature. 1999; 400: 37-42Crossref PubMed Scopus (754) Google Scholar, 38Liu Z. Makaroff C.A. Plant Cell. 2006; 18: 1213-1225Crossref PubMed Scopus (61) Google Scholar). Last, M76 peptidases are required for processing of mitochondrial proteins including respiratory chain complexes (39Osman C. Wilmes C. Tatsuta T. Langer T. Mol. Biol. Cell. 2007; 18: 627-635Crossref PubMed Scopus (115) Google Scholar). On the basis of these intracellular advances, extracellular proteolysis expanded with the emergence of multicellularity.TABLE 2Eukaryotic peptidase families Thirty-four peptidase families are widely distributed in eukaryotic organisms playing numerous roles in intra- and extracellular biology. The six families of peptidases involved in ubiquitin- and SUMO-mediated protein turnover are most notable given their vital role in protein homeostasis.FamilyClanCommon nameBiological functionA1AAAPepsinProtein turnover and processingA22AADPresenillin peptidaseCell signallingA22BADIMPASProtein processingC12CAUbiquitin C-terminal hydrolaseProtein turnoverC13CDLegumainProtein processingC14BCDMetacaspaseProtein processingC15CFPyroglutamyl-peptidaseBioactive peptide processingC19CAUbiquitin-specific peptidaseProtein turnoverC1ACAPapainProtein turnover and processingC1BCABleomycin hydrolaseProtein processingC48CESENP peptidaseProtein turnoverC50CDSeparaseCell divisionC54CAAutophaginAutophagyC65CAOtubainProtein turnoverC78CAUbiquitin-fold modifier peptidaseProtein turnoverM10AMAMatrix metallopeptidaseExtracellular matrix remodelingM14AMCCarboxypeptidase A1Protein turnover and processingM14BMCCarboxypeptidase EProtein turnover and processingM16AMEPitrilysinProtein processingM16CMEMitochondrial processing peptidaseProtein processingM18MHAminopeptidase 1Protein turnoverM49M-Dipeptidyl-peptidase IIIProtein turnoverM67AMPPoh1 peptidaseProtein turnoverM67CMPAMSH deubiquitinating peptidaseProtein turnoverM76M-Atp23 peptidaseProtein processingM8MALeishmanolysinProtein processingS10SCCarboxypeptidase YProtein processingS26BSFSignalaseProtein processingS28SCPro-Xaa carboxypeptidaseProtein processingS59SPNucleoporinNuclear pore formationS8BSBSubtilisinProtein turnoverS9BSCDipeptidyl-peptidase IVProtein processingS9DSCDipeptidyl-peptidase VProtein processingT1APBProteasome catalytic subunitProtein turnover Open table in a new tab A number of peptidase families involved in extracellular biology evolved in eukarya to broker key regulatory processes and set the stage for further expansion (Table 2). Invention of novel selectivity mechanisms fostered gains in the complexity of extracellular biology. Selectivity advances include the removal of a single basic amino acid from the C terminus of a substrate (M14B family) (40Reznik S.E. Fricker L.D. Cell Mol. Life Sci. 2001; 58: 1790-1804Crossref PubMed Scopus (134) Google Scholar), cleavage of substrates following a penultimate proline residue (S28 family) (41Leiting B. Pryor K.D. Wu J.K. Marsilio F. Patel R.A. Craik C.S. Ellman J.A. Cummings R.T. Thornberry N.A. Biochem. J. 2003; 371: 525-532Crossref PubMed Google Scholar), and steric hindrance to a dipeptidyl peptidase-selective active site by an associated protein domain (S9B family) (42Rea D. Fulop V. Cell Biochem. Biophys. 2006; 44: 349-365Crossref PubMed Scopus (74) Google Scholar). The unusual selectivity of these novel peptidases promoted expansion of cellular communication through processing small bioactive peptides. Of eukaryotic inventions only a handful are specific to higher metazoa. Remarkably, only 10 peptidase families are restricted to the presence in non-yeast metazoa implying late evolutionary discovery (Table 3). Of these, the C14A caspases are notable for their central role in apoptosis (43Nicholson D.W. Thornberry N.A. Trends Biochem. Sci. 1997; 22: 299-306Abstract Full Text PDF PubMed Scopus (2180) Google Scholar). The C14 family is the clearest example of phylogenetic distinction in a peptidase clan. Peptidases in the C14A subfamily are found in eukaryotes, whereas C14B peptidases are not. Therefore an ancestral C14B peptidase is the root of all C14A peptidases. Two additions to the ubiquitin-dependent protein recycling machinery are present in higher organisms, the Cezanne deubiquitinylating peptidases and CylD protein (44Kovalenko A. Chable-Bessia C. Cantarella G. Israel A. Wallach D. Courtois G. Nature. 2003; 424: 801-805Crossref PubMed Scopus (848) Google Scholar, 45Evans P.C. Smith T.S. Lai M.J. Williams M.G. Burke D.F. Heyninck K. Kreike M.M. Beyaert R. Blundell T.L. Kilshaw P.J. J. Biol. Chem. 2003; 278: 23180-23186Abstract Full Text Full Text PDF PubMed Scopus (138) Google Scholar). Diversity of function is embodied by the M12A family of astacins, which act in processes ranging from hormonal control to extracellular matrix remodeling (46Stocker W. Grams F. Baumann U. Reinemer P. Gomis-Ruth F.X. McKay D.B. Bode W. Protein Sci. 1995; 4: 823-840Crossref PubMed Scopus (634) Google Scholar). Early association with a disintegrin domain propelled massive and rapid expansion of adamalysins to shape extracellular matrix composition and other signaling functions (47Black R.A. Rauch C.T. Kozlosky C.J. Peschon J.J. Slack J.L. Wolfson M.F. Castner B.J. Stocking K.L. Reddy P. Srinivasan S. Nelson N. Boiani N. Schooley K.A. Gerhart M. Davis R. Fitzner J.N. Johnson R.S. Paxton R.J. March C.J. Cerretti D.P. Nature. 1997; 385: 729-733Crossref PubMed Scopus (2705) Google Scholar). Of late emerging peptidases, pappalysins are noteworthy given the origin of the name of this family, pregnancy-associated plasma protein-A (48Boldt H.B. Overgaard M.T. Laursen L.S. Weyer K. Sottrup-Jensen L. Oxvig C. Biochem. J. 2001; 358: 359-367Crossref PubMed Scopus (152) Google Scholar). The small number of peptidase inventions by animals is mirrored in plants and yeasts. Only the A1B and M11 peptidase families appear limited to plants, whereas seven families are unique to yeasts. Together, these observations suggest the conserved degradome among eukaryotes required few gains to enable complex multicellularity.TABLE 3Peptidase inventions in non-yeast metazoa Ten peptidase families are unique to higher organisms. Each peptidase family has been shown to play a role in key, and largely, extracellular processes. The most notable exception are the caspases, the executioners of cell death. Both the M12A and M12B peptidase families have diverse functions and underwent rapid expansion in higher metazoa.FamilyClanCommon nameBiological functionC14ACDCaspaseApoptosisC46CHHedgehogDevelopmentC64CACezanne peptidaseProtein turnoverC67CACylD proteinProtein turnoverM12AMAAstacinExtracellular biologyM12BMAAdamalysinExtracellular biologyM2MAAngiotensin-converting enzymeBioactive peptide processingM43BMAPappalysinBioactive peptide processingM50AMMS2P peptidaseSignal peptide processingS60SRLactoferrinIron metabolism Open table in a new tab Although most peptidase families can be grouped through organism-level description, several cannot due to sporadic distribution, poor identification by the HMMs applied, and rare occurrence. In certain instances, the MEROPS classification may require adjustment. For example, several larger peptidase clans with many families are likely distantly related. The common ancestor of these families lived over a billion years ago whose traces have been obliterated by the passage of time (49Feng D.F. Cho G. Doolittle R.F. Proc. Natl. Acad. Sci. U. S. A. 1997; 94: 13028-13033Crossref PubMed Scopus (325) Google Scholar). Hence, phylogenetic reconstruction is challenging and demands both additional sequence information and close attention to reconcile convergent from divergent evolution. It is reasonable that the majority of peptidase families arise from divergent evolution as both the structure and enzymatic mechanism is conserved. For example, the M16 family contains three subfamilies with structural homology and weak sequence similarity (50Shen Y. Joachimiak A. Rosner M.R. Tang W.J. Nature. 2006; 443: 870-874Crossref PubMed Scopus (280) Google Scholar, 51Taylor A.B. Smith B.S. Kitada S. Kojima K. Miyaura H. Otwinowski Z. Ito A. Deisenhofer J. Structure. 2001; 9: 615-625Abstract Full Text Full Text PDF PubMed Scopus (186) Google Scholar, 52Johnson K.A. Bhushan S. Stahl A. Hallberg B.M. Frohn A. Glaser E. Eneqvist T. EMBO J. 2006; 25: 1977-1986Crossref PubMed Scopus (81) Google Scholar). When combined with organism-level distribution, M16B predates both M16C and M16A and is the ancestral family. Such a simple pattern is not readily discerned with most peptidase families. Of these, the S1A peptidases have contentious origin despite widespread abundance. Peptidase families with mixed distribution are spread throughout the clustered data. Dispersal of these 32 peptidase families result from the diversity of both peptidases and genomes examined, gene transfer between organisms, and misidentification. Both free-living and parasitic lifestyles are also encompassed by the dataset and further complicate interpretation of the results. Of mixed peptidase families, the tricorn peptidase, C-terminal processing peptidase, and HslV component of the HslUV peptidase are notable given their importance toward intracellular protein homeostasis yet overlapping and redundant functionality (53Wu W.F. Zhou Y. Gottesman S. J. Bacteriol. 1999; 181: 3681-3687Crossref PubMed Google Scholar, 54Glas R. Bogyo M. McMaster J.S. Gaczynska M. Ploegh H.L. Nature. 1998; 392: 618-622Crossref PubMed Scopus (238) Google Scholar). Two abundant peptidases in higher organisms, the S1A family of trypsins and S9A prolyl oligopeptidases also show a widely dispersed pattern. These peptidase families require further attention to address the interplay between gene transfer and loss. Analysis of peptidase diversity spanning the complete biosphere introduces limitations on the results. HMMs used are derived from MEROPS annotation that return 80% coverage of the data base (supplemental Fig. S1). Usage of protein not the nucleotide sequence data continues bias of the genome annotation and quality of the underlying sequence data (55Devos D. Valencia A. Trends Genet. 2001; 17: 429-431Abstract Full Text Full Text PDF PubMed Scopus (225) Google Scholar). Pseudogenes are not considered. Due to the broad nature of the HMMs applied, some peptidases identified may be non-peptidase homologs, particularly in the S33 family. Hence, the total numbers of peptidases counted are useful estimates that differ from previous studies (56Puente X.S. Sanchez L.M. Gutierrez-Fernandez A. Velasco G. Lopez-Otin C. Biochem. Soc. Trans. 2005; 33: 331-334Crossref PubMed Scopus (111) Google Scholar, 57Puente X.S. Gutierrez-Fernandez A. Ordonez G.R. Hillier L.W. Lopez-Otin C. Genomics. 2005; 86: 638-647Crossref PubMed Scopus (51) Google Scholar, 58Lopez-Otin, C., and Bond, J. S. (July 23, 2008) J. Biol. Chem., DOI 10.1074/jbc.R800035200Google Scholar). These issues encourage revisions to the framework. Despite these limitations, clear trends emerge in the evolution of peptidase diversity. Identification of a near ubiquitous core of peptidase families provides a working definition of a basal degradome and a starting point to appreciate evolutionary expansion of proteolytic networks. Near ubiquitous peptidase families do not increase the copy number per genome over time and do not expand into different functional niches. Non-expanding, highly conserved peptidase families broker intracellular processes and absence in certain organisms suggests they can be replaced. Therefore, although the core set of peptidases fulfill vital roles they are neither malleable to the forces of molecular evolution nor location independent. Such observations are all the more remarkable given exposure to billions of years of natural selection. Degradome expansion demanded novel mechanisms of limited proteolysis particularly in extracellular environments. Eukaryotes met this requirement through a variety of protein innovations yet a surprising paucity of these novel families is unique to higher metazoa. Limited peptidase inventions in higher organisms suggest network organization, refining properties of select peptidases and enhanced spatial and temporal regulation were sufficient to generate the wide diversity of proteolysis in life. Trypsins, the S1A family, are the most abundant member in the human degradome, such as those of all non-yeast metazoa and many lower eukaryotes. Emergence of the S1A peptidase family predates major speciation events in multicellular eukaryotes, yet the origin of this family remains unclear as with several other peptidase families due to mixed distribution. Visualization of degradome diversity provides a roadmap to develop a timeline for the evolution of proteolysis at the systems level. Download .zip (.13 MB) Help with zip files" @default.
- W2043317687 created "2016-06-24" @default.
- W2043317687 creator A5042932390 @default.
- W2043317687 creator A5046701540 @default.
- W2043317687 date "2008-10-01" @default.
- W2043317687 modified "2023-10-18" @default.
- W2043317687 title "Evolution of Peptidase Diversity" @default.
- W2043317687 cites W119139793 @default.
- W2043317687 cites W1607874074 @default.
- W2043317687 cites W1638194466 @default.
- W2043317687 cites W1942214720 @default.
- W2043317687 cites W1964911761 @default.
- W2043317687 cites W1967860838 @default.
- W2043317687 cites W1969995117 @default.
- W2043317687 cites W1974224443 @default.
- W2043317687 cites W1983345237 @default.
- W2043317687 cites W1989992559 @default.
- W2043317687 cites W1991063206 @default.
- W2043317687 cites W2005359497 @default.
- W2043317687 cites W2006373747 @default.
- W2043317687 cites W2007810290 @default.
- W2043317687 cites W2020969008 @default.
- W2043317687 cites W2022114463 @default.
- W2043317687 cites W2029454644 @default.
- W2043317687 cites W2030776545 @default.
- W2043317687 cites W2032838501 @default.
- W2043317687 cites W2033309752 @default.
- W2043317687 cites W2035393691 @default.
- W2043317687 cites W2035847639 @default.
- W2043317687 cites W2045165720 @default.
- W2043317687 cites W2046974910 @default.
- W2043317687 cites W2049110705 @default.
- W2043317687 cites W2053006426 @default.
- W2043317687 cites W2053684471 @default.
- W2043317687 cites W2055264790 @default.
- W2043317687 cites W2061222946 @default.
- W2043317687 cites W2062919894 @default.
- W2043317687 cites W2069508712 @default.
- W2043317687 cites W2074881372 @default.
- W2043317687 cites W2088749908 @default.
- W2043317687 cites W2093035634 @default.
- W2043317687 cites W2094079380 @default.
- W2043317687 cites W2096457861 @default.
- W2043317687 cites W2097536945 @default.
- W2043317687 cites W2098558111 @default.
- W2043317687 cites W2106882534 @default.
- W2043317687 cites W2107190860 @default.
- W2043317687 cites W2110283328 @default.
- W2043317687 cites W2127115092 @default.
- W2043317687 cites W2133155521 @default.
- W2043317687 cites W2133334674 @default.
- W2043317687 cites W2140244239 @default.
- W2043317687 cites W2150926065 @default.
- W2043317687 cites W2150934173 @default.
- W2043317687 cites W2151831732 @default.
- W2043317687 cites W2158665004 @default.
- W2043317687 cites W2166865790 @default.
- W2043317687 cites W2169009504 @default.
- W2043317687 cites W2169436026 @default.
- W2043317687 cites W2178503716 @default.
- W2043317687 cites W3147254695 @default.
- W2043317687 cites W4211160326 @default.
- W2043317687 cites W4251174927 @default.
- W2043317687 cites W4253913636 @default.
- W2043317687 doi "https://doi.org/10.1074/jbc.m804650200" @default.
- W2043317687 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2573091" @default.
- W2043317687 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/18768474" @default.
- W2043317687 hasPublicationYear "2008" @default.
- W2043317687 type Work @default.
- W2043317687 sameAs 2043317687 @default.
- W2043317687 citedByCount "93" @default.
- W2043317687 countsByYear W20433176872012 @default.
- W2043317687 countsByYear W20433176872013 @default.
- W2043317687 countsByYear W20433176872014 @default.
- W2043317687 countsByYear W20433176872015 @default.
- W2043317687 countsByYear W20433176872016 @default.
- W2043317687 countsByYear W20433176872017 @default.
- W2043317687 countsByYear W20433176872018 @default.
- W2043317687 countsByYear W20433176872019 @default.
- W2043317687 countsByYear W20433176872020 @default.
- W2043317687 countsByYear W20433176872021 @default.
- W2043317687 countsByYear W20433176872022 @default.
- W2043317687 countsByYear W20433176872023 @default.
- W2043317687 crossrefType "journal-article" @default.
- W2043317687 hasAuthorship W2043317687A5042932390 @default.
- W2043317687 hasAuthorship W2043317687A5046701540 @default.
- W2043317687 hasBestOaLocation W20433176871 @default.
- W2043317687 hasConcept C144024400 @default.
- W2043317687 hasConcept C19165224 @default.
- W2043317687 hasConcept C2781316041 @default.
- W2043317687 hasConcept C78458016 @default.
- W2043317687 hasConcept C86803240 @default.
- W2043317687 hasConceptScore W2043317687C144024400 @default.
- W2043317687 hasConceptScore W2043317687C19165224 @default.
- W2043317687 hasConceptScore W2043317687C2781316041 @default.
- W2043317687 hasConceptScore W2043317687C78458016 @default.
- W2043317687 hasConceptScore W2043317687C86803240 @default.
- W2043317687 hasIssue "44" @default.