Matches in SemOpenAlex for { <https://semopenalex.org/work/W2949203875> ?p ?o ?g. }
- W2949203875 abstract "While eukaryotic noncoding RNAs have recently received intense scrutiny, it is becoming clear that bacterial transcription is at least as pervasive. Bacterial small RNAs and antisense RNAs (sRNAs) are often assumed to be noncoding, due to their lack of long open reading frames (ORFs). However, there are numerous examples of sRNAs encoding for small proteins, whether or not they also have a regulatory role at the RNA level.Here, we apply flexible machine learning techniques based on sequence features and comparative genomics to quantify the prevalence of sRNA ORFs under natural selection to maintain protein-coding function in 14 phylogenetically diverse bacteria. Importantly, we quantify uncertainty in our predictions, and follow up on them using mass spectrometry proteomics and comparison to datasets including ribosome profiling.A majority of annotated sRNAs have at least one ORF between 10 and 50 amino acids long, and we conservatively predict that 409±191.7 unannotated sRNA ORFs are under selection to maintain coding (mean estimate and 95% confidence interval), an average of 29 per species considered here. This implies that overall at least 10.3±0.5% of sRNAs have a coding ORF, and in some species around 20% do. 165±69 of these novel coding ORFs have some antisense overlap to annotated ORFs. As experimental validation, many of our predictions are translated in published ribosome profiling data and are identified via mass spectrometry shotgun proteomics. B. subtilis sRNAs with coding ORFs are enriched for high expression in biofilms and confluent growth, and S. pneumoniae sRNAs with coding ORFs are involved in virulence. sRNA coding ORFs are enriched for transmembrane domains and many are predicted novel components of type I toxin/antitoxin systems.We predict over two dozen new protein-coding genes per bacterial species, but crucially also quantified the uncertainty in this estimate. Our predictions for sRNA coding ORFs, along with predicted novel type I toxins and tools for sorting and visualizing genomic context, are freely available in a user-friendly format at http://disco-bac.web.pasteur.fr. We expect these easily-accessible predictions to be a valuable tool for the study not only of bacterial sRNAs and type I toxin-antitoxin systems, but also of bacterial genetics and genomics." @default.
- W2949203875 created "2019-06-27" @default.
- W2949203875 creator A5004385228 @default.
- W2949203875 creator A5004627077 @default.
- W2949203875 creator A5036917740 @default.
- W2949203875 creator A5059640936 @default.
- W2949203875 creator A5069672356 @default.
- W2949203875 creator A5076028970 @default.
- W2949203875 creator A5084173757 @default.
- W2949203875 date "2017-07-21" @default.
- W2949203875 modified "2023-10-10" @default.
- W2949203875 title "Common and phylogenetically widespread coding for peptides by bacterial small RNAs" @default.
- W2949203875 cites W1956770701 @default.
- W2949203875 cites W1963945004 @default.
- W2949203875 cites W1964557084 @default.
- W2949203875 cites W1969422740 @default.
- W2949203875 cites W1977241485 @default.
- W2949203875 cites W1981870052 @default.
- W2949203875 cites W1986733693 @default.
- W2949203875 cites W2024354549 @default.
- W2949203875 cites W2026570544 @default.
- W2949203875 cites W2027839322 @default.
- W2949203875 cites W2042191361 @default.
- W2949203875 cites W2043301766 @default.
- W2949203875 cites W2044750677 @default.
- W2949203875 cites W2046374424 @default.
- W2949203875 cites W2057382328 @default.
- W2949203875 cites W2062640511 @default.
- W2949203875 cites W2062896692 @default.
- W2949203875 cites W2068386909 @default.
- W2949203875 cites W2071746658 @default.
- W2949203875 cites W2073708599 @default.
- W2949203875 cites W2075619582 @default.
- W2949203875 cites W2077460182 @default.
- W2949203875 cites W2082047304 @default.
- W2949203875 cites W2087934409 @default.
- W2949203875 cites W2090327616 @default.
- W2949203875 cites W2090364092 @default.
- W2949203875 cites W2092035637 @default.
- W2949203875 cites W2098508116 @default.
- W2949203875 cites W2102619694 @default.
- W2949203875 cites W2103222799 @default.
- W2949203875 cites W2105114815 @default.
- W2949203875 cites W2105199251 @default.
- W2949203875 cites W2106350024 @default.
- W2949203875 cites W2110335151 @default.
- W2949203875 cites W2116955103 @default.
- W2949203875 cites W2117253407 @default.
- W2949203875 cites W2117725451 @default.
- W2949203875 cites W2128580612 @default.
- W2949203875 cites W2131892542 @default.
- W2949203875 cites W2133990480 @default.
- W2949203875 cites W2139658526 @default.
- W2949203875 cites W2140257956 @default.
- W2949203875 cites W2140874466 @default.
- W2949203875 cites W2148771849 @default.
- W2949203875 cites W2152770371 @default.
- W2949203875 cites W2159468050 @default.
- W2949203875 cites W2166494042 @default.
- W2949203875 cites W2170747616 @default.
- W2949203875 cites W2189955651 @default.
- W2949203875 cites W2322749552 @default.
- W2949203875 cites W342631587 @default.
- W2949203875 cites W4246535589 @default.
- W2949203875 cites W582950784 @default.
- W2949203875 doi "https://doi.org/10.1186/s12864-017-3932-y" @default.
- W2949203875 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/5521070" @default.
- W2949203875 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/28732463" @default.
- W2949203875 hasPublicationYear "2017" @default.
- W2949203875 type Work @default.
- W2949203875 sameAs 2949203875 @default.
- W2949203875 citedByCount "27" @default.
- W2949203875 countsByYear W29492038752018 @default.
- W2949203875 countsByYear W29492038752019 @default.
- W2949203875 countsByYear W29492038752020 @default.
- W2949203875 countsByYear W29492038752021 @default.
- W2949203875 countsByYear W29492038752022 @default.
- W2949203875 crossrefType "journal-article" @default.
- W2949203875 hasAuthorship W2949203875A5004385228 @default.
- W2949203875 hasAuthorship W2949203875A5004627077 @default.
- W2949203875 hasAuthorship W2949203875A5036917740 @default.
- W2949203875 hasAuthorship W2949203875A5059640936 @default.
- W2949203875 hasAuthorship W2949203875A5069672356 @default.
- W2949203875 hasAuthorship W2949203875A5076028970 @default.
- W2949203875 hasAuthorship W2949203875A5084173757 @default.
- W2949203875 hasBestOaLocation W29492038751 @default.
- W2949203875 hasConcept C104317684 @default.
- W2949203875 hasConcept C167625842 @default.
- W2949203875 hasConcept C182325514 @default.
- W2949203875 hasConcept C2780530800 @default.
- W2949203875 hasConcept C46111723 @default.
- W2949203875 hasConcept C47289529 @default.
- W2949203875 hasConcept C54355233 @default.
- W2949203875 hasConcept C67705224 @default.
- W2949203875 hasConcept C70721500 @default.
- W2949203875 hasConcept C86803240 @default.
- W2949203875 hasConcept C88478588 @default.
- W2949203875 hasConceptScore W2949203875C104317684 @default.
- W2949203875 hasConceptScore W2949203875C167625842 @default.
- W2949203875 hasConceptScore W2949203875C182325514 @default.