Matches in SemOpenAlex for { <https://semopenalex.org/work/W3206316099> ?p ?o ?g. }
- W3206316099 abstract "Paralogs formed through gene duplication and isoforms formed through alternative splicing have been important processes for increasing protein diversity and maintaining cellular homeostasis. Despite their recognized importance and the advent of large-scale genomic and transcriptomic analyses, paradoxically, accurate annotations of all gene loci to allow the identification of paralogs and isoforms remain surprisingly incomplete. In particular, the global analysis of the transcriptome of a non-model organism for which there is no reference genome is especially challenging.To reliably discriminate between the paralogs and isoforms in RNA-seq data, we redefined the pre-existing sequence features (sequence similarity, inverse count of consecutive identical or non-identical blocks, and match-mismatch fraction) previously derived from full-length cDNAs and EST sequences and described newly discovered genomic and transcriptomic features (twilight zone of protein sequence alignment and expression level difference). In addition, the effectiveness and relevance of the proposed features were verified with two widely used support vector machine (SVM) and random forest (RF) models. From nine RNA-seq datasets, all AUC (area under the curve) scores of ROC (receiver operating characteristic) curves were over 0.9 in the RF model and significantly higher than those in the SVM model.In this study, using an RF model with five proposed RNA-seq features, we implemented our method called Paralogs and Isoforms Classifier based on Machine-learning approaches (PIC-Me) and showed that it outperformed an existing method. Finally, we envision that our tool will be a valuable computational resource for the genomics community to help with gene annotation and will aid in comparative transcriptomics and evolutionary genomics studies, especially those on non-model organisms." @default.
- W3206316099 created "2021-10-25" @default.
- W3206316099 creator A5024138203 @default.
- W3206316099 creator A5036941709 @default.
- W3206316099 creator A5061950123 @default.
- W3206316099 date "2021-10-01" @default.
- W3206316099 modified "2023-10-10" @default.
- W3206316099 title "PIC-Me: paralogs and isoforms classifier based on machine-learning approaches" @default.
- W3206316099 cites W1807937952 @default.
- W3206316099 cites W1913751214 @default.
- W3206316099 cites W1971231926 @default.
- W3206316099 cites W1981239255 @default.
- W3206316099 cites W1990813498 @default.
- W3206316099 cites W1991402780 @default.
- W3206316099 cites W1999574084 @default.
- W3206316099 cites W2014770059 @default.
- W3206316099 cites W2028279880 @default.
- W3206316099 cites W2048512322 @default.
- W3206316099 cites W2057874793 @default.
- W3206316099 cites W2059338023 @default.
- W3206316099 cites W2062561058 @default.
- W3206316099 cites W2063086867 @default.
- W3206316099 cites W2069724339 @default.
- W3206316099 cites W2079269132 @default.
- W3206316099 cites W2098433852 @default.
- W3206316099 cites W2099451735 @default.
- W3206316099 cites W2100459781 @default.
- W3206316099 cites W2101220662 @default.
- W3206316099 cites W2109322710 @default.
- W3206316099 cites W2124985265 @default.
- W3206316099 cites W2126419817 @default.
- W3206316099 cites W2127774996 @default.
- W3206316099 cites W2131271579 @default.
- W3206316099 cites W2132023322 @default.
- W3206316099 cites W2137176543 @default.
- W3206316099 cites W2142678478 @default.
- W3206316099 cites W2145100181 @default.
- W3206316099 cites W2147098528 @default.
- W3206316099 cites W2151462466 @default.
- W3206316099 cites W2153833484 @default.
- W3206316099 cites W2154431984 @default.
- W3206316099 cites W2155804510 @default.
- W3206316099 cites W2156340505 @default.
- W3206316099 cites W2158948118 @default.
- W3206316099 cites W2166085889 @default.
- W3206316099 cites W2169711373 @default.
- W3206316099 cites W2407523666 @default.
- W3206316099 cites W2469916682 @default.
- W3206316099 cites W2588289967 @default.
- W3206316099 cites W2612976808 @default.
- W3206316099 cites W2789597829 @default.
- W3206316099 cites W2790734286 @default.
- W3206316099 cites W2947780575 @default.
- W3206316099 cites W3136918052 @default.
- W3206316099 doi "https://doi.org/10.1186/s12859-021-04229-x" @default.
- W3206316099 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/8529730" @default.
- W3206316099 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/34674638" @default.
- W3206316099 hasPublicationYear "2021" @default.
- W3206316099 type Work @default.
- W3206316099 sameAs 3206316099 @default.
- W3206316099 citedByCount "0" @default.
- W3206316099 crossrefType "journal-article" @default.
- W3206316099 hasAuthorship W3206316099A5024138203 @default.
- W3206316099 hasAuthorship W3206316099A5036941709 @default.
- W3206316099 hasAuthorship W3206316099A5061950123 @default.
- W3206316099 hasBestOaLocation W32063160991 @default.
- W3206316099 hasConcept C104317684 @default.
- W3206316099 hasConcept C105565629 @default.
- W3206316099 hasConcept C107397762 @default.
- W3206316099 hasConcept C119857082 @default.
- W3206316099 hasConcept C12267149 @default.
- W3206316099 hasConcept C141231307 @default.
- W3206316099 hasConcept C150194340 @default.
- W3206316099 hasConcept C154945302 @default.
- W3206316099 hasConcept C162317418 @default.
- W3206316099 hasConcept C169258074 @default.
- W3206316099 hasConcept C189206191 @default.
- W3206316099 hasConcept C194583182 @default.
- W3206316099 hasConcept C41008148 @default.
- W3206316099 hasConcept C53345823 @default.
- W3206316099 hasConcept C54355233 @default.
- W3206316099 hasConcept C54458228 @default.
- W3206316099 hasConcept C67705224 @default.
- W3206316099 hasConcept C70721500 @default.
- W3206316099 hasConcept C86803240 @default.
- W3206316099 hasConcept C95371953 @default.
- W3206316099 hasConcept C95623464 @default.
- W3206316099 hasConceptScore W3206316099C104317684 @default.
- W3206316099 hasConceptScore W3206316099C105565629 @default.
- W3206316099 hasConceptScore W3206316099C107397762 @default.
- W3206316099 hasConceptScore W3206316099C119857082 @default.
- W3206316099 hasConceptScore W3206316099C12267149 @default.
- W3206316099 hasConceptScore W3206316099C141231307 @default.
- W3206316099 hasConceptScore W3206316099C150194340 @default.
- W3206316099 hasConceptScore W3206316099C154945302 @default.
- W3206316099 hasConceptScore W3206316099C162317418 @default.
- W3206316099 hasConceptScore W3206316099C169258074 @default.
- W3206316099 hasConceptScore W3206316099C189206191 @default.
- W3206316099 hasConceptScore W3206316099C194583182 @default.
- W3206316099 hasConceptScore W3206316099C41008148 @default.