Matches in SemOpenAlex for { <https://semopenalex.org/work/W3012055802> ?p ?o ?g. }
- W3012055802 endingPage "621" @default.
- W3012055802 startingPage "612" @default.
- W3012055802 abstract "Genes are termed to be essential if their loss of function compromises viability or results in profound loss of fitness. On the genome scale, these genes can be determined experimentally employing RNAi or knockout screens, but this is very resource intensive. Computational methods for essential gene prediction can overcome this drawback, particularly when intrinsic (e.g. from the protein sequence) as well as extrinsic features (e.g. from transcription profiles) are considered. In this work, we employed machine learning to predict essential genes in Drosophila melanogaster. A total of 27,340 features were generated based on a large variety of different aspects comprising nucleotide and protein sequences, gene networks, protein-protein interactions, evolutionary conservation and functional annotations. Employing cross-validation, we obtained an excellent prediction performance. The best model achieved in D. melanogaster a ROC-AUC of 0.90, a PR-AUC of 0.30 and a F1 score of 0.34. Our approach considerably outperformed a benchmark method in which only features derived from the protein sequences were used (P < 0.001). Investigating which features contributed to this success, we found all categories of features, most prominently network topological, functional and sequence-based features. To evaluate our approach we performed the same workflow for essential gene prediction in human and achieved an ROC-AUC = 0.97, PR-AUC = 0.73, and F1 = 0.64. In summary, this study shows that using our well-elaborated assembly of features covering a broad range of intrinsic and extrinsic gene and protein features enabled intelligent systems to predict well the essentiality of genes in an organism." @default.
- W3012055802 created "2020-03-23" @default.
- W3012055802 creator A5005195211 @default.
- W3012055802 creator A5012302618 @default.
- W3012055802 creator A5031343408 @default.
- W3012055802 creator A5051035768 @default.
- W3012055802 creator A5077171389 @default.
- W3012055802 creator A5084196031 @default.
- W3012055802 date "2020-01-01" @default.
- W3012055802 modified "2023-10-07" @default.
- W3012055802 title "Essential gene prediction in Drosophila melanogaster using machine learning approaches based on sequence and functional features" @default.
- W3012055802 cites W1578104218 @default.
- W3012055802 cites W1831050183 @default.
- W3012055802 cites W1966327575 @default.
- W3012055802 cites W1966716734 @default.
- W3012055802 cites W1974482866 @default.
- W3012055802 cites W1977743896 @default.
- W3012055802 cites W1987553882 @default.
- W3012055802 cites W1989191870 @default.
- W3012055802 cites W1998025025 @default.
- W3012055802 cites W2003390994 @default.
- W3012055802 cites W2005740247 @default.
- W3012055802 cites W2011584557 @default.
- W3012055802 cites W2051001559 @default.
- W3012055802 cites W2075216460 @default.
- W3012055802 cites W2097401587 @default.
- W3012055802 cites W2105295234 @default.
- W3012055802 cites W2107903949 @default.
- W3012055802 cites W2113140044 @default.
- W3012055802 cites W2119412782 @default.
- W3012055802 cites W2122349387 @default.
- W3012055802 cites W2130790725 @default.
- W3012055802 cites W2148143831 @default.
- W3012055802 cites W2153091158 @default.
- W3012055802 cites W2155723007 @default.
- W3012055802 cites W2157076315 @default.
- W3012055802 cites W2158714788 @default.
- W3012055802 cites W2164339920 @default.
- W3012055802 cites W2167364244 @default.
- W3012055802 cites W2171464043 @default.
- W3012055802 cites W2184815069 @default.
- W3012055802 cites W2200257500 @default.
- W3012055802 cites W2296755350 @default.
- W3012055802 cites W2334956771 @default.
- W3012055802 cites W2340497515 @default.
- W3012055802 cites W2404769538 @default.
- W3012055802 cites W2537679995 @default.
- W3012055802 cites W2543582635 @default.
- W3012055802 cites W2751584274 @default.
- W3012055802 cites W2796976120 @default.
- W3012055802 cites W2801813992 @default.
- W3012055802 cites W2805661657 @default.
- W3012055802 cites W2898204371 @default.
- W3012055802 cites W2898311987 @default.
- W3012055802 cites W2900569176 @default.
- W3012055802 cites W2904672199 @default.
- W3012055802 cites W2914170009 @default.
- W3012055802 cites W2917858269 @default.
- W3012055802 cites W2944545583 @default.
- W3012055802 cites W2949280944 @default.
- W3012055802 cites W2951756882 @default.
- W3012055802 cites W2971136217 @default.
- W3012055802 cites W4246444062 @default.
- W3012055802 doi "https://doi.org/10.1016/j.csbj.2020.02.022" @default.
- W3012055802 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7096750" @default.
- W3012055802 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/32257045" @default.
- W3012055802 hasPublicationYear "2020" @default.
- W3012055802 type Work @default.
- W3012055802 sameAs 3012055802 @default.
- W3012055802 citedByCount "26" @default.
- W3012055802 countsByYear W30120558022020 @default.
- W3012055802 countsByYear W30120558022021 @default.
- W3012055802 countsByYear W30120558022022 @default.
- W3012055802 countsByYear W30120558022023 @default.
- W3012055802 crossrefType "journal-article" @default.
- W3012055802 hasAuthorship W3012055802A5005195211 @default.
- W3012055802 hasAuthorship W3012055802A5012302618 @default.
- W3012055802 hasAuthorship W3012055802A5031343408 @default.
- W3012055802 hasAuthorship W3012055802A5051035768 @default.
- W3012055802 hasAuthorship W3012055802A5077171389 @default.
- W3012055802 hasAuthorship W3012055802A5084196031 @default.
- W3012055802 hasBestOaLocation W30120558021 @default.
- W3012055802 hasConcept C10010492 @default.
- W3012055802 hasConcept C104317684 @default.
- W3012055802 hasConcept C105565629 @default.
- W3012055802 hasConcept C119857082 @default.
- W3012055802 hasConcept C13280743 @default.
- W3012055802 hasConcept C141231307 @default.
- W3012055802 hasConcept C154945302 @default.
- W3012055802 hasConcept C167625842 @default.
- W3012055802 hasConcept C185798385 @default.
- W3012055802 hasConcept C205649164 @default.
- W3012055802 hasConcept C2775905019 @default.
- W3012055802 hasConcept C2776998849 @default.
- W3012055802 hasConcept C2780104201 @default.
- W3012055802 hasConcept C41008148 @default.
- W3012055802 hasConcept C54355233 @default.
- W3012055802 hasConcept C60644358 @default.