Matches in SemOpenAlex for { <https://semopenalex.org/work/W2786142325> ?p ?o ?g. }
- W2786142325 endingPage "i303" @default.
- W2786142325 startingPage "i295" @default.
- W2786142325 abstract "The effective representation of proteins is a crucial task that directly affects the performance of many bioinformatics problems. Related proteins usually bind to similar ligands. Chemical characteristics of ligands are known to capture the functional and mechanistic properties of proteins suggesting that a ligand based approach can be utilized in protein representation. In this study, we propose SMILESVec, a SMILES-based method to represent ligands and a novel method to compute similarity of proteins by describing them based on their ligands. The proteins are defined utilizing the word-embeddings of the SMILES strings of their ligands. The performance of the proposed protein description method is evaluated in protein clustering task using TransClust and MCL algorithms. Two other protein representation methods that utilize protein sequence, BLAST and ProtVec, and two compound fingerprint based protein representation methods are compared. We showed that ligand-based protein representation, which uses only SMILES strings of the ligands that proteins bind to, performs as well as protein-sequence based representation methods in protein clustering. The results suggest that ligand-based protein description can be an alternative to the traditional sequence or structure based representation of proteins and this novel approach can be applied to different bioinformatics problems such as prediction of new protein-ligand interactions and protein function annotation." @default.
- W2786142325 created "2018-02-23" @default.
- W2786142325 creator A5030252794 @default.
- W2786142325 creator A5061247461 @default.
- W2786142325 creator A5077786612 @default.
- W2786142325 date "2018-06-27" @default.
- W2786142325 modified "2023-10-16" @default.
- W2786142325 title "A novel methodology on distributed representations of proteins using their interacting ligands" @default.
- W2786142325 cites W1501531009 @default.
- W2786142325 cites W1601495365 @default.
- W2786142325 cites W1663797894 @default.
- W2786142325 cites W1984794455 @default.
- W2786142325 cites W1988037271 @default.
- W2786142325 cites W1992427214 @default.
- W2786142325 cites W2001424887 @default.
- W2786142325 cites W2005851379 @default.
- W2786142325 cites W2008381136 @default.
- W2786142325 cites W2036887984 @default.
- W2786142325 cites W2038252445 @default.
- W2786142325 cites W2041947516 @default.
- W2786142325 cites W2055043387 @default.
- W2786142325 cites W2058720919 @default.
- W2786142325 cites W2070930739 @default.
- W2786142325 cites W2080189382 @default.
- W2786142325 cites W2085277871 @default.
- W2786142325 cites W2096541451 @default.
- W2786142325 cites W2110340112 @default.
- W2786142325 cites W2124166542 @default.
- W2786142325 cites W2136134567 @default.
- W2786142325 cites W2143986312 @default.
- W2786142325 cites W2145957695 @default.
- W2786142325 cites W2152950651 @default.
- W2786142325 cites W2153341679 @default.
- W2786142325 cites W2155478691 @default.
- W2786142325 cites W2157253968 @default.
- W2786142325 cites W2161072217 @default.
- W2786142325 cites W2162011385 @default.
- W2786142325 cites W2256119113 @default.
- W2786142325 cites W2297764176 @default.
- W2786142325 cites W2340754995 @default.
- W2786142325 cites W2442910829 @default.
- W2786142325 cites W2462891382 @default.
- W2786142325 cites W2487850358 @default.
- W2786142325 cites W2489720698 @default.
- W2786142325 cites W2493614923 @default.
- W2786142325 cites W2547584528 @default.
- W2786142325 cites W2559007573 @default.
- W2786142325 cites W2592692577 @default.
- W2786142325 cites W2622236128 @default.
- W2786142325 cites W2777416523 @default.
- W2786142325 cites W2952610991 @default.
- W2786142325 cites W363319864 @default.
- W2786142325 doi "https://doi.org/10.1093/bioinformatics/bty287" @default.
- W2786142325 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/6022674" @default.
- W2786142325 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/29949957" @default.
- W2786142325 hasPublicationYear "2018" @default.
- W2786142325 type Work @default.
- W2786142325 sameAs 2786142325 @default.
- W2786142325 citedByCount "29" @default.
- W2786142325 countsByYear W27861423252018 @default.
- W2786142325 countsByYear W27861423252019 @default.
- W2786142325 countsByYear W27861423252020 @default.
- W2786142325 countsByYear W27861423252021 @default.
- W2786142325 countsByYear W27861423252022 @default.
- W2786142325 countsByYear W27861423252023 @default.
- W2786142325 crossrefType "journal-article" @default.
- W2786142325 hasAuthorship W2786142325A5030252794 @default.
- W2786142325 hasAuthorship W2786142325A5061247461 @default.
- W2786142325 hasAuthorship W2786142325A5077786612 @default.
- W2786142325 hasBestOaLocation W27861423251 @default.
- W2786142325 hasConcept C10010492 @default.
- W2786142325 hasConcept C103278499 @default.
- W2786142325 hasConcept C104317684 @default.
- W2786142325 hasConcept C109095088 @default.
- W2786142325 hasConcept C115961682 @default.
- W2786142325 hasConcept C116569031 @default.
- W2786142325 hasConcept C14036430 @default.
- W2786142325 hasConcept C154945302 @default.
- W2786142325 hasConcept C162324750 @default.
- W2786142325 hasConcept C167625842 @default.
- W2786142325 hasConcept C170493617 @default.
- W2786142325 hasConcept C17744445 @default.
- W2786142325 hasConcept C187736073 @default.
- W2786142325 hasConcept C199539241 @default.
- W2786142325 hasConcept C207060522 @default.
- W2786142325 hasConcept C2776359362 @default.
- W2786142325 hasConcept C2778112365 @default.
- W2786142325 hasConcept C2780451532 @default.
- W2786142325 hasConcept C2986374874 @default.
- W2786142325 hasConcept C41008148 @default.
- W2786142325 hasConcept C54355233 @default.
- W2786142325 hasConcept C55493867 @default.
- W2786142325 hasConcept C60644358 @default.
- W2786142325 hasConcept C70721500 @default.
- W2786142325 hasConcept C73555534 @default.
- W2786142325 hasConcept C86803240 @default.
- W2786142325 hasConcept C94625758 @default.
- W2786142325 hasConceptScore W2786142325C10010492 @default.