Matches in SemOpenAlex for { <https://semopenalex.org/work/W3014207239> ?p ?o ?g. }
- W3014207239 endingPage "6" @default.
- W3014207239 startingPage "1" @default.
- W3014207239 abstract "With the advent of natural language processing (NLP) techniques empowered with deep learning approaches, more detailed relationships between words have been unraveled. Word2Vec is quite robust in discovering contextual and semantic relationships. Genome being a long text, is subject to similar studies to unravel yet to be discovered relationships between DNA k-mers. Dna2vec applies Word2Vec approach to whole genome so that DNA k-mers are represented as vectors. The cosine similarity queries on DNA vectors reveal unusual relationships between DNA k-mers. In this study, we examined DNA sequence based prediction of mutation susceptibility. Initially,we generated word vectors for human and mouse genome via dna2vec. On the other hand, we retrieved coordinates of common and all mutations from dbSNP. For each coordinate, we extracted 8 nucleotide k-mers intersecting mutations and results are aggregated. such a way that number of mutations for each 8-mer has been tabulated. These results are incorporated with dna2vec cosine similarity data. Our results showed that for a given k-mer, k-mers with highest cosine similarity coincide with highest mutation count k-mer. In other words, the neighbor with the highest cosine similarity for a k-mer was also seen to be the neighbor overlapping the mutation count. As a result of our studies, human and mouse, dna2vec vs. mutation overlap is 80% and 70%, respectively. In conclusion, dna2vec and other word embedding approaches can be used to reveal mutation or variation characteristics of genomes without sequencing or experimental data, solely using the genome sequence itself. This might pave the way for understanding the underlying mechanism or dynamics of mutations in genomes." @default.
- W3014207239 created "2020-04-10" @default.
- W3014207239 creator A5010415587 @default.
- W3014207239 date "2020-03-20" @default.
- W3014207239 modified "2023-10-01" @default.
- W3014207239 title "Assessment of Mutation Susceptibility in DNA Sequences with Word Vectors" @default.
- W3014207239 cites W1241017059 @default.
- W3014207239 cites W2074231493 @default.
- W3014207239 cites W2166434810 @default.
- W3014207239 cites W2188648436 @default.
- W3014207239 cites W2250189634 @default.
- W3014207239 cites W2408291668 @default.
- W3014207239 cites W2460442863 @default.
- W3014207239 cites W2485374661 @default.
- W3014207239 cites W2500830883 @default.
- W3014207239 cites W2534538876 @default.
- W3014207239 cites W2560645892 @default.
- W3014207239 cites W2740751204 @default.
- W3014207239 cites W2741447225 @default.
- W3014207239 cites W2963639656 @default.
- W3014207239 cites W2963864161 @default.
- W3014207239 cites W2963923670 @default.
- W3014207239 cites W3104097132 @default.
- W3014207239 doi "https://doi.org/10.38016/jista.674910" @default.
- W3014207239 hasPublicationYear "2020" @default.
- W3014207239 type Work @default.
- W3014207239 sameAs 3014207239 @default.
- W3014207239 citedByCount "2" @default.
- W3014207239 countsByYear W30142072392021 @default.
- W3014207239 countsByYear W30142072392022 @default.
- W3014207239 crossrefType "journal-article" @default.
- W3014207239 hasAuthorship W3014207239A5010415587 @default.
- W3014207239 hasBestOaLocation W30142072391 @default.
- W3014207239 hasConcept C103278499 @default.
- W3014207239 hasConcept C104317684 @default.
- W3014207239 hasConcept C115961682 @default.
- W3014207239 hasConcept C135763542 @default.
- W3014207239 hasConcept C141231307 @default.
- W3014207239 hasConcept C153180895 @default.
- W3014207239 hasConcept C153209595 @default.
- W3014207239 hasConcept C154945302 @default.
- W3014207239 hasConcept C197077220 @default.
- W3014207239 hasConcept C2524010 @default.
- W3014207239 hasConcept C2776461190 @default.
- W3014207239 hasConcept C2780762811 @default.
- W3014207239 hasConcept C33923547 @default.
- W3014207239 hasConcept C35794970 @default.
- W3014207239 hasConcept C41008148 @default.
- W3014207239 hasConcept C41608201 @default.
- W3014207239 hasConcept C501734568 @default.
- W3014207239 hasConcept C51679486 @default.
- W3014207239 hasConcept C54355233 @default.
- W3014207239 hasConcept C552990157 @default.
- W3014207239 hasConcept C70721500 @default.
- W3014207239 hasConcept C86803240 @default.
- W3014207239 hasConcept C90805587 @default.
- W3014207239 hasConceptScore W3014207239C103278499 @default.
- W3014207239 hasConceptScore W3014207239C104317684 @default.
- W3014207239 hasConceptScore W3014207239C115961682 @default.
- W3014207239 hasConceptScore W3014207239C135763542 @default.
- W3014207239 hasConceptScore W3014207239C141231307 @default.
- W3014207239 hasConceptScore W3014207239C153180895 @default.
- W3014207239 hasConceptScore W3014207239C153209595 @default.
- W3014207239 hasConceptScore W3014207239C154945302 @default.
- W3014207239 hasConceptScore W3014207239C197077220 @default.
- W3014207239 hasConceptScore W3014207239C2524010 @default.
- W3014207239 hasConceptScore W3014207239C2776461190 @default.
- W3014207239 hasConceptScore W3014207239C2780762811 @default.
- W3014207239 hasConceptScore W3014207239C33923547 @default.
- W3014207239 hasConceptScore W3014207239C35794970 @default.
- W3014207239 hasConceptScore W3014207239C41008148 @default.
- W3014207239 hasConceptScore W3014207239C41608201 @default.
- W3014207239 hasConceptScore W3014207239C501734568 @default.
- W3014207239 hasConceptScore W3014207239C51679486 @default.
- W3014207239 hasConceptScore W3014207239C54355233 @default.
- W3014207239 hasConceptScore W3014207239C552990157 @default.
- W3014207239 hasConceptScore W3014207239C70721500 @default.
- W3014207239 hasConceptScore W3014207239C86803240 @default.
- W3014207239 hasConceptScore W3014207239C90805587 @default.
- W3014207239 hasIssue "1" @default.
- W3014207239 hasLocation W30142072391 @default.
- W3014207239 hasLocation W30142072392 @default.
- W3014207239 hasOpenAccess W3014207239 @default.
- W3014207239 hasPrimaryLocation W30142072391 @default.
- W3014207239 hasRelatedWork W2786259169 @default.
- W3014207239 hasRelatedWork W2810280135 @default.
- W3014207239 hasRelatedWork W2920825344 @default.
- W3014207239 hasRelatedWork W2954487701 @default.
- W3014207239 hasRelatedWork W3100525534 @default.
- W3014207239 hasRelatedWork W3152530645 @default.
- W3014207239 hasRelatedWork W3166527962 @default.
- W3014207239 hasRelatedWork W3175251692 @default.
- W3014207239 hasRelatedWork W4280524156 @default.
- W3014207239 hasRelatedWork W4319989520 @default.
- W3014207239 hasVolume "3" @default.
- W3014207239 isParatext "false" @default.
- W3014207239 isRetracted "false" @default.
- W3014207239 magId "3014207239" @default.