Matches in SemOpenAlex for { <https://semopenalex.org/work/W2783279235> ?p ?o ?g. }
- W2783279235 abstract "Neural embeddings are a popular set of methods for representing words, phrases or text as a low dimensional vector (typically 50-500 dimensions). However, it is difficult to interpret these dimensions in a meaningful manner, and creating neural embeddings requires extensive training and tuning of multiple parameters and hyperparameters. We present here a simple unsupervised method for representing words, phrases or text as a low dimensional vector, in which the meaning and relative importance of dimensions is transparent to inspection. We have created a near-comprehensive vector representation of words, and selected bigrams, trigrams and abbreviations, using the set of titles and abstracts in PubMed as a corpus. This vector is used to create several novel implicit word-word and text-text similarity metrics. The implicit word-word similarity metrics correlate well with human judgement of word pair similarity and relatedness, and outperform or equal all other reported methods on a variety of biomedical benchmarks, including several implementations of neural embeddings trained on PubMed corpora. Our implicit word-word metrics capture different aspects of word-word relatedness than word2vec-based metrics and are only partially correlated (rho = ~0.5-0.8 depending on task and corpus). The vector representations of words, bigrams, trigrams, abbreviations, and PubMed title+abstracts are all publicly available from this http URL for release under CC-BY-NC license. Several public web query interfaces are also available at the same site, including one which allows the user to specify a given word and view its most closely related terms according to direct co-occurrence as well as different implicit similarity metrics." @default.
- W2783279235 created "2018-01-26" @default.
- W2783279235 creator A5033859037 @default.
- W2783279235 creator A5044759535 @default.
- W2783279235 date "2018-01-05" @default.
- W2783279235 modified "2023-09-27" @default.
- W2783279235 title "Unsupervised Low-Dimensional Vector Representations for Words, Phrases and Text that are Transparent, Scalable, and produce Similarity Metrics that are Complementary to Neural Embeddings" @default.
- W2783279235 cites W1964879903 @default.
- W2783279235 cites W1971220772 @default.
- W2783279235 cites W1990524510 @default.
- W2783279235 cites W2025365931 @default.
- W2783279235 cites W202767273 @default.
- W2783279235 cites W2084377579 @default.
- W2783279235 cites W2090987348 @default.
- W2783279235 cites W2117463461 @default.
- W2783279235 cites W2251157338 @default.
- W2783279235 cites W2251491951 @default.
- W2783279235 cites W2318087363 @default.
- W2783279235 cites W2335990354 @default.
- W2783279235 cites W2413669350 @default.
- W2783279235 cites W2509406088 @default.
- W2783279235 cites W2509472880 @default.
- W2783279235 cites W2515248967 @default.
- W2783279235 cites W2515910099 @default.
- W2783279235 cites W2516925101 @default.
- W2783279235 cites W2518378939 @default.
- W2783279235 cites W2585718388 @default.
- W2783279235 cites W2735288945 @default.
- W2783279235 cites W2742152190 @default.
- W2783279235 cites W2743327679 @default.
- W2783279235 cites W2769063188 @default.
- W2783279235 cites W2772528510 @default.
- W2783279235 cites W2949547296 @default.
- W2783279235 hasPublicationYear "2018" @default.
- W2783279235 type Work @default.
- W2783279235 sameAs 2783279235 @default.
- W2783279235 citedByCount "0" @default.
- W2783279235 crossrefType "posted-content" @default.
- W2783279235 hasAuthorship W2783279235A5033859037 @default.
- W2783279235 hasAuthorship W2783279235A5044759535 @default.
- W2783279235 hasConcept C103278499 @default.
- W2783279235 hasConcept C108757681 @default.
- W2783279235 hasConcept C115961682 @default.
- W2783279235 hasConcept C137546455 @default.
- W2783279235 hasConcept C153180895 @default.
- W2783279235 hasConcept C154945302 @default.
- W2783279235 hasConcept C177264268 @default.
- W2783279235 hasConcept C199360897 @default.
- W2783279235 hasConcept C204321447 @default.
- W2783279235 hasConcept C23123220 @default.
- W2783279235 hasConcept C2524010 @default.
- W2783279235 hasConcept C2776461190 @default.
- W2783279235 hasConcept C2777462759 @default.
- W2783279235 hasConcept C2780762811 @default.
- W2783279235 hasConcept C33923547 @default.
- W2783279235 hasConcept C41008148 @default.
- W2783279235 hasConcept C41608201 @default.
- W2783279235 hasConcept C90805587 @default.
- W2783279235 hasConceptScore W2783279235C103278499 @default.
- W2783279235 hasConceptScore W2783279235C108757681 @default.
- W2783279235 hasConceptScore W2783279235C115961682 @default.
- W2783279235 hasConceptScore W2783279235C137546455 @default.
- W2783279235 hasConceptScore W2783279235C153180895 @default.
- W2783279235 hasConceptScore W2783279235C154945302 @default.
- W2783279235 hasConceptScore W2783279235C177264268 @default.
- W2783279235 hasConceptScore W2783279235C199360897 @default.
- W2783279235 hasConceptScore W2783279235C204321447 @default.
- W2783279235 hasConceptScore W2783279235C23123220 @default.
- W2783279235 hasConceptScore W2783279235C2524010 @default.
- W2783279235 hasConceptScore W2783279235C2776461190 @default.
- W2783279235 hasConceptScore W2783279235C2777462759 @default.
- W2783279235 hasConceptScore W2783279235C2780762811 @default.
- W2783279235 hasConceptScore W2783279235C33923547 @default.
- W2783279235 hasConceptScore W2783279235C41008148 @default.
- W2783279235 hasConceptScore W2783279235C41608201 @default.
- W2783279235 hasConceptScore W2783279235C90805587 @default.
- W2783279235 hasLocation W27832792351 @default.
- W2783279235 hasOpenAccess W2783279235 @default.
- W2783279235 hasPrimaryLocation W27832792351 @default.
- W2783279235 hasRelatedWork W2090708902 @default.
- W2783279235 hasRelatedWork W2172003973 @default.
- W2783279235 hasRelatedWork W2341132943 @default.
- W2783279235 hasRelatedWork W2387546565 @default.
- W2783279235 hasRelatedWork W2528236455 @default.
- W2783279235 hasRelatedWork W2751494470 @default.
- W2783279235 hasRelatedWork W2766034339 @default.
- W2783279235 hasRelatedWork W2803805392 @default.
- W2783279235 hasRelatedWork W2911514212 @default.
- W2783279235 hasRelatedWork W2937960099 @default.
- W2783279235 hasRelatedWork W2963148156 @default.
- W2783279235 hasRelatedWork W2963912736 @default.
- W2783279235 hasRelatedWork W2964279163 @default.
- W2783279235 hasRelatedWork W2969453628 @default.
- W2783279235 hasRelatedWork W2978368133 @default.
- W2783279235 hasRelatedWork W2990098834 @default.
- W2783279235 hasRelatedWork W3025337893 @default.
- W2783279235 hasRelatedWork W31399744 @default.
- W2783279235 hasRelatedWork W3142516437 @default.
- W2783279235 hasRelatedWork W3176848562 @default.
- W2783279235 isParatext "false" @default.