Matches in SemOpenAlex for { <https://semopenalex.org/work/W4247538456> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W4247538456 abstract "<sec> <title>BACKGROUND</title> Biomedical semantic indexing is a very useful support tool for human curators in their efforts for indexing and cataloging the biomedical literature. </sec> <sec> <title>OBJECTIVE</title> The aim of this study was to describe a system to automatically assign Medical Subject Headings (MeSH) to biomedical articles from MEDLINE. </sec> <sec> <title>METHODS</title> Our approach relies on the assumption that similar documents should be classified by similar MeSH terms. Although previous work has already exploited the document similarity by using a k-nearest neighbors algorithm, we represent documents as document vectors by search engine indexing and then compute the similarity between documents using cosine similarity. Once the most similar documents for a given input document are retrieved, we rank their MeSH terms to choose the most suitable set for the input document. To do this, we define a scoring function that takes into account the frequency of the term into the set of retrieved documents and the similarity between the input document and each retrieved document. In addition, we implement guidelines proposed by human curators to annotate MEDLINE articles; in particular, the heuristic that says if 3 MeSH terms are proposed to classify an article and they share the same ancestor, they should be replaced by this ancestor. The representation of the MeSH thesaurus as a graph database allows us to employ graph search algorithms to quickly and easily capture hierarchical relationships such as the lowest common ancestor between terms. </sec> <sec> <title>RESULTS</title> Our experiments show promising results with an F1 of 69% on the test dataset. </sec> <sec> <title>CONCLUSIONS</title> To the best of our knowledge, this is the first work that combines search and graph database technologies for the task of biomedical semantic indexing. Due to its horizontal scalability, ElasticSearch becomes a real solution to index large collections of documents (such as the bibliographic database MEDLINE). Moreover, the use of graph search algorithms for accessing MeSH information could provide a support tool for cataloging MEDLINE abstracts in real time. </sec>" @default.
- W4247538456 created "2022-05-12" @default.
- W4247538456 creator A5009969418 @default.
- W4247538456 creator A5051530256 @default.
- W4247538456 creator A5079791321 @default.
- W4247538456 date "2017-06-12" @default.
- W4247538456 modified "2023-09-23" @default.
- W4247538456 title "Search and Graph Database Technologies for Biomedical Semantic Indexing: Experimental Analysis (Preprint)" @default.
- W4247538456 cites W1019512417 @default.
- W4247538456 cites W1751470192 @default.
- W4247538456 cites W1981208470 @default.
- W4247538456 cites W2043309298 @default.
- W4247538456 cites W2115024484 @default.
- W4247538456 cites W2129144539 @default.
- W4247538456 cites W2142955026 @default.
- W4247538456 cites W2152143870 @default.
- W4247538456 cites W2332215074 @default.
- W4247538456 cites W3122990092 @default.
- W4247538456 doi "https://doi.org/10.2196/preprints.7059" @default.
- W4247538456 hasPublicationYear "2017" @default.
- W4247538456 type Work @default.
- W4247538456 citedByCount "0" @default.
- W4247538456 crossrefType "posted-content" @default.
- W4247538456 hasAuthorship W4247538456A5009969418 @default.
- W4247538456 hasAuthorship W4247538456A5051530256 @default.
- W4247538456 hasAuthorship W4247538456A5079791321 @default.
- W4247538456 hasBestOaLocation W42475384562 @default.
- W4247538456 hasConcept C103278499 @default.
- W4247538456 hasConcept C115961682 @default.
- W4247538456 hasConcept C116738811 @default.
- W4247538456 hasConcept C124101348 @default.
- W4247538456 hasConcept C130318100 @default.
- W4247538456 hasConcept C132525143 @default.
- W4247538456 hasConcept C154945302 @default.
- W4247538456 hasConcept C177264268 @default.
- W4247538456 hasConcept C199360897 @default.
- W4247538456 hasConcept C204321447 @default.
- W4247538456 hasConcept C23123220 @default.
- W4247538456 hasConcept C2778698081 @default.
- W4247538456 hasConcept C2780762811 @default.
- W4247538456 hasConcept C41008148 @default.
- W4247538456 hasConcept C73555534 @default.
- W4247538456 hasConcept C75165309 @default.
- W4247538456 hasConcept C80444323 @default.
- W4247538456 hasConceptScore W4247538456C103278499 @default.
- W4247538456 hasConceptScore W4247538456C115961682 @default.
- W4247538456 hasConceptScore W4247538456C116738811 @default.
- W4247538456 hasConceptScore W4247538456C124101348 @default.
- W4247538456 hasConceptScore W4247538456C130318100 @default.
- W4247538456 hasConceptScore W4247538456C132525143 @default.
- W4247538456 hasConceptScore W4247538456C154945302 @default.
- W4247538456 hasConceptScore W4247538456C177264268 @default.
- W4247538456 hasConceptScore W4247538456C199360897 @default.
- W4247538456 hasConceptScore W4247538456C204321447 @default.
- W4247538456 hasConceptScore W4247538456C23123220 @default.
- W4247538456 hasConceptScore W4247538456C2778698081 @default.
- W4247538456 hasConceptScore W4247538456C2780762811 @default.
- W4247538456 hasConceptScore W4247538456C41008148 @default.
- W4247538456 hasConceptScore W4247538456C73555534 @default.
- W4247538456 hasConceptScore W4247538456C75165309 @default.
- W4247538456 hasConceptScore W4247538456C80444323 @default.
- W4247538456 hasLocation W42475384561 @default.
- W4247538456 hasLocation W42475384562 @default.
- W4247538456 hasOpenAccess W4247538456 @default.
- W4247538456 hasPrimaryLocation W42475384561 @default.
- W4247538456 hasRelatedWork W1601902 @default.
- W4247538456 hasRelatedWork W1626643 @default.
- W4247538456 hasRelatedWork W17041259 @default.
- W4247538456 hasRelatedWork W2371595 @default.
- W4247538456 hasRelatedWork W2621597 @default.
- W4247538456 hasRelatedWork W412939 @default.
- W4247538456 hasRelatedWork W5783831 @default.
- W4247538456 hasRelatedWork W6745161 @default.
- W4247538456 hasRelatedWork W7981713 @default.
- W4247538456 hasRelatedWork W5208458 @default.
- W4247538456 isParatext "false" @default.
- W4247538456 isRetracted "false" @default.
- W4247538456 workType "article" @default.