Matches in SemOpenAlex for { <https://semopenalex.org/work/W3090995764> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W3090995764 abstract "It is a growing trend among researchers to make their data publicly available for experimental reproducibility and data reusability. Sharing data with fellow researchers helps in increasing the visibility of the work. On the other hand, there are researchers who are inhibited by the lack of data resources. To overcome this challenge, many repositories and knowledge bases have been established to date to ease data sharing. Further, in the past two decades, there has been an exponential increase in the number of datasets added to these dataset repositories. However, most of these repositories are domain-specific, and none of them can recommend datasets to researchers/users. Naturally, it is challenging for a researcher to keep track of all the relevant repositories for potential use. Thus, a dataset recommender system that recommends datasets to a researcher based on previous publications can enhance their productivity and expedite further research. This work adopts an information retrieval (IR) paradigm for dataset recommendation. We hypothesize that two fundamental differences exist between dataset recommendation and PubMed-style biomedical IR beyond the corpus. First, instead of keywords, the query is the researcher, embodied by his or her publications. Second, to filter the relevant datasets from non-relevant ones, researchers are better represented by a set of interests, as opposed to the entire body of their research. This second approach is implemented using a non-parametric clustering technique. These clusters are used to recommend datasets for each researcher using the cosine similarity between the vector representations of publication clusters and datasets. The maximum normalized discounted cumulative gain at 10 (NDCG@10), precision at 10 (p@10) partial and p@10 strict of 0.89, 0.78 and 0.61, respectively, were obtained using the proposed method after manual evaluation by five researchers. As per the best of our knowledge, this is the first study of its kind on content-based dataset recommendation. We hope that this system will further promote data sharing, offset the researchers' workload in identifying the right dataset and increase the reusability of biomedical datasets. Database URL: http://genestudy.org/recommends/#/." @default.
- W3090995764 created "2020-10-08" @default.
- W3090995764 creator A5040613229 @default.
- W3090995764 creator A5045155778 @default.
- W3090995764 creator A5046709245 @default.
- W3090995764 date "2020-01-01" @default.
- W3090995764 modified "2023-09-25" @default.
- W3090995764 title "A content-based dataset recommendation system for researchers—a case study on Gene Expression Omnibus (GEO) repository" @default.
- W3090995764 cites W1834382163 @default.
- W3090995764 cites W2158952538 @default.
- W3090995764 cites W2341468445 @default.
- W3090995764 cites W2442340835 @default.
- W3090995764 cites W2605454765 @default.
- W3090995764 cites W2774661587 @default.
- W3090995764 cites W2783207633 @default.
- W3090995764 cites W2785664611 @default.
- W3090995764 cites W2786146603 @default.
- W3090995764 cites W2786484491 @default.
- W3090995764 cites W2790574847 @default.
- W3090995764 cites W2794901446 @default.
- W3090995764 cites W2806740783 @default.
- W3090995764 cites W2914132662 @default.
- W3090995764 cites W2950031835 @default.
- W3090995764 cites W2951403620 @default.
- W3090995764 cites W2963806188 @default.
- W3090995764 cites W3009868230 @default.
- W3090995764 cites W3104308188 @default.
- W3090995764 doi "https://doi.org/10.1093/database/baaa064" @default.
- W3090995764 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/7659921" @default.
- W3090995764 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/33002137" @default.
- W3090995764 hasPublicationYear "2020" @default.
- W3090995764 type Work @default.
- W3090995764 sameAs 3090995764 @default.
- W3090995764 citedByCount "13" @default.
- W3090995764 countsByYear W30909957642020 @default.
- W3090995764 countsByYear W30909957642021 @default.
- W3090995764 countsByYear W30909957642022 @default.
- W3090995764 countsByYear W30909957642023 @default.
- W3090995764 crossrefType "journal-article" @default.
- W3090995764 hasAuthorship W3090995764A5040613229 @default.
- W3090995764 hasAuthorship W3090995764A5045155778 @default.
- W3090995764 hasAuthorship W3090995764A5046709245 @default.
- W3090995764 hasBestOaLocation W30909957641 @default.
- W3090995764 hasConcept C103278499 @default.
- W3090995764 hasConcept C106131492 @default.
- W3090995764 hasConcept C115961682 @default.
- W3090995764 hasConcept C124101348 @default.
- W3090995764 hasConcept C136764020 @default.
- W3090995764 hasConcept C154945302 @default.
- W3090995764 hasConcept C177264268 @default.
- W3090995764 hasConcept C199360897 @default.
- W3090995764 hasConcept C23123220 @default.
- W3090995764 hasConcept C2522767166 @default.
- W3090995764 hasConcept C2780762811 @default.
- W3090995764 hasConcept C31972630 @default.
- W3090995764 hasConcept C41008148 @default.
- W3090995764 hasConcept C557471498 @default.
- W3090995764 hasConcept C73555534 @default.
- W3090995764 hasConceptScore W3090995764C103278499 @default.
- W3090995764 hasConceptScore W3090995764C106131492 @default.
- W3090995764 hasConceptScore W3090995764C115961682 @default.
- W3090995764 hasConceptScore W3090995764C124101348 @default.
- W3090995764 hasConceptScore W3090995764C136764020 @default.
- W3090995764 hasConceptScore W3090995764C154945302 @default.
- W3090995764 hasConceptScore W3090995764C177264268 @default.
- W3090995764 hasConceptScore W3090995764C199360897 @default.
- W3090995764 hasConceptScore W3090995764C23123220 @default.
- W3090995764 hasConceptScore W3090995764C2522767166 @default.
- W3090995764 hasConceptScore W3090995764C2780762811 @default.
- W3090995764 hasConceptScore W3090995764C31972630 @default.
- W3090995764 hasConceptScore W3090995764C41008148 @default.
- W3090995764 hasConceptScore W3090995764C557471498 @default.
- W3090995764 hasConceptScore W3090995764C73555534 @default.
- W3090995764 hasFunder F4320308129 @default.
- W3090995764 hasFunder F4320332161 @default.
- W3090995764 hasLocation W30909957641 @default.
- W3090995764 hasLocation W30909957642 @default.
- W3090995764 hasLocation W30909957643 @default.
- W3090995764 hasOpenAccess W3090995764 @default.
- W3090995764 hasPrimaryLocation W30909957641 @default.
- W3090995764 hasRelatedWork W2000822082 @default.
- W3090995764 hasRelatedWork W2020422879 @default.
- W3090995764 hasRelatedWork W2071071438 @default.
- W3090995764 hasRelatedWork W2360653256 @default.
- W3090995764 hasRelatedWork W2517270107 @default.
- W3090995764 hasRelatedWork W2576320324 @default.
- W3090995764 hasRelatedWork W4220978606 @default.
- W3090995764 hasRelatedWork W4316658050 @default.
- W3090995764 hasRelatedWork W4366674473 @default.
- W3090995764 hasRelatedWork W88463392 @default.
- W3090995764 hasVolume "2020" @default.
- W3090995764 isParatext "false" @default.
- W3090995764 isRetracted "false" @default.
- W3090995764 magId "3090995764" @default.
- W3090995764 workType "article" @default.