Matches in SemOpenAlex for { <https://semopenalex.org/work/W4387358658> ?p ?o ?g. }
- W4387358658 abstract "Abstract Every week thousands of biomedical research papers are published with a portion of them containing supporting tables with data about genes, transcripts, variants, and proteins. For example, supporting tables may contain differentially expressed genes and proteins from transcriptomics and proteomics assays, targets of transcription factors from ChIP-seq experiments, hits from genome-wide CRISPR screens, or genes identified to harbor mutations from GWAS studies. Because these gene sets are commonly buried in the supplemental tables of research publications, they are not widely available for search and reuse. Rummagene, available from https://rummagene.com , is a web server application that provides access to hundreds of thousands human and mouse gene sets extracted from supporting materials of publications listed on PubMed Central (PMC). To create Rummagene, we first developed a softbot that extracts human and mouse gene sets from supporting tables of PMC publications. So far, the softbot has scanned 5,448,589 PMC articles to find 121,237 articles that contain 642,389 gene sets. These gene sets are served for enrichment analysis, free text, and table title search. Users of Rummagene can submit their own gene sets to find matching gene sets ranked by their overlap with the input gene set. In addition to providing the extracted gene sets for search, we investigated the massive corpus of these gene sets for statistical patterns. We show that the number of gene sets reported in publications is rapidly increasing, containing both short sets that are highly enriched in highly studied genes, and long sets from omics profiling. We also demonstrate that the gene sets in Rummagene can be used for transcription factor and kinase enrichment analyses, and for gene function predictions. By combining gene set similarity with abstract similarity, Rummagene can be used to find surprising relationships between unexpected biological processes, concepts, and named entities. Finally, by overlaying the Rummagene gene set space with the Enrichr gene set space we can discover areas of biological and biomedical knowledge unique to each resource." @default.
- W4387358658 created "2023-10-06" @default.
- W4387358658 creator A5007301355 @default.
- W4387358658 creator A5012061047 @default.
- W4387358658 creator A5031360208 @default.
- W4387358658 creator A5032679807 @default.
- W4387358658 creator A5032951414 @default.
- W4387358658 creator A5060461473 @default.
- W4387358658 date "2023-10-05" @default.
- W4387358658 modified "2023-10-12" @default.
- W4387358658 title "Rummagene: Mining Gene Sets from Supporting Materials of PMC Publications" @default.
- W4387358658 cites W1965092590 @default.
- W4387358658 cites W1993087844 @default.
- W4387358658 cites W2004444667 @default.
- W4387358658 cites W2010502047 @default.
- W4387358658 cites W2047017978 @default.
- W4387358658 cites W2052250375 @default.
- W4387358658 cites W2098023402 @default.
- W4387358658 cites W2103017472 @default.
- W4387358658 cites W2106810132 @default.
- W4387358658 cites W2123454534 @default.
- W4387358658 cites W2128386749 @default.
- W4387358658 cites W2144211451 @default.
- W4387358658 cites W2145627818 @default.
- W4387358658 cites W2302501749 @default.
- W4387358658 cites W2345356016 @default.
- W4387358658 cites W2537679995 @default.
- W4387358658 cites W2550535012 @default.
- W4387358658 cites W2559028527 @default.
- W4387358658 cites W2606715885 @default.
- W4387358658 cites W2612467560 @default.
- W4387358658 cites W2763052433 @default.
- W4387358658 cites W2800392236 @default.
- W4387358658 cites W2807733000 @default.
- W4387358658 cites W2889326414 @default.
- W4387358658 cites W2904372598 @default.
- W4387358658 cites W2945310727 @default.
- W4387358658 cites W2945883683 @default.
- W4387358658 cites W2999208018 @default.
- W4387358658 cites W3103145119 @default.
- W4387358658 cites W3106188259 @default.
- W4387358658 cites W3132458063 @default.
- W4387358658 cites W3143715979 @default.
- W4387358658 cites W3160199866 @default.
- W4387358658 cites W3187696456 @default.
- W4387358658 cites W3212334377 @default.
- W4387358658 cites W3216633781 @default.
- W4387358658 cites W4206418540 @default.
- W4387358658 cites W4225865462 @default.
- W4387358658 cites W4284895446 @default.
- W4387358658 cites W4383498494 @default.
- W4387358658 doi "https://doi.org/10.1101/2023.10.03.560783" @default.
- W4387358658 hasPublicationYear "2023" @default.
- W4387358658 type Work @default.
- W4387358658 citedByCount "0" @default.
- W4387358658 crossrefType "posted-content" @default.
- W4387358658 hasAuthorship W4387358658A5007301355 @default.
- W4387358658 hasAuthorship W4387358658A5012061047 @default.
- W4387358658 hasAuthorship W4387358658A5031360208 @default.
- W4387358658 hasAuthorship W4387358658A5032679807 @default.
- W4387358658 hasAuthorship W4387358658A5032951414 @default.
- W4387358658 hasAuthorship W4387358658A5060461473 @default.
- W4387358658 hasBestOaLocation W43873586581 @default.
- W4387358658 hasConcept C104317684 @default.
- W4387358658 hasConcept C124101348 @default.
- W4387358658 hasConcept C141231307 @default.
- W4387358658 hasConcept C23123220 @default.
- W4387358658 hasConcept C41008148 @default.
- W4387358658 hasConcept C514705636 @default.
- W4387358658 hasConcept C54355233 @default.
- W4387358658 hasConcept C58642233 @default.
- W4387358658 hasConcept C59822182 @default.
- W4387358658 hasConcept C60644358 @default.
- W4387358658 hasConcept C62177273 @default.
- W4387358658 hasConcept C70721500 @default.
- W4387358658 hasConcept C86803240 @default.
- W4387358658 hasConceptScore W4387358658C104317684 @default.
- W4387358658 hasConceptScore W4387358658C124101348 @default.
- W4387358658 hasConceptScore W4387358658C141231307 @default.
- W4387358658 hasConceptScore W4387358658C23123220 @default.
- W4387358658 hasConceptScore W4387358658C41008148 @default.
- W4387358658 hasConceptScore W4387358658C514705636 @default.
- W4387358658 hasConceptScore W4387358658C54355233 @default.
- W4387358658 hasConceptScore W4387358658C58642233 @default.
- W4387358658 hasConceptScore W4387358658C59822182 @default.
- W4387358658 hasConceptScore W4387358658C60644358 @default.
- W4387358658 hasConceptScore W4387358658C62177273 @default.
- W4387358658 hasConceptScore W4387358658C70721500 @default.
- W4387358658 hasConceptScore W4387358658C86803240 @default.
- W4387358658 hasLocation W43873586581 @default.
- W4387358658 hasOpenAccess W4387358658 @default.
- W4387358658 hasPrimaryLocation W43873586581 @default.
- W4387358658 hasRelatedWork W1515070932 @default.
- W4387358658 hasRelatedWork W1894796423 @default.
- W4387358658 hasRelatedWork W1977739016 @default.
- W4387358658 hasRelatedWork W1990946056 @default.
- W4387358658 hasRelatedWork W2079327011 @default.
- W4387358658 hasRelatedWork W2086525401 @default.
- W4387358658 hasRelatedWork W2159799774 @default.
- W4387358658 hasRelatedWork W4243726184 @default.