Matches in SemOpenAlex for { <https://semopenalex.org/work/W7520102> ?p ?o ?g. }
- W7520102 abstract "Continued rapid advancements in genomic, proteomic and metabolomic technologies demand computer-aided methods and tools to efficiently and timely process large amount of data, extract meaningful information, and interpret data into knowledge. While numerous algorithms and systems have been developed for information extraction (i.e. profiling analysis), biological interpretation still largely relies on biologists' domain knowledge, as well as collecting and analyzing functional information from various public databases. The goal of this project was to build a text clustering-based software system, called GeneNarrator, for functional analysis of genes (microarray experiments). GeneNarrator automatically collected MEDLINE citations for a list of genes as the source of functional information. A two-step clustering approach was designed to process the citations. The first-step (text) clustering grouped the citations into hierarchical topics. The second-step (gene) clustering grouped the genes based on the similarities of their occurrences across the clusters resulting from step one. Hence, we planned to demonstrate how, instead of manually collecting and tediously sifting through potentially thousands of citations, biologists can be presented with dozens of topics as a summarization of the citations, and gene (groups) mapped to the topics. In order to improve the first-step text clustering part of the system, several strategies were explored, including different vector space models (BOW-based or concept-based) for text representation, vector space dimensionality reduction (document frequency filtering), and multi clustering. The most improvement came from multi-clustering. The clusterings were evaluated in terms of self-consistency and agreement with a manually constructed gold standard dataset using a newly proposed metric, normalized mutual information." @default.
- W7520102 created "2016-06-24" @default.
- W7520102 creator A5015473533 @default.
- W7520102 creator A5039067378 @default.
- W7520102 creator A5085417026 @default.
- W7520102 date "2018-08-13" @default.
- W7520102 modified "2023-09-24" @default.
- W7520102 title "Improving text clustering for functional analysis of genes" @default.
- W7520102 cites W1483363854 @default.
- W7520102 cites W1493454437 @default.
- W7520102 cites W1496224574 @default.
- W7520102 cites W1540711569 @default.
- W7520102 cites W1553333136 @default.
- W7520102 cites W1559013041 @default.
- W7520102 cites W1574901103 @default.
- W7520102 cites W1595303882 @default.
- W7520102 cites W1602918385 @default.
- W7520102 cites W1604131382 @default.
- W7520102 cites W1651093245 @default.
- W7520102 cites W1661470104 @default.
- W7520102 cites W1672197616 @default.
- W7520102 cites W173051488 @default.
- W7520102 cites W1736751365 @default.
- W7520102 cites W1751470192 @default.
- W7520102 cites W1956559956 @default.
- W7520102 cites W1970113371 @default.
- W7520102 cites W1992419399 @default.
- W7520102 cites W1996764654 @default.
- W7520102 cites W1998154529 @default.
- W7520102 cites W2005422315 @default.
- W7520102 cites W2018201696 @default.
- W7520102 cites W2025887562 @default.
- W7520102 cites W2030644393 @default.
- W7520102 cites W2030871208 @default.
- W7520102 cites W2033403400 @default.
- W7520102 cites W2040058125 @default.
- W7520102 cites W2045340002 @default.
- W7520102 cites W2045399211 @default.
- W7520102 cites W2046571239 @default.
- W7520102 cites W2050318399 @default.
- W7520102 cites W2070412788 @default.
- W7520102 cites W2081980673 @default.
- W7520102 cites W2088327421 @default.
- W7520102 cites W2097089247 @default.
- W7520102 cites W2097366050 @default.
- W7520102 cites W2098162425 @default.
- W7520102 cites W2100288886 @default.
- W7520102 cites W2105106534 @default.
- W7520102 cites W2105948726 @default.
- W7520102 cites W2106093878 @default.
- W7520102 cites W2107344625 @default.
- W7520102 cites W2107556259 @default.
- W7520102 cites W2111360319 @default.
- W7520102 cites W2111874188 @default.
- W7520102 cites W2114535528 @default.
- W7520102 cites W2114582436 @default.
- W7520102 cites W2115159360 @default.
- W7520102 cites W2117225622 @default.
- W7520102 cites W2133111499 @default.
- W7520102 cites W2135893370 @default.
- W7520102 cites W2137297622 @default.
- W7520102 cites W2139549820 @default.
- W7520102 cites W2141815298 @default.
- W7520102 cites W2142063400 @default.
- W7520102 cites W2149942697 @default.
- W7520102 cites W2149953769 @default.
- W7520102 cites W2153922613 @default.
- W7520102 cites W2157261429 @default.
- W7520102 cites W2160135458 @default.
- W7520102 cites W2162161511 @default.
- W7520102 cites W2162307046 @default.
- W7520102 cites W2166023446 @default.
- W7520102 cites W2169974160 @default.
- W7520102 cites W2171667218 @default.
- W7520102 cites W2171920878 @default.
- W7520102 cites W2175050464 @default.
- W7520102 cites W2338565478 @default.
- W7520102 cites W2383581414 @default.
- W7520102 cites W52245848 @default.
- W7520102 cites W66669671 @default.
- W7520102 cites W87960219 @default.
- W7520102 cites W929420331 @default.
- W7520102 doi "https://doi.org/10.31274/rtd-180813-4375" @default.
- W7520102 hasPublicationYear "2018" @default.
- W7520102 type Work @default.
- W7520102 sameAs 7520102 @default.
- W7520102 citedByCount "0" @default.
- W7520102 crossrefType "dissertation" @default.
- W7520102 hasAuthorship W7520102A5015473533 @default.
- W7520102 hasAuthorship W7520102A5039067378 @default.
- W7520102 hasAuthorship W7520102A5085417026 @default.
- W7520102 hasBestOaLocation W75201021 @default.
- W7520102 hasConcept C111919701 @default.
- W7520102 hasConcept C124101348 @default.
- W7520102 hasConcept C154945302 @default.
- W7520102 hasConcept C170858558 @default.
- W7520102 hasConcept C177937566 @default.
- W7520102 hasConcept C184509293 @default.
- W7520102 hasConcept C187191949 @default.
- W7520102 hasConcept C23123220 @default.