Matches in SemOpenAlex for { <https://semopenalex.org/work/W200338504> ?p ?o ?g. }
Showing items 1 to 81 of
81
with 100 items per page.
- W200338504 abstract "The Gene Ontology (GO) database annotates a large number of genes according to their functions (the biological processes, molecular functions and cellular components in which they are involved). However, it is far from complete, and so there is a need for techniques that automatically assign GO functional categories to genes based on integration of available data. The present work describes one such technique, that uses a combination of sequence similarity and a similarity measure based on mutual information applied to cross-experiment microarray gene expression analysis. First of all, in order to test the relevance of sequence similarity for gene function inference, similarity searches of genes belonging to the same GO (from here on we will use “GO” as a shorthand for “GO category”, as well as for the “Gene Ontology” as a whole) were done across the human genome. A BLAST attachment value (BAV) for each GO was defined as the sum of the e-value exponents found between pairs of genes in the GO, divided by the sum of all e-value exponents found between genes in the GO and genes outside the GO. Next, to assess the “expression based similarity” of human genes, we used a dataset (GDS181) from GEO, a gene expression and molecular data repository maintained by the NCBI, providing gene expression profiles from 85 different tissues, organs, and cell lines in the normal physiological state. The dataset contains 12,625 probes, and we used 9,725 of them associated to genes with identifiable GO relationships. For each gene in the dataset, we calculated the Mutual Information (MI) between its expression values measured across all tissues and the corresponding values for the other genes. In order to calculate MI, the gene expression values were discretized, meaning that each one was replaced by one of K symbols. The symbol replacing an expression value was calculated by first normalizing the values into [0,1], and then partitioning this interval into K equally sized subintervals. The normalization was done on a per-gene basis. After experimenting with several different values of K, a value of K=3 was chosen for all further experiments. Using a similar procedure to the one used for calculating the BAV, a MI attachment value (MiAV) was obtained. For each GO, the MiAV was defined as the sum of the MI expression values found for all pairs of genes in the GO, divided by the sum of all MI values between genes in the GO and genes outside the GO. Then, our gene function inference (GFI) process proceeds as follows. Given a gene for which one wants to know the function, one begins by comparing it with all other genes, using both BLAST and expression data. Then, given a GO, one may calculate the values Bs and Ms, representing the maximum similarity found between the query gene and any gene inside the GO, using BLAST or MI, respectively. Those values plus attachments are used in the following equation for estimating the pertinence of a gene to a GO:" @default.
- W200338504 created "2016-06-24" @default.
- W200338504 creator A5001217898 @default.
- W200338504 creator A5005497971 @default.
- W200338504 creator A5011631137 @default.
- W200338504 creator A5030606120 @default.
- W200338504 creator A5038877365 @default.
- W200338504 creator A5063430399 @default.
- W200338504 date "2006-01-01" @default.
- W200338504 modified "2023-09-27" @default.
- W200338504 title "Inferring Gene Ontology Category Membership via Gene Expression and Sequence Similarity Data Analysis." @default.
- W200338504 hasPublicationYear "2006" @default.
- W200338504 type Work @default.
- W200338504 sameAs 200338504 @default.
- W200338504 citedByCount "0" @default.
- W200338504 crossrefType "journal-article" @default.
- W200338504 hasAuthorship W200338504A5001217898 @default.
- W200338504 hasAuthorship W200338504A5005497971 @default.
- W200338504 hasAuthorship W200338504A5011631137 @default.
- W200338504 hasAuthorship W200338504A5030606120 @default.
- W200338504 hasAuthorship W200338504A5038877365 @default.
- W200338504 hasAuthorship W200338504A5063430399 @default.
- W200338504 hasConcept C103278499 @default.
- W200338504 hasConcept C104317684 @default.
- W200338504 hasConcept C114009990 @default.
- W200338504 hasConcept C115961682 @default.
- W200338504 hasConcept C124101348 @default.
- W200338504 hasConcept C141231307 @default.
- W200338504 hasConcept C150194340 @default.
- W200338504 hasConcept C154945302 @default.
- W200338504 hasConcept C197077220 @default.
- W200338504 hasConcept C2776214188 @default.
- W200338504 hasConcept C2987395477 @default.
- W200338504 hasConcept C33498276 @default.
- W200338504 hasConcept C41008148 @default.
- W200338504 hasConcept C54355233 @default.
- W200338504 hasConcept C70721500 @default.
- W200338504 hasConcept C86803240 @default.
- W200338504 hasConceptScore W200338504C103278499 @default.
- W200338504 hasConceptScore W200338504C104317684 @default.
- W200338504 hasConceptScore W200338504C114009990 @default.
- W200338504 hasConceptScore W200338504C115961682 @default.
- W200338504 hasConceptScore W200338504C124101348 @default.
- W200338504 hasConceptScore W200338504C141231307 @default.
- W200338504 hasConceptScore W200338504C150194340 @default.
- W200338504 hasConceptScore W200338504C154945302 @default.
- W200338504 hasConceptScore W200338504C197077220 @default.
- W200338504 hasConceptScore W200338504C2776214188 @default.
- W200338504 hasConceptScore W200338504C2987395477 @default.
- W200338504 hasConceptScore W200338504C33498276 @default.
- W200338504 hasConceptScore W200338504C41008148 @default.
- W200338504 hasConceptScore W200338504C54355233 @default.
- W200338504 hasConceptScore W200338504C70721500 @default.
- W200338504 hasConceptScore W200338504C86803240 @default.
- W200338504 hasLocation W2003385041 @default.
- W200338504 hasOpenAccess W200338504 @default.
- W200338504 hasPrimaryLocation W2003385041 @default.
- W200338504 hasRelatedWork W1818600713 @default.
- W200338504 hasRelatedWork W1942191628 @default.
- W200338504 hasRelatedWork W1985563591 @default.
- W200338504 hasRelatedWork W2001903569 @default.
- W200338504 hasRelatedWork W2048148485 @default.
- W200338504 hasRelatedWork W2057731521 @default.
- W200338504 hasRelatedWork W2062677609 @default.
- W200338504 hasRelatedWork W2070047930 @default.
- W200338504 hasRelatedWork W2127316014 @default.
- W200338504 hasRelatedWork W2130189476 @default.
- W200338504 hasRelatedWork W2143197938 @default.
- W200338504 hasRelatedWork W2160875291 @default.
- W200338504 hasRelatedWork W2164891186 @default.
- W200338504 hasRelatedWork W2168639800 @default.
- W200338504 hasRelatedWork W2606112621 @default.
- W200338504 hasRelatedWork W756431481 @default.
- W200338504 hasRelatedWork W85880394 @default.
- W200338504 hasRelatedWork W2600135273 @default.
- W200338504 hasRelatedWork W2842484345 @default.
- W200338504 hasRelatedWork W2992704331 @default.
- W200338504 isParatext "false" @default.
- W200338504 isRetracted "false" @default.
- W200338504 magId "200338504" @default.
- W200338504 workType "article" @default.