Matches in SemOpenAlex for { <https://semopenalex.org/work/W2029478210> ?p ?o ?g. }
- W2029478210 abstract "The fungal pathogen Fusarium graminearum (telomorph Gibberella zeae) is the causal agent of several destructive crop diseases, where a set of genes usually work in concert to cause diseases to crops. To function appropriately, the F. graminearum proteins inside one cell should be assigned to different compartments, i.e. subcellular localizations. Therefore, the subcellular localizations of F. graminearum proteins can provide insights into protein functions and pathogenic mechanisms of this destructive pathogen fungus. Unfortunately, there are no subcellular localization information for F. graminearum proteins available now. Computational approaches provide an alternative way to predicting F. graminearum protein subcellular localizations due to the expensive and time-consuming biological experiments in lab. In this paper, we developed a novel predictor, namely FGsub, to predict F. graminearum protein subcellular localizations from the primary structures. First, a non-redundant fungi data set with subcellular localization annotation is collected from UniProtKB database and used as training set, where the subcellular locations are classified into 10 groups. Subsequently, Support Vector Machine (SVM) is trained on the training set and used to predict F. graminearum protein subcellular localizations for those proteins that do not have significant sequence similarity to those in training set. The performance of SVMs on training set with 10-fold cross-validation demonstrates the efficiency and effectiveness of the proposed method. In addition, for F. graminearum proteins that have significant sequence similarity to those in training set, BLAST is utilized to transfer annotations of homologous proteins to uncharacterized F. graminearum proteins so that the F. graminearum proteins are annotated more comprehensively. In this work, we present FGsub to predict F. graminearum protein subcellular localizations in a comprehensive manner. We make four fold contributions to this filed. First, we present a new algorithm to cope with imbalance problem that arises in protein subcellular localization prediction, which can solve imbalance problem and avoid false positive results. Second, we design an ensemble classifier which employs feature selection to further improve prediction accuracy. Third, we use BLAST to complement machine learning based methods, which enlarges our prediction coverage. Last and most important, we predict the subcellular localizations of 12786 F. graminearum proteins, which provide insights into protein functions and pathogenic mechanisms of this destructive pathogen fungus." @default.
- W2029478210 created "2016-06-24" @default.
- W2029478210 creator A5012283696 @default.
- W2029478210 creator A5019033812 @default.
- W2029478210 creator A5049435754 @default.
- W2029478210 creator A5091371305 @default.
- W2029478210 date "2010-09-01" @default.
- W2029478210 modified "2023-09-25" @default.
- W2029478210 title "FGsub: Fusarium graminearum protein subcellular localizations predicted from primary structures" @default.
- W2029478210 cites W1511433968 @default.
- W2029478210 cites W1751678145 @default.
- W2029478210 cites W1963496351 @default.
- W2029478210 cites W2006211612 @default.
- W2029478210 cites W2006782375 @default.
- W2029478210 cites W2012352014 @default.
- W2029478210 cites W2020194549 @default.
- W2029478210 cites W2025131366 @default.
- W2029478210 cites W2082605863 @default.
- W2029478210 cites W2103466939 @default.
- W2029478210 cites W2107146119 @default.
- W2029478210 cites W2110104044 @default.
- W2029478210 cites W2110659301 @default.
- W2029478210 cites W2111127373 @default.
- W2029478210 cites W2121651202 @default.
- W2029478210 cites W2124603595 @default.
- W2029478210 cites W2129483074 @default.
- W2029478210 cites W2132160618 @default.
- W2029478210 cites W2135322638 @default.
- W2029478210 cites W2136930152 @default.
- W2029478210 cites W2140602911 @default.
- W2029478210 cites W2145957695 @default.
- W2029478210 cites W2146620899 @default.
- W2029478210 cites W2152458080 @default.
- W2029478210 cites W2156125289 @default.
- W2029478210 cites W2160072419 @default.
- W2029478210 cites W2160979370 @default.
- W2029478210 cites W2162024319 @default.
- W2029478210 cites W2170700642 @default.
- W2029478210 cites W4211000692 @default.
- W2029478210 cites W4211237594 @default.
- W2029478210 doi "https://doi.org/10.1186/1752-0509-4-s2-s12" @default.
- W2029478210 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2982686" @default.
- W2029478210 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/20840726" @default.
- W2029478210 hasPublicationYear "2010" @default.
- W2029478210 type Work @default.
- W2029478210 sameAs 2029478210 @default.
- W2029478210 citedByCount "10" @default.
- W2029478210 countsByYear W20294782102012 @default.
- W2029478210 countsByYear W20294782102013 @default.
- W2029478210 countsByYear W20294782102014 @default.
- W2029478210 countsByYear W20294782102015 @default.
- W2029478210 crossrefType "journal-article" @default.
- W2029478210 hasAuthorship W2029478210A5012283696 @default.
- W2029478210 hasAuthorship W2029478210A5019033812 @default.
- W2029478210 hasAuthorship W2029478210A5049435754 @default.
- W2029478210 hasAuthorship W2029478210A5091371305 @default.
- W2029478210 hasBestOaLocation W20294782101 @default.
- W2029478210 hasConcept C10010492 @default.
- W2029478210 hasConcept C104317684 @default.
- W2029478210 hasConcept C12267149 @default.
- W2029478210 hasConcept C140051345 @default.
- W2029478210 hasConcept C154945302 @default.
- W2029478210 hasConcept C167625842 @default.
- W2029478210 hasConcept C202264299 @default.
- W2029478210 hasConcept C2776879804 @default.
- W2029478210 hasConcept C2778867309 @default.
- W2029478210 hasConcept C2781266966 @default.
- W2029478210 hasConcept C41008148 @default.
- W2029478210 hasConcept C54355233 @default.
- W2029478210 hasConcept C70721500 @default.
- W2029478210 hasConcept C86803240 @default.
- W2029478210 hasConceptScore W2029478210C10010492 @default.
- W2029478210 hasConceptScore W2029478210C104317684 @default.
- W2029478210 hasConceptScore W2029478210C12267149 @default.
- W2029478210 hasConceptScore W2029478210C140051345 @default.
- W2029478210 hasConceptScore W2029478210C154945302 @default.
- W2029478210 hasConceptScore W2029478210C167625842 @default.
- W2029478210 hasConceptScore W2029478210C202264299 @default.
- W2029478210 hasConceptScore W2029478210C2776879804 @default.
- W2029478210 hasConceptScore W2029478210C2778867309 @default.
- W2029478210 hasConceptScore W2029478210C2781266966 @default.
- W2029478210 hasConceptScore W2029478210C41008148 @default.
- W2029478210 hasConceptScore W2029478210C54355233 @default.
- W2029478210 hasConceptScore W2029478210C70721500 @default.
- W2029478210 hasConceptScore W2029478210C86803240 @default.
- W2029478210 hasIssue "S2" @default.
- W2029478210 hasLocation W20294782101 @default.
- W2029478210 hasLocation W20294782102 @default.
- W2029478210 hasLocation W20294782103 @default.
- W2029478210 hasLocation W20294782104 @default.
- W2029478210 hasOpenAccess W2029478210 @default.
- W2029478210 hasPrimaryLocation W20294782101 @default.
- W2029478210 hasRelatedWork W1966526845 @default.
- W2029478210 hasRelatedWork W2029478210 @default.
- W2029478210 hasRelatedWork W2036790151 @default.
- W2029478210 hasRelatedWork W2063149544 @default.
- W2029478210 hasRelatedWork W2077919959 @default.
- W2029478210 hasRelatedWork W2105523601 @default.
- W2029478210 hasRelatedWork W2132704278 @default.
- W2029478210 hasRelatedWork W2730472814 @default.