Matches in SemOpenAlex for { <https://semopenalex.org/work/W1945816218> ?p ?o ?g. }
Showing items 1 to 71 of
71
with 100 items per page.
- W1945816218 abstract "This thesis seeks to establish if the use of negation in Inductive Rule Learning (IRL) for text classification is effective. Text classification is a widely research topic in the domain of data mining. There have been many techniques directed at text classification; one of them is IRL, widely chosen because of its simplicity, comprehensibility and interpretability by humans. IRL is a process whereby rules in the form of $antecedent -> conclusion$ are learnt to build a classifier. Thus, the learnt classifier comprises a set of rules, which are used to perform classification. To learn a rule, words from pre-labelled documents, known as features, are selected to be used as conjunctions in the rule antecedent. These rules typically do not include any negated features in their antecedent; although in some cases, as demonstrated in this thesis, the inclusion of negation is required and beneficial for the text classification task. With respect to the use of negation in IRL, two issues need to be addressed: (i) the identification of the features to be negated and (ii) the improvisation of rule refinement strategies to generate rules both with and without negation. To address the first issue, feature space division is proposed, whereby the feature space containing features to be used for rule refinement is divided into three sub-spaces to facilitate the identification of the features which can be advantageously negated. To address the second issue, eight rule refinement strategies are proposed, which are able to generate both rules with and without negation. Typically, single keywords which are deemed significant to differentiate between classes are selected to be used in the text representation in the text classification task. Phrases have also been proposed because they are considered to be semantically richer than single keywords. Therefore, with respect to the work conducted in this thesis, three different types of phrases ($n$-gram phrases, keyphrases and fuzzy phrases) are extracted to be used as the text representation in addition to the use of single keywords. To establish the effectiveness of the use of negation in IRL, the eight proposed rule refinement strategies are compared with one another, using keywords and the three different types of phrases as the text representation, to determine whether the best strategy is one which generates rules with negation or without negation. Two types of classification tasks are conducted; binary classification and multi-class classification. The best strategy in the proposed IRL mechanism is compared to five existing text classification techniques with respect to binary classification: (i) the Sequential Minimal Optimization (SMO) algorithm, (ii) Naive Bayes (NB), (iii) JRip, (iv) OlexGreedy and (v) OlexGA from the Waikato Environment for Knowledge Analysis (WEKA) machine learning workbench. In the multi-class classification task, the proposed IRL mechanism is compared to the Total From Partial Classification (TFPC) algorithm. The datasets used in the experiments include three text datasets: 20 Newsgroups, Reuters-21578 and Small Animal Veterinary Surveillance Network (SAVSNET) datasets and five UCI Machine Learning Repository tabular datasets. The results obtained from the experiments showed that the strategies which generated rules with negation were more effective when the keyword representation was used and less prominent when the phrase representations were used. Strategies which generated rules with negation also performed better with respect to binary classification compared to multi-class classification. In comparison with the other machine learning techniques selected, the proposed IRL mechanism was shown to generally outperform all the compared techniques and was competitive with SMO." @default.
- W1945816218 created "2016-06-24" @default.
- W1945816218 creator A5068898351 @default.
- W1945816218 date "2012-06-01" @default.
- W1945816218 modified "2023-09-26" @default.
- W1945816218 title "An investigation into the use of negation in Inductive Rule Learning for text classification" @default.
- W1945816218 hasPublicationYear "2012" @default.
- W1945816218 type Work @default.
- W1945816218 sameAs 1945816218 @default.
- W1945816218 citedByCount "0" @default.
- W1945816218 crossrefType "dissertation" @default.
- W1945816218 hasAuthorship W1945816218A5068898351 @default.
- W1945816218 hasConcept C116834253 @default.
- W1945816218 hasConcept C119857082 @default.
- W1945816218 hasConcept C138496976 @default.
- W1945816218 hasConcept C149271511 @default.
- W1945816218 hasConcept C154945302 @default.
- W1945816218 hasConcept C15744967 @default.
- W1945816218 hasConcept C199360897 @default.
- W1945816218 hasConcept C204321447 @default.
- W1945816218 hasConcept C2185349 @default.
- W1945816218 hasConcept C2779382394 @default.
- W1945816218 hasConcept C2781067378 @default.
- W1945816218 hasConcept C2781256819 @default.
- W1945816218 hasConcept C41008148 @default.
- W1945816218 hasConcept C59822182 @default.
- W1945816218 hasConcept C86803240 @default.
- W1945816218 hasConcept C95623464 @default.
- W1945816218 hasConceptScore W1945816218C116834253 @default.
- W1945816218 hasConceptScore W1945816218C119857082 @default.
- W1945816218 hasConceptScore W1945816218C138496976 @default.
- W1945816218 hasConceptScore W1945816218C149271511 @default.
- W1945816218 hasConceptScore W1945816218C154945302 @default.
- W1945816218 hasConceptScore W1945816218C15744967 @default.
- W1945816218 hasConceptScore W1945816218C199360897 @default.
- W1945816218 hasConceptScore W1945816218C204321447 @default.
- W1945816218 hasConceptScore W1945816218C2185349 @default.
- W1945816218 hasConceptScore W1945816218C2779382394 @default.
- W1945816218 hasConceptScore W1945816218C2781067378 @default.
- W1945816218 hasConceptScore W1945816218C2781256819 @default.
- W1945816218 hasConceptScore W1945816218C41008148 @default.
- W1945816218 hasConceptScore W1945816218C59822182 @default.
- W1945816218 hasConceptScore W1945816218C86803240 @default.
- W1945816218 hasConceptScore W1945816218C95623464 @default.
- W1945816218 hasLocation W19458162181 @default.
- W1945816218 hasOpenAccess W1945816218 @default.
- W1945816218 hasPrimaryLocation W19458162181 @default.
- W1945816218 hasRelatedWork W141548771 @default.
- W1945816218 hasRelatedWork W1508807071 @default.
- W1945816218 hasRelatedWork W1532837156 @default.
- W1945816218 hasRelatedWork W1549533884 @default.
- W1945816218 hasRelatedWork W1553199198 @default.
- W1945816218 hasRelatedWork W1576941664 @default.
- W1945816218 hasRelatedWork W1634742235 @default.
- W1945816218 hasRelatedWork W1653270119 @default.
- W1945816218 hasRelatedWork W1666379114 @default.
- W1945816218 hasRelatedWork W1965229441 @default.
- W1945816218 hasRelatedWork W1991520024 @default.
- W1945816218 hasRelatedWork W2061664277 @default.
- W1945816218 hasRelatedWork W2097633543 @default.
- W1945816218 hasRelatedWork W2126975755 @default.
- W1945816218 hasRelatedWork W2136588307 @default.
- W1945816218 hasRelatedWork W2158256570 @default.
- W1945816218 hasRelatedWork W2188562394 @default.
- W1945816218 hasRelatedWork W2288034745 @default.
- W1945816218 hasRelatedWork W2886642366 @default.
- W1945816218 hasRelatedWork W2900652827 @default.
- W1945816218 isParatext "false" @default.
- W1945816218 isRetracted "false" @default.
- W1945816218 magId "1945816218" @default.
- W1945816218 workType "dissertation" @default.