Matches in SemOpenAlex for { <https://semopenalex.org/work/W3207518904> ?p ?o ?g. }
- W3207518904 endingPage "116051" @default.
- W3207518904 startingPage "116051" @default.
- W3207518904 abstract "The e-government platform not only enables the government department to publish policy texts online, but also makes it easier for users to access the policy, especially for the convenience of understanding the policies by reading the keywords. For a given policy text, keywords take up only a small proportion, which can be seen as an unbalanced data set. Therefore, in this paper, we try to design automatic keyword extraction method of policy text with unbalanced data set. In order to achieve this goal, we firstly propose a new ensemble oversampling method to synthesize new data. In this case, we sample data from the training set by using Bagging method. During each sampling process, we train a logistic regression model to classify the training set. Based on the predicted probabilities, we utilize the classification confidence to divide training set into three regions by using three-way decisions (3WD). Then, we implement different strategies to synthesize new data. Besides, for keyword extraction of policy text, we conduct a series of experiments by using the classical supervised and unsupervised methods. In our experiment results, we can find that both in the public data sets and manual data sets, our sampling method can achieve better performance of F-measure and G-mean indexes, no matter what the supervised machine learning method is. This can also explain the advantage of 3WD. Different regions have different strategies to synthesize new data." @default.
- W3207518904 created "2021-10-25" @default.
- W3207518904 creator A5041033174 @default.
- W3207518904 creator A5059386507 @default.
- W3207518904 creator A5069907313 @default.
- W3207518904 creator A5088064886 @default.
- W3207518904 date "2022-02-01" @default.
- W3207518904 modified "2023-10-01" @default.
- W3207518904 title "Exploring ensemble oversampling method for imbalanced keyword extraction learning in policy text based on three-way decisions and SMOTE" @default.
- W3207518904 cites W1490343430 @default.
- W3207518904 cites W1509979276 @default.
- W3207518904 cites W1993220166 @default.
- W3207518904 cites W2000206219 @default.
- W3207518904 cites W2024935628 @default.
- W3207518904 cites W2029729099 @default.
- W3207518904 cites W2064418625 @default.
- W3207518904 cites W2070813883 @default.
- W3207518904 cites W2118978333 @default.
- W3207518904 cites W2132791018 @default.
- W3207518904 cites W2148143831 @default.
- W3207518904 cites W2168508521 @default.
- W3207518904 cites W2191253925 @default.
- W3207518904 cites W2317515691 @default.
- W3207518904 cites W2343346879 @default.
- W3207518904 cites W2531607313 @default.
- W3207518904 cites W2562319768 @default.
- W3207518904 cites W2593914038 @default.
- W3207518904 cites W2600072788 @default.
- W3207518904 cites W2766296277 @default.
- W3207518904 cites W2772993019 @default.
- W3207518904 cites W2792234535 @default.
- W3207518904 cites W2800788706 @default.
- W3207518904 cites W2887596728 @default.
- W3207518904 cites W2896206046 @default.
- W3207518904 cites W2913485292 @default.
- W3207518904 cites W2953722276 @default.
- W3207518904 cites W2981905647 @default.
- W3207518904 cites W2985875790 @default.
- W3207518904 cites W2987225788 @default.
- W3207518904 cites W2992548114 @default.
- W3207518904 cites W2996241447 @default.
- W3207518904 cites W3005687093 @default.
- W3207518904 cites W3014524176 @default.
- W3207518904 cites W3016385481 @default.
- W3207518904 cites W3025327496 @default.
- W3207518904 cites W3026092610 @default.
- W3207518904 cites W3030406084 @default.
- W3207518904 cites W3033266910 @default.
- W3207518904 cites W3035802597 @default.
- W3207518904 cites W3045116697 @default.
- W3207518904 cites W3046771924 @default.
- W3207518904 doi "https://doi.org/10.1016/j.eswa.2021.116051" @default.
- W3207518904 hasPublicationYear "2022" @default.
- W3207518904 type Work @default.
- W3207518904 sameAs 3207518904 @default.
- W3207518904 citedByCount "11" @default.
- W3207518904 countsByYear W32075189042022 @default.
- W3207518904 countsByYear W32075189042023 @default.
- W3207518904 crossrefType "journal-article" @default.
- W3207518904 hasAuthorship W3207518904A5041033174 @default.
- W3207518904 hasAuthorship W3207518904A5059386507 @default.
- W3207518904 hasAuthorship W3207518904A5069907313 @default.
- W3207518904 hasAuthorship W3207518904A5088064886 @default.
- W3207518904 hasConcept C106131492 @default.
- W3207518904 hasConcept C111919701 @default.
- W3207518904 hasConcept C119857082 @default.
- W3207518904 hasConcept C124101348 @default.
- W3207518904 hasConcept C136389625 @default.
- W3207518904 hasConcept C140779682 @default.
- W3207518904 hasConcept C154945302 @default.
- W3207518904 hasConcept C177264268 @default.
- W3207518904 hasConcept C185592680 @default.
- W3207518904 hasConcept C197323446 @default.
- W3207518904 hasConcept C198531522 @default.
- W3207518904 hasConcept C199360897 @default.
- W3207518904 hasConcept C2776257435 @default.
- W3207518904 hasConcept C31258907 @default.
- W3207518904 hasConcept C31972630 @default.
- W3207518904 hasConcept C41008148 @default.
- W3207518904 hasConcept C43617362 @default.
- W3207518904 hasConcept C50644808 @default.
- W3207518904 hasConcept C98045186 @default.
- W3207518904 hasConceptScore W3207518904C106131492 @default.
- W3207518904 hasConceptScore W3207518904C111919701 @default.
- W3207518904 hasConceptScore W3207518904C119857082 @default.
- W3207518904 hasConceptScore W3207518904C124101348 @default.
- W3207518904 hasConceptScore W3207518904C136389625 @default.
- W3207518904 hasConceptScore W3207518904C140779682 @default.
- W3207518904 hasConceptScore W3207518904C154945302 @default.
- W3207518904 hasConceptScore W3207518904C177264268 @default.
- W3207518904 hasConceptScore W3207518904C185592680 @default.
- W3207518904 hasConceptScore W3207518904C197323446 @default.
- W3207518904 hasConceptScore W3207518904C198531522 @default.
- W3207518904 hasConceptScore W3207518904C199360897 @default.
- W3207518904 hasConceptScore W3207518904C2776257435 @default.
- W3207518904 hasConceptScore W3207518904C31258907 @default.
- W3207518904 hasConceptScore W3207518904C31972630 @default.
- W3207518904 hasConceptScore W3207518904C41008148 @default.