Matches in SemOpenAlex for { <https://semopenalex.org/work/W3025740135> ?p ?o ?g. }
Showing items 1 to 92 of
92
with 100 items per page.
- W3025740135 endingPage "91226" @default.
- W3025740135 startingPage "91213" @default.
- W3025740135 abstract "In recent years, unethical behavior in the cyber-environment has been revealed. The presence of offensive language on social media platforms and automatic detection of such language is becoming a major challenge in modern society. The complexity of natural language constructs makes this task even more challenging. Until now, most of the research has focused on resource-rich languages like English. Roman Urdu and Urdu are two scripts of writing the Urdu language on social media. The Roman script uses the English language characters while the Urdu script uses Urdu language characters. Urdu and Hindi languages are similar with the only difference in their writing script but the Roman scripts of both languages are similar. This study is about the detection of offensive language from the user's comments presented in a resource-poor language Urdu. We propose the first offensive dataset of Urdu containing user-generated comments from social media. We use individual and combined n-grams techniques to extract features at character-level and word-level. We apply seventeen classifiers from seven machine learning techniques to detect offensive language from both Urdu and Roman Urdu text comments. Experiments show that the regression-based models using character n-grams show superior performance to process the Urdu language. Character-level tri-gram outperforms the other word and character n-grams. LogitBoost and SimpleLogistic outperform the other models and achieve 99.2% and 95.9% values of F-measure on Roman Urdu and Urdu datasets respectively. Our designed dataset is publically available on GitHub for future research." @default.
- W3025740135 created "2020-05-21" @default.
- W3025740135 creator A5006388286 @default.
- W3025740135 creator A5025341361 @default.
- W3025740135 creator A5032840064 @default.
- W3025740135 creator A5034516128 @default.
- W3025740135 creator A5065810383 @default.
- W3025740135 date "2020-01-01" @default.
- W3025740135 modified "2023-10-17" @default.
- W3025740135 title "Automatic Detection of Offensive Language for Urdu and Roman Urdu" @default.
- W3025740135 cites W1871142974 @default.
- W3025740135 cites W2007148919 @default.
- W3025740135 cites W2044173330 @default.
- W3025740135 cites W2082610780 @default.
- W3025740135 cites W2129176157 @default.
- W3025740135 cites W2133990480 @default.
- W3025740135 cites W2160685721 @default.
- W3025740135 cites W2181854537 @default.
- W3025740135 cites W2311430799 @default.
- W3025740135 cites W2409439155 @default.
- W3025740135 cites W2513138008 @default.
- W3025740135 cites W2612186323 @default.
- W3025740135 cites W2792441346 @default.
- W3025740135 cites W2792670840 @default.
- W3025740135 cites W2806420696 @default.
- W3025740135 cites W2808869054 @default.
- W3025740135 cites W2811056307 @default.
- W3025740135 cites W2851904645 @default.
- W3025740135 cites W2889171966 @default.
- W3025740135 cites W2891636837 @default.
- W3025740135 cites W2901224794 @default.
- W3025740135 cites W2901236145 @default.
- W3025740135 cites W2902248801 @default.
- W3025740135 cites W2953692061 @default.
- W3025740135 cites W2954691966 @default.
- W3025740135 cites W2962977603 @default.
- W3025740135 cites W2963481894 @default.
- W3025740135 cites W2966292608 @default.
- W3025740135 cites W2969026449 @default.
- W3025740135 cites W3008993296 @default.
- W3025740135 doi "https://doi.org/10.1109/access.2020.2994950" @default.
- W3025740135 hasPublicationYear "2020" @default.
- W3025740135 type Work @default.
- W3025740135 sameAs 3025740135 @default.
- W3025740135 citedByCount "54" @default.
- W3025740135 countsByYear W30257401352020 @default.
- W3025740135 countsByYear W30257401352021 @default.
- W3025740135 countsByYear W30257401352022 @default.
- W3025740135 countsByYear W30257401352023 @default.
- W3025740135 crossrefType "journal-article" @default.
- W3025740135 hasAuthorship W3025740135A5006388286 @default.
- W3025740135 hasAuthorship W3025740135A5025341361 @default.
- W3025740135 hasAuthorship W3025740135A5032840064 @default.
- W3025740135 hasAuthorship W3025740135A5034516128 @default.
- W3025740135 hasAuthorship W3025740135A5065810383 @default.
- W3025740135 hasBestOaLocation W30257401351 @default.
- W3025740135 hasConcept C138885662 @default.
- W3025740135 hasConcept C176856949 @default.
- W3025740135 hasConcept C2777350258 @default.
- W3025740135 hasConcept C33923547 @default.
- W3025740135 hasConcept C41008148 @default.
- W3025740135 hasConcept C41895202 @default.
- W3025740135 hasConcept C42475967 @default.
- W3025740135 hasConceptScore W3025740135C138885662 @default.
- W3025740135 hasConceptScore W3025740135C176856949 @default.
- W3025740135 hasConceptScore W3025740135C2777350258 @default.
- W3025740135 hasConceptScore W3025740135C33923547 @default.
- W3025740135 hasConceptScore W3025740135C41008148 @default.
- W3025740135 hasConceptScore W3025740135C41895202 @default.
- W3025740135 hasConceptScore W3025740135C42475967 @default.
- W3025740135 hasFunder F4320321001 @default.
- W3025740135 hasLocation W30257401351 @default.
- W3025740135 hasLocation W30257401352 @default.
- W3025740135 hasOpenAccess W3025740135 @default.
- W3025740135 hasPrimaryLocation W30257401351 @default.
- W3025740135 hasRelatedWork W119993593 @default.
- W3025740135 hasRelatedWork W2481063371 @default.
- W3025740135 hasRelatedWork W2546719740 @default.
- W3025740135 hasRelatedWork W2901224794 @default.
- W3025740135 hasRelatedWork W3025740135 @default.
- W3025740135 hasRelatedWork W3099150152 @default.
- W3025740135 hasRelatedWork W3108035982 @default.
- W3025740135 hasRelatedWork W3113424104 @default.
- W3025740135 hasRelatedWork W3189446895 @default.
- W3025740135 hasRelatedWork W4200076228 @default.
- W3025740135 hasVolume "8" @default.
- W3025740135 isParatext "false" @default.
- W3025740135 isRetracted "false" @default.
- W3025740135 magId "3025740135" @default.
- W3025740135 workType "article" @default.