Matches in SemOpenAlex for { <https://semopenalex.org/work/W2510188630> ?p ?o ?g. }
Showing items 1 to 94 of
94
with 100 items per page.
- W2510188630 abstract "Abstract This paper presents an efficient text mining method focusing on extraction and updating of unknown words (unknown foreign words) to improve data classification and POS tags. Proposed methods can also help to improve the accuracy of mining frequent pattern and association rules from unstructured (textual) data. Many researches have been done by numerous scholars on estimation and segmentation for unknown words, but, they are limited to grammatical and linguistic rules with limited vocabulary. In our project we have consider the fact, that no language is free from the influence of foreign languages, especially, country like Korea where there is a rapid improvement in the area of culture and media and the frequent usage of these foreign languages, resulted in mixing up different languages, their style along with slangs and also abbreviated words in daily life and conversation. The main characteristic of our system is to find such unknown foreign words and update them to appropriate words, which depends on available information through dictionaries. We have also explained the essential natural language processing (NLP) tools used for data processing. Our proposed method used simple but efficient techniques, first it converts the data into structured form, using data preprocessing techniques. In this phase data passes through different stages, such as, cleaning, integration and selection of important data, and then it gets organized into databases structure for further analysis and processing. This database consists of different kinds of dictionaries, our system heavily based on dictionaries. We have manually created various kinds of dictionaries for different kinds of unknown foreign words processing and analysis with the help of our team members. Our proposed methods for discovering and updating foreign unknown word, first discovers the foreign word using morphological analysis with the help of automatically and manually created dictionaries, then suffix trimming and word segmentation, next our algorithm checks for its different written pattern using dictionaries according to its spelling and synonym word in native language (Korean) and also, updates the POS tags. We have tested on different collection of data from economics news, beauty & fashion and college student blogs, the results have shown great efficiency and improvement, and they were adequate enough to research further." @default.
- W2510188630 created "2016-09-16" @default.
- W2510188630 creator A5013026056 @default.
- W2510188630 creator A5074946816 @default.
- W2510188630 date "2016-01-01" @default.
- W2510188630 modified "2023-09-23" @default.
- W2510188630 title "Lexicon-corpus Based Korean Unknown Foreign Word Extraction and Updating Using Syllable Identification" @default.
- W2510188630 cites W1558333962 @default.
- W2510188630 cites W1597161471 @default.
- W2510188630 cites W1982666937 @default.
- W2510188630 cites W2053029294 @default.
- W2510188630 cites W2113169868 @default.
- W2510188630 cites W2138294949 @default.
- W2510188630 cites W2141093894 @default.
- W2510188630 cites W2149794389 @default.
- W2510188630 cites W2155894186 @default.
- W2510188630 cites W2166559705 @default.
- W2510188630 cites W2222807470 @default.
- W2510188630 doi "https://doi.org/10.1016/j.proeng.2016.07.445" @default.
- W2510188630 hasPublicationYear "2016" @default.
- W2510188630 type Work @default.
- W2510188630 sameAs 2510188630 @default.
- W2510188630 citedByCount "0" @default.
- W2510188630 crossrefType "journal-article" @default.
- W2510188630 hasAuthorship W2510188630A5013026056 @default.
- W2510188630 hasAuthorship W2510188630A5074946816 @default.
- W2510188630 hasBestOaLocation W25101886301 @default.
- W2510188630 hasConcept C10551718 @default.
- W2510188630 hasConcept C109089402 @default.
- W2510188630 hasConcept C116834253 @default.
- W2510188630 hasConcept C138885662 @default.
- W2510188630 hasConcept C154945302 @default.
- W2510188630 hasConcept C188338183 @default.
- W2510188630 hasConcept C195807954 @default.
- W2510188630 hasConcept C204321447 @default.
- W2510188630 hasConcept C2777601683 @default.
- W2510188630 hasConcept C2778121359 @default.
- W2510188630 hasConcept C28490314 @default.
- W2510188630 hasConcept C34736171 @default.
- W2510188630 hasConcept C41008148 @default.
- W2510188630 hasConcept C41895202 @default.
- W2510188630 hasConcept C59822182 @default.
- W2510188630 hasConcept C81917197 @default.
- W2510188630 hasConcept C86803240 @default.
- W2510188630 hasConcept C89600930 @default.
- W2510188630 hasConcept C90805587 @default.
- W2510188630 hasConcept C98501671 @default.
- W2510188630 hasConceptScore W2510188630C10551718 @default.
- W2510188630 hasConceptScore W2510188630C109089402 @default.
- W2510188630 hasConceptScore W2510188630C116834253 @default.
- W2510188630 hasConceptScore W2510188630C138885662 @default.
- W2510188630 hasConceptScore W2510188630C154945302 @default.
- W2510188630 hasConceptScore W2510188630C188338183 @default.
- W2510188630 hasConceptScore W2510188630C195807954 @default.
- W2510188630 hasConceptScore W2510188630C204321447 @default.
- W2510188630 hasConceptScore W2510188630C2777601683 @default.
- W2510188630 hasConceptScore W2510188630C2778121359 @default.
- W2510188630 hasConceptScore W2510188630C28490314 @default.
- W2510188630 hasConceptScore W2510188630C34736171 @default.
- W2510188630 hasConceptScore W2510188630C41008148 @default.
- W2510188630 hasConceptScore W2510188630C41895202 @default.
- W2510188630 hasConceptScore W2510188630C59822182 @default.
- W2510188630 hasConceptScore W2510188630C81917197 @default.
- W2510188630 hasConceptScore W2510188630C86803240 @default.
- W2510188630 hasConceptScore W2510188630C89600930 @default.
- W2510188630 hasConceptScore W2510188630C90805587 @default.
- W2510188630 hasConceptScore W2510188630C98501671 @default.
- W2510188630 hasLocation W25101886301 @default.
- W2510188630 hasOpenAccess W2510188630 @default.
- W2510188630 hasPrimaryLocation W25101886301 @default.
- W2510188630 hasRelatedWork W114050236 @default.
- W2510188630 hasRelatedWork W1192346563 @default.
- W2510188630 hasRelatedWork W1547448759 @default.
- W2510188630 hasRelatedWork W1574373040 @default.
- W2510188630 hasRelatedWork W2014912547 @default.
- W2510188630 hasRelatedWork W2081664963 @default.
- W2510188630 hasRelatedWork W2094265589 @default.
- W2510188630 hasRelatedWork W213502119 @default.
- W2510188630 hasRelatedWork W2250542595 @default.
- W2510188630 hasRelatedWork W2488290647 @default.
- W2510188630 hasRelatedWork W2535603471 @default.
- W2510188630 hasRelatedWork W2595034312 @default.
- W2510188630 hasRelatedWork W2776837312 @default.
- W2510188630 hasRelatedWork W2784050058 @default.
- W2510188630 hasRelatedWork W2804946693 @default.
- W2510188630 hasRelatedWork W2899065432 @default.
- W2510188630 hasRelatedWork W3088743876 @default.
- W2510188630 hasRelatedWork W34052887 @default.
- W2510188630 hasRelatedWork W393951510 @default.
- W2510188630 hasRelatedWork W843948033 @default.
- W2510188630 isParatext "false" @default.
- W2510188630 isRetracted "false" @default.
- W2510188630 magId "2510188630" @default.
- W2510188630 workType "article" @default.