Matches in SemOpenAlex for { <https://semopenalex.org/work/W2005229640> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W2005229640 abstract "Identification of Chinese coding type is a major and challenging issue in Chinese web content audit and analysis. In this paper we develop a novel algorithm based on the theory of Kolmogorov complexity to identify the coding type of Chinese characters of a given text segment. An array of text compressors are used as filters to evaluate the information distance of text under examination and the training corpus coded in different coding type. The information distance can be used to decide the coding type according to the Kolmogorov theory. In this paper a particular compressing algorithm is used to minimize computing complexity by separating coding book training stage and compressing stage. Finally, we present the experimental results through which the accuracy and performance of the algorithm is confirmed. The result also proves that this algorithm is especially efficient when short text segment is under examination comparing with the n-gram algorithms." @default.
- W2005229640 created "2016-06-24" @default.
- W2005229640 creator A5040329242 @default.
- W2005229640 creator A5049920844 @default.
- W2005229640 creator A5052913836 @default.
- W2005229640 creator A5062314947 @default.
- W2005229640 date "2010-09-01" @default.
- W2005229640 modified "2023-09-22" @default.
- W2005229640 title "Chinese coding type identification based on Kolmogorov complexity theory" @default.
- W2005229640 cites W2103070807 @default.
- W2005229640 cites W2107913424 @default.
- W2005229640 cites W2128859735 @default.
- W2005229640 cites W2144221002 @default.
- W2005229640 cites W2155615465 @default.
- W2005229640 cites W2157090926 @default.
- W2005229640 doi "https://doi.org/10.1109/icnidc.2010.5657789" @default.
- W2005229640 hasPublicationYear "2010" @default.
- W2005229640 type Work @default.
- W2005229640 sameAs 2005229640 @default.
- W2005229640 citedByCount "0" @default.
- W2005229640 crossrefType "proceedings-article" @default.
- W2005229640 hasAuthorship W2005229640A5040329242 @default.
- W2005229640 hasAuthorship W2005229640A5049920844 @default.
- W2005229640 hasAuthorship W2005229640A5052913836 @default.
- W2005229640 hasAuthorship W2005229640A5062314947 @default.
- W2005229640 hasConcept C105795698 @default.
- W2005229640 hasConcept C113709454 @default.
- W2005229640 hasConcept C11413529 @default.
- W2005229640 hasConcept C116834253 @default.
- W2005229640 hasConcept C124101348 @default.
- W2005229640 hasConcept C154945302 @default.
- W2005229640 hasConcept C179518139 @default.
- W2005229640 hasConcept C179799912 @default.
- W2005229640 hasConcept C204321447 @default.
- W2005229640 hasConcept C2779341405 @default.
- W2005229640 hasConcept C33923547 @default.
- W2005229640 hasConcept C41008148 @default.
- W2005229640 hasConcept C52622258 @default.
- W2005229640 hasConcept C59822182 @default.
- W2005229640 hasConcept C80444323 @default.
- W2005229640 hasConcept C86803240 @default.
- W2005229640 hasConceptScore W2005229640C105795698 @default.
- W2005229640 hasConceptScore W2005229640C113709454 @default.
- W2005229640 hasConceptScore W2005229640C11413529 @default.
- W2005229640 hasConceptScore W2005229640C116834253 @default.
- W2005229640 hasConceptScore W2005229640C124101348 @default.
- W2005229640 hasConceptScore W2005229640C154945302 @default.
- W2005229640 hasConceptScore W2005229640C179518139 @default.
- W2005229640 hasConceptScore W2005229640C179799912 @default.
- W2005229640 hasConceptScore W2005229640C204321447 @default.
- W2005229640 hasConceptScore W2005229640C2779341405 @default.
- W2005229640 hasConceptScore W2005229640C33923547 @default.
- W2005229640 hasConceptScore W2005229640C41008148 @default.
- W2005229640 hasConceptScore W2005229640C52622258 @default.
- W2005229640 hasConceptScore W2005229640C59822182 @default.
- W2005229640 hasConceptScore W2005229640C80444323 @default.
- W2005229640 hasConceptScore W2005229640C86803240 @default.
- W2005229640 hasLocation W20052296401 @default.
- W2005229640 hasOpenAccess W2005229640 @default.
- W2005229640 hasPrimaryLocation W20052296401 @default.
- W2005229640 hasRelatedWork W1514337702 @default.
- W2005229640 hasRelatedWork W1541776932 @default.
- W2005229640 hasRelatedWork W1583079388 @default.
- W2005229640 hasRelatedWork W1601718731 @default.
- W2005229640 hasRelatedWork W1791004188 @default.
- W2005229640 hasRelatedWork W1938301739 @default.
- W2005229640 hasRelatedWork W1979150118 @default.
- W2005229640 hasRelatedWork W1996039391 @default.
- W2005229640 hasRelatedWork W2022077460 @default.
- W2005229640 hasRelatedWork W2036377381 @default.
- W2005229640 hasRelatedWork W2051264459 @default.
- W2005229640 hasRelatedWork W2069905857 @default.
- W2005229640 hasRelatedWork W2085022170 @default.
- W2005229640 hasRelatedWork W2087791076 @default.
- W2005229640 hasRelatedWork W2128720666 @default.
- W2005229640 hasRelatedWork W2128859735 @default.
- W2005229640 hasRelatedWork W2144634917 @default.
- W2005229640 hasRelatedWork W2151961172 @default.
- W2005229640 hasRelatedWork W2222059330 @default.
- W2005229640 hasRelatedWork W651911422 @default.
- W2005229640 isParatext "false" @default.
- W2005229640 isRetracted "false" @default.
- W2005229640 magId "2005229640" @default.
- W2005229640 workType "article" @default.