Matches in SemOpenAlex for { <https://semopenalex.org/work/W2107078199> ?p ?o ?g. }
- W2107078199 endingPage "57" @default.
- W2107078199 startingPage "51" @default.
- W2107078199 abstract "In this paper, we proposed a Chinese word segmentation model for micro-blog text. Although Conditional Random Fields (CRFs) models have been presented to deal with word segmentation, this is still the first time to apply it for the segmentation in the domain of Chinese micro-blog. Different from the genres of common articles, micro-blog has gradually become a new literary with the development of Internet. However, the unavailable of microblog training data has been the obstacle to develop a good segmenter based on trainable models. Considering the linguistic characteristics of the text, we proposed some methods to make the CRFs models suitable for segmentation in the domain of micro-blog. Several experiments have been conducted with different settings and then an optimal tagging method and feature templates have been designed. The proposed model has been implemented for the Second CIPS-SIGHAN Joint Conference on Chinese Language Processing Bakeoff (Bakeoff-2012) and achieves a very high Fmeasure of 93.38% within the test set of 5,000 micro-blog sentences. One of our main contri" @default.
- W2107078199 created "2016-06-24" @default.
- W2107078199 creator A5010462322 @default.
- W2107078199 creator A5025832925 @default.
- W2107078199 creator A5067109077 @default.
- W2107078199 creator A5088191810 @default.
- W2107078199 date "2012-12-01" @default.
- W2107078199 modified "2023-10-01" @default.
- W2107078199 title "CRFs-Based Chinese Word Segmentation for Micro-Blog with Small-Scale Data" @default.
- W2107078199 cites W1773803948 @default.
- W2107078199 cites W2113116527 @default.
- W2107078199 cites W2147880316 @default.
- W2107078199 cites W2163377725 @default.
- W2107078199 cites W2252264945 @default.
- W2107078199 cites W2358307482 @default.
- W2107078199 cites W2402385743 @default.
- W2107078199 cites W2467575451 @default.
- W2107078199 cites W2785522575 @default.
- W2107078199 cites W2795155754 @default.
- W2107078199 cites W287031571 @default.
- W2107078199 cites W3140899690 @default.
- W2107078199 hasPublicationYear "2012" @default.
- W2107078199 type Work @default.
- W2107078199 sameAs 2107078199 @default.
- W2107078199 citedByCount "5" @default.
- W2107078199 countsByYear W21070781992013 @default.
- W2107078199 countsByYear W21070781992015 @default.
- W2107078199 countsByYear W21070781992016 @default.
- W2107078199 countsByYear W21070781992018 @default.
- W2107078199 crossrefType "journal-article" @default.
- W2107078199 hasAuthorship W2107078199A5010462322 @default.
- W2107078199 hasAuthorship W2107078199A5025832925 @default.
- W2107078199 hasAuthorship W2107078199A5067109077 @default.
- W2107078199 hasAuthorship W2107078199A5088191810 @default.
- W2107078199 hasConcept C110875604 @default.
- W2107078199 hasConcept C134306372 @default.
- W2107078199 hasConcept C136764020 @default.
- W2107078199 hasConcept C137293760 @default.
- W2107078199 hasConcept C138885662 @default.
- W2107078199 hasConcept C143275388 @default.
- W2107078199 hasConcept C152565575 @default.
- W2107078199 hasConcept C154945302 @default.
- W2107078199 hasConcept C162324750 @default.
- W2107078199 hasConcept C16910744 @default.
- W2107078199 hasConcept C169903167 @default.
- W2107078199 hasConcept C17744445 @default.
- W2107078199 hasConcept C187736073 @default.
- W2107078199 hasConcept C199360897 @default.
- W2107078199 hasConcept C199539241 @default.
- W2107078199 hasConcept C204321447 @default.
- W2107078199 hasConcept C2775953691 @default.
- W2107078199 hasConcept C2776401178 @default.
- W2107078199 hasConcept C2776650193 @default.
- W2107078199 hasConcept C2779135771 @default.
- W2107078199 hasConcept C2780451532 @default.
- W2107078199 hasConcept C33923547 @default.
- W2107078199 hasConcept C36503486 @default.
- W2107078199 hasConcept C41008148 @default.
- W2107078199 hasConcept C41895202 @default.
- W2107078199 hasConcept C518677369 @default.
- W2107078199 hasConcept C89600930 @default.
- W2107078199 hasConcept C90805587 @default.
- W2107078199 hasConcept C98501671 @default.
- W2107078199 hasConceptScore W2107078199C110875604 @default.
- W2107078199 hasConceptScore W2107078199C134306372 @default.
- W2107078199 hasConceptScore W2107078199C136764020 @default.
- W2107078199 hasConceptScore W2107078199C137293760 @default.
- W2107078199 hasConceptScore W2107078199C138885662 @default.
- W2107078199 hasConceptScore W2107078199C143275388 @default.
- W2107078199 hasConceptScore W2107078199C152565575 @default.
- W2107078199 hasConceptScore W2107078199C154945302 @default.
- W2107078199 hasConceptScore W2107078199C162324750 @default.
- W2107078199 hasConceptScore W2107078199C16910744 @default.
- W2107078199 hasConceptScore W2107078199C169903167 @default.
- W2107078199 hasConceptScore W2107078199C17744445 @default.
- W2107078199 hasConceptScore W2107078199C187736073 @default.
- W2107078199 hasConceptScore W2107078199C199360897 @default.
- W2107078199 hasConceptScore W2107078199C199539241 @default.
- W2107078199 hasConceptScore W2107078199C204321447 @default.
- W2107078199 hasConceptScore W2107078199C2775953691 @default.
- W2107078199 hasConceptScore W2107078199C2776401178 @default.
- W2107078199 hasConceptScore W2107078199C2776650193 @default.
- W2107078199 hasConceptScore W2107078199C2779135771 @default.
- W2107078199 hasConceptScore W2107078199C2780451532 @default.
- W2107078199 hasConceptScore W2107078199C33923547 @default.
- W2107078199 hasConceptScore W2107078199C36503486 @default.
- W2107078199 hasConceptScore W2107078199C41008148 @default.
- W2107078199 hasConceptScore W2107078199C41895202 @default.
- W2107078199 hasConceptScore W2107078199C518677369 @default.
- W2107078199 hasConceptScore W2107078199C89600930 @default.
- W2107078199 hasConceptScore W2107078199C90805587 @default.
- W2107078199 hasConceptScore W2107078199C98501671 @default.
- W2107078199 hasLocation W21070781991 @default.
- W2107078199 hasOpenAccess W2107078199 @default.
- W2107078199 hasPrimaryLocation W21070781991 @default.
- W2107078199 hasRelatedWork W1516793345 @default.
- W2107078199 hasRelatedWork W1631260214 @default.
- W2107078199 hasRelatedWork W1971678616 @default.