Matches in SemOpenAlex for { <https://semopenalex.org/work/W2756894272> ?p ?o ?g. }
- W2756894272 endingPage "212" @default.
- W2756894272 startingPage "203" @default.
- W2756894272 abstract "Many of the existing Named Entity Recognition (NER) solutions are built based on news corpus data with proper syntax. These solutions might not lead to highly accurate results when being applied to noisy, user generated data, e.g., tweets, which can feature sloppy spelling, concept drift, and limited contextualization of terms and concepts due to length constraints. The models described in this paper are based on linear chain conditional random fields (CRFs), use the BIEOU encoding scheme, and leverage random feature dropout for up-sampling the training data. The considered features include word clusters and pre-trained distributed word representations, updated gazetteer features, and global context predictions. The latter feature allows for ingesting the meaning of new or rare tokens into the system via unsupervised learning and for alleviating the need to learn lexicon based features, which usually tend to be high dimensional. In this paper, we report on the solution [ST] we submitted to the WNUT 2016 NER shared task. We also present an improvement over our original submission [SI], which we built by using semi-supervised learning on labelled training data and pre-trained resourced constructed from unlabelled tweet data. Our ST solution achieved an F1 score of 1.2% higher than the baseline (35.1% F1) for the task of extracting 10 entity types. The SI resulted in an increase of 8.2% in F1 score over the base-line (7.08% over ST). Finally, the SI model’s evaluation on the test data achieved a F1 score of 47.3% (~1.15% increase over the 2nd best submitted solution). Our experimental setup and results are available as a standalone twitter NER tool at https://github.com/napsternxg/TwitterNER." @default.
- W2756894272 created "2017-10-06" @default.
- W2756894272 creator A5021358528 @default.
- W2756894272 creator A5025085845 @default.
- W2756894272 date "2016-12-01" @default.
- W2756894272 modified "2023-09-24" @default.
- W2756894272 title "Semi-supervised Named Entity Recognition in noisy-text" @default.
- W2756894272 cites W130850236 @default.
- W2756894272 cites W1485997076 @default.
- W2756894272 cites W1990334093 @default.
- W2756894272 cites W2004763266 @default.
- W2756894272 cites W2010657328 @default.
- W2756894272 cites W2018789714 @default.
- W2756894272 cites W2048679005 @default.
- W2756894272 cites W2095705004 @default.
- W2756894272 cites W2111557120 @default.
- W2756894272 cites W2121227244 @default.
- W2756894272 cites W2132655161 @default.
- W2756894272 cites W2141099517 @default.
- W2756894272 cites W2143933463 @default.
- W2756894272 cites W2147880316 @default.
- W2756894272 cites W2153579005 @default.
- W2756894272 cites W2158049734 @default.
- W2756894272 cites W2158139315 @default.
- W2756894272 cites W2158899491 @default.
- W2756894272 cites W2168386304 @default.
- W2756894272 cites W2168596788 @default.
- W2756894272 cites W2250539671 @default.
- W2756894272 cites W2250729567 @default.
- W2756894272 cites W2251567709 @default.
- W2756894272 cites W2913389685 @default.
- W2756894272 cites W2963489789 @default.
- W2756894272 hasPublicationYear "2016" @default.
- W2756894272 type Work @default.
- W2756894272 sameAs 2756894272 @default.
- W2756894272 citedByCount "4" @default.
- W2756894272 countsByYear W27568942722019 @default.
- W2756894272 countsByYear W27568942722020 @default.
- W2756894272 crossrefType "proceedings-article" @default.
- W2756894272 hasAuthorship W2756894272A5021358528 @default.
- W2756894272 hasAuthorship W2756894272A5025085845 @default.
- W2756894272 hasConcept C119857082 @default.
- W2756894272 hasConcept C138885662 @default.
- W2756894272 hasConcept C148524875 @default.
- W2756894272 hasConcept C152565575 @default.
- W2756894272 hasConcept C153083717 @default.
- W2756894272 hasConcept C153180895 @default.
- W2756894272 hasConcept C154945302 @default.
- W2756894272 hasConcept C162324750 @default.
- W2756894272 hasConcept C16910744 @default.
- W2756894272 hasConcept C187736073 @default.
- W2756894272 hasConcept C199360897 @default.
- W2756894272 hasConcept C204321447 @default.
- W2756894272 hasConcept C2524010 @default.
- W2756894272 hasConcept C2775953691 @default.
- W2756894272 hasConcept C2776145597 @default.
- W2756894272 hasConcept C2776401178 @default.
- W2756894272 hasConcept C2778121359 @default.
- W2756894272 hasConcept C2779135771 @default.
- W2756894272 hasConcept C2780451532 @default.
- W2756894272 hasConcept C33923547 @default.
- W2756894272 hasConcept C41008148 @default.
- W2756894272 hasConcept C41895202 @default.
- W2756894272 hasConcept C90805587 @default.
- W2756894272 hasConceptScore W2756894272C119857082 @default.
- W2756894272 hasConceptScore W2756894272C138885662 @default.
- W2756894272 hasConceptScore W2756894272C148524875 @default.
- W2756894272 hasConceptScore W2756894272C152565575 @default.
- W2756894272 hasConceptScore W2756894272C153083717 @default.
- W2756894272 hasConceptScore W2756894272C153180895 @default.
- W2756894272 hasConceptScore W2756894272C154945302 @default.
- W2756894272 hasConceptScore W2756894272C162324750 @default.
- W2756894272 hasConceptScore W2756894272C16910744 @default.
- W2756894272 hasConceptScore W2756894272C187736073 @default.
- W2756894272 hasConceptScore W2756894272C199360897 @default.
- W2756894272 hasConceptScore W2756894272C204321447 @default.
- W2756894272 hasConceptScore W2756894272C2524010 @default.
- W2756894272 hasConceptScore W2756894272C2775953691 @default.
- W2756894272 hasConceptScore W2756894272C2776145597 @default.
- W2756894272 hasConceptScore W2756894272C2776401178 @default.
- W2756894272 hasConceptScore W2756894272C2778121359 @default.
- W2756894272 hasConceptScore W2756894272C2779135771 @default.
- W2756894272 hasConceptScore W2756894272C2780451532 @default.
- W2756894272 hasConceptScore W2756894272C33923547 @default.
- W2756894272 hasConceptScore W2756894272C41008148 @default.
- W2756894272 hasConceptScore W2756894272C41895202 @default.
- W2756894272 hasConceptScore W2756894272C90805587 @default.
- W2756894272 hasLocation W27568942721 @default.
- W2756894272 hasOpenAccess W2756894272 @default.
- W2756894272 hasPrimaryLocation W27568942721 @default.
- W2756894272 hasRelatedWork W11838848 @default.
- W2756894272 hasRelatedWork W141372029 @default.
- W2756894272 hasRelatedWork W2072150326 @default.
- W2756894272 hasRelatedWork W2087923637 @default.
- W2756894272 hasRelatedWork W2149710647 @default.
- W2756894272 hasRelatedWork W2405861896 @default.
- W2756894272 hasRelatedWork W2572043993 @default.
- W2756894272 hasRelatedWork W2610748790 @default.