Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385572914> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W4385572914 abstract "The availability of high quality training data is still a bottleneck for the practical utilization of information extraction models, despite the breakthroughs in zero and few-shot learning techniques. This is further exacerbated for industry applications, where new tasks, domains, and specific use cases keep arising, which makes it impractical to depend on manually annotated data. Therefore, weak and distant supervision emerged as popular approaches to bootstrap training, utilizing labeling functions to guide the annotation process. Weakly-supervised annotation of training data is fast and efficient, however, it results in many irrelevant and out-of-context matches. This is a challenging problem that can degrade the performance in downstream models, or require a manual data cleaning step that can incur significant overhead. In this paper we present a prototype-based filtering approach, that can be utilized to denoise weakly supervised training data. The system is very simple, unsupervised, scalable, and requires little manual intervention, yet results in significant precision gains. We apply the technique in the task of attribute value extraction in e-commerce websites, and achieve up to 9% gain in precision for the downstream models, with a minimal drop in recall." @default.
- W4385572914 created "2023-08-05" @default.
- W4385572914 creator A5066820530 @default.
- W4385572914 creator A5089887413 @default.
- W4385572914 date "2022-01-01" @default.
- W4385572914 modified "2023-09-24" @default.
- W4385572914 title "Prototype-Representations for Training Data Filtering in Weakly-Supervised Information Extraction" @default.
- W4385572914 doi "https://doi.org/10.18653/v1/2022.emnlp-industry.47" @default.
- W4385572914 hasPublicationYear "2022" @default.
- W4385572914 type Work @default.
- W4385572914 citedByCount "0" @default.
- W4385572914 crossrefType "proceedings-article" @default.
- W4385572914 hasAuthorship W4385572914A5066820530 @default.
- W4385572914 hasAuthorship W4385572914A5089887413 @default.
- W4385572914 hasBestOaLocation W43855729141 @default.
- W4385572914 hasConcept C111919701 @default.
- W4385572914 hasConcept C119857082 @default.
- W4385572914 hasConcept C124101348 @default.
- W4385572914 hasConcept C144024400 @default.
- W4385572914 hasConcept C149635348 @default.
- W4385572914 hasConcept C151730666 @default.
- W4385572914 hasConcept C154945302 @default.
- W4385572914 hasConcept C162324750 @default.
- W4385572914 hasConcept C187736073 @default.
- W4385572914 hasConcept C195807954 @default.
- W4385572914 hasConcept C21547014 @default.
- W4385572914 hasConcept C2776145971 @default.
- W4385572914 hasConcept C2776207758 @default.
- W4385572914 hasConcept C2776321320 @default.
- W4385572914 hasConcept C2779343474 @default.
- W4385572914 hasConcept C2779903281 @default.
- W4385572914 hasConcept C2779960059 @default.
- W4385572914 hasConcept C2780451532 @default.
- W4385572914 hasConcept C2780513914 @default.
- W4385572914 hasConcept C36289849 @default.
- W4385572914 hasConcept C41008148 @default.
- W4385572914 hasConcept C48044578 @default.
- W4385572914 hasConcept C77088390 @default.
- W4385572914 hasConcept C81669768 @default.
- W4385572914 hasConcept C86803240 @default.
- W4385572914 hasConcept C98045186 @default.
- W4385572914 hasConceptScore W4385572914C111919701 @default.
- W4385572914 hasConceptScore W4385572914C119857082 @default.
- W4385572914 hasConceptScore W4385572914C124101348 @default.
- W4385572914 hasConceptScore W4385572914C144024400 @default.
- W4385572914 hasConceptScore W4385572914C149635348 @default.
- W4385572914 hasConceptScore W4385572914C151730666 @default.
- W4385572914 hasConceptScore W4385572914C154945302 @default.
- W4385572914 hasConceptScore W4385572914C162324750 @default.
- W4385572914 hasConceptScore W4385572914C187736073 @default.
- W4385572914 hasConceptScore W4385572914C195807954 @default.
- W4385572914 hasConceptScore W4385572914C21547014 @default.
- W4385572914 hasConceptScore W4385572914C2776145971 @default.
- W4385572914 hasConceptScore W4385572914C2776207758 @default.
- W4385572914 hasConceptScore W4385572914C2776321320 @default.
- W4385572914 hasConceptScore W4385572914C2779343474 @default.
- W4385572914 hasConceptScore W4385572914C2779903281 @default.
- W4385572914 hasConceptScore W4385572914C2779960059 @default.
- W4385572914 hasConceptScore W4385572914C2780451532 @default.
- W4385572914 hasConceptScore W4385572914C2780513914 @default.
- W4385572914 hasConceptScore W4385572914C36289849 @default.
- W4385572914 hasConceptScore W4385572914C41008148 @default.
- W4385572914 hasConceptScore W4385572914C48044578 @default.
- W4385572914 hasConceptScore W4385572914C77088390 @default.
- W4385572914 hasConceptScore W4385572914C81669768 @default.
- W4385572914 hasConceptScore W4385572914C86803240 @default.
- W4385572914 hasConceptScore W4385572914C98045186 @default.
- W4385572914 hasLocation W43855729141 @default.
- W4385572914 hasOpenAccess W4385572914 @default.
- W4385572914 hasPrimaryLocation W43855729141 @default.
- W4385572914 hasRelatedWork W2087937280 @default.
- W4385572914 hasRelatedWork W2157013742 @default.
- W4385572914 hasRelatedWork W2170649215 @default.
- W4385572914 hasRelatedWork W2361361118 @default.
- W4385572914 hasRelatedWork W2384224704 @default.
- W4385572914 hasRelatedWork W2394022102 @default.
- W4385572914 hasRelatedWork W2560526572 @default.
- W4385572914 hasRelatedWork W3087165870 @default.
- W4385572914 hasRelatedWork W3120511008 @default.
- W4385572914 hasRelatedWork W4295301636 @default.
- W4385572914 isParatext "false" @default.
- W4385572914 isRetracted "false" @default.
- W4385572914 workType "article" @default.