Matches in SemOpenAlex for { <https://semopenalex.org/work/W2000577984> ?p ?o ?g. }
Showing items 1 to 80 of
80
with 100 items per page.
- W2000577984 endingPage "3" @default.
- W2000577984 startingPage "3" @default.
- W2000577984 abstract "Previous work demonstrated that Web counts can be used to approximate bigram counts, suggesting that Web-based frequencies should be useful for a wide variety of Natural Language Processing (NLP) tasks. However, only a limited number of tasks have so far been tested using Web-scale data sets. The present article overcomes this limitation by systematically investigating the performance of Web-based models for several NLP tasks, covering both syntax and semantics, both generation and analysis, and a wider range of n -grams and parts of speech than have been previously explored. For the majority of our tasks, we find that simple, unsupervised models perform better when n -gram counts are obtained from the Web rather than from a large corpus. In some cases, performance can be improved further by using backoff or interpolation techniques that combine Web counts and corpus counts. However, unsupervised Web-based models generally fail to outperform supervised state-of-the-art models trained on smaller corpora. We argue that Web-based models should therefore be used as a baseline for, rather than an alternative to, standard supervised models." @default.
- W2000577984 created "2016-06-24" @default.
- W2000577984 creator A5041024491 @default.
- W2000577984 creator A5054936589 @default.
- W2000577984 date "2005-02-01" @default.
- W2000577984 modified "2023-09-30" @default.
- W2000577984 title "Web-based models for natural language processing" @default.
- W2000577984 cites W2014516359 @default.
- W2000577984 cites W2047295649 @default.
- W2000577984 cites W2112642950 @default.
- W2000577984 cites W2115983295 @default.
- W2000577984 cites W2118996379 @default.
- W2000577984 cites W2121357917 @default.
- W2000577984 cites W2124416056 @default.
- W2000577984 doi "https://doi.org/10.1145/1075389.1075392" @default.
- W2000577984 hasPublicationYear "2005" @default.
- W2000577984 type Work @default.
- W2000577984 sameAs 2000577984 @default.
- W2000577984 citedByCount "152" @default.
- W2000577984 countsByYear W20005779842012 @default.
- W2000577984 countsByYear W20005779842013 @default.
- W2000577984 countsByYear W20005779842014 @default.
- W2000577984 countsByYear W20005779842015 @default.
- W2000577984 countsByYear W20005779842016 @default.
- W2000577984 countsByYear W20005779842017 @default.
- W2000577984 countsByYear W20005779842018 @default.
- W2000577984 countsByYear W20005779842019 @default.
- W2000577984 countsByYear W20005779842021 @default.
- W2000577984 countsByYear W20005779842022 @default.
- W2000577984 countsByYear W20005779842023 @default.
- W2000577984 crossrefType "journal-article" @default.
- W2000577984 hasAuthorship W2000577984A5041024491 @default.
- W2000577984 hasAuthorship W2000577984A5054936589 @default.
- W2000577984 hasConcept C108757681 @default.
- W2000577984 hasConcept C118643609 @default.
- W2000577984 hasConcept C119857082 @default.
- W2000577984 hasConcept C136197465 @default.
- W2000577984 hasConcept C136764020 @default.
- W2000577984 hasConcept C137293760 @default.
- W2000577984 hasConcept C137546455 @default.
- W2000577984 hasConcept C154945302 @default.
- W2000577984 hasConcept C184337299 @default.
- W2000577984 hasConcept C199360897 @default.
- W2000577984 hasConcept C204321447 @default.
- W2000577984 hasConcept C41008148 @default.
- W2000577984 hasConcept C60048249 @default.
- W2000577984 hasConceptScore W2000577984C108757681 @default.
- W2000577984 hasConceptScore W2000577984C118643609 @default.
- W2000577984 hasConceptScore W2000577984C119857082 @default.
- W2000577984 hasConceptScore W2000577984C136197465 @default.
- W2000577984 hasConceptScore W2000577984C136764020 @default.
- W2000577984 hasConceptScore W2000577984C137293760 @default.
- W2000577984 hasConceptScore W2000577984C137546455 @default.
- W2000577984 hasConceptScore W2000577984C154945302 @default.
- W2000577984 hasConceptScore W2000577984C184337299 @default.
- W2000577984 hasConceptScore W2000577984C199360897 @default.
- W2000577984 hasConceptScore W2000577984C204321447 @default.
- W2000577984 hasConceptScore W2000577984C41008148 @default.
- W2000577984 hasConceptScore W2000577984C60048249 @default.
- W2000577984 hasIssue "1" @default.
- W2000577984 hasLocation W20005779841 @default.
- W2000577984 hasOpenAccess W2000577984 @default.
- W2000577984 hasPrimaryLocation W20005779841 @default.
- W2000577984 hasRelatedWork W1530216171 @default.
- W2000577984 hasRelatedWork W1530957496 @default.
- W2000577984 hasRelatedWork W1700330385 @default.
- W2000577984 hasRelatedWork W2041167939 @default.
- W2000577984 hasRelatedWork W2105076537 @default.
- W2000577984 hasRelatedWork W2140830074 @default.
- W2000577984 hasRelatedWork W2359001871 @default.
- W2000577984 hasRelatedWork W4380301020 @default.
- W2000577984 hasRelatedWork W4385571594 @default.
- W2000577984 hasRelatedWork W54512559 @default.
- W2000577984 hasVolume "2" @default.
- W2000577984 isParatext "false" @default.
- W2000577984 isRetracted "false" @default.
- W2000577984 magId "2000577984" @default.
- W2000577984 workType "article" @default.