Matches in SemOpenAlex for { <https://semopenalex.org/work/W2071117649> ?p ?o ?g. }
Showing items 1 to 80 of
80
with 100 items per page.
- W2071117649 abstract "This paper describes an implementation to compute positional ngram statistics (i.e. Frequency and Mutual Expectation) based on masks, suffix array-based data structures and multidimensional arrays. Positional ngrams are ordered sequences of words that represent continuous or discontinuous substrings of a corpus. In particular, the positional ngram model has shown successful results for the extraction of discontinuous collocations from large corpora. However, its computation is heavy. For instance, 4.299.742 positional ngrams (n=1..7) can be generated from a 100.000-word size corpus in a seven-word size window context. In comparison, only 700.000 ngrams would be computed for the classical ngram model. It is clear that huge efforts need to be made to process positional ngram statistics in reasonable time and space. Our solution shows O(h(F) N log N) time complexity where N is the corpus size and h(F) a function of the window context." @default.
- W2071117649 created "2016-06-24" @default.
- W2071117649 creator A5038268027 @default.
- W2071117649 creator A5043440123 @default.
- W2071117649 date "2003-01-01" @default.
- W2071117649 modified "2023-10-14" @default.
- W2071117649 title "Using masks, suffix array-based data structures and multidimensional arrays to compute positional ngram statistics from corpora" @default.
- W2071117649 cites W1607753411 @default.
- W2071117649 cites W1969173824 @default.
- W2071117649 cites W2000484009 @default.
- W2071117649 cites W2008434289 @default.
- W2071117649 cites W2149468555 @default.
- W2071117649 cites W2245946652 @default.
- W2071117649 cites W2293095839 @default.
- W2071117649 doi "https://doi.org/10.3115/1119282.1119286" @default.
- W2071117649 hasPublicationYear "2003" @default.
- W2071117649 type Work @default.
- W2071117649 sameAs 2071117649 @default.
- W2071117649 citedByCount "13" @default.
- W2071117649 countsByYear W20711176492012 @default.
- W2071117649 countsByYear W20711176492014 @default.
- W2071117649 crossrefType "proceedings-article" @default.
- W2071117649 hasAuthorship W2071117649A5038268027 @default.
- W2071117649 hasAuthorship W2071117649A5043440123 @default.
- W2071117649 hasBestOaLocation W20711176491 @default.
- W2071117649 hasConcept C111919701 @default.
- W2071117649 hasConcept C11413529 @default.
- W2071117649 hasConcept C138885662 @default.
- W2071117649 hasConcept C151730666 @default.
- W2071117649 hasConcept C154945302 @default.
- W2071117649 hasConcept C162319229 @default.
- W2071117649 hasConcept C182407805 @default.
- W2071117649 hasConcept C199360897 @default.
- W2071117649 hasConcept C2524010 @default.
- W2071117649 hasConcept C2778751112 @default.
- W2071117649 hasConcept C2779259728 @default.
- W2071117649 hasConcept C2779343474 @default.
- W2071117649 hasConcept C2779804580 @default.
- W2071117649 hasConcept C33923547 @default.
- W2071117649 hasConcept C41008148 @default.
- W2071117649 hasConcept C41895202 @default.
- W2071117649 hasConcept C45374587 @default.
- W2071117649 hasConcept C86803240 @default.
- W2071117649 hasConcept C90805587 @default.
- W2071117649 hasConceptScore W2071117649C111919701 @default.
- W2071117649 hasConceptScore W2071117649C11413529 @default.
- W2071117649 hasConceptScore W2071117649C138885662 @default.
- W2071117649 hasConceptScore W2071117649C151730666 @default.
- W2071117649 hasConceptScore W2071117649C154945302 @default.
- W2071117649 hasConceptScore W2071117649C162319229 @default.
- W2071117649 hasConceptScore W2071117649C182407805 @default.
- W2071117649 hasConceptScore W2071117649C199360897 @default.
- W2071117649 hasConceptScore W2071117649C2524010 @default.
- W2071117649 hasConceptScore W2071117649C2778751112 @default.
- W2071117649 hasConceptScore W2071117649C2779259728 @default.
- W2071117649 hasConceptScore W2071117649C2779343474 @default.
- W2071117649 hasConceptScore W2071117649C2779804580 @default.
- W2071117649 hasConceptScore W2071117649C33923547 @default.
- W2071117649 hasConceptScore W2071117649C41008148 @default.
- W2071117649 hasConceptScore W2071117649C41895202 @default.
- W2071117649 hasConceptScore W2071117649C45374587 @default.
- W2071117649 hasConceptScore W2071117649C86803240 @default.
- W2071117649 hasConceptScore W2071117649C90805587 @default.
- W2071117649 hasLocation W20711176491 @default.
- W2071117649 hasOpenAccess W2071117649 @default.
- W2071117649 hasPrimaryLocation W20711176491 @default.
- W2071117649 hasRelatedWork W153119118 @default.
- W2071117649 hasRelatedWork W1568850903 @default.
- W2071117649 hasRelatedWork W1593730394 @default.
- W2071117649 hasRelatedWork W2022280812 @default.
- W2071117649 hasRelatedWork W2109062349 @default.
- W2071117649 hasRelatedWork W2176972898 @default.
- W2071117649 hasRelatedWork W2186217540 @default.
- W2071117649 hasRelatedWork W2374526264 @default.
- W2071117649 hasRelatedWork W2736693933 @default.
- W2071117649 hasRelatedWork W2996215016 @default.
- W2071117649 isParatext "false" @default.
- W2071117649 isRetracted "false" @default.
- W2071117649 magId "2071117649" @default.
- W2071117649 workType "article" @default.