Matches in SemOpenAlex for { <https://semopenalex.org/work/W2173604056> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W2173604056 endingPage "147" @default.
- W2173604056 startingPage "135" @default.
- W2173604056 abstract "There is a growing trend in linguistics to use large corpora as a tool in the study of language. Through the investigation of the different contexts a word occurs in, it is possible to gain insight in the meanings associated with the word. Concordances are commonly used as a tool in lexicography, but while the study of concordances is fruitful it is also tedious, so statistical methods are gaining grounds in corpus linguistics. Several statistical measures have been introduced to measure the strength in association between two words, e.g. t-score (Barnbrook 1996:97-98), mutual information, MI (Charniak 1993; McEnery & Wilson 1996; Oakes 1998) and Berry-Rogghe’s z-score (1973). Those measures are designed to measure the strength of association between words occurring at a close distance from each other, i.e. immediately next to each other or within a fixed window span. Research that uses the sentence as a linguistic unit of study has also been presented. For example, antonymous concepts have been shown to co-occur in the same sentence more often than chance predicts by Justeson & Katz 1991, 1992 and Fellbaum 1995. A problem using the sentence as unit of study is that the lengths of the sentences vary from sentence to sentence. This has an impact on the statistical calculation – it is more likely to find two given words in a long sentence than in a short one. The probability of finding two given words co-occurring in the same sentence is thus affected. We introduce an exact expression for the calculation of the expected number of sentential co-occurrences. The p-value is calculated assuming that the number of random co-occurrences follows a Poisson distribution. A formal proof justifying this approximation is provided in the appendix. Apart from the statistical methods that account for the variation in sentence length, a case study is presented as an application of the statistical method. The study replicates Justeson and Katz’s 1991 study that shows that English antonyms co-occur sententially more frequently than chance predicts. The results of our study show that the variation in sentence length causes the chance for co-occurrence of two given words to increase. However, the main finding of Justeson & Katz is reinforced: antonyms co-occur significantly more often in the same sentence than expected by chance. (Less)" @default.
- W2173604056 created "2016-06-24" @default.
- W2173604056 creator A5045448454 @default.
- W2173604056 creator A5052050793 @default.
- W2173604056 date "2001-01-01" @default.
- W2173604056 modified "2023-09-24" @default.
- W2173604056 title "Statistics for sentential co-occurrence" @default.
- W2173604056 cites W1501095784 @default.
- W2173604056 cites W1547269118 @default.
- W2173604056 cites W1994851566 @default.
- W2173604056 cites W2007780422 @default.
- W2173604056 cites W2012301138 @default.
- W2173604056 cites W2021034890 @default.
- W2173604056 cites W2052505931 @default.
- W2173604056 cites W2079730890 @default.
- W2173604056 cites W2091384152 @default.
- W2173604056 cites W2100322854 @default.
- W2173604056 cites W2798655805 @default.
- W2173604056 cites W565739154 @default.
- W2173604056 hasPublicationYear "2001" @default.
- W2173604056 type Work @default.
- W2173604056 sameAs 2173604056 @default.
- W2173604056 citedByCount "9" @default.
- W2173604056 countsByYear W21736040562012 @default.
- W2173604056 countsByYear W21736040562015 @default.
- W2173604056 crossrefType "journal-article" @default.
- W2173604056 hasAuthorship W2173604056A5045448454 @default.
- W2173604056 hasAuthorship W2173604056A5052050793 @default.
- W2173604056 hasConcept C138885662 @default.
- W2173604056 hasConcept C154945302 @default.
- W2173604056 hasConcept C204321447 @default.
- W2173604056 hasConcept C2777530160 @default.
- W2173604056 hasConcept C2780009758 @default.
- W2173604056 hasConcept C33923547 @default.
- W2173604056 hasConcept C41008148 @default.
- W2173604056 hasConcept C41895202 @default.
- W2173604056 hasConcept C77088390 @default.
- W2173604056 hasConcept C90805587 @default.
- W2173604056 hasConceptScore W2173604056C138885662 @default.
- W2173604056 hasConceptScore W2173604056C154945302 @default.
- W2173604056 hasConceptScore W2173604056C204321447 @default.
- W2173604056 hasConceptScore W2173604056C2777530160 @default.
- W2173604056 hasConceptScore W2173604056C2780009758 @default.
- W2173604056 hasConceptScore W2173604056C33923547 @default.
- W2173604056 hasConceptScore W2173604056C41008148 @default.
- W2173604056 hasConceptScore W2173604056C41895202 @default.
- W2173604056 hasConceptScore W2173604056C77088390 @default.
- W2173604056 hasConceptScore W2173604056C90805587 @default.
- W2173604056 hasLocation W21736040561 @default.
- W2173604056 hasOpenAccess W2173604056 @default.
- W2173604056 hasPrimaryLocation W21736040561 @default.
- W2173604056 hasRelatedWork W1547269118 @default.
- W2173604056 hasRelatedWork W1633743783 @default.
- W2173604056 hasRelatedWork W1894217104 @default.
- W2173604056 hasRelatedWork W1954301502 @default.
- W2173604056 hasRelatedWork W2022223514 @default.
- W2173604056 hasRelatedWork W2038721957 @default.
- W2173604056 hasRelatedWork W2044340178 @default.
- W2173604056 hasRelatedWork W2046567002 @default.
- W2173604056 hasRelatedWork W2066683727 @default.
- W2173604056 hasRelatedWork W2086481551 @default.
- W2173604056 hasRelatedWork W2091384152 @default.
- W2173604056 hasRelatedWork W2100322854 @default.
- W2173604056 hasRelatedWork W2124814209 @default.
- W2173604056 hasRelatedWork W2129056111 @default.
- W2173604056 hasRelatedWork W2131267020 @default.
- W2173604056 hasRelatedWork W2131518270 @default.
- W2173604056 hasRelatedWork W2137973529 @default.
- W2173604056 hasRelatedWork W3089644846 @default.
- W2173604056 hasRelatedWork W388854203 @default.
- W2173604056 hasRelatedWork W1539724416 @default.
- W2173604056 hasVolume "48" @default.
- W2173604056 isParatext "false" @default.
- W2173604056 isRetracted "false" @default.
- W2173604056 magId "2173604056" @default.
- W2173604056 workType "article" @default.