Matches in SemOpenAlex for { <https://semopenalex.org/work/W343877837> ?p ?o ?g. }
- W343877837 endingPage "240" @default.
- W343877837 startingPage "235" @default.
- W343877837 abstract "In this paper, we investigate the impact of several local and global weighting schemes on Latent Semantic Analysis' (LSA) ability to capture semantic similarity between two texts. We worked with texts varying in size from sentences to paragraphs. We present a comparison of 3 local and 3 global weighting schemes across 3 different standardized data sets related to semantic similarity tasks. For local weighting, we used binary weighting, term-frequency, and log-type. For global weighting, we relied on binary, inverted document frequencies (IDF) collected from the English Wikipedia, and entropy, which is the standard weighting scheme used by most LSA-based applications. We studied all possible combinations of these weighting schemes on the following three tasks and corresponding data sets: paraphrase identification at sentence level using the Microsoft Research Paraphrase Corpus, paraphrase identification at sentence level using data from the intelligent tutoring system iSTART, and mental model detection based on student-articulated paragraphs in MetaTutor, another intelligent tutoring system. Our experiments revealed that for sentence-level texts a combination of type frequency local weighting in combination with either IDF or binary global weighting works best. For paragraph-level texts, a log-type local weighting in combination with binary global weighting works best. We also found that global weights have a greater impact for sententence-level similarity as the local weight is undermined by the small size of such texts." @default.
- W343877837 created "2016-06-24" @default.
- W343877837 creator A5026888231 @default.
- W343877837 creator A5038432216 @default.
- W343877837 creator A5045839835 @default.
- W343877837 creator A5081661094 @default.
- W343877837 date "2010-05-06" @default.
- W343877837 modified "2023-09-23" @default.
- W343877837 title "The Role of Local and Global Weighting in Assessing the Semantic Similarity of Texts Using Latent Semantic Analysis" @default.
- W343877837 cites W138447450 @default.
- W343877837 cites W1570448133 @default.
- W343877837 cites W1732828232 @default.
- W343877837 cites W1980776243 @default.
- W343877837 cites W1987054081 @default.
- W343877837 cites W1990524510 @default.
- W343877837 cites W2003677545 @default.
- W343877837 cites W2058616517 @default.
- W343877837 cites W2072773380 @default.
- W343877837 cites W2250503148 @default.
- W343877837 cites W28532327 @default.
- W343877837 cites W96573554 @default.
- W343877837 hasPublicationYear "2010" @default.
- W343877837 type Work @default.
- W343877837 sameAs 343877837 @default.
- W343877837 citedByCount "18" @default.
- W343877837 countsByYear W3438778372012 @default.
- W343877837 countsByYear W3438778372013 @default.
- W343877837 countsByYear W3438778372014 @default.
- W343877837 countsByYear W3438778372015 @default.
- W343877837 countsByYear W3438778372017 @default.
- W343877837 countsByYear W3438778372018 @default.
- W343877837 countsByYear W3438778372019 @default.
- W343877837 countsByYear W3438778372021 @default.
- W343877837 crossrefType "proceedings-article" @default.
- W343877837 hasAuthorship W343877837A5026888231 @default.
- W343877837 hasAuthorship W343877837A5038432216 @default.
- W343877837 hasAuthorship W343877837A5045839835 @default.
- W343877837 hasAuthorship W343877837A5081661094 @default.
- W343877837 hasConcept C103278499 @default.
- W343877837 hasConcept C115961682 @default.
- W343877837 hasConcept C126838900 @default.
- W343877837 hasConcept C130318100 @default.
- W343877837 hasConcept C136764020 @default.
- W343877837 hasConcept C154945302 @default.
- W343877837 hasConcept C170133592 @default.
- W343877837 hasConcept C183115368 @default.
- W343877837 hasConcept C204321447 @default.
- W343877837 hasConcept C2777206241 @default.
- W343877837 hasConcept C2777530160 @default.
- W343877837 hasConcept C2780922921 @default.
- W343877837 hasConcept C33923547 @default.
- W343877837 hasConcept C41008148 @default.
- W343877837 hasConcept C48372109 @default.
- W343877837 hasConcept C71924100 @default.
- W343877837 hasConcept C94375191 @default.
- W343877837 hasConceptScore W343877837C103278499 @default.
- W343877837 hasConceptScore W343877837C115961682 @default.
- W343877837 hasConceptScore W343877837C126838900 @default.
- W343877837 hasConceptScore W343877837C130318100 @default.
- W343877837 hasConceptScore W343877837C136764020 @default.
- W343877837 hasConceptScore W343877837C154945302 @default.
- W343877837 hasConceptScore W343877837C170133592 @default.
- W343877837 hasConceptScore W343877837C183115368 @default.
- W343877837 hasConceptScore W343877837C204321447 @default.
- W343877837 hasConceptScore W343877837C2777206241 @default.
- W343877837 hasConceptScore W343877837C2777530160 @default.
- W343877837 hasConceptScore W343877837C2780922921 @default.
- W343877837 hasConceptScore W343877837C33923547 @default.
- W343877837 hasConceptScore W343877837C41008148 @default.
- W343877837 hasConceptScore W343877837C48372109 @default.
- W343877837 hasConceptScore W343877837C71924100 @default.
- W343877837 hasConceptScore W343877837C94375191 @default.
- W343877837 hasLocation W3438778371 @default.
- W343877837 hasOpenAccess W343877837 @default.
- W343877837 hasPrimaryLocation W3438778371 @default.
- W343877837 hasRelatedWork W107526599 @default.
- W343877837 hasRelatedWork W13343750 @default.
- W343877837 hasRelatedWork W1561908597 @default.
- W343877837 hasRelatedWork W1566018662 @default.
- W343877837 hasRelatedWork W1647729745 @default.
- W343877837 hasRelatedWork W1732828232 @default.
- W343877837 hasRelatedWork W1741956239 @default.
- W343877837 hasRelatedWork W187228978 @default.
- W343877837 hasRelatedWork W1877481539 @default.
- W343877837 hasRelatedWork W1880262756 @default.
- W343877837 hasRelatedWork W1980776243 @default.
- W343877837 hasRelatedWork W1983578042 @default.
- W343877837 hasRelatedWork W1990524510 @default.
- W343877837 hasRelatedWork W2121184547 @default.
- W343877837 hasRelatedWork W2131273899 @default.
- W343877837 hasRelatedWork W2136930489 @default.
- W343877837 hasRelatedWork W2147152072 @default.
- W343877837 hasRelatedWork W2158997610 @default.
- W343877837 hasRelatedWork W2613678836 @default.
- W343877837 hasRelatedWork W343518776 @default.
- W343877837 isParatext "false" @default.
- W343877837 isRetracted "false" @default.
- W343877837 magId "343877837" @default.