Matches in SemOpenAlex for { <https://semopenalex.org/work/W1549638892> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W1549638892 endingPage "332" @default.
- W1549638892 startingPage "323" @default.
- W1549638892 abstract "Focused (thematic) crawling is a relatively new, promising approach to improving the recall of expert search on the Web. It involves the automatic classification of visited documents into a user- or community-specific topic hierarchy (ontology). The quality of training data for the classifier is the most critical issue and a potential bottleneck for the effectivity and scale of a focused crawler. This paper presents the BINGO! approach to focused crawling that aims to overcome the limitations of initial training data. To this end, BINGO! identifies, among the crawled and positively classified documents of a topic, characteristic and uses them for periodically re-training the classifier; this way the crawler is dynamically adapted based on the most significant documents seen so far. Two kinds of archetypes are considered: good authorities as determined by employing Kleinberg's (1999) link analysis algorithm, and documents that have been automatically classified with high confidence using a linear SVM classifier. Our approach is fully implemented in the BINGO! system, and our experiments indicate that the dynamic enhancement of training data based on archetypes extends the knowledge base of the classifier by a substantial margin without loss of classification accuracy." @default.
- W1549638892 created "2016-06-24" @default.
- W1549638892 creator A5053520164 @default.
- W1549638892 creator A5060837952 @default.
- W1549638892 creator A5088135366 @default.
- W1549638892 creator A5089788861 @default.
- W1549638892 date "2002-01-01" @default.
- W1549638892 modified "2023-09-23" @default.
- W1549638892 title "BINGO!: Bookmark-Induced Gathering of Information." @default.
- W1549638892 hasPublicationYear "2002" @default.
- W1549638892 type Work @default.
- W1549638892 sameAs 1549638892 @default.
- W1549638892 citedByCount "4" @default.
- W1549638892 countsByYear W15496388922012 @default.
- W1549638892 crossrefType "journal-article" @default.
- W1549638892 hasAuthorship W1549638892A5053520164 @default.
- W1549638892 hasAuthorship W1549638892A5060837952 @default.
- W1549638892 hasAuthorship W1549638892A5088135366 @default.
- W1549638892 hasAuthorship W1549638892A5089788861 @default.
- W1549638892 hasConcept C100368936 @default.
- W1549638892 hasConcept C105702510 @default.
- W1549638892 hasConcept C110875604 @default.
- W1549638892 hasConcept C11392498 @default.
- W1549638892 hasConcept C119857082 @default.
- W1549638892 hasConcept C12267149 @default.
- W1549638892 hasConcept C124101348 @default.
- W1549638892 hasConcept C136764020 @default.
- W1549638892 hasConcept C13743948 @default.
- W1549638892 hasConcept C149635348 @default.
- W1549638892 hasConcept C154945302 @default.
- W1549638892 hasConcept C173576120 @default.
- W1549638892 hasConcept C23123220 @default.
- W1549638892 hasConcept C2780513914 @default.
- W1549638892 hasConcept C41008148 @default.
- W1549638892 hasConcept C71924100 @default.
- W1549638892 hasConcept C73340581 @default.
- W1549638892 hasConcept C81669768 @default.
- W1549638892 hasConcept C95623464 @default.
- W1549638892 hasConceptScore W1549638892C100368936 @default.
- W1549638892 hasConceptScore W1549638892C105702510 @default.
- W1549638892 hasConceptScore W1549638892C110875604 @default.
- W1549638892 hasConceptScore W1549638892C11392498 @default.
- W1549638892 hasConceptScore W1549638892C119857082 @default.
- W1549638892 hasConceptScore W1549638892C12267149 @default.
- W1549638892 hasConceptScore W1549638892C124101348 @default.
- W1549638892 hasConceptScore W1549638892C136764020 @default.
- W1549638892 hasConceptScore W1549638892C13743948 @default.
- W1549638892 hasConceptScore W1549638892C149635348 @default.
- W1549638892 hasConceptScore W1549638892C154945302 @default.
- W1549638892 hasConceptScore W1549638892C173576120 @default.
- W1549638892 hasConceptScore W1549638892C23123220 @default.
- W1549638892 hasConceptScore W1549638892C2780513914 @default.
- W1549638892 hasConceptScore W1549638892C41008148 @default.
- W1549638892 hasConceptScore W1549638892C71924100 @default.
- W1549638892 hasConceptScore W1549638892C73340581 @default.
- W1549638892 hasConceptScore W1549638892C81669768 @default.
- W1549638892 hasConceptScore W1549638892C95623464 @default.
- W1549638892 hasOpenAccess W1549638892 @default.
- W1549638892 hasRelatedWork W13111539 @default.
- W1549638892 hasRelatedWork W1480207373 @default.
- W1549638892 hasRelatedWork W1780726185 @default.
- W1549638892 hasRelatedWork W1881973443 @default.
- W1549638892 hasRelatedWork W2065168033 @default.
- W1549638892 hasRelatedWork W2080354349 @default.
- W1549638892 hasRelatedWork W2094405361 @default.
- W1549638892 hasRelatedWork W2133870777 @default.
- W1549638892 hasRelatedWork W2164896748 @default.
- W1549638892 hasRelatedWork W2168609510 @default.
- W1549638892 hasRelatedWork W2181540916 @default.
- W1549638892 hasRelatedWork W2276611798 @default.
- W1549638892 hasRelatedWork W2315161676 @default.
- W1549638892 hasRelatedWork W2353753557 @default.
- W1549638892 hasRelatedWork W2401153089 @default.
- W1549638892 hasRelatedWork W2516571161 @default.
- W1549638892 hasRelatedWork W2613535260 @default.
- W1549638892 hasRelatedWork W3129095573 @default.
- W1549638892 hasRelatedWork W91053882 @default.
- W1549638892 hasRelatedWork W128549517 @default.
- W1549638892 isParatext "false" @default.
- W1549638892 isRetracted "false" @default.
- W1549638892 magId "1549638892" @default.
- W1549638892 workType "article" @default.