Matches in SemOpenAlex for { <https://semopenalex.org/work/W3006380594> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W3006380594 endingPage "78" @default.
- W3006380594 startingPage "67" @default.
- W3006380594 abstract "The contents of many valuable web-accessible databases are only accessible through search interfaces and are hence invisible to traditional web “crawlers.” Recent studies have estimated the size of this “hidden web” to be 500 billion pages, while the size of the “crawlable” web is only an estimated two billion pages. Recently, commercial web sites have started to manually organize web-accessible databases into Yahoo!-like hierarchical classification schemes. In this paper, we introduce a method for automating this classification process by using a small number of query probes. To classify a database, our algorithm does not retrieve or inspect any documents or pages from the database, but rather just exploits the number of matches that each query probe generates at the database in question. We have conducted an extensive experimental evaluation of our technique over collections of real documents, including over one hundred web-accessible databases. Our experiments show that our system has low overhead and achieves high classification accuracy across a variety of databases." @default.
- W3006380594 created "2020-02-24" @default.
- W3006380594 creator A5010731709 @default.
- W3006380594 creator A5059607920 @default.
- W3006380594 creator A5090568527 @default.
- W3006380594 date "2001-05-01" @default.
- W3006380594 modified "2023-10-17" @default.
- W3006380594 title "Probe, count, and classify" @default.
- W3006380594 cites W1539477445 @default.
- W3006380594 cites W1978394996 @default.
- W3006380594 cites W2005422315 @default.
- W3006380594 cites W2012426233 @default.
- W3006380594 cites W2016892599 @default.
- W3006380594 cites W2018157370 @default.
- W3006380594 cites W2058982198 @default.
- W3006380594 cites W2060216474 @default.
- W3006380594 cites W2061973112 @default.
- W3006380594 cites W2094934653 @default.
- W3006380594 cites W2099944991 @default.
- W3006380594 cites W2125725207 @default.
- W3006380594 cites W2170654002 @default.
- W3006380594 cites W4247346926 @default.
- W3006380594 doi "https://doi.org/10.1145/376284.375671" @default.
- W3006380594 hasPublicationYear "2001" @default.
- W3006380594 type Work @default.
- W3006380594 sameAs 3006380594 @default.
- W3006380594 citedByCount "23" @default.
- W3006380594 countsByYear W30063805942012 @default.
- W3006380594 countsByYear W30063805942013 @default.
- W3006380594 countsByYear W30063805942014 @default.
- W3006380594 countsByYear W30063805942015 @default.
- W3006380594 countsByYear W30063805942017 @default.
- W3006380594 countsByYear W30063805942021 @default.
- W3006380594 crossrefType "journal-article" @default.
- W3006380594 hasAuthorship W3006380594A5010731709 @default.
- W3006380594 hasAuthorship W3006380594A5059607920 @default.
- W3006380594 hasAuthorship W3006380594A5090568527 @default.
- W3006380594 hasConcept C110875604 @default.
- W3006380594 hasConcept C111919701 @default.
- W3006380594 hasConcept C118689300 @default.
- W3006380594 hasConcept C136197465 @default.
- W3006380594 hasConcept C136764020 @default.
- W3006380594 hasConcept C13743948 @default.
- W3006380594 hasConcept C154945302 @default.
- W3006380594 hasConcept C164120249 @default.
- W3006380594 hasConcept C165696696 @default.
- W3006380594 hasConcept C173576120 @default.
- W3006380594 hasConcept C21959979 @default.
- W3006380594 hasConcept C23123220 @default.
- W3006380594 hasConcept C2779960059 @default.
- W3006380594 hasConcept C38652104 @default.
- W3006380594 hasConcept C41008148 @default.
- W3006380594 hasConcept C46721378 @default.
- W3006380594 hasConcept C61096286 @default.
- W3006380594 hasConcept C77088390 @default.
- W3006380594 hasConcept C97854310 @default.
- W3006380594 hasConceptScore W3006380594C110875604 @default.
- W3006380594 hasConceptScore W3006380594C111919701 @default.
- W3006380594 hasConceptScore W3006380594C118689300 @default.
- W3006380594 hasConceptScore W3006380594C136197465 @default.
- W3006380594 hasConceptScore W3006380594C136764020 @default.
- W3006380594 hasConceptScore W3006380594C13743948 @default.
- W3006380594 hasConceptScore W3006380594C154945302 @default.
- W3006380594 hasConceptScore W3006380594C164120249 @default.
- W3006380594 hasConceptScore W3006380594C165696696 @default.
- W3006380594 hasConceptScore W3006380594C173576120 @default.
- W3006380594 hasConceptScore W3006380594C21959979 @default.
- W3006380594 hasConceptScore W3006380594C23123220 @default.
- W3006380594 hasConceptScore W3006380594C2779960059 @default.
- W3006380594 hasConceptScore W3006380594C38652104 @default.
- W3006380594 hasConceptScore W3006380594C41008148 @default.
- W3006380594 hasConceptScore W3006380594C46721378 @default.
- W3006380594 hasConceptScore W3006380594C61096286 @default.
- W3006380594 hasConceptScore W3006380594C77088390 @default.
- W3006380594 hasConceptScore W3006380594C97854310 @default.
- W3006380594 hasIssue "2" @default.
- W3006380594 hasLocation W30063805941 @default.
- W3006380594 hasOpenAccess W3006380594 @default.
- W3006380594 hasPrimaryLocation W30063805941 @default.
- W3006380594 hasRelatedWork W1505765080 @default.
- W3006380594 hasRelatedWork W2047534223 @default.
- W3006380594 hasRelatedWork W2091065008 @default.
- W3006380594 hasRelatedWork W2162324631 @default.
- W3006380594 hasRelatedWork W2315628598 @default.
- W3006380594 hasRelatedWork W2621313185 @default.
- W3006380594 hasRelatedWork W2783231802 @default.
- W3006380594 hasRelatedWork W2899211245 @default.
- W3006380594 hasRelatedWork W3117083188 @default.
- W3006380594 hasRelatedWork W2592441986 @default.
- W3006380594 hasVolume "30" @default.
- W3006380594 isParatext "false" @default.
- W3006380594 isRetracted "false" @default.
- W3006380594 magId "3006380594" @default.
- W3006380594 workType "article" @default.