Matches in SemOpenAlex for { <https://semopenalex.org/work/W1569165351> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W1569165351 endingPage "1324" @default.
- W1569165351 startingPage "1321" @default.
- W1569165351 abstract "This chapter investigates a system to automatically grab data from data intensive Websites. The system first infers a model that describes the Website as a collection of classes. Each class represents a set of structurally homogeneous pages, and it is associated with a small set of representative pages. Based on the model, a library of wrappers, one per class, is then inferred with the help an external wrapper generator. The model, together with the library of wrappers, can thus be used to navigate the site and extract the data. The inference process is performed incrementally. The system starts from a given entry point that becomes the first member of the first class in the model. It then refines the model by exploring its boundaries to gather new pages. At each iteration, the system selects a link collection from the model outbound, and iteratively fetches a page by following one of the links in the collection. In order to reduce the number of pages actually visited, after each download the system makes a guess on the class of remaining pages. If looking at the pages already downloaded, there is sufficient evidence that the guess is right, the remaining pages of the collections are assigned to classes without actually fetching them. The process iterates until all the link collections are typed with a known class." @default.
- W1569165351 created "2016-06-24" @default.
- W1569165351 creator A5008291639 @default.
- W1569165351 creator A5018918066 @default.
- W1569165351 creator A5022736687 @default.
- W1569165351 creator A5030700727 @default.
- W1569165351 date "2004-01-01" @default.
- W1569165351 modified "2023-09-26" @default.
- W1569165351 title "An Automatic Data Grabber for Large Web Sites" @default.
- W1569165351 cites W1489992655 @default.
- W1569165351 cites W2004356755 @default.
- W1569165351 cites W2085016361 @default.
- W1569165351 cites W2104086170 @default.
- W1569165351 cites W2112546875 @default.
- W1569165351 cites W2119966992 @default.
- W1569165351 cites W2150721933 @default.
- W1569165351 cites W2160196229 @default.
- W1569165351 cites W2170188121 @default.
- W1569165351 cites W14180986 @default.
- W1569165351 doi "https://doi.org/10.1016/b978-012088469-8.50137-6" @default.
- W1569165351 hasPublicationYear "2004" @default.
- W1569165351 type Work @default.
- W1569165351 sameAs 1569165351 @default.
- W1569165351 citedByCount "9" @default.
- W1569165351 countsByYear W15691653512013 @default.
- W1569165351 countsByYear W15691653512014 @default.
- W1569165351 crossrefType "book-chapter" @default.
- W1569165351 hasAuthorship W1569165351A5008291639 @default.
- W1569165351 hasAuthorship W1569165351A5018918066 @default.
- W1569165351 hasAuthorship W1569165351A5022736687 @default.
- W1569165351 hasAuthorship W1569165351A5030700727 @default.
- W1569165351 hasBestOaLocation W15691653512 @default.
- W1569165351 hasConcept C121332964 @default.
- W1569165351 hasConcept C136764020 @default.
- W1569165351 hasConcept C154945302 @default.
- W1569165351 hasConcept C163258240 @default.
- W1569165351 hasConcept C173576120 @default.
- W1569165351 hasConcept C177264268 @default.
- W1569165351 hasConcept C199360897 @default.
- W1569165351 hasConcept C21959979 @default.
- W1569165351 hasConcept C23123220 @default.
- W1569165351 hasConcept C2777212361 @default.
- W1569165351 hasConcept C2780992000 @default.
- W1569165351 hasConcept C41008148 @default.
- W1569165351 hasConcept C61096286 @default.
- W1569165351 hasConcept C62520636 @default.
- W1569165351 hasConcept C66882249 @default.
- W1569165351 hasConcept C67617509 @default.
- W1569165351 hasConcept C77088390 @default.
- W1569165351 hasConcept C97355855 @default.
- W1569165351 hasConceptScore W1569165351C121332964 @default.
- W1569165351 hasConceptScore W1569165351C136764020 @default.
- W1569165351 hasConceptScore W1569165351C154945302 @default.
- W1569165351 hasConceptScore W1569165351C163258240 @default.
- W1569165351 hasConceptScore W1569165351C173576120 @default.
- W1569165351 hasConceptScore W1569165351C177264268 @default.
- W1569165351 hasConceptScore W1569165351C199360897 @default.
- W1569165351 hasConceptScore W1569165351C21959979 @default.
- W1569165351 hasConceptScore W1569165351C23123220 @default.
- W1569165351 hasConceptScore W1569165351C2777212361 @default.
- W1569165351 hasConceptScore W1569165351C2780992000 @default.
- W1569165351 hasConceptScore W1569165351C41008148 @default.
- W1569165351 hasConceptScore W1569165351C61096286 @default.
- W1569165351 hasConceptScore W1569165351C62520636 @default.
- W1569165351 hasConceptScore W1569165351C66882249 @default.
- W1569165351 hasConceptScore W1569165351C67617509 @default.
- W1569165351 hasConceptScore W1569165351C77088390 @default.
- W1569165351 hasConceptScore W1569165351C97355855 @default.
- W1569165351 hasLocation W15691653511 @default.
- W1569165351 hasLocation W15691653512 @default.
- W1569165351 hasOpenAccess W1569165351 @default.
- W1569165351 hasPrimaryLocation W15691653511 @default.
- W1569165351 hasRelatedWork W1545545132 @default.
- W1569165351 hasRelatedWork W1591904946 @default.
- W1569165351 hasRelatedWork W1840312346 @default.
- W1569165351 hasRelatedWork W2036147130 @default.
- W1569165351 hasRelatedWork W2123828121 @default.
- W1569165351 hasRelatedWork W2144190808 @default.
- W1569165351 hasRelatedWork W2411679502 @default.
- W1569165351 hasRelatedWork W50774052 @default.
- W1569165351 hasRelatedWork W2187430634 @default.
- W1569165351 hasRelatedWork W2592441986 @default.
- W1569165351 isParatext "false" @default.
- W1569165351 isRetracted "false" @default.
- W1569165351 magId "1569165351" @default.
- W1569165351 workType "book-chapter" @default.