Matches in SemOpenAlex for { <https://semopenalex.org/work/W2087090969> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W2087090969 abstract "Web pages such as product catalogues and Web sites resulting from querying a search engine often follow a global layout template which facilitates the retrieval of information for a user. In this paper we present a technique which makes such content machine-processable by extracting and transforming it into tabular form. We achieve this goal via ViPER, our fully automatic wrapper system, while localizing and extracting structured data records from suchlike Web pages following a sophisticated strategy based on the visual perception of a Web page. The first contribution of this paper is to give deep insight into the post-processing heuristics of ViPER, which become materialized by a set of rules. Once these rules are defined, the regular content of a Web page can be abstracted into a relational view. Second, we show that new, unseen contents rendered with the same layout, only have to be extracted by ViPER, whereas the remaining transformation can be performed by applying the learned rules accordingly" @default.
- W2087090969 created "2016-06-24" @default.
- W2087090969 creator A5028788578 @default.
- W2087090969 creator A5048594908 @default.
- W2087090969 creator A5070204016 @default.
- W2087090969 date "2006-11-01" @default.
- W2087090969 modified "2023-09-25" @default.
- W2087090969 title "Learning Rules to Pre-process Web Data for Automatic Integration" @default.
- W2087090969 cites W2005646337 @default.
- W2087090969 cites W2052198547 @default.
- W2087090969 cites W2052409393 @default.
- W2087090969 cites W2075726969 @default.
- W2087090969 cites W2084130127 @default.
- W2087090969 cites W2084959895 @default.
- W2087090969 cites W2133669904 @default.
- W2087090969 cites W2166686713 @default.
- W2087090969 cites W4233962457 @default.
- W2087090969 cites W4235592330 @default.
- W2087090969 doi "https://doi.org/10.1109/ruleml.2006.16" @default.
- W2087090969 hasPublicationYear "2006" @default.
- W2087090969 type Work @default.
- W2087090969 sameAs 2087090969 @default.
- W2087090969 citedByCount "6" @default.
- W2087090969 countsByYear W20870909692015 @default.
- W2087090969 countsByYear W20870909692016 @default.
- W2087090969 crossrefType "proceedings-article" @default.
- W2087090969 hasAuthorship W2087090969A5028788578 @default.
- W2087090969 hasAuthorship W2087090969A5048594908 @default.
- W2087090969 hasAuthorship W2087090969A5070204016 @default.
- W2087090969 hasConcept C111919701 @default.
- W2087090969 hasConcept C124101348 @default.
- W2087090969 hasConcept C127705205 @default.
- W2087090969 hasConcept C135572916 @default.
- W2087090969 hasConcept C136764020 @default.
- W2087090969 hasConcept C150670458 @default.
- W2087090969 hasConcept C177264268 @default.
- W2087090969 hasConcept C18903297 @default.
- W2087090969 hasConcept C197046077 @default.
- W2087090969 hasConcept C199360897 @default.
- W2087090969 hasConcept C21959979 @default.
- W2087090969 hasConcept C23123220 @default.
- W2087090969 hasConcept C2779448229 @default.
- W2087090969 hasConcept C2780389661 @default.
- W2087090969 hasConcept C41008148 @default.
- W2087090969 hasConcept C5655090 @default.
- W2087090969 hasConcept C77088390 @default.
- W2087090969 hasConcept C86803240 @default.
- W2087090969 hasConcept C98045186 @default.
- W2087090969 hasConceptScore W2087090969C111919701 @default.
- W2087090969 hasConceptScore W2087090969C124101348 @default.
- W2087090969 hasConceptScore W2087090969C127705205 @default.
- W2087090969 hasConceptScore W2087090969C135572916 @default.
- W2087090969 hasConceptScore W2087090969C136764020 @default.
- W2087090969 hasConceptScore W2087090969C150670458 @default.
- W2087090969 hasConceptScore W2087090969C177264268 @default.
- W2087090969 hasConceptScore W2087090969C18903297 @default.
- W2087090969 hasConceptScore W2087090969C197046077 @default.
- W2087090969 hasConceptScore W2087090969C199360897 @default.
- W2087090969 hasConceptScore W2087090969C21959979 @default.
- W2087090969 hasConceptScore W2087090969C23123220 @default.
- W2087090969 hasConceptScore W2087090969C2779448229 @default.
- W2087090969 hasConceptScore W2087090969C2780389661 @default.
- W2087090969 hasConceptScore W2087090969C41008148 @default.
- W2087090969 hasConceptScore W2087090969C5655090 @default.
- W2087090969 hasConceptScore W2087090969C77088390 @default.
- W2087090969 hasConceptScore W2087090969C86803240 @default.
- W2087090969 hasConceptScore W2087090969C98045186 @default.
- W2087090969 hasLocation W20870909691 @default.
- W2087090969 hasOpenAccess W2087090969 @default.
- W2087090969 hasPrimaryLocation W20870909691 @default.
- W2087090969 hasRelatedWork W1509467138 @default.
- W2087090969 hasRelatedWork W1548492051 @default.
- W2087090969 hasRelatedWork W2013725398 @default.
- W2087090969 hasRelatedWork W2087090969 @default.
- W2087090969 hasRelatedWork W2116332923 @default.
- W2087090969 hasRelatedWork W2128719260 @default.
- W2087090969 hasRelatedWork W2411679502 @default.
- W2087090969 hasRelatedWork W3159087444 @default.
- W2087090969 hasRelatedWork W67510309 @default.
- W2087090969 hasRelatedWork W2513545296 @default.
- W2087090969 isParatext "false" @default.
- W2087090969 isRetracted "false" @default.
- W2087090969 magId "2087090969" @default.
- W2087090969 workType "article" @default.