Matches in SemOpenAlex for { <https://semopenalex.org/work/W2040757233> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W2040757233 endingPage "263" @default.
- W2040757233 startingPage "249" @default.
- W2040757233 abstract "Web data extraction has been an important part for many Web data analysis applications. In this paper, we formulate the data extraction problem as the decoding process of page generation based on structured data and tree templates. We propose an unsupervised, page-level data extraction approach to deduce the schema and templates for each individual deep Website, which contains either singleton or multiple data records in one Webpage. FiVaTech applies tree matching, tree alignment, and mining techniques to achieve the challenging task. In experiments, FiVaTech has much higher precision than EXALG and is comparable with other record-level extraction systems like ViPER and MSE. The experiments show an encouraging result for the test pages used in many state-of-the-art Web data extraction works." @default.
- W2040757233 created "2016-06-24" @default.
- W2040757233 creator A5078524542 @default.
- W2040757233 creator A5085280123 @default.
- W2040757233 date "2010-02-01" @default.
- W2040757233 modified "2023-09-26" @default.
- W2040757233 title "FiVaTech: Page-Level Web Data Extraction from Template Pages" @default.
- W2040757233 cites W2005646337 @default.
- W2040757233 cites W2049461910 @default.
- W2040757233 cites W2065568440 @default.
- W2040757233 cites W2128341918 @default.
- W2040757233 cites W2133669904 @default.
- W2040757233 cites W2134150392 @default.
- W2040757233 cites W2143309843 @default.
- W2040757233 cites W2153072229 @default.
- W2040757233 cites W2160196229 @default.
- W2040757233 cites W4233962457 @default.
- W2040757233 cites W4239866631 @default.
- W2040757233 doi "https://doi.org/10.1109/tkde.2009.82" @default.
- W2040757233 hasPublicationYear "2010" @default.
- W2040757233 type Work @default.
- W2040757233 sameAs 2040757233 @default.
- W2040757233 citedByCount "108" @default.
- W2040757233 countsByYear W20407572332012 @default.
- W2040757233 countsByYear W20407572332013 @default.
- W2040757233 countsByYear W20407572332014 @default.
- W2040757233 countsByYear W20407572332015 @default.
- W2040757233 countsByYear W20407572332016 @default.
- W2040757233 countsByYear W20407572332017 @default.
- W2040757233 countsByYear W20407572332018 @default.
- W2040757233 countsByYear W20407572332019 @default.
- W2040757233 countsByYear W20407572332020 @default.
- W2040757233 countsByYear W20407572332021 @default.
- W2040757233 countsByYear W20407572332022 @default.
- W2040757233 crossrefType "journal-article" @default.
- W2040757233 hasAuthorship W2040757233A5078524542 @default.
- W2040757233 hasAuthorship W2040757233A5085280123 @default.
- W2040757233 hasConcept C113174947 @default.
- W2040757233 hasConcept C124101348 @default.
- W2040757233 hasConcept C134306372 @default.
- W2040757233 hasConcept C136764020 @default.
- W2040757233 hasConcept C137922610 @default.
- W2040757233 hasConcept C17744445 @default.
- W2040757233 hasConcept C195807954 @default.
- W2040757233 hasConcept C199360897 @default.
- W2040757233 hasConcept C199539241 @default.
- W2040757233 hasConcept C21959979 @default.
- W2040757233 hasConcept C23123220 @default.
- W2040757233 hasConcept C2777466982 @default.
- W2040757233 hasConcept C2779473830 @default.
- W2040757233 hasConcept C33923547 @default.
- W2040757233 hasConcept C41008148 @default.
- W2040757233 hasConcept C82714645 @default.
- W2040757233 hasConceptScore W2040757233C113174947 @default.
- W2040757233 hasConceptScore W2040757233C124101348 @default.
- W2040757233 hasConceptScore W2040757233C134306372 @default.
- W2040757233 hasConceptScore W2040757233C136764020 @default.
- W2040757233 hasConceptScore W2040757233C137922610 @default.
- W2040757233 hasConceptScore W2040757233C17744445 @default.
- W2040757233 hasConceptScore W2040757233C195807954 @default.
- W2040757233 hasConceptScore W2040757233C199360897 @default.
- W2040757233 hasConceptScore W2040757233C199539241 @default.
- W2040757233 hasConceptScore W2040757233C21959979 @default.
- W2040757233 hasConceptScore W2040757233C23123220 @default.
- W2040757233 hasConceptScore W2040757233C2777466982 @default.
- W2040757233 hasConceptScore W2040757233C2779473830 @default.
- W2040757233 hasConceptScore W2040757233C33923547 @default.
- W2040757233 hasConceptScore W2040757233C41008148 @default.
- W2040757233 hasConceptScore W2040757233C82714645 @default.
- W2040757233 hasIssue "2" @default.
- W2040757233 hasLocation W20407572331 @default.
- W2040757233 hasOpenAccess W2040757233 @default.
- W2040757233 hasPrimaryLocation W20407572331 @default.
- W2040757233 hasRelatedWork W108340680 @default.
- W2040757233 hasRelatedWork W118350637 @default.
- W2040757233 hasRelatedWork W1548492051 @default.
- W2040757233 hasRelatedWork W1973356180 @default.
- W2040757233 hasRelatedWork W2049318906 @default.
- W2040757233 hasRelatedWork W2151311386 @default.
- W2040757233 hasRelatedWork W2151928232 @default.
- W2040757233 hasRelatedWork W2355247546 @default.
- W2040757233 hasRelatedWork W2371618206 @default.
- W2040757233 hasRelatedWork W2390855652 @default.
- W2040757233 hasVolume "22" @default.
- W2040757233 isParatext "false" @default.
- W2040757233 isRetracted "false" @default.
- W2040757233 magId "2040757233" @default.
- W2040757233 workType "article" @default.