Matches in SemOpenAlex for { <https://semopenalex.org/work/W2402156761> ?p ?o ?g. }
- W2402156761 endingPage "192" @default.
- W2402156761 startingPage "169" @default.
- W2402156761 abstract "This paper studies structured data extraction from template-generated Web pages. Such pages contain most of structured data on the Web. Extracted structured data can be later integrated and reused in very big range of applications, such as price comparison portals, business intelligence tools, various mashups and etc. It encourages industry and academics to seek automatic solutions. To tackle the problem of automatic structured Web data extraction we present a new approach - structured data extraction based on clustering visually similar Web page elements. Our method called ClustVX combines visual and pure HTML features of Web page to cluster visually similar Web page elements and then extract structured Web data. ClustVX can extract structured data from Web pages where more than one data record is present. With extensive experimental evaluation on three benchmark datasets we demonstrate that ClustVX achieves better results than other state-of-the-art automatic structured Web data extraction methods." @default.
- W2402156761 created "2016-06-24" @default.
- W2402156761 creator A5062980566 @default.
- W2402156761 creator A5064548188 @default.
- W2402156761 date "2014-01-01" @default.
- W2402156761 modified "2023-09-26" @default.
- W2402156761 title "Unsupervised Structured Data Extraction from Template-generated Web Pages" @default.
- W2402156761 cites W104909239 @default.
- W2402156761 cites W1519606823 @default.
- W2402156761 cites W1553019137 @default.
- W2402156761 cites W1617966546 @default.
- W2402156761 cites W1803802947 @default.
- W2402156761 cites W1976373002 @default.
- W2402156761 cites W2002956097 @default.
- W2402156761 cites W2005646337 @default.
- W2402156761 cites W2015551056 @default.
- W2402156761 cites W2022158118 @default.
- W2402156761 cites W2022166150 @default.
- W2402156761 cites W2040757233 @default.
- W2402156761 cites W2044515729 @default.
- W2402156761 cites W2049461910 @default.
- W2402156761 cites W2065568440 @default.
- W2402156761 cites W2069388662 @default.
- W2402156761 cites W2071628657 @default.
- W2402156761 cites W2072936489 @default.
- W2402156761 cites W2084801987 @default.
- W2402156761 cites W2108223890 @default.
- W2402156761 cites W2128341918 @default.
- W2402156761 cites W2133669904 @default.
- W2402156761 cites W2134150392 @default.
- W2402156761 cites W2134907429 @default.
- W2402156761 cites W2135767707 @default.
- W2402156761 cites W2139599797 @default.
- W2402156761 cites W2140116426 @default.
- W2402156761 cites W2143309843 @default.
- W2402156761 cites W2148210463 @default.
- W2402156761 cites W2160189941 @default.
- W2402156761 cites W2160196229 @default.
- W2402156761 cites W2163072729 @default.
- W2402156761 cites W2168358004 @default.
- W2402156761 cites W2171364811 @default.
- W2402156761 cites W2185093343 @default.
- W2402156761 cites W2405968456 @default.
- W2402156761 cites W2471366537 @default.
- W2402156761 cites W2542557639 @default.
- W2402156761 cites W343945789 @default.
- W2402156761 cites W836144344 @default.
- W2402156761 cites W270849871 @default.
- W2402156761 hasPublicationYear "2014" @default.
- W2402156761 type Work @default.
- W2402156761 sameAs 2402156761 @default.
- W2402156761 citedByCount "4" @default.
- W2402156761 countsByYear W24021567612014 @default.
- W2402156761 countsByYear W24021567612015 @default.
- W2402156761 countsByYear W24021567612016 @default.
- W2402156761 countsByYear W24021567612017 @default.
- W2402156761 crossrefType "journal-article" @default.
- W2402156761 hasAuthorship W2402156761A5062980566 @default.
- W2402156761 hasAuthorship W2402156761A5064548188 @default.
- W2402156761 hasConcept C130436687 @default.
- W2402156761 hasConcept C13280743 @default.
- W2402156761 hasConcept C136764020 @default.
- W2402156761 hasConcept C154945302 @default.
- W2402156761 hasConcept C162005631 @default.
- W2402156761 hasConcept C173576120 @default.
- W2402156761 hasConcept C17744445 @default.
- W2402156761 hasConcept C185798385 @default.
- W2402156761 hasConcept C196126337 @default.
- W2402156761 hasConcept C197046077 @default.
- W2402156761 hasConcept C199539241 @default.
- W2402156761 hasConcept C205649164 @default.
- W2402156761 hasConcept C21959979 @default.
- W2402156761 hasConcept C23123220 @default.
- W2402156761 hasConcept C2777466982 @default.
- W2402156761 hasConcept C2779473830 @default.
- W2402156761 hasConcept C41008148 @default.
- W2402156761 hasConcept C73555534 @default.
- W2402156761 hasConcept C79373723 @default.
- W2402156761 hasConceptScore W2402156761C130436687 @default.
- W2402156761 hasConceptScore W2402156761C13280743 @default.
- W2402156761 hasConceptScore W2402156761C136764020 @default.
- W2402156761 hasConceptScore W2402156761C154945302 @default.
- W2402156761 hasConceptScore W2402156761C162005631 @default.
- W2402156761 hasConceptScore W2402156761C173576120 @default.
- W2402156761 hasConceptScore W2402156761C17744445 @default.
- W2402156761 hasConceptScore W2402156761C185798385 @default.
- W2402156761 hasConceptScore W2402156761C196126337 @default.
- W2402156761 hasConceptScore W2402156761C197046077 @default.
- W2402156761 hasConceptScore W2402156761C199539241 @default.
- W2402156761 hasConceptScore W2402156761C205649164 @default.
- W2402156761 hasConceptScore W2402156761C21959979 @default.
- W2402156761 hasConceptScore W2402156761C23123220 @default.
- W2402156761 hasConceptScore W2402156761C2777466982 @default.
- W2402156761 hasConceptScore W2402156761C2779473830 @default.
- W2402156761 hasConceptScore W2402156761C41008148 @default.
- W2402156761 hasConceptScore W2402156761C73555534 @default.
- W2402156761 hasConceptScore W2402156761C79373723 @default.
- W2402156761 hasLocation W24021567611 @default.