Matches in SemOpenAlex for { <https://semopenalex.org/work/W2166817967> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W2166817967 abstract "We have developed an unsupervised framework for simultaneously extracting and normalizing attributes of products from multiple Web pages originated from different sites. Our framework is designed based on a probabilistic graphical model that can model the page-independent content information and the page-dependent layout information of the text fragments in Web pages. One characteristic of our framework is that previously unseen attributes can be discovered from the clue contained in the layout format of the text fragments. Our framework tackles both extraction and normalization tasks by jointly considering the relationship between the content and layout information. Dirichlet process prior is employed leading to another advantage that the number of discovered product attributes is unlimited. An unsupervised inference algorithm based on variational method is presented. The semantics of the normalized attributes can be visualized by examining the term weights in the model. Our framework can be applied to a wide range of Web mining applications such as product matching and retrieval. We have conducted extensive experiments from four different domains consisting of over 300 Web pages from over 150 different Web sites, demonstrating the robustness and effectiveness of our framework." @default.
- W2166817967 created "2016-06-24" @default.
- W2166817967 creator A5010910554 @default.
- W2166817967 creator A5017247401 @default.
- W2166817967 creator A5018582154 @default.
- W2166817967 date "2008-07-20" @default.
- W2166817967 modified "2023-10-18" @default.
- W2166817967 title "An unsupervised framework for extracting and normalizing product attributes from multiple web sites" @default.
- W2166817967 cites W1536860849 @default.
- W2166817967 cites W1616576116 @default.
- W2166817967 cites W1774330103 @default.
- W2166817967 cites W1994382199 @default.
- W2166817967 cites W2072169887 @default.
- W2166817967 cites W2119577990 @default.
- W2166817967 cites W2127498532 @default.
- W2166817967 cites W2158266063 @default.
- W2166817967 cites W2158823144 @default.
- W2166817967 cites W2164456230 @default.
- W2166817967 cites W2167138081 @default.
- W2166817967 cites W2171472464 @default.
- W2166817967 cites W2421105961 @default.
- W2166817967 doi "https://doi.org/10.1145/1390334.1390343" @default.
- W2166817967 hasPublicationYear "2008" @default.
- W2166817967 type Work @default.
- W2166817967 sameAs 2166817967 @default.
- W2166817967 citedByCount "46" @default.
- W2166817967 countsByYear W21668179672012 @default.
- W2166817967 countsByYear W21668179672013 @default.
- W2166817967 countsByYear W21668179672014 @default.
- W2166817967 countsByYear W21668179672015 @default.
- W2166817967 countsByYear W21668179672016 @default.
- W2166817967 countsByYear W21668179672017 @default.
- W2166817967 countsByYear W21668179672018 @default.
- W2166817967 countsByYear W21668179672019 @default.
- W2166817967 countsByYear W21668179672020 @default.
- W2166817967 countsByYear W21668179672023 @default.
- W2166817967 crossrefType "proceedings-article" @default.
- W2166817967 hasAuthorship W2166817967A5010910554 @default.
- W2166817967 hasAuthorship W2166817967A5017247401 @default.
- W2166817967 hasAuthorship W2166817967A5018582154 @default.
- W2166817967 hasConcept C104317684 @default.
- W2166817967 hasConcept C124101348 @default.
- W2166817967 hasConcept C136764020 @default.
- W2166817967 hasConcept C136886441 @default.
- W2166817967 hasConcept C144024400 @default.
- W2166817967 hasConcept C154945302 @default.
- W2166817967 hasConcept C171686336 @default.
- W2166817967 hasConcept C185592680 @default.
- W2166817967 hasConcept C19165224 @default.
- W2166817967 hasConcept C21959979 @default.
- W2166817967 hasConcept C23123220 @default.
- W2166817967 hasConcept C2776214188 @default.
- W2166817967 hasConcept C41008148 @default.
- W2166817967 hasConcept C49937458 @default.
- W2166817967 hasConcept C500882744 @default.
- W2166817967 hasConcept C55493867 @default.
- W2166817967 hasConcept C63479239 @default.
- W2166817967 hasConceptScore W2166817967C104317684 @default.
- W2166817967 hasConceptScore W2166817967C124101348 @default.
- W2166817967 hasConceptScore W2166817967C136764020 @default.
- W2166817967 hasConceptScore W2166817967C136886441 @default.
- W2166817967 hasConceptScore W2166817967C144024400 @default.
- W2166817967 hasConceptScore W2166817967C154945302 @default.
- W2166817967 hasConceptScore W2166817967C171686336 @default.
- W2166817967 hasConceptScore W2166817967C185592680 @default.
- W2166817967 hasConceptScore W2166817967C19165224 @default.
- W2166817967 hasConceptScore W2166817967C21959979 @default.
- W2166817967 hasConceptScore W2166817967C23123220 @default.
- W2166817967 hasConceptScore W2166817967C2776214188 @default.
- W2166817967 hasConceptScore W2166817967C41008148 @default.
- W2166817967 hasConceptScore W2166817967C49937458 @default.
- W2166817967 hasConceptScore W2166817967C500882744 @default.
- W2166817967 hasConceptScore W2166817967C55493867 @default.
- W2166817967 hasConceptScore W2166817967C63479239 @default.
- W2166817967 hasLocation W21668179671 @default.
- W2166817967 hasOpenAccess W2166817967 @default.
- W2166817967 hasPrimaryLocation W21668179671 @default.
- W2166817967 hasRelatedWork W1983719983 @default.
- W2166817967 hasRelatedWork W1985125789 @default.
- W2166817967 hasRelatedWork W2001975024 @default.
- W2166817967 hasRelatedWork W2144753143 @default.
- W2166817967 hasRelatedWork W2251363724 @default.
- W2166817967 hasRelatedWork W2279018116 @default.
- W2166817967 hasRelatedWork W2294903680 @default.
- W2166817967 hasRelatedWork W2370554703 @default.
- W2166817967 hasRelatedWork W2594155836 @default.
- W2166817967 hasRelatedWork W4309940794 @default.
- W2166817967 isParatext "false" @default.
- W2166817967 isRetracted "false" @default.
- W2166817967 magId "2166817967" @default.
- W2166817967 workType "article" @default.