Matches in SemOpenAlex for { <https://semopenalex.org/work/W2023374068> ?p ?o ?g. }
- W2023374068 endingPage "180" @default.
- W2023374068 startingPage "168" @default.
- W2023374068 abstract "Unsupervised web page classification refers to the problem of clustering the pages in a web site so that each cluster includes a set of web pages that can be classified using a unique class. The existing proposals to perform web page classification do not fulfill a number of requirements that would make them suitable for enterprise web information integration, namely: to be based on a lightweight crawling, so as to avoid interfering with the normal operation of the web site, to be unsupervised, which avoids the need for a training set of pre-classified pages, or to use features from outside the page to be classified, which avoids having to download it. In this article, we propose CALA, a new automated proposal to generate URL-based web page classifiers. Our proposal builds a number of URL patterns that represent the different classes of pages in a web site, so further pages can be classified by matching their URLs to the patterns. Its salient features are that it fulfills all of the previous requirements, and it has been validated by a number of experiments using real-world, top-visited web sites. Our validation proves that CALA is very effective and efficient in practice." @default.
- W2023374068 created "2016-06-24" @default.
- W2023374068 creator A5024884892 @default.
- W2023374068 creator A5032959798 @default.
- W2023374068 creator A5044105054 @default.
- W2023374068 creator A5057698424 @default.
- W2023374068 date "2014-02-01" @default.
- W2023374068 modified "2023-10-03" @default.
- W2023374068 title "CALA: An unsupervised URL-based web page classification system" @default.
- W2023374068 cites W1489949474 @default.
- W2023374068 cites W1537655875 @default.
- W2023374068 cites W1590352526 @default.
- W2023374068 cites W1979584682 @default.
- W2023374068 cites W2003471189 @default.
- W2023374068 cites W2008638605 @default.
- W2023374068 cites W2013761541 @default.
- W2023374068 cites W2013970953 @default.
- W2023374068 cites W2017224880 @default.
- W2023374068 cites W2032304665 @default.
- W2023374068 cites W2040075907 @default.
- W2023374068 cites W2053049304 @default.
- W2023374068 cites W2055886766 @default.
- W2023374068 cites W2059586463 @default.
- W2023374068 cites W2067698488 @default.
- W2023374068 cites W2072489225 @default.
- W2023374068 cites W2081980673 @default.
- W2023374068 cites W2101351004 @default.
- W2023374068 cites W2102218101 @default.
- W2023374068 cites W2121641626 @default.
- W2023374068 cites W2121871415 @default.
- W2023374068 cites W2129595335 @default.
- W2023374068 cites W2134150392 @default.
- W2023374068 cites W2134491992 @default.
- W2023374068 cites W2135514714 @default.
- W2023374068 cites W2137313854 @default.
- W2023374068 cites W2138621811 @default.
- W2023374068 cites W2140316698 @default.
- W2023374068 cites W2149228935 @default.
- W2023374068 cites W2150721933 @default.
- W2023374068 cites W2152805927 @default.
- W2023374068 cites W2156346468 @default.
- W2023374068 cites W2162872186 @default.
- W2023374068 cites W2165466912 @default.
- W2023374068 cites W2169899598 @default.
- W2023374068 cites W2177500510 @default.
- W2023374068 cites W2421105961 @default.
- W2023374068 cites W2913389685 @default.
- W2023374068 cites W4234580423 @default.
- W2023374068 cites W4254697110 @default.
- W2023374068 doi "https://doi.org/10.1016/j.knosys.2013.12.019" @default.
- W2023374068 hasPublicationYear "2014" @default.
- W2023374068 type Work @default.
- W2023374068 sameAs 2023374068 @default.
- W2023374068 citedByCount "24" @default.
- W2023374068 countsByYear W20233740682013 @default.
- W2023374068 countsByYear W20233740682014 @default.
- W2023374068 countsByYear W20233740682015 @default.
- W2023374068 countsByYear W20233740682016 @default.
- W2023374068 countsByYear W20233740682017 @default.
- W2023374068 countsByYear W20233740682018 @default.
- W2023374068 countsByYear W20233740682019 @default.
- W2023374068 countsByYear W20233740682020 @default.
- W2023374068 countsByYear W20233740682021 @default.
- W2023374068 countsByYear W20233740682022 @default.
- W2023374068 countsByYear W20233740682023 @default.
- W2023374068 crossrefType "journal-article" @default.
- W2023374068 hasAuthorship W2023374068A5024884892 @default.
- W2023374068 hasAuthorship W2023374068A5032959798 @default.
- W2023374068 hasAuthorship W2023374068A5044105054 @default.
- W2023374068 hasAuthorship W2023374068A5057698424 @default.
- W2023374068 hasBestOaLocation W20233740682 @default.
- W2023374068 hasConcept C100368936 @default.
- W2023374068 hasConcept C105702510 @default.
- W2023374068 hasConcept C136764020 @default.
- W2023374068 hasConcept C13743948 @default.
- W2023374068 hasConcept C154945302 @default.
- W2023374068 hasConcept C162005631 @default.
- W2023374068 hasConcept C173576120 @default.
- W2023374068 hasConcept C177264268 @default.
- W2023374068 hasConcept C195409031 @default.
- W2023374068 hasConcept C199360897 @default.
- W2023374068 hasConcept C21959979 @default.
- W2023374068 hasConcept C23123220 @default.
- W2023374068 hasConcept C2780719617 @default.
- W2023374068 hasConcept C41008148 @default.
- W2023374068 hasConcept C61096286 @default.
- W2023374068 hasConcept C71924100 @default.
- W2023374068 hasConcept C73555534 @default.
- W2023374068 hasConceptScore W2023374068C100368936 @default.
- W2023374068 hasConceptScore W2023374068C105702510 @default.
- W2023374068 hasConceptScore W2023374068C136764020 @default.
- W2023374068 hasConceptScore W2023374068C13743948 @default.
- W2023374068 hasConceptScore W2023374068C154945302 @default.
- W2023374068 hasConceptScore W2023374068C162005631 @default.
- W2023374068 hasConceptScore W2023374068C173576120 @default.
- W2023374068 hasConceptScore W2023374068C177264268 @default.
- W2023374068 hasConceptScore W2023374068C195409031 @default.
- W2023374068 hasConceptScore W2023374068C199360897 @default.