Matches in SemOpenAlex for { <https://semopenalex.org/work/W2568642289> ?p ?o ?g. }
Showing items 1 to 59 of
59
with 100 items per page.
- W2568642289 abstract "With the exponential growth of both the amount and the diversity of the web information, web site mining is highly desirable for automatically discovering and classifying topic-specific web resources from the World Wide Web. Nevertheless, existing web site mining methods have not yet handled adequately how to make use of all the correlative contextual semantic clues and how to denoise the content of web sites effectually so as to obtain a better classification accuracy. This paper circumstantiates three issues to be solved for designing an effective and efficient web site mining algorithm, i.e., the sampling size, the analysis granularity, and the representation structure of web sites. On the basis, this paper proposes a novel multiscale tree representation model of web sites, and presents a multiscale web site mining approach that contains an HMT-based two-phase classification algorithm, a context-based interscale fusion algorithm, a two-stage text-based denoising procedure, and an entropy-base pruning strategy. The proposed model and algorithms may be used as a starting-point for further investigating some related issues of web sites, such as query optimization of multiple sites and web usage mining. Experiments also show that the approach achieves in average 16% improvement in classification accuracy and 34.5% reduction in processing time over the baseline system." @default.
- W2568642289 created "2017-01-13" @default.
- W2568642289 creator A5086383286 @default.
- W2568642289 date "2004-01-01" @default.
- W2568642289 modified "2023-09-23" @default.
- W2568642289 title "A Web Site Representation and Mining Algorithm Using the Multiscale Tree Model" @default.
- W2568642289 hasPublicationYear "2004" @default.
- W2568642289 type Work @default.
- W2568642289 sameAs 2568642289 @default.
- W2568642289 citedByCount "0" @default.
- W2568642289 crossrefType "journal-article" @default.
- W2568642289 hasAuthorship W2568642289A5086383286 @default.
- W2568642289 hasConcept C108010975 @default.
- W2568642289 hasConcept C119857082 @default.
- W2568642289 hasConcept C124101348 @default.
- W2568642289 hasConcept C136764020 @default.
- W2568642289 hasConcept C162005631 @default.
- W2568642289 hasConcept C197046077 @default.
- W2568642289 hasConcept C23123220 @default.
- W2568642289 hasConcept C35578498 @default.
- W2568642289 hasConcept C41008148 @default.
- W2568642289 hasConcept C6557445 @default.
- W2568642289 hasConcept C86803240 @default.
- W2568642289 hasConceptScore W2568642289C108010975 @default.
- W2568642289 hasConceptScore W2568642289C119857082 @default.
- W2568642289 hasConceptScore W2568642289C124101348 @default.
- W2568642289 hasConceptScore W2568642289C136764020 @default.
- W2568642289 hasConceptScore W2568642289C162005631 @default.
- W2568642289 hasConceptScore W2568642289C197046077 @default.
- W2568642289 hasConceptScore W2568642289C23123220 @default.
- W2568642289 hasConceptScore W2568642289C35578498 @default.
- W2568642289 hasConceptScore W2568642289C41008148 @default.
- W2568642289 hasConceptScore W2568642289C6557445 @default.
- W2568642289 hasConceptScore W2568642289C86803240 @default.
- W2568642289 hasOpenAccess W2568642289 @default.
- W2568642289 hasRelatedWork W1487872886 @default.
- W2568642289 hasRelatedWork W1489740164 @default.
- W2568642289 hasRelatedWork W1503687901 @default.
- W2568642289 hasRelatedWork W1512719150 @default.
- W2568642289 hasRelatedWork W1540134652 @default.
- W2568642289 hasRelatedWork W1543509794 @default.
- W2568642289 hasRelatedWork W1942397591 @default.
- W2568642289 hasRelatedWork W2084475300 @default.
- W2568642289 hasRelatedWork W2115488054 @default.
- W2568642289 hasRelatedWork W2125571602 @default.
- W2568642289 hasRelatedWork W2169187188 @default.
- W2568642289 hasRelatedWork W2183837240 @default.
- W2568642289 hasRelatedWork W2237566883 @default.
- W2568642289 hasRelatedWork W2390291968 @default.
- W2568642289 hasRelatedWork W2590212966 @default.
- W2568642289 hasRelatedWork W2951370462 @default.
- W2568642289 hasRelatedWork W3121507283 @default.
- W2568642289 hasRelatedWork W574985642 @default.
- W2568642289 hasRelatedWork W890554776 @default.
- W2568642289 hasRelatedWork W2243439986 @default.
- W2568642289 isParatext "false" @default.
- W2568642289 isRetracted "false" @default.
- W2568642289 magId "2568642289" @default.
- W2568642289 workType "article" @default.