Matches in SemOpenAlex for { <https://semopenalex.org/work/W2253603516> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W2253603516 endingPage "353" @default.
- W2253603516 startingPage "352" @default.
- W2253603516 abstract "Automated generation of wrappers has been one of important topics in Web mining because wrappers bridge between HTML hypertext pages on the Web and business applications that need useful information from the HTML pages in a structured form. In this paper, we propose an efficient and accurate wrapper induction technique to elicits concise patterns that occur frequently in the HTML pages using minimum description length (MDL) principle and a suffix-tree sequence storage mechanism. To induce accurate and concise wrapper patterns from Web pages, our algorithm, MDL-Wrapper, uses MDL principle as a tradeoff criterion between the number of occurrence of important patterns and the length of the patterns. The estimation of the occurrence is efficiently calculated by and obtained from suffix tree storage mechanism. Experiments on wrappers for price information and news information from popular Web pages as unlabeled examples show that MDL-Wrapper is efficient and effective for wrapper induction tasks." @default.
- W2253603516 created "2016-06-24" @default.
- W2253603516 creator A5029780795 @default.
- W2253603516 creator A5053598975 @default.
- W2253603516 date "2007-07-01" @default.
- W2253603516 modified "2023-09-26" @default.
- W2253603516 title "Wrapper Induction based on Minimum Description Length using a Suffix Tree" @default.
- W2253603516 cites W1500874501 @default.
- W2253603516 cites W2054658115 @default.
- W2253603516 cites W2059513841 @default.
- W2253603516 hasPublicationYear "2007" @default.
- W2253603516 type Work @default.
- W2253603516 sameAs 2253603516 @default.
- W2253603516 citedByCount "1" @default.
- W2253603516 countsByYear W22536035162015 @default.
- W2253603516 crossrefType "journal-article" @default.
- W2253603516 hasAuthorship W2253603516A5029780795 @default.
- W2253603516 hasAuthorship W2253603516A5053598975 @default.
- W2253603516 hasConcept C113174947 @default.
- W2253603516 hasConcept C124101348 @default.
- W2253603516 hasConcept C134306372 @default.
- W2253603516 hasConcept C136764020 @default.
- W2253603516 hasConcept C138885662 @default.
- W2253603516 hasConcept C154945302 @default.
- W2253603516 hasConcept C162215914 @default.
- W2253603516 hasConcept C162319229 @default.
- W2253603516 hasConcept C199360897 @default.
- W2253603516 hasConcept C21959979 @default.
- W2253603516 hasConcept C23123220 @default.
- W2253603516 hasConcept C2779804580 @default.
- W2253603516 hasConcept C2781166958 @default.
- W2253603516 hasConcept C33923547 @default.
- W2253603516 hasConcept C41008148 @default.
- W2253603516 hasConcept C41895202 @default.
- W2253603516 hasConcept C81639021 @default.
- W2253603516 hasConcept C87465248 @default.
- W2253603516 hasConceptScore W2253603516C113174947 @default.
- W2253603516 hasConceptScore W2253603516C124101348 @default.
- W2253603516 hasConceptScore W2253603516C134306372 @default.
- W2253603516 hasConceptScore W2253603516C136764020 @default.
- W2253603516 hasConceptScore W2253603516C138885662 @default.
- W2253603516 hasConceptScore W2253603516C154945302 @default.
- W2253603516 hasConceptScore W2253603516C162215914 @default.
- W2253603516 hasConceptScore W2253603516C162319229 @default.
- W2253603516 hasConceptScore W2253603516C199360897 @default.
- W2253603516 hasConceptScore W2253603516C21959979 @default.
- W2253603516 hasConceptScore W2253603516C23123220 @default.
- W2253603516 hasConceptScore W2253603516C2779804580 @default.
- W2253603516 hasConceptScore W2253603516C2781166958 @default.
- W2253603516 hasConceptScore W2253603516C33923547 @default.
- W2253603516 hasConceptScore W2253603516C41008148 @default.
- W2253603516 hasConceptScore W2253603516C41895202 @default.
- W2253603516 hasConceptScore W2253603516C81639021 @default.
- W2253603516 hasConceptScore W2253603516C87465248 @default.
- W2253603516 hasLocation W22536035161 @default.
- W2253603516 hasOpenAccess W2253603516 @default.
- W2253603516 hasPrimaryLocation W22536035161 @default.
- W2253603516 hasRelatedWork W1492472380 @default.
- W2253603516 hasRelatedWork W1510758414 @default.
- W2253603516 hasRelatedWork W1981863669 @default.
- W2253603516 hasRelatedWork W1995855187 @default.
- W2253603516 hasRelatedWork W2023094000 @default.
- W2253603516 hasRelatedWork W2084886972 @default.
- W2253603516 hasRelatedWork W2101662882 @default.
- W2253603516 hasRelatedWork W2106568316 @default.
- W2253603516 hasRelatedWork W2134907429 @default.
- W2253603516 hasRelatedWork W2151928232 @default.
- W2253603516 hasRelatedWork W2163155321 @default.
- W2253603516 hasRelatedWork W2167200772 @default.
- W2253603516 hasRelatedWork W2259445367 @default.
- W2253603516 hasRelatedWork W2382405935 @default.
- W2253603516 hasRelatedWork W2613027510 @default.
- W2253603516 hasRelatedWork W28465383 @default.
- W2253603516 hasRelatedWork W2912161846 @default.
- W2253603516 hasRelatedWork W3034400204 @default.
- W2253603516 hasRelatedWork W3081339119 @default.
- W2253603516 hasRelatedWork W857958923 @default.
- W2253603516 isParatext "false" @default.
- W2253603516 isRetracted "false" @default.
- W2253603516 magId "2253603516" @default.
- W2253603516 workType "article" @default.