Matches in SemOpenAlex for { <https://semopenalex.org/work/W2001104490> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W2001104490 endingPage "52" @default.
- W2001104490 startingPage "21" @default.
- W2001104490 abstract "Several studies have concentrated on the generation of wrappers for web data sources. As wrappers can be easily described as grammars, the grammatical inference heritage could play a significant role in this research field. Recent results have identified a new subclass of regular languages, called prefix mark-up languages, that nicely abstract the structures usually found in HTML pages of large web sites. This class has been proven to be identifiable in the limit, and a PTIME unsupervised learning algorithm has been previously developed. Unfortunately, many real-life web pages do not fall in this class of languages. In this article we analyze the roots of the problem and we propose a technique to transform pages in order to bring them into the class of prefix mark-up languages. In this way, we have a practical solution without renouncing to the formal background defined within the grammatical inference framework. We report on some experiments that we have conducted on real-life web pages to evaluate the approach; the results of this activity demonstrate the effectiveness of the presented techniques." @default.
- W2001104490 created "2016-06-24" @default.
- W2001104490 creator A5022736687 @default.
- W2001104490 creator A5030700727 @default.
- W2001104490 date "2008-02-06" @default.
- W2001104490 modified "2023-10-12" @default.
- W2001104490 title "WRAPPER INFERENCE FOR AMBIGUOUS WEB PAGES" @default.
- W2001104490 cites W1519606823 @default.
- W2001104490 cites W2001341021 @default.
- W2001104490 cites W2005646337 @default.
- W2001104490 cites W2072548741 @default.
- W2001104490 cites W2072936489 @default.
- W2001104490 cites W2092386826 @default.
- W2001104490 cites W2093009437 @default.
- W2001104490 cites W2096496923 @default.
- W2001104490 cites W2135479443 @default.
- W2001104490 cites W2150721933 @default.
- W2001104490 cites W2154444297 @default.
- W2001104490 cites W2162340487 @default.
- W2001104490 cites W2170021258 @default.
- W2001104490 doi "https://doi.org/10.1080/08839510701853093" @default.
- W2001104490 hasPublicationYear "2008" @default.
- W2001104490 type Work @default.
- W2001104490 sameAs 2001104490 @default.
- W2001104490 citedByCount "24" @default.
- W2001104490 countsByYear W20011044902012 @default.
- W2001104490 countsByYear W20011044902013 @default.
- W2001104490 countsByYear W20011044902014 @default.
- W2001104490 countsByYear W20011044902015 @default.
- W2001104490 countsByYear W20011044902016 @default.
- W2001104490 countsByYear W20011044902017 @default.
- W2001104490 countsByYear W20011044902018 @default.
- W2001104490 countsByYear W20011044902019 @default.
- W2001104490 countsByYear W20011044902020 @default.
- W2001104490 crossrefType "journal-article" @default.
- W2001104490 hasAuthorship W2001104490A5022736687 @default.
- W2001104490 hasAuthorship W2001104490A5030700727 @default.
- W2001104490 hasBestOaLocation W20011044901 @default.
- W2001104490 hasConcept C11413529 @default.
- W2001104490 hasConcept C134026603 @default.
- W2001104490 hasConcept C136764020 @default.
- W2001104490 hasConcept C138885662 @default.
- W2001104490 hasConcept C141603448 @default.
- W2001104490 hasConcept C154945302 @default.
- W2001104490 hasConcept C199360897 @default.
- W2001104490 hasConcept C204321447 @default.
- W2001104490 hasConcept C21959979 @default.
- W2001104490 hasConcept C23123220 @default.
- W2001104490 hasConcept C2776214188 @default.
- W2001104490 hasConcept C2777212361 @default.
- W2001104490 hasConcept C311688 @default.
- W2001104490 hasConcept C41008148 @default.
- W2001104490 hasConcept C41895202 @default.
- W2001104490 hasConcept C53893814 @default.
- W2001104490 hasConceptScore W2001104490C11413529 @default.
- W2001104490 hasConceptScore W2001104490C134026603 @default.
- W2001104490 hasConceptScore W2001104490C136764020 @default.
- W2001104490 hasConceptScore W2001104490C138885662 @default.
- W2001104490 hasConceptScore W2001104490C141603448 @default.
- W2001104490 hasConceptScore W2001104490C154945302 @default.
- W2001104490 hasConceptScore W2001104490C199360897 @default.
- W2001104490 hasConceptScore W2001104490C204321447 @default.
- W2001104490 hasConceptScore W2001104490C21959979 @default.
- W2001104490 hasConceptScore W2001104490C23123220 @default.
- W2001104490 hasConceptScore W2001104490C2776214188 @default.
- W2001104490 hasConceptScore W2001104490C2777212361 @default.
- W2001104490 hasConceptScore W2001104490C311688 @default.
- W2001104490 hasConceptScore W2001104490C41008148 @default.
- W2001104490 hasConceptScore W2001104490C41895202 @default.
- W2001104490 hasConceptScore W2001104490C53893814 @default.
- W2001104490 hasIssue "1-2" @default.
- W2001104490 hasLocation W20011044901 @default.
- W2001104490 hasOpenAccess W2001104490 @default.
- W2001104490 hasPrimaryLocation W20011044901 @default.
- W2001104490 hasRelatedWork W1569841287 @default.
- W2001104490 hasRelatedWork W1782169904 @default.
- W2001104490 hasRelatedWork W1997090502 @default.
- W2001104490 hasRelatedWork W2141137168 @default.
- W2001104490 hasRelatedWork W2167662847 @default.
- W2001104490 hasRelatedWork W2411679502 @default.
- W2001104490 hasRelatedWork W3107474891 @default.
- W2001104490 hasRelatedWork W3147354785 @default.
- W2001104490 hasRelatedWork W4245313431 @default.
- W2001104490 hasRelatedWork W2513545296 @default.
- W2001104490 hasVolume "22" @default.
- W2001104490 isParatext "false" @default.
- W2001104490 isRetracted "false" @default.
- W2001104490 magId "2001104490" @default.
- W2001104490 workType "article" @default.