Matches in SemOpenAlex for { <https://semopenalex.org/work/W2275993472> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W2275993472 abstract "Structured Data extraction from deep Web pages is a challenging task due to the underlying complex structures of such pages. Also website developer generally follows different web page design technique. Data extraction from webpage is highly useful to build our own database from number applications. A large number of techniques have been proposed to address this problem, but all of them have inherent limitations because they present different limitations and constraints for extracting data from such webpages. This paper presents two different approaches to get structured data extraction. The first approach is non-generic solution which is based on template detection using intersection of Document Object Model Tree of various webpages from the same website. This approach is giving better result in terms of efficiency and accurately locating the main data at the particular webpage. The second approach is based on partial tree alignment mechanism based on using important visual features such as length, size, and position of web table available on the webpages. This approach is a generic solution as it does not depend on one particular website and its webpage template. It is perfectly locating the multiple data regions, data records and data items within a given web page. We have compared our work’s result with existing mechanism and found our result much better for number webpage." @default.
- W2275993472 created "2016-06-24" @default.
- W2275993472 creator A5027892604 @default.
- W2275993472 creator A5037055375 @default.
- W2275993472 creator A5083240076 @default.
- W2275993472 date "2015-11-18" @default.
- W2275993472 modified "2023-09-23" @default.
- W2275993472 title "Web Content Mining Based on Dom Intersection and Visual Features Concept" @default.
- W2275993472 cites W1566513354 @default.
- W2275993472 cites W1927338256 @default.
- W2275993472 cites W1988121588 @default.
- W2275993472 cites W1989338554 @default.
- W2275993472 cites W2040075907 @default.
- W2275993472 cites W2049781914 @default.
- W2275993472 cites W2096806473 @default.
- W2275993472 cites W2101909884 @default.
- W2275993472 cites W2108770713 @default.
- W2275993472 cites W2136072238 @default.
- W2275993472 cites W2160189941 @default.
- W2275993472 cites W343945789 @default.
- W2275993472 doi "https://doi.org/10.6084/m9.figshare.1605578.v1" @default.
- W2275993472 hasPublicationYear "2015" @default.
- W2275993472 type Work @default.
- W2275993472 sameAs 2275993472 @default.
- W2275993472 citedByCount "0" @default.
- W2275993472 crossrefType "journal-article" @default.
- W2275993472 hasAuthorship W2275993472A5027892604 @default.
- W2275993472 hasAuthorship W2275993472A5037055375 @default.
- W2275993472 hasAuthorship W2275993472A5083240076 @default.
- W2275993472 hasConcept C113174947 @default.
- W2275993472 hasConcept C124101348 @default.
- W2275993472 hasConcept C127413603 @default.
- W2275993472 hasConcept C134306372 @default.
- W2275993472 hasConcept C136764020 @default.
- W2275993472 hasConcept C137922610 @default.
- W2275993472 hasConcept C146978453 @default.
- W2275993472 hasConcept C201995342 @default.
- W2275993472 hasConcept C21959979 @default.
- W2275993472 hasConcept C23123220 @default.
- W2275993472 hasConcept C2780451532 @default.
- W2275993472 hasConcept C33923547 @default.
- W2275993472 hasConcept C41008148 @default.
- W2275993472 hasConcept C45235069 @default.
- W2275993472 hasConcept C64543145 @default.
- W2275993472 hasConcept C68476402 @default.
- W2275993472 hasConceptScore W2275993472C113174947 @default.
- W2275993472 hasConceptScore W2275993472C124101348 @default.
- W2275993472 hasConceptScore W2275993472C127413603 @default.
- W2275993472 hasConceptScore W2275993472C134306372 @default.
- W2275993472 hasConceptScore W2275993472C136764020 @default.
- W2275993472 hasConceptScore W2275993472C137922610 @default.
- W2275993472 hasConceptScore W2275993472C146978453 @default.
- W2275993472 hasConceptScore W2275993472C201995342 @default.
- W2275993472 hasConceptScore W2275993472C21959979 @default.
- W2275993472 hasConceptScore W2275993472C23123220 @default.
- W2275993472 hasConceptScore W2275993472C2780451532 @default.
- W2275993472 hasConceptScore W2275993472C33923547 @default.
- W2275993472 hasConceptScore W2275993472C41008148 @default.
- W2275993472 hasConceptScore W2275993472C45235069 @default.
- W2275993472 hasConceptScore W2275993472C64543145 @default.
- W2275993472 hasConceptScore W2275993472C68476402 @default.
- W2275993472 hasLocation W22759934721 @default.
- W2275993472 hasOpenAccess W2275993472 @default.
- W2275993472 hasPrimaryLocation W22759934721 @default.
- W2275993472 hasRelatedWork W1507164096 @default.
- W2275993472 hasRelatedWork W1521051295 @default.
- W2275993472 hasRelatedWork W1544491176 @default.
- W2275993472 hasRelatedWork W16977880 @default.
- W2275993472 hasRelatedWork W1941664561 @default.
- W2275993472 hasRelatedWork W1968053850 @default.
- W2275993472 hasRelatedWork W2026635729 @default.
- W2275993472 hasRelatedWork W2131817001 @default.
- W2275993472 hasRelatedWork W2147858496 @default.
- W2275993472 hasRelatedWork W2158026699 @default.
- W2275993472 hasRelatedWork W2187337573 @default.
- W2275993472 hasRelatedWork W2201534957 @default.
- W2275993472 hasRelatedWork W2307003804 @default.
- W2275993472 hasRelatedWork W2349303672 @default.
- W2275993472 hasRelatedWork W2371618206 @default.
- W2275993472 hasRelatedWork W2405794981 @default.
- W2275993472 hasRelatedWork W2609375942 @default.
- W2275993472 hasRelatedWork W3195407093 @default.
- W2275993472 hasRelatedWork W43928412 @default.
- W2275993472 hasRelatedWork W6578894 @default.
- W2275993472 isParatext "false" @default.
- W2275993472 isRetracted "false" @default.
- W2275993472 magId "2275993472" @default.
- W2275993472 workType "article" @default.