Matches in SemOpenAlex for { <https://semopenalex.org/work/W2794115504> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W2794115504 abstract "Web news extraction is a very important step in the process of Web intelligent information processing. It is the basis of research and application of network public opinion monitoring, heterogeneous Web data source integration and information retrieval. Therefore, the research and design of Web news content information extraction method has important research and application value. Using the idea of web information extraction based on statistics and web structure, this paper improves an existing webpage text extraction algorithm named ERBDF and designs a web news text extraction algorithm based on statistics and DOM tree structure (EETD). Finally, two algorithms are tested and compared in the accuracy and speed of text extraction and the results show that EETD has a better overall performance." @default.
- W2794115504 created "2018-03-29" @default.
- W2794115504 creator A5014547918 @default.
- W2794115504 creator A5043289848 @default.
- W2794115504 creator A5064078720 @default.
- W2794115504 creator A5073201466 @default.
- W2794115504 date "2017-08-01" @default.
- W2794115504 modified "2023-09-23" @default.
- W2794115504 title "An efficient method for extracting web news content" @default.
- W2794115504 cites W1553019137 @default.
- W2794115504 cites W2007278884 @default.
- W2794115504 cites W2019577381 @default.
- W2794115504 cites W2049488566 @default.
- W2794115504 cites W2171364811 @default.
- W2794115504 cites W2515878395 @default.
- W2794115504 cites W2557512290 @default.
- W2794115504 cites W2574209248 @default.
- W2794115504 cites W2580751378 @default.
- W2794115504 cites W2604105233 @default.
- W2794115504 cites W2619036841 @default.
- W2794115504 cites W2688602461 @default.
- W2794115504 doi "https://doi.org/10.1109/icengtechnol.2017.8308202" @default.
- W2794115504 hasPublicationYear "2017" @default.
- W2794115504 type Work @default.
- W2794115504 sameAs 2794115504 @default.
- W2794115504 citedByCount "0" @default.
- W2794115504 crossrefType "proceedings-article" @default.
- W2794115504 hasAuthorship W2794115504A5014547918 @default.
- W2794115504 hasAuthorship W2794115504A5043289848 @default.
- W2794115504 hasAuthorship W2794115504A5064078720 @default.
- W2794115504 hasAuthorship W2794115504A5073201466 @default.
- W2794115504 hasConcept C111919701 @default.
- W2794115504 hasConcept C124101348 @default.
- W2794115504 hasConcept C130436687 @default.
- W2794115504 hasConcept C136764020 @default.
- W2794115504 hasConcept C17744445 @default.
- W2794115504 hasConcept C195807954 @default.
- W2794115504 hasConcept C199539241 @default.
- W2794115504 hasConcept C21959979 @default.
- W2794115504 hasConcept C23123220 @default.
- W2794115504 hasConcept C2777466982 @default.
- W2794115504 hasConcept C2779473830 @default.
- W2794115504 hasConcept C41008148 @default.
- W2794115504 hasConcept C98045186 @default.
- W2794115504 hasConceptScore W2794115504C111919701 @default.
- W2794115504 hasConceptScore W2794115504C124101348 @default.
- W2794115504 hasConceptScore W2794115504C130436687 @default.
- W2794115504 hasConceptScore W2794115504C136764020 @default.
- W2794115504 hasConceptScore W2794115504C17744445 @default.
- W2794115504 hasConceptScore W2794115504C195807954 @default.
- W2794115504 hasConceptScore W2794115504C199539241 @default.
- W2794115504 hasConceptScore W2794115504C21959979 @default.
- W2794115504 hasConceptScore W2794115504C23123220 @default.
- W2794115504 hasConceptScore W2794115504C2777466982 @default.
- W2794115504 hasConceptScore W2794115504C2779473830 @default.
- W2794115504 hasConceptScore W2794115504C41008148 @default.
- W2794115504 hasConceptScore W2794115504C98045186 @default.
- W2794115504 hasLocation W27941155041 @default.
- W2794115504 hasOpenAccess W2794115504 @default.
- W2794115504 hasPrimaryLocation W27941155041 @default.
- W2794115504 hasRelatedWork W1963553539 @default.
- W2794115504 hasRelatedWork W2139830916 @default.
- W2794115504 hasRelatedWork W2248392221 @default.
- W2794115504 hasRelatedWork W2249532877 @default.
- W2794115504 hasRelatedWork W2254934377 @default.
- W2794115504 hasRelatedWork W2352473746 @default.
- W2794115504 hasRelatedWork W2355197776 @default.
- W2794115504 hasRelatedWork W2369251627 @default.
- W2794115504 hasRelatedWork W2370892791 @default.
- W2794115504 hasRelatedWork W2374762067 @default.
- W2794115504 hasRelatedWork W2376787882 @default.
- W2794115504 hasRelatedWork W2383967660 @default.
- W2794115504 hasRelatedWork W2385723522 @default.
- W2794115504 hasRelatedWork W2389364963 @default.
- W2794115504 hasRelatedWork W2391518440 @default.
- W2794115504 hasRelatedWork W2439421558 @default.
- W2794115504 hasRelatedWork W2777295365 @default.
- W2794115504 hasRelatedWork W2805704837 @default.
- W2794115504 hasRelatedWork W3002216803 @default.
- W2794115504 hasRelatedWork W820653439 @default.
- W2794115504 isParatext "false" @default.
- W2794115504 isRetracted "false" @default.
- W2794115504 magId "2794115504" @default.
- W2794115504 workType "article" @default.