Matches in SemOpenAlex for { <https://semopenalex.org/work/W206874103> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W206874103 abstract "World Wide Web (WWW) is a huge repository of interlinked hypertext documents known as web pages. Users access these hypertext documents via Internet . Since its inception in 1990, WWW has become many folds in size, and now it contains more than 50 bil lion publicly accessible web documents distributed all over the world on thousands of web servers and still gro wing at exponential rate. It is very difficult to s earch information from such a huge collection of WWW as the web pages or documents are not organized as books on shelves in a library, nor are web pages complete ly catalogued at one central location. Search engin e is basic information retrieval tool, used to access informat ion from WWW. In response to the search query provided by users, Search engines use their database to sear ch the relevant documents and produce the result af ter ranking on the basis of relevance. In fact, the Sea rch engine builds its database, with the help of We bCrawlers. To maximize the download rate and to retrieve the whole or significant portion of the Web, search engi nes run multiple crawlers in parallel. Overlapping of downloaded web documents, quality, network bandwidth and refreshing of web documents are the major challenging problems faced by existin g parallel WebCrawlers that are addressed in this w ork. A Multi Threaded (MT) server based novel architecture for incremental parallel web crawler has been desi gned that helps to reduce overlapping, quality and netwo rk bandwidth problems. Additionally, web page change detection methods have been developed to refresh th e web document by detecting the structural, present ation and content level changes in web documents. These change detection methods help to detect whether vers ion of a web page, existing at Search engine side has g ot changed from the one existing at Web server end or not. If it has got changed, the WebCrawler should replac e the existing version at Search engine database si de to keep its repository up-to-date" @default.
- W206874103 created "2016-06-24" @default.
- W206874103 creator A5018650026 @default.
- W206874103 creator A5037897289 @default.
- W206874103 creator A5055173970 @default.
- W206874103 creator A5073464815 @default.
- W206874103 date "2012-01-01" @default.
- W206874103 modified "2023-09-28" @default.
- W206874103 title "AN APPROACH TO DESIGN INCREMENTAL PARALLEL WEBCRAWLER" @default.
- W206874103 cites W110443600 @default.
- W206874103 cites W1510634602 @default.
- W206874103 cites W1565845362 @default.
- W206874103 cites W1566984846 @default.
- W206874103 cites W1613836731 @default.
- W206874103 cites W173995639 @default.
- W206874103 cites W1767717098 @default.
- W206874103 cites W182619827 @default.
- W206874103 cites W1854214752 @default.
- W206874103 cites W1956559956 @default.
- W206874103 cites W1976232673 @default.
- W206874103 cites W1997438973 @default.
- W206874103 cites W2001832505 @default.
- W206874103 cites W2029341294 @default.
- W206874103 cites W2029500199 @default.
- W206874103 cites W2038378248 @default.
- W206874103 cites W2066636486 @default.
- W206874103 cites W2117085788 @default.
- W206874103 cites W2131321260 @default.
- W206874103 cites W2147164982 @default.
- W206874103 cites W2152304295 @default.
- W206874103 cites W2153083299 @default.
- W206874103 cites W2295141584 @default.
- W206874103 cites W2538492099 @default.
- W206874103 cites W2649339340 @default.
- W206874103 cites W2785282843 @default.
- W206874103 hasPublicationYear "2012" @default.
- W206874103 type Work @default.
- W206874103 sameAs 206874103 @default.
- W206874103 citedByCount "2" @default.
- W206874103 countsByYear W2068741032013 @default.
- W206874103 countsByYear W2068741032016 @default.
- W206874103 crossrefType "journal-article" @default.
- W206874103 hasAuthorship W206874103A5018650026 @default.
- W206874103 hasAuthorship W206874103A5037897289 @default.
- W206874103 hasAuthorship W206874103A5055173970 @default.
- W206874103 hasAuthorship W206874103A5073464815 @default.
- W206874103 hasConcept C110875604 @default.
- W206874103 hasConcept C11392498 @default.
- W206874103 hasConcept C136764020 @default.
- W206874103 hasConcept C13743948 @default.
- W206874103 hasConcept C162215914 @default.
- W206874103 hasConcept C173576120 @default.
- W206874103 hasConcept C21959979 @default.
- W206874103 hasConcept C23123220 @default.
- W206874103 hasConcept C41008148 @default.
- W206874103 hasConcept C521815418 @default.
- W206874103 hasConcept C61096286 @default.
- W206874103 hasConceptScore W206874103C110875604 @default.
- W206874103 hasConceptScore W206874103C11392498 @default.
- W206874103 hasConceptScore W206874103C136764020 @default.
- W206874103 hasConceptScore W206874103C13743948 @default.
- W206874103 hasConceptScore W206874103C162215914 @default.
- W206874103 hasConceptScore W206874103C173576120 @default.
- W206874103 hasConceptScore W206874103C21959979 @default.
- W206874103 hasConceptScore W206874103C23123220 @default.
- W206874103 hasConceptScore W206874103C41008148 @default.
- W206874103 hasConceptScore W206874103C521815418 @default.
- W206874103 hasConceptScore W206874103C61096286 @default.
- W206874103 hasLocation W2068741031 @default.
- W206874103 hasOpenAccess W206874103 @default.
- W206874103 hasPrimaryLocation W2068741031 @default.
- W206874103 hasRelatedWork W147348152 @default.
- W206874103 hasRelatedWork W1548519504 @default.
- W206874103 hasRelatedWork W1551580192 @default.
- W206874103 hasRelatedWork W1562586804 @default.
- W206874103 hasRelatedWork W1787988175 @default.
- W206874103 hasRelatedWork W1793206274 @default.
- W206874103 hasRelatedWork W1980656088 @default.
- W206874103 hasRelatedWork W1986870096 @default.
- W206874103 hasRelatedWork W199725966 @default.
- W206874103 hasRelatedWork W2026800377 @default.
- W206874103 hasRelatedWork W2044968286 @default.
- W206874103 hasRelatedWork W2056598903 @default.
- W206874103 hasRelatedWork W2151932833 @default.
- W206874103 hasRelatedWork W2182971975 @default.
- W206874103 hasRelatedWork W2337162139 @default.
- W206874103 hasRelatedWork W2345526731 @default.
- W206874103 hasRelatedWork W2619549793 @default.
- W206874103 hasRelatedWork W2993396824 @default.
- W206874103 hasRelatedWork W94036982 @default.
- W206874103 hasRelatedWork W750555316 @default.
- W206874103 isParatext "false" @default.
- W206874103 isRetracted "false" @default.
- W206874103 magId "206874103" @default.
- W206874103 workType "article" @default.