Matches in SemOpenAlex for { <https://semopenalex.org/work/W827894772> ?p ?o ?g. }
Showing items 1 to 99 of
99
with 100 items per page.
- W827894772 abstract "Web robots or crawlers are an essential component of all search engines. Major search engines such as Google and AltaVista use their own robots (GoogleBot and Mercator) to crawl and index billions of Web pages over the Internet. Web robots are also increasingly adopted by digital libraries to collect data and on-line documents. The crawling process requires massive amounts of hardware and network resources as well as time. However, when only information about a predefined topic set is desired, the use of traditional crawling strategy becomes inefficient and cost ineffective. This thesis presents issues in developing a focused crawler - CNDROBOT, which only explores well-selected domain sites and collects potential on-topic documents for the CINDI digital library. The research was concerned with the studies on various search engines, types of Web robots, and crawling strategies. The research primarily involved the design and implementation of the CNDROBOT as well as the integration of the Document Filtering Subsystem. Finally, a Web application for the CNDROT was developed and an extensive test was conducted for various components and functions of this system. This thesis demonstrates that the CNDROBOT is capable of effectively and efficiently discovering large amounts of desired documents and supplying them for the CINDI digital library" @default.
- W827894772 created "2016-06-24" @default.
- W827894772 creator A5020502531 @default.
- W827894772 date "2005-01-01" @default.
- W827894772 modified "2023-09-27" @default.
- W827894772 title "CNDROBOT : a robot for the CINDI digital library system" @default.
- W827894772 hasPublicationYear "2005" @default.
- W827894772 type Work @default.
- W827894772 sameAs 827894772 @default.
- W827894772 citedByCount "5" @default.
- W827894772 crossrefType "dissertation" @default.
- W827894772 hasAuthorship W827894772A5020502531 @default.
- W827894772 hasConcept C100368936 @default.
- W827894772 hasConcept C105702510 @default.
- W827894772 hasConcept C110875604 @default.
- W827894772 hasConcept C111919701 @default.
- W827894772 hasConcept C11392498 @default.
- W827894772 hasConcept C118643609 @default.
- W827894772 hasConcept C121332964 @default.
- W827894772 hasConcept C124952713 @default.
- W827894772 hasConcept C127413603 @default.
- W827894772 hasConcept C134306372 @default.
- W827894772 hasConcept C136764020 @default.
- W827894772 hasConcept C13743948 @default.
- W827894772 hasConcept C142362112 @default.
- W827894772 hasConcept C154945302 @default.
- W827894772 hasConcept C164913051 @default.
- W827894772 hasConcept C168167062 @default.
- W827894772 hasConcept C173576120 @default.
- W827894772 hasConcept C177264268 @default.
- W827894772 hasConcept C199360897 @default.
- W827894772 hasConcept C21959979 @default.
- W827894772 hasConcept C23123220 @default.
- W827894772 hasConcept C33923547 @default.
- W827894772 hasConcept C36503486 @default.
- W827894772 hasConcept C41008148 @default.
- W827894772 hasConcept C513874922 @default.
- W827894772 hasConcept C71924100 @default.
- W827894772 hasConcept C73340581 @default.
- W827894772 hasConcept C90509273 @default.
- W827894772 hasConcept C97355855 @default.
- W827894772 hasConcept C98045186 @default.
- W827894772 hasConceptScore W827894772C100368936 @default.
- W827894772 hasConceptScore W827894772C105702510 @default.
- W827894772 hasConceptScore W827894772C110875604 @default.
- W827894772 hasConceptScore W827894772C111919701 @default.
- W827894772 hasConceptScore W827894772C11392498 @default.
- W827894772 hasConceptScore W827894772C118643609 @default.
- W827894772 hasConceptScore W827894772C121332964 @default.
- W827894772 hasConceptScore W827894772C124952713 @default.
- W827894772 hasConceptScore W827894772C127413603 @default.
- W827894772 hasConceptScore W827894772C134306372 @default.
- W827894772 hasConceptScore W827894772C136764020 @default.
- W827894772 hasConceptScore W827894772C13743948 @default.
- W827894772 hasConceptScore W827894772C142362112 @default.
- W827894772 hasConceptScore W827894772C154945302 @default.
- W827894772 hasConceptScore W827894772C164913051 @default.
- W827894772 hasConceptScore W827894772C168167062 @default.
- W827894772 hasConceptScore W827894772C173576120 @default.
- W827894772 hasConceptScore W827894772C177264268 @default.
- W827894772 hasConceptScore W827894772C199360897 @default.
- W827894772 hasConceptScore W827894772C21959979 @default.
- W827894772 hasConceptScore W827894772C23123220 @default.
- W827894772 hasConceptScore W827894772C33923547 @default.
- W827894772 hasConceptScore W827894772C36503486 @default.
- W827894772 hasConceptScore W827894772C41008148 @default.
- W827894772 hasConceptScore W827894772C513874922 @default.
- W827894772 hasConceptScore W827894772C71924100 @default.
- W827894772 hasConceptScore W827894772C73340581 @default.
- W827894772 hasConceptScore W827894772C90509273 @default.
- W827894772 hasConceptScore W827894772C97355855 @default.
- W827894772 hasConceptScore W827894772C98045186 @default.
- W827894772 hasLocation W8278947721 @default.
- W827894772 hasOpenAccess W827894772 @default.
- W827894772 hasPrimaryLocation W8278947721 @default.
- W827894772 hasRelatedWork W1539380046 @default.
- W827894772 hasRelatedWork W1563959936 @default.
- W827894772 hasRelatedWork W2011642193 @default.
- W827894772 hasRelatedWork W2066309116 @default.
- W827894772 hasRelatedWork W2105123614 @default.
- W827894772 hasRelatedWork W2106390192 @default.
- W827894772 hasRelatedWork W2140279085 @default.
- W827894772 hasRelatedWork W2186692612 @default.
- W827894772 hasRelatedWork W2321471970 @default.
- W827894772 hasRelatedWork W2548298479 @default.
- W827894772 hasRelatedWork W2898209107 @default.
- W827894772 hasRelatedWork W2941499861 @default.
- W827894772 hasRelatedWork W3162086139 @default.
- W827894772 hasRelatedWork W3202833648 @default.
- W827894772 hasRelatedWork W3205545366 @default.
- W827894772 hasRelatedWork W40206232 @default.
- W827894772 hasRelatedWork W1513150269 @default.
- W827894772 hasRelatedWork W2185961408 @default.
- W827894772 hasRelatedWork W2264139282 @default.
- W827894772 hasRelatedWork W2552722382 @default.
- W827894772 isParatext "false" @default.
- W827894772 isRetracted "false" @default.
- W827894772 magId "827894772" @default.
- W827894772 workType "dissertation" @default.