Abstract

A COMPARATIVE STUDY OF SCHEDULING ALGORITHMS FOR WEB CRAWLING USING VB.NET TECHNOLOGY

Sushil Kumar, Dr. Anuj Kumar

071-079

Vol: 1, Issue: 3, 2011

Under the present study, Web Crawler simulator has been designed than analyze the different web crawling algorithm to evaluate their performance. Web crawler is a computer program or software. Web crawler is an essential component of search engines, data mining and other Internet applications. Scheduling Web pages to be downloaded is an important aspect of crawling. Previous research on Web crawl focused on optimizing either crawl speed or quality of the Web pages downloaded. While both metrics are important, scheduling using one of them alone is insufficient and can bias or hurt overall crawl process. This paper is all about the comparative study of scheduling algorithm for Web Crawling using VB.NET Technology

Download PDF

    References

  1. http://en.wikipedia.org/wiki/Web_crawler#Examples_of_Web_crawlers
  2. http://www.chato.cl/papers/castillo04_scheduling_algorithms_web_crawling.pdf
  3. http://ieeexplore.ieee.org/iel5/2/34424/01642621.pdf
  4. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.1.9569&rep=rep1&type=pdf.
  5. http://dollar.biz.uiowa.edu/~pant/Papers/crawling.pdf
  6. Marc Najork, Allan Heydon SRC Research Report 173, “High-Performance Web
  7. Sergey Brin and Lawrence Page, ”Theanatomy of a large-scale hyper textual Web search engine”, In Proceedings of the Seventh International World Wide Web Conference, pages 107–117, April 1998
  8. . [Ard¨o A]. (2005). “Combine Web crawler,” Software package for general and focused Web-crawling. http://combine.it.lth.se/.
Back

Disclaimer: Indexing of published papers is subject to the evaluation and acceptance criteria of the respective indexing agencies. While we strive to maintain high academic and editorial standards, International Journal of Research in Science and Technology does not guarantee the indexing of any published paper. Acceptance and inclusion in indexing databases are determined by the quality, originality, and relevance of the paper, and are at the sole discretion of the indexing bodies.

We are one of the best in the field of watches and we take care of the needs of our customers and produce replica watches of very good quality as per their demands.