BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER

Authors

  • Prateek Raman*, Ravi Kant Gautam, Ravi Yadav, Manish Kumar Sharma Author

Keywords:

Best First Search, Priority Strategy of Web Grasping, B-tree Algorithm, Web Revisiting strategy Recommendation System.

Abstract

Web crawlers are Internet bot that automatically traverse the hyper-link structure of the world wide web in order to locate and retrieve information. This paper describes a web crawling approach based on B-tree search and HTML Parser. As the goal of crawler is to selectively seek out pages that are relevant to given keywords. Rather than collecting and indexing all available web documents to be able to answer all possible queries, a crawler analyze its crawl boundary to hit upon the links that are likely to be most relevant for the crawl, and avoids irrelevant links of the document.

Downloads

Published

2016-05-30

Issue

Section

Articles

How to Cite

BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER. (2016). International Journal of Engineering Sciences & Management Research, 3(5), 102-106. https://ijesmr.com/index.php/ijesmr/article/view/232