BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER
Keywords:
Best First Search, Priority Strategy of Web Grasping, B-tree Algorithm, Web Revisiting strategy Recommendation System.Abstract
Web crawlers are Internet bot that automatically traverse the hyper-link structure of the world wide web in order to locate and retrieve information. This paper describes a web crawling approach based on B-tree search and HTML Parser. As the goal of crawler is to selectively seek out pages that are relevant to given keywords. Rather than collecting and indexing all available web documents to be able to answer all possible queries, a crawler analyze its crawl boundary to hit upon the links that are likely to be most relevant for the crawl, and avoids irrelevant links of the document.
Downloads
Published
2016-05-30
Issue
Section
Articles
How to Cite
BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER. (2016). International Journal of Engineering Sciences & Management Research, 3(5), 102-106. https://ijesmr.com/index.php/ijesmr/article/view/232

