BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER

Prateek Raman*, Ravi Kant Gautam, Ravi Yadav, Manish Kumar Sharma

BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER

Authors

Prateek Raman*, Ravi Kant Gautam, Ravi Yadav, Manish Kumar Sharma Author

Keywords:

Best First Search, Priority Strategy of Web Grasping, B-tree Algorithm, Web Revisiting strategy Recommendation System.

Abstract

Web crawlers are Internet bot that automatically traverse the hyper-link structure of the world wide web in order to locate and retrieve information. This paper describes a web crawling approach based on B-tree search and HTML Parser. As the goal of crawler is to selectively seek out pages that are relevant to given keywords. Rather than collecting and indexing all available web documents to be able to answer all possible queries, a crawler analyze its crawl boundary to hit upon the links that are likely to be most relevant for the crawl, and avoids irrelevant links of the document.

Downloads

Published

2016-05-30

Issue

Vol. 3 No. 5 (2016): May 2016

Section

Articles

How to Cite

BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER. (2016). International Journal of Engineering Sciences & Management Research, 3(5), 102-106. https://ijesmr.com/index.php/ijesmr/article/view/232

Download Citation

BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B-TREE SEARCH AND HTML PARSER

Authors

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

Language

Information

Indexed IN