UGC Approved Journal no 63975(19)

ISSN: 2349-5162 | ESTD Year : 2014
Call for Paper
Volume 11 | Issue 4 | April 2024

JETIREXPLORE- Search Thousands of research papers



WhatsApp Contact
Click Here

Published in:

Volume 6 Issue 2
February-2019
eISSN: 2349-5162

UGC and ISSN approved 7.95 impact factor UGC Approved Journal no 63975

7.95 impact factor calculated by Google scholar

Unique Identifier

Published Paper ID:
JETIRAE06092


Registration ID:
198768

Page Number

397-402

Share This Article


Jetir RMS

Title

Intelligent Web Crawler for Deep Web Search using Page Rank Mechanism

Abstract

Web pages available in the internet are growing tremendously now days. In such a situation searching more relevant information in the Internet is a very hard task. Very big information is hidden behind query forms, this information interface to undetermined databases containing high quality structured data. Conventional search engines cannot access and index this hidden part of the Web. Retraining this hidden information from web is very challenging task. Therefore, we introduce a two types of framework, namely SmartCrawler, for effectively harvesting deep web interfaces. In the first stage that is site discovering, centre pages are searched with the help of search engines which in turn avoid visiting a large number of pages. To achieve more rigid results for a focused crawl, SmartCrawler ranks websites to prioritize highly suited ones for a given topic. In the second stage, adaptive link - ranking achieves fast in - site searching by excavating most suited links. To eliminate bias on visiting some highly related links in hidden web directories, we design a link tree data structure to achieve immense coverage for a website. The SmartCrawler techniques only consider an url. So we use SmartSearch technique for queries using page rank algorithm. The experimental results on a set of representative domains show the dexterity and accuracy of proposed crawler framework, which efficiently retrieves deep-web interfaces from large - scale sites and access higher harvest rates than other crawlers.

Key Words

Clustering, classification and association rules, data mining

Cite This Article

"Intelligent Web Crawler for Deep Web Search using Page Rank Mechanism ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.6, Issue 2, page no.397-402, February-2019, Available :http://www.jetir.org/papers/JETIRAE06092.pdf

ISSN


2349-5162 | Impact Factor 7.95 Calculate by Google Scholar

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 7.95 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Cite This Article

"Intelligent Web Crawler for Deep Web Search using Page Rank Mechanism ", International Journal of Emerging Technologies and Innovative Research (www.jetir.org | UGC and issn Approved), ISSN:2349-5162, Vol.6, Issue 2, page no. pp397-402, February-2019, Available at : http://www.jetir.org/papers/JETIRAE06092.pdf

Publication Details

Published Paper ID: JETIRAE06092
Registration ID: 198768
Published In: Volume 6 | Issue 2 | Year February-2019
DOI (Digital Object Identifier):
Page No: 397-402
Country: -, -, - .
Area: Engineering
ISSN Number: 2349-5162
Publisher: IJ Publication


Preview This Article


Downlaod

Click here for Article Preview

Download PDF

Downloads

0002910

Print This Page

Current Call For Paper

Jetir RMS