TELKOMNIKA (Telecommunication Computing Electronics and Control)
Vol 12, No 4: December 2014

A Model of Vertical Crawler Based on Hidden Markov Chain

Ye Hu (School of Computer Science and Technology, Hubei University of Technology)
Jun Tu (School of Computer Science and Technology, Hubei University of Technology)
Wangyu Tong (School of Computer Science and Technology, Hubei University of Technology)



Article Info

Publish Date
01 Dec 2014

Abstract

The large size and the dynamic nature of the Web make it necessary to continually maintain Web based information retrieval systems. In order to get more objects by visiting few irrelevant web pages, the web crawler usually takes the heuristic searching strategy that ranks urls by their importance and preferentially visits the more important web pages. While some systems rely on crawlers that exhaustively crawl the Web, others incorporate “focus” within their crawlers to harvest application or topic-specific collections. In this paper, using the Hidden Markov Model(HMM) learning ability to solve the problem of the theme of the crawler drift, has obtained the certain effect.

Copyrights © 2014






Journal Info

Abbrev

TELKOMNIKA

Publisher

Subject

Computer Science & IT

Description

Submitted papers are evaluated by anonymous referees by single blind peer review for contribution, originality, relevance, and presentation. The Editor shall inform you of the results of the review as soon as possible, hopefully in 10 weeks. Please notice that because of the great number of ...