用户名: 密码: 验证码:
Full similarity-based page ranking.
详细信息   
文摘
The World Wide Web creates many new challenges to information retrieval [1]. As the information on Web grows so rapidly, the need of a user efficiently searching some specific piece of information becomes increasingly imperative. The sheer mass and almost anarchic structure of the Web makes effective search difficult. Some good search engine alleviate the problem to some extent by ranking the search results based on the relevancy of the Web pages to users query. They aim to place the most prominent pages at high ranks. Most of current search engines work by first retrieving a set of Web pages based on traditional text-based search engine and then applying link-based page ranking algorithms to rank this set of Web pages. Though those link-based ranking algorithm do extract the potential relationship among the Web pages and improve the Internet search to some extent, they have several problems. One of the most important problems is computation complexity since the convergence of those eigenvector-based ranking algorithms requires iteration which is computationally expensive. Full Similarity-based Ranking FSBR) using densely connected clustering, a novel approach for Web page ranking, is proposed by Prof. Xinhua Zhuang. Under his advising, I did thorough literature overview, proposed a novel Subgraph Chaining Expansion algorithm, built testbed, implemented FSBR algorithm, and conducted simulation and extensive experiments. FSBR is a generic full similarity-based ranking scheme. It allows similarity measures built on link structure and other ranking contributable features. It finds similarity-based densely connected clusters, which are believed to play a significant role in page ranking and topic analysis, and use them in page ranking. Unlike popular eigenvector-based ranking algorithms, our scheme does not require complex iterations; hence it has a much lower computational complexity. The experimental results also show that FSBR provides much higher accuracy than the HITS page ranking algorithm.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700