To read this content please select one of the options below:

Archive knowledge discovery by proxy cache

Hsiang‐Fu Yu (Computer Center, at the National Central University, Taiwan, ROC)
Yi‐Ming Chen (Department of Information Management, at the National Central University, Taiwan, ROC)
Li‐Ming Tseng (Distributed System Laboratory, Department of Computer Science & Information Engineering, at the National Central University, Taiwan, ROC)

Internet Research

ISSN: 1066-2243

Article publication date: 1 February 2004

1441

Abstract

An archive is a file containing several related files. Many Internet resources, such as freeware, shareware and trail software, are often packaged into archives for easy installation and taking. Additionally, thousands of users search for archives and download them from different sources everyday. In this paper, previous research on archive downloading is extended via proxy cache to support archive searching. Internet proxy cache servers are used to gather a significant number of Web pages, detect those that contain archive links, and then use the obtained data to search archives by description or filename. Two schemes, iterative and backtracking, are proposed to obtain Web pages with archive links. The experimental results indicate that the precision that both of the schemes can achieve is about the same; however, the backtracking scheme reduces the number of checked pages by a factor of 26. Finally, a real system was implemented to demonstrate the proposed approaches.

Keywords

Citation

Yu, H., Chen, Y. and Tseng, L. (2004), "Archive knowledge discovery by proxy cache", Internet Research, Vol. 14 No. 1, pp. 34-47. https://doi.org/10.1108/10662240410516309

Publisher

:

Emerald Group Publishing Limited

Copyright © 2004, Emerald Group Publishing Limited

Related articles