Abstract
In web page retrievals, search engines are usually used. However, conventional search engines have a problem in that their update intervals are very long because they are based on centralized architecture, which gathers documents using robots. So we proposed the Cooperative Search Engine (CSE) in order to reduce the update interval. CSE is a distributed search engine, which integrates small local search engines into a large global search engine by using local meta search engines. A local meta search engine hides a local search engine in each web site. Although CSE can reduce the update interval, the retrieval performance is not enough. So, we proposed several speed up techniques. In this paper, we describe the structure and behavior of CSE and its efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nobuyoshi Sato, Minoru Uehara, Yoshifumi Sakai, Hideki Mori: Fresh Information Retrieval in Cooperative Search Engine. Proc. of 2nd Software Engineering Artificial Intelligence, Networking & Parallel / Distributed Computing 2001 (SNPD’01). pp. 104–111 (2001)
Nobuyoshi Sato, Minoru Uehara, Yoshifumi Sakai, Hideki Mori: On Updating in Very Short Time by Distributed Search Engines. Proc. of The 2002 Symposium on Applications and the Internet (SAINT2002). pp. 176–183 (2002)
Minoru Uehara, Nobuyoshi Sato, Takashi Yamamoto, Yoshihiro Nishida, Hideki Mori: Minimizing Query Targets in CSE. Proc. of 7th Workshop on Multimedia Communication and Distributed Processing Systems (DPSWS’99). pp. 85–90 (1999) (in Japanese)
Namazu Project: Namazu: a Full-Text Search Engine. http://www.namazu.org/
Sony Corp.: SonyDrive Search Engine. http://www.sony.co.jp/sd/Search/
C. Mic Bowman, Peter B. Danzig, Darren R. Hardy, Udi Manber, Michael F. Schwartz: The Harvest Information Discovery and Access System. Proc. of 2nd International WWW Conference. http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Searching/schwartz.harvest/schwartz.harvest.html (1994)
Lycos Inc.: WebAnts. http://polarbear.eng.lycos.com/
Hayato Yamana, et al.: Experiments of Collecting in WWW Information using Distributed WWW Robots. Proc. of 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’98). pp. 379–380 (1998)
M. Koster: A Standard for Robot Exclusion. http://info.webcrawler.com/mak/projects/robots/norobots.html (1994)
Tokiharu Noto, Hiroshi Takeno: Design and Implementation of a Scalable WWW Information Collection Robot. Proc. of 8th Workshop on Multimedia Communication and Distributed Processing Systems (DPSWS 2000). pp. 7–12 (2000) (in Japanese)
Steve Kirsch: Infoseek’s approach to distributed search. Report of the Distributed Indexing / Searching Workshop (DISW’96). http://www.w3.org/Search/9605-Indexing-Workshop/Papers/Kirsch@Infoseek.html (1996)
FreshEye Corp.: FreshEye. http://www.fresheye.com/
Info Space Inc.: MetaCrawler. http://www.meracrawer.com/
CNET Networks, Inc.: SavvySearch. http://www.savvysearch.com/
S. Lawrence, C. L. Gills: The NECI Metasearch Engine. http://www.neci.nec.com/~lawrence/inquirus.html
C. Weider, J. Fullton, S. Spero: Architecture of the Whois++ Index Service. RFC1913 (1996)
Nippon Telegraph and Telephone Corp.: Ingrid. http://www.ingrid.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sato, N., Uehara, M., Sakai, Y., Mori, H. (2002). Fresh Information Retrieval Using Cooperative Meta Search Engines. In: Chong, I. (eds) Information Networking: Wireless Communications Technologies and Network Applications. ICOIN 2002. Lecture Notes in Computer Science, vol 2344. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45801-8_62
Download citation
DOI: https://doi.org/10.1007/3-540-45801-8_62
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44255-4
Online ISBN: 978-3-540-45801-2
eBook Packages: Springer Book Archive