Abstract
A vast amount of information is available on the WWW. There are a lot of Web pages whose content is ‘local’ and interesting for people in a very narrow regional area. Usually, users search for information with search engines, even though finding or excluding local information may still be difficult. In this paper,we propose a newinformation retrieval method that is based on the localness degree for discovering or excluding local information from theWWW. Localness degree is a new notion for estimating the local dependence and ubiquitous nature of Web pages. The localness degree is computed by 1) a content analysis of the Web page itself, to determine the frequency of occurrence of geographical words, and the geographical area (i.e., latitude and longitude) covered by the location information given on the page, and 2) a comparison of the Web page with other pages with respect to daily (ubiquitous) content.We also show some results of our preliminary experiments of the retrieval method based on the localness degree.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Qiang Ma, Kazutoshi Sumiya, and Katsumi Tanaka. Information filtering based on timeseries features for data dissemination systems. TOD7, 41(SIG6):46–57, 8 2000.
Qiang Ma, Shinya Miyazaki, and Katsumi Tanaka. Webscan: Discovering and notifying important changes of web sites. In DEXA, volume 2113 of Lecture Notes in Computer Science, pages 587–598. Springer, 9 2001.
Shinya Miyazaki, Qiang Ma, and Katsumi Tanaka. Webscan: Content-based change discovery and broadcast-notification of web sites. TOD10, 42(SIG8):96–107, 7 2001.
MACHIgoo. http://machi.goo.ne.jp. 10 Chiyako Matsumoto, Qiang Ma and Katsumi Tanaka
Nobuyuki Miura, Katsumi Takahashi, Seiji Yokoji, and Kenichi Shima. Location oriented information integration mobile info search 2 experiment. The 57th National Convention of IPSJ, 3:637–638, 10 1998.Web Information Retrieval Based on the Localness Degree 181
Orkut Buyukkokten, Junghoo Cho, Hector Garcia-Molina, Luis Gravano, and Narayanan Shivakumar. Exploiting geographical location information of web pages. InWebDB (Informal Proceedings), pages 91–96, 1999.
Kaoru Hiramatsu and Toru Ishida. An augmented web space for digital cities. In SAINT, pages 105–112, 2001.
Daniel Egnor. Geographic search. Technical report, Google Programming Contest, 2002.
Soumen Chakurabarti, Btron Dom, David Gibson, Jon M. Kleinberg, S. Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, and Andrew Tomkins. Hypersearching the web. scientific american, 1999.
Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604–632, 1999.
Antonin Guttman. R-trees: A dynamic index structure for spatial searching. Proc. ACM SIGMOD Conference on Management of Data, 14(2):47–57, 1984.
Carlo Zaniolo, Stefano Ceri, Christos Faloutsos, Richard T. Snodgrass, V. S. Subrahmanian, and Roberto Zicari. Advanced Database Systems. The Morgan Kaufmann, 1997.
T. Takeda. The latitude / longitude position database of all-prefectures cities, towns and villages in japan, 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Matsumoto, C., Ma, Q., Tanaka, K. (2002). Web Information Retrieval Based on the Localness Degree. In: Hameurlain, A., Cicchetti, R., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2002. Lecture Notes in Computer Science, vol 2453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46146-9_18
Download citation
DOI: https://doi.org/10.1007/3-540-46146-9_18
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44126-7
Online ISBN: 978-3-540-46146-3
eBook Packages: Springer Book Archive