NaviMoz: Mining Navigational Patterns in Portal Catalogs

Christodoulou, Eleni; Dalamagas, Theodore; Sellis, Timos

doi:10.1007/11896548_60

Eleni Christodoulou²⁶,
Theodore Dalamagas²⁶ &
Timos Sellis²⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4254))

Included in the following conference series:

International Conference on Extending Database Technology

646 Accesses

Abstract

Portal Catalogs is a popular means of searching for information on the Web. They provide querying and browsing capabilities on data organized in a hierarchy, on a category/subcategory basis. This paper presents mining techniques on user navigational patterns in the hierarchies of portal catalogs. Specifically, we study and implement navigation retrieval methods and clustering tasks based on navigational patterns. The above mining tasks are quite useful for portal administrators, since they can be used to observe users’ behavior, extract personal preferences and re-organize the structure of the portal to satisfy better user needs and navigational habits. These mining tasks have been implemented in the NaviMoz, a prototype system for mining navigational patterns in portal catalogs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Modeling user interests from web browsing activities

Article 01 November 2016

Automatic Generation of Sitemaps Based on Navigation Systems

An Unsupervised Method for Concept Association Analysis in Text Collections

References

Eirinaki, M., Vazirgiannis, M.: Web mining for web personalization. ACM Transactions on Internet Technology (TOIT) 3(1) (2003)
Google Scholar
Anderson, C.R., Horvitz, E.: A Dynamic Personalized Start Page. In: Proceedings of the 11th WWW Conference 2002 (2002)
Google Scholar
Ajijth, A., Ramos, V.: Web Usage Mining Using Artificial Ant Colony Clustering and Genetic Programming. In: Congress on Evolutionary Computation, Canberra, Australia, December 2003, pp. 1384–1391. IEEE Press, Los Alamitos (2003)
Google Scholar
Fang, X., Liu Sheng, O.R.: Designing a Better Web Portal for Digital Government: A Web-Mining Based Approach, http://diggov.org/library/library/dgo2005/demosb/fang_designing.pdf
Kaneta, Y., Munna Ahaduzzaman, M.M., Ohkawa, T.: A Method of Extracting Sentences Related to Protein Interaction from Literature using a Structure Database. In: Proceedings of the 2nd European Workshop on Data Mining and Text Mining for Bioinformatics (ECML/PKDD 2004), Italy (September 2004)
Google Scholar
Kamdar, T., Joshi, A.: On Creating Adaptive Web Sites using Web Log Mining, TR-CS-00-05. Department of Computer Science and Electrical Engineering University of Maryland, Baltimore Country (2000)
Google Scholar
Krishnamurthy, L., Nadeau, J., Ozsoyoglu, G., Ozsoyoglu, M., Schaeffer, G., Tasan, M., Xu, W.: Pathways Database System: An integrated set of tools for biological pathways. Bioinformatics 19(8) (2003)
Google Scholar
Mobasher, B., Dai, H., Luo, T., Sung, Y., Zhu, J.: Integrating Web Usage and Content Mining for More Effective Personalization. In: Proceedings of the International Conference on E-Commerce and Web Technologies, Greenwich, UK, pp. 165–176 (2000)
Google Scholar
Pensa, R.G., Leschi, C., Besson, J., Boulicaut, J.: Assessment of discretization techniques for relevant pattern discovery from gene expression data. In: Proceedings of the 2nd Workshop on Data Mining in Bioinformatics, Seattle, USA (August 2004)
Google Scholar
Pierrakos, D., Paliouras, G., Papatheodorou, C., Karkaletsis, V., Dikaiakos, M.: Web community directories: A new approach to web personalization. In: Berendt, B., Hotho, A., Mladenič, D., van Someren, M., Spiliopoulou, M., Stumme, G. (eds.) EWMF 2003. LNCS, vol. 3209, pp. 113–129. Springer, Heidelberg (2004)
Chapter Google Scholar
Toolan, F., Kusmerick, N.: Mining web logs for personalized site maps. In: Proceedings of the 3rd International Conference on Web Information Systems Engineering (WISE 2002) (2002)
Google Scholar
Wagner, R.A., Fischer, M.J.: The String to String Correction Problem. Journal of the Association for the Computer Machinery 21(1), 168–173 (1974)
MATH MathSciNet Google Scholar
Rasmussen, E.: Clustering algorithms. In: Frakes, W., Baeza-Yates, R. (eds.) Information Retrieval: Data Structures and Algorithms. Prentice Hall, Englewood Cliffs (1992)
Google Scholar
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: Clustering algorithms and validity measures. In: Proceedings of the SSDBM Conference, Virginia, USA (2001)
Google Scholar
Dalamagas, T., Cheng, T., Winkel, K.J., Sellis, T.: A Methodology for Clustering XML Documents by Structure. In: Information Systems. Elsevier, Amsterdam (2004)
Google Scholar
Hubert, L.J., Levin, J.R.: A general statistical framework for accessing categorical clustering in free recall. Psychological Bulletin 83, 1072–1082 (1976)
Article Google Scholar
Baumgarten, M., Buchner, A.G., Anand, S.S., Mulvenna, M.D., Hughes, J.G.: User-Driven Navigation Pattern Discovery from Internet Data, pp. 74–91
Google Scholar
Agrawal, R., Psaila, G., Wimmers, E.L., Zat, M.: Querying shapes of histories. In: Proceedings of 21st International Conference on Very Large Data Bases, pp. 502–514. Morgan Kaufmann, San Francisco (1995)
Google Scholar
XML path language, XPath: www.w3.org/TR/xpath
Jardine, N., van Rijsbergen, C.J.: The use of hierarchical clustering in information retrieval. Information storage and retrieval 7, 217–240 (1971)
Article Google Scholar
Voorhees, E.: The effectiveness and efficiency of agglomerative hierarchic clustering in document retrieval, Ph.D. thesis, Cornell University, Ithaca, New York (October 1985)
Google Scholar
Hearst, M., Pedersen, J.O.: Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In: Proceedings of the ACM SIGIR Conference, Zurich, Switzerland, pp. 76–84 (1996)
Google Scholar
Cormen, T., Leiserson, C., Rivest, R.: Introduction to algorithms. MIT Press, Cambridge (1990)
MATH Google Scholar
Gower, J.C., Ross, G.J.S.: Minimum spanning trees and single linkage cluster analysis. Applied Statistics 18, 54–64 (1969)
Article MathSciNet Google Scholar
Milligan, G.W., Cooper, M.C.: An examination of procedures for determining the number of clusters in a data set. Psychometrika 50, 159–179 (1985)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Electr. and Comp. Engineering, National Techn., University of Athens, Athens, GR 15773, USA
Eleni Christodoulou, Theodore Dalamagas & Timos Sellis

Authors

Eleni Christodoulou
View author publications
You can also search for this author in PubMed Google Scholar
Theodore Dalamagas
View author publications
You can also search for this author in PubMed Google Scholar
Timos Sellis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technische Universität München, Germany
Torsten Grust
International University in Germany, Germany
Hagen Höpfner
University of the Basque Country,
Arantza Illarramendi
University of Bayreuth, Bayreuth, Germany
Stefan Jablonski
Università di Milano, Italy
Marco Mesiti
University of Erlangen-Nurmberg, Germany
Sascha Müller
Institut für Informatik, Ludwig-Maximilians-Universität, Oettingenstr. 67, 80538, München, Germany
Paula-Lavinia Patranjan
Dept. of Computer Science and Automation, TU Ilmenau
Kai-Uwe Sattler
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou
Université de Mons-Hainaut, Mons, Belgium
Jef Wijsen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Christodoulou, E., Dalamagas, T., Sellis, T. (2006). NaviMoz: Mining Navigational Patterns in Portal Catalogs. In: Grust, T., et al. Current Trends in Database Technology – EDBT 2006. EDBT 2006. Lecture Notes in Computer Science, vol 4254. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11896548_60

Download citation

DOI: https://doi.org/10.1007/11896548_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-46788-5
Online ISBN: 978-3-540-46790-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics