Skip to main content

Model-Driven Development of Multidimensional Models from Web Log Files

  • Conference paper
Advances in Conceptual Modeling – Applications and Challenges (ER 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6413))

Included in the following conference series:

Abstract

Analyzing Web log data is important in order to study the usage of a website. Even though some approaches propose data warehousing techniques for structuring the Web log data into a multidimensional model, they present two main drawbacks: (i) they are based on informal guidelines and must be manually applied; and (ii) they consider data tailored to a specific Web log format, thus being restricted to specific analysis tools. To overcome these limitations, we present a model-driven approach for obtaining a conceptual multidimensional model from Web log data in a comprehensive, integrated and automatic manner. This approach consists of the following steps: (i) obtaining a conceptual model of the Web log data based on a unified metamodel, (ii) deriving a multidimensional model from this Web log model by formally defining a set of QVT (Query/View/Transformation) transformation rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alves, R., Belo, O.: Mining clickstream-based data cubes. In: 6th International Conference on Enterprise Information Systems, pp. 583–586 (2004)

    Google Scholar 

  2. Alves, R., Belo, O., Cavalcanti, F., Ferreira, P.: Clickstreams, the basis to establish user navigation patterns on web sites. In: Fifth International Conference on Data Mining, Text Mining and their Business Applications, pp. 87–96. WIT Press, Southampton (2004)

    Google Scholar 

  3. Aurélio, D.M., Jorge, A.M., Soares, C., Leal, J.P., Machado, P.: A data warehouse for web intelligence. In: Neves, J., Santos, M.F., Machado, J.M. (eds.) EPIA 2007. LNCS (LNAI), vol. 4874, pp. 487–499. Springer, Heidelberg (2007)

    Google Scholar 

  4. Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining world wide web browsing patterns. Knowl. Inf. Syst. 1, 5–32 (1999)

    Article  Google Scholar 

  5. Eirinaki, M., Vazirgiannis, M.: Web mining for web personalization. ACM Trans. Internet Techn. 3, 1–27 (2003)

    Article  Google Scholar 

  6. Fraternali, P., Lanzi, P.L., Matera, M., Maurino, A.: Model-driven web usage analysis for the evaluation of web application quality. J. Web Eng. 3, 124–152 (2004)

    Google Scholar 

  7. Golfarelli, M., Maio, D., Rizzi, S.: The Dimensional Fact Model: A conceptual model for data warehouses. Int. J. Cooperative Inf. Syst. 7, 215–247 (1998)

    Article  Google Scholar 

  8. Hüsemann, B., Lechtenbörger, J., Vossen, G.: Conceptual data warehouse modeling. In: 2nd Intl. Workshop on Design and Management of Data Warehouses, pp. 6–1–6–11 (2000)

    Google Scholar 

  9. Jensen, M.R., Holmgren, T., Pedersen, T.B.: Discovering multidimensional structure in relational data. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 138–148. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  10. Joshi, K.P., Joshi, A., Yesha, Y.: On using a warehouse to analyze web logs. Distributed and Parallel Databases 13, 161–180 (2003)

    Article  MATH  Google Scholar 

  11. Kimball, R., Merz, R.: The data webhouse toolkit: building the web-enabled data warehouse. John Wiley & Sons, Inc., New York (2000)

    Google Scholar 

  12. Lopes, C.T., David, G.: Higher education web information system usage analysis with a data webhouse. In: Gavrilova, M.L., Gervasi, O., Kumar, V., Tan, C.J.K., Taniar, D., Laganá, A., Mun, Y., Choo, H. (eds.) ICCSA 2006. LNCS, vol. 3983, pp. 78–87. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  13. Luján-Mora, S., Trujillo, J., Song, I.Y.: A uml profile for multidimensional modeling in data warehouses. Data Knowl. Eng. 59, 725–769 (2006)

    Article  Google Scholar 

  14. Mazón, J.N., Trujillo, J.: A model driven modernization approach for automatically deriving multidimensional models in data warehouses. In: Parent, C., Schewe, K.-D., Storey, V.C., Thalheim, B. (eds.) ER 2007. LNCS, vol. 4801, pp. 56–71. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  15. Mazón, J.N., Trujillo, J.: A hybrid model driven development framework for the multidimensional modeling of data warehouses. SIGMOD Record 38, 12–17 (2009)

    Article  Google Scholar 

  16. Phipps, C., Davis, K.C.: Automating data warehouse conceptual schema design and evaluation. In: 4th Intl. Workshop on Design and Management of Data Warehouses, pp. 23–32 (2002)

    Google Scholar 

  17. Rizzi, S., Abelló, A., Lechtenbörger, J., Trujillo, J.: Research in data warehouse modeling and design: dead or alive? In: 9th International Workshop on Data Warehousing and OLAP, pp. 3–10 (2006)

    Google Scholar 

  18. The Apache Software Foundation: Log files, http://eregie.premier-ministre.gouv.fr/manual/logs.html

  19. W3C Consortium: Extended common log file format, http://www.w3.org/TR/WD-logfile.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hernández, P., Garrigós, I., Mazón, JN. (2010). Model-Driven Development of Multidimensional Models from Web Log Files. In: Trujillo, J., et al. Advances in Conceptual Modeling – Applications and Challenges. ER 2010. Lecture Notes in Computer Science, vol 6413. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16385-2_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16385-2_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16384-5

  • Online ISBN: 978-3-642-16385-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics