Skip to main content
Log in

OLAP and bibliographic databases

  • Published:
Scientometrics Aims and scope Submit manuscript

Abstract

The application of online analytical processing (OLAP) technology to bibliographic databases is addressed. We show that OLAP tools can be used by librarians for periodic and ad hoc reporting, quality assurance, and data integrity checking, as well as by research policy makers for monitoring the development of science and evaluating or comparing disciplines, fields or research groups. It is argued that traditional relational database management systems, used mainly for day-to-day data storage and transactional processing, are not appropriate for performing such tasks on a regular basis. For the purpose, a fully functional OLAP solution has been implemented on Biomedicina Slovenica, a Slovenian national bibliographic database. We demonstrate the system's usefulness by extracting data for studying a selection of scientometric issues: changes in the number of published papers, citations and pure citations over time, their dependence on the number of co-operating authors and on the number of organisations the authors are affiliated to, and time-patterns of citations. Hardware, software and feasibility considerations are discussed and the phases of the process of developing bibliographic OLAP applications are outlined.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • AGRAWAL, R., GUPTA, A., SARAWAGI, S. (1997), Modeling multidimensional databases. In: GRAY, A., LARSON, P. (Eds), Proceedings of the Thirteenth International Conference on Data Engineering, April 7-11, 1997 Birmingham U.K., IEEE Computer Society, pp. 232.243.

    Google Scholar 

  • BARNETT, G. A., FINK, E. L., DEBUS, M. B. (1989), A mathematical model of academic citation age. Communication Research, 16 (4): 510.531.

    Google Scholar 

  • BURRELL, Q. L. (2002), Modelling citation age data: Simple graphical methods from reliability theory. Scientometrics, 55 (2): 273.285.

    Article  Google Scholar 

  • CODD, E. F., CODD, S. B., SALLEY, C. T. (1993), Beyond decision support. Computerworld, 27 (30): 87.89.

    Google Scholar 

  • EGGHE, L. (2000), Lectures on Informetrics and Scientometrics. Bangalore: Sarada Rangathan Endowement for Library Science.

    Google Scholar 

  • EGGHE, L., RAO, I. K. R. (1992), Citation age data and the obsolescence function: Fits and explanations. Information Processing and Management, 28 (2): 201.217.

    Article  Google Scholar 

  • EGGHE, L., ROUSSEAU, R. (2000), Aging, obsolescence, impact, growth and utilization: Definitions and relations. Journal of the American Society for Information Science, 51 (11): 1004.1017.

    Article  Google Scholar 

  • FAYYAD, U., PIATETSKY-SHAPIRO, G., SMYTH, P. (1996a), From data mining to knowledge discovery: An overview. In: FAYYAD, U., PIATETSKY-SHAPIRO, G., SMYTH, P. (Eds), Advances in Knowledge Discovery and Data Mining, MIT Press, Cambridge, MA, pp. 1.34.

    Google Scholar 

  • FAYYAD, U., PIATETSKY-SHAPIRO, G., SMYTH, P. (1996b). From data mining to knowledge discovery in databases. AI Magazine, 17 (3): 37.54.

    Google Scholar 

  • GORDON, M. D., LINDSAY, R. K. (1996), Toward discovery support systems: A replication, re-examination, and extension of Swanson.s work on literature-based discovery of a connection between Raynaud.s and fish oil. Journal of the American Society for Information Science, 47 (2): 116.128.

    Article  Google Scholar 

  • HAN, J., KAMBER, M. (2001), Data Mining: Concepts and Techniques. San Francisco: Morgan Kaufmann.

    Google Scholar 

  • HRISTOVSKI, D., ROŽIć-HRISTOVSKI, A., ADAMIČ, Š. (1996), A decision support system for biomedical research evaluation. In: BRENDER, J., CHRISTENSEN, J. P., SCHERRER, J., MCNAIR, P. (Eds), Medical Informatics Europe '96, IOS Press, Tokyo, pp. 609.613.

    Google Scholar 

  • HRISTOVSKI, D., STARE, J., PETERLIN, B., DžEROSKI, S. (2001), Supporting discovery in medicine by association rule mining in MEDLINE and UMLS. In: PATEL, V. L., ROGERS, R., HAUX, R. (Eds), MEDINFO 2001, IOS Press, Amsterdam, pp. 1344.1348.

    Google Scholar 

  • KATZ, J. S., HICKS, D. (1997). Desktop Scientometrics. Scientometrics, 38 (1): 141.153.

    Article  Google Scholar 

  • LINDSAY, R. K., GORDON, M. D. (1999). Literature-based discovery by lexical statistics. Journal of the American Society for Information Science, 50 (7): 574.587.

    Article  Google Scholar 

  • MARCUM, J. W. (2001), From information center to discovery system: The next step for libraries? Journal of Academic Librarianship, 27 (2): 97.106.

    MathSciNet  Google Scholar 

  • NICHOLSON, S. (2002). Bibliomining: Data Mining for Libraries. [http://www.bibliomining.com/index.html]

  • Oracle Corporation (2000). Oracle Express: Database Administration Guide (release 6.3.1). Redwood City: Oracle.

    Google Scholar 

  • PENDSE, N. (2003), What is OLAP? [http://www.olapreport.com/fasmi.htm, part of The OLAP Report website]

  • POE, V. (1996), Building a Data Warehouse for Decision Support. Upper Saddle River: Prentice Hall.

    Google Scholar 

  • SU, S. T., NEEDAMANGALA, A. (2000), Harvesting information from a library data warehouse. Information Technology and Libraries, 19 (1): 17.28.

    Google Scholar 

  • SOTOLONGO-AGUILAR, G. R., SUAREZ-BALSEIRO, C. A., GUZMAN-SANCHEZ, M. V. (2000), Modular Bibliometric Information System with Proprietary Software (MOBIS-ProSoft): A versatile approach to bibliometric research tools. LIBRES: Library and Information Science Electronic Journal, 10 (2). [http://libres.curtin.edu.au/libres10n2/modular.htm]

  • SWANSON, D. R. (1986), Fish oil, Raynaud.s syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine, 30 (1): 7.18.

    Google Scholar 

  • SWANSON, D. R. (1989). Online search for logically-related noninteractive medical literatures: A systematic trial-and-error strategy. Journal of the American Society for Information Science, 47 (2): 116.128.

    MathSciNet  Google Scholar 

  • THOMSEN, E. (2002), OLAP Solutions: Building Multidimensional Information Systems (2nd ed.). New York: Wiley Computer Publishing.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Emil Hudomalj.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hudomalj, E., Vidmar, G. OLAP and bibliographic databases. Scientometrics 58, 609–622 (2003). https://doi.org/10.1023/B:SCIE.0000006883.28709.d2

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:SCIE.0000006883.28709.d2

Keywords

Navigation