Abstract
XML is the most popular platform-independent data expression language which is used to specify various digital content such as web content, multimedia content, bio-chemical data, etc. These various forms of XML data are continuously increasing by a large amount and there is a strong demand on effectively managing such data in digital libraries or archives. The most popular query language to search and retrieve information from such semi-structured XML digital libraries is XQuery. XQuery has a very powerful syntax which allows users to iterate over data items and perform calculation, string matching, and output formatting. However, it lacks a simple and easy way to group and provide summaries on vast amounts of XML data. This grouping and summary function is especially important for large digital archives where users like to obtain an overview or summary of the contents in the digital library. Our work is focused on providing an easy way for grouping in XQuery at the query language level. We provide several cases where this can be considered to be effective. We have also implemented an XQuery processing system with grouping functions based on the eXist Native XML Database.
* This work was supported by the Korea Research Foundation Grant (KRF-2004-041-D00572) and also partially supported by the second stage of the BK21 program.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
XML(eXtensible Markup Language), http://www.w3.org/XML/
XQuery (XML Query Language), http://www.w3.org/XML/Query/
XML Path Language (XPath) 2.0, http://www.w3.org/TR/2005/WD-xpath20-20050404/
eXist(An Open Source Native XML Database), http://exist.sourceforge.net/
Chatziantoniou, D., Ross, K.A.: Querying multiple features of groups in relational databases. In: VLDB (1996)
Chatziantoniou, D., Ross, K.A.: Groupwise processing of relational queries. In: VLDB (1997)
Chaudhuri, S., Kaushik, R., Naughton, J.F.: On Relational Support for XML Publishing: Beyond Sorting and Tagging. In: SIGMOD (2003)
McHugh, J., Abiteboul, S., Goldman, R., Quass, D., Widom, J.: Lore: A Database Management System for Semistructured Data. SIGMOD Record 26(3), 54–66 (1997)
Jagadish, H.V., Al-Khalifa, S., Chapman, A., Lakshmanan, L.V.S., Nierman, A., Paparizos, S., Patel, J.M., Srivastava, D., Wiwatwattana, N., Wu, Y., Yu, C.: TIMBER: A Native XML Database. VLDB Journal 11(4) (2002)
Deutsch, A., Papakonstantinou, Y., Xu, Y.: Minimization and Group-By Detection for Nested XQueries. In: Int’l Conference on Data Engineering (ICDE), p. 839 (2004)
Beyer, K., Chamberlin, D., Colby, L.S., Ozcan, F., Pirahesh, H., Xu, Y.: XML query, update, and search: Extending XQuery for analytics. In: Proceedings of the 2005 ACM SIGMOD Int’l Conference on Management of Data, June 2005, pp. 503–514 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, M., Song, S., Kim, Y., Shin, H. (2006). Supporting Efficient Grouping and Summary Information for Semistructured Digital Libraries. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds) Digital Libraries: Achievements, Challenges and Opportunities. ICADL 2006. Lecture Notes in Computer Science, vol 4312. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11931584_43
Download citation
DOI: https://doi.org/10.1007/11931584_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49375-4
Online ISBN: 978-3-540-49377-8
eBook Packages: Computer ScienceComputer Science (R0)