skip to main content
10.1145/1066677.1066819acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

Distributed evaluation of generalized path queries

Published:13 March 2005Publication History

ABSTRACT

Nowadays, we are required to deal with more complex data, prime examples of which are data on the Web, XML data, biological data, etc. There are already proposed abstractions to handle these kinds of data, in particular in terms of semistructured data models. A semistructured model conceives a database essentially as a finite directed labeled graph whose nodes represent objects, and whose edges represent relationships between objects. In this paper, we focus on path queries, which are considered the basic querying mechanism for semistructured data. In essence, such queries are used to navigate, or discover paths that conform to specifications captured by regular expressions. In order to make the navigation more useful, we consider generalized path queries, in which the symbols could optionally be weighted by numbers. Such numbers can express a variety of information about the data that the query could possibly match or navigate.Motivated by the plethora of today's applications utilizing Web services and peer-to-peer architectures, we present a distributed algorithm for evaluating generalized path queries. We follow a realistic model with distributed (non-shared) memory and message-passing between processors. An optimal solution to the problem lies in the intersection of ideas related to distributed query evaluation, distributed shortest path computation, and queueing systems.

References

  1. Abiteboul, S., Buneman, P., and Suciu, D. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann, San Francisco, CA, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aho, A., Hopcroft, J. E., and Ullman, J. D. The Design and Analysis of Computer Algorithms. Addison-Wesley, Reading, MA, 1974. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Calvanese, D., Giacomo, G., Lenzerini, M., and Vardi, M. Y., Rewriting of regular expressions and regular path queries. In Proceedings of the 19th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS '99). ACM Press, New York, NY, 1999, 194--204. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Calvanese, D., Giacomo, G., Lenzerini, M., and Vardi, M. Y. View-based query processing and constraint satisfaction. In Proceedings of the 15th Annual IEEE Symposium on Logic in Computer Science (LICS '00). IEEE Press, Washington, DC, 2000, pp. 361--371. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cooper, R. Introduction to Queueing Theory. Elsevier, North-Holland, 1981.Google ScholarGoogle Scholar
  6. Flesca, S., Furfaro, F., and Greco, S. Weighted path queries on web data. In Proceedings of the 4th International Workshop on the Web and Databases (WebDB '01). Informal Proceedings, pp. 7--12.Google ScholarGoogle Scholar
  7. Gallo, G., and Pallottino, S. Shortest path methods in transportation models. In: (M. Florian, ed.) Transportation Planning Models. Elsevier, North-Holland, 1984, pp. 227--256.Google ScholarGoogle Scholar
  8. Grahne, G., and Thomo, A. Approximate reasoning in semistructured data. In Proceedings of the 8th International Workshop on Knowledge Representation meets Databases (KRDB '01). Online Proceedings, http://ceur-ws.org/Vol-45/03-thomo.psGoogle ScholarGoogle Scholar
  9. Grahne, G., and Thomo, A. Query answering and containment for regular path queries under distortions. In Proceedings of the 3rd International Symposium on Foundations of Information and Knowledge Systems (FoIKS '04). Lecture Notes in Computer Science 2942, Springer, 2004, pp. 98--115.Google ScholarGoogle ScholarCross RefCross Ref
  10. Hopcroft, J., E., and Ullman, J., D. Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, Reading, MA, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hribar, M., R., Taylor, V., E., and Boyce, D., E. Implementing parallel shortest path for parallel transportation applications. Parallel Computing, 27 (12), 2001, pp. 1537--1568.Google ScholarGoogle ScholarCross RefCross Ref
  12. Matocha, J., and Camp, T. A taxonomy of distributed termination detection algorithms. Journal of Systems and Software, 43 (3), 1998, pp. 207--221. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Mendelzon, A., O., Mihaila, G., A., and Milo, T. Querying the World Wide Web. International Journal on Digital Libraries, 1 (1), 1997, pp. 57--67.Google ScholarGoogle Scholar
  14. Mendelzon A. O., and Wood T. P., Finding regular simple paths in graph databases. SIAM Journal on Computing, 24 (6), 1995, pp. 1235--1258. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Suciu D., Distributed query evaluation on semistructured data. ACM Transactions on Database Systems, 27 (1), 2002, pp. 1--62. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Yadin, M., and Naor, P. Queueing systems with a removable service station. Operations Research Quarterly, 14 (4), 1963, pp. 393--405.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Distributed evaluation of generalized path queries

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SAC '05: Proceedings of the 2005 ACM symposium on Applied computing
            March 2005
            1814 pages
            ISBN:1581139640
            DOI:10.1145/1066677

            Copyright © 2005 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 13 March 2005

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            Overall Acceptance Rate1,650of6,669submissions,25%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader