skip to main content
research-article
Free Access

Distributed selection: a missing piece of data aggregation

Published:01 September 2008Publication History
Skip Abstract Section

Abstract

In this article, we study the problem of distributed selection from a theoretical point of view. Given a general connected graph of diameter D consisting of n nodes in which each node holds a numeric element, the goal of a k-selection algorithm is to determine the kth smallest of these elements. We prove that distributed selection indeed requires more work than other aggregation functions such as, e.g., the computation of the average or the maximum of all elements. On the other hand, we show that the kth smallest element can be computed efficiently by providing both a randomized and a deterministic k-selection algorithm, dispelling the misconception that solving distributed selection through in-network aggregation is infeasible.

References

  1. Blum, M., Floyd, R. W., Pratt, V, Rivest, R. L, and Tarjan, R. E. Time bounds for selection. Journal of Computer and System Sciences. 7:448--461, 1973.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Burri, N., von Rickenbach, P., and Wattenhofer, R. Dozer. Ultra-low power data gathering in sensor networks. In International Conference on Information Processing in Sensor Networks (IPSN), 2007 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chin, F. Y. L. and Ting, H. F. An improved algorithm for finding the median distributively. Algorithmica. 2:77--86, 1987.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Frederickson, G. N. Tradeoffs for selection in distributed networks. In Proceedings of the 2nd Annual ACM Symposium on Principles of Distributed Computing (PODC), pp. 154--160, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Goel, A. and Estrin, D. Simultaneous optimization for concave costs: Single sink aggregation or single source buy-at-bulk. Algorithmica, 43(l-2):5--15, 2005.Google ScholarGoogle Scholar
  6. Jia, L, Lin, G., Noubir, G., Rajaraman, R. and Sundaram, R. Universal approximations for TSP, Steiner Tree, and set cover. In 37th Annual ACM Symposium on Theory of Computing (STOC), pp. 386--395, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Kempe, D., Dobra, A., and Gehrke, J. Gossip-based computation of aggregate information. In Proceedings of the 44th Annual IEEE Symposium on Foundations of Computer Science (FOCS), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Madden, S., Franklin, M. J., Hellerstein, J. M., and Hong, W. TAG: a tiny aggregation service for ad-hoc sensor networks. In Proceedings of the 5th Annual Symposium on Operating Systems Design and Implementation (OSDI), pp. 131--146,2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Marberg, J. M. and Gafini, E. An optimal shout-echo algorithm for selection in distributed sets. In Proceedings of the 23rd Allerton Conference on Communication, Control, and Computing, 1985.Google ScholarGoogle Scholar
  10. Moscibroda, T. The worst-case capacity of wireless sensor networks. In 6th International Conference on Information Processing in Sensor Networks (IPSN), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Moscibroda, T. and Wattenhofer, R. The complexity of connectivity in wireless networks. In 25th Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM), 2006.Google ScholarGoogle ScholarCross RefCross Ref
  12. Negro, A., Santoro, N., and Urrutia, J. Efficient distributed selection with bounded messages. IEEE Transactions of Parallel and Distributed Systems. 8(4):397--401,1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Patt-Shamir, B. A note on efficient aggregate queries in sensor networks. In Proceedings of the 23rd Annual ACM Symposium on Principles of Distributed Computing (PODC), pp. 283--289,2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Peleg, D. Distributed Computing: A Locality-Sensitive Approach. SIAM Monographs on Discrete Mathematics and Applications, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Rodeh, M. Finding the median distributively. Journal of Computer and System Science, 24(2):162--166. 1982.Google ScholarGoogle ScholarCross RefCross Ref
  16. Rodem, D., Santoro, N., and Sidney, J. Shout-echo selection in distributed files. Networks. 16:235--249, 1986.Google ScholarGoogle Scholar
  17. Santoro, N., Scheutzow, M., and Sidney, J. B. On the expected complexity of distributed selection. Journal on Parallel and Distributed Computing, 5(2):194--203, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Santoro, N., Sidney, J. B., and Sidney, S. J. A distributed selection algorithm and its expected communication complexity. Theoretical Computer Science. 100(1):185--204, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Schönhage, A., Paterson, M. S., and Pippenger, N. Finding the median. Journal of Computer and System Sciences, 13:184--199, 1976.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Shrira, L, Francez, N., and Rodeh. M. Distributed k-selection: From a sequential to a distributed algorithm. In Proceedings of the 2nd Annual ACM Symposium on Principles of Distributed Computing (PODC), pp. 143--153, 1983. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Tiwari, P. Lower bounds on communication complexity in distributed computer networks. Journal of the ACM (JACM). 34(4):921,1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yao, A. Some complexity questions related to distributive computing. In Proceedings of the Annual ACM Symposium on Theory of Computing (STOC), pp. 209,1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yao, Y and Gehrke, J. The Cougar approach to in-network query processing in sensor networks. ACM SIGMOD Record, 31(3):9--18, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Zhao, J., Govindan, R., and Estrin, D. Computing aggregates for monitoring wireless sensor networks. In Proceedings of the 1st IEEE International Workshop on Sensor Network Protocols and Applications (SNPA), 2003.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Distributed selection: a missing piece of data aggregation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image Communications of the ACM
        Communications of the ACM  Volume 51, Issue 9
        Enterprise information integration: and other tools for merging data
        September 2008
        124 pages
        ISSN:0001-0782
        EISSN:1557-7317
        DOI:10.1145/1378727
        Issue’s Table of Contents

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 September 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Popular
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format