Skip to main content

Maintaining the Dominant Representatives on Data Streams

  • Conference paper
Database and Expert Systems Applications (DEXA 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5690))

Included in the following conference series:

Abstract

It is well known that traditional skyline query is very likely to return over many but less informative data points in the result, especially when the querying dataset is high-dimensional or anti-correlated. In data stream applications where large amounts of data are continuously generated, this problem becomes much more serious since the full skyline result cannot be obtained efficiently and analyzed easily. To cope with this difficulty, in this paper, we propose a new concept called Combinatorial Dominant relationship to abstract dominant representatives of stream data. Based on this concept, we propose three novel skyline queries, namely basic convex skyline query (BCSQ), dynamic convex skyline query (DCSQ), andreverse convex skyline query (RCSQ), combining the concepts of convex derived from geometry and the traditional skyline for the first time. These queries can adaptively abstract the contour of skyline points without specifying the size of result set in advance and promote information content of the query result. To efficiently process these queries and maintain their results, we design and analyze algorithms by exploiting a memory indexing structure called DCEL which is used to represent and store the arrangement of data in the sliding window. We convert the problems of points in the primal plane into those of lines in dual plane through dual transformation, which helps us avoid expensive full skyline computation and speeds up the candidate set selection. Finally, through extensive experiments with both real and synthetic datasets, we validate the representative capability of CSQs, as well as the performance of our proposed algorithms.

This research is supported by National 863 Hi-Tech R & D Plan of China under Grant No. 2008AA01Z120, NSFC under Grant Nos. 60673138, 60603046, Program for New Century Excellent Talents in University and Union Project with BMEC on industry-study-research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal, P., Erickson, J.: Geometric Range Searching and Its Relatives. In: Advances in Discrete and Computational Geometry, Contemporary Mathematics, vol. 223, pp. 1–56 (1999)

    Google Scholar 

  2. Preparata, F.P., Shamos, M.I.: Computational geometry: An introduction. Springer, Heidelberg (1985)

    Book  MATH  Google Scholar 

  3. Agarwal, P.K., Sharir, M.: Arrangements and Their Applications. In: Handbook of Computational Geometry, ch. 2, pp. 49–119. Elsevier, Amsterdam (2000)

    Chapter  Google Scholar 

  4. De Berg, M., Cheong, O., Van Kreveld, M., Overmars, M.: Computational Geometry: Algorithms and Applications (March 2008)

    Google Scholar 

  5. Dellis, E., Seeger, B.: Efficient Computation of Reverse Skyline Queries. In: VLDB (2007)

    Google Scholar 

  6. Das, G., Gunopulos, D.: Ad hoc Top-k Query Answering for Data Streams. In: VLDB (2007)

    Google Scholar 

  7. Chan, C.Y., Jagadish, H.V., Tan, K.L., Tung, A.K.H., Zhang, Z.: Finding k-dominant skylines in high dimensional space. In: SIGMOD (2006)

    Google Scholar 

  8. Chan, C.Y., Jagadish, H.V., Tan, K.L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 478–495. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  9. Ramsak, F., Kossmann, D., Rost, S.: Shooting stars in the sky: An online algorithm for skyline queries. In: VLDB (2002)

    Google Scholar 

  10. Fu, G., Papadias, D., Tao, Y., Seeger, B.: An optimal and progressive algorithm for skyline queries. In: SIGMOD (2003)

    Google Scholar 

  11. Tan, K., et al.: Efficient progressive skyline computation. In: VLDB (2001)

    Google Scholar 

  12. Lin, X., Yuan, Y., Zhang, Q., Zhang, Y.: Selecting stars: The k most representative skyline operator. In: ICDE (2007)

    Google Scholar 

  13. Morse, M., Patel, J., Jagadish, H.V.: Efficient skyline computation over low-cardinality domains. In: VLDB (2007)

    Google Scholar 

  14. Pei, J., Jin, W., Ester, M., Tao, Y.: Catching the best views of skyline: a semantic approach based on decisive subspaces. In: VLDB (2005)

    Google Scholar 

  15. Kossmann, D., Borzsonyi, S., Stocker, K.: The skyline operator. In: ICDE (2001)

    Google Scholar 

  16. Sharifzadeh, M., Shahabi, C.: The spatial skyline queries. In: VLDB Conference (2006)

    Google Scholar 

  17. Anthony, K.H., Tung, L., Wang, X.S., Ooi, B.C.: Efficient skyline query processing on peer-to-peer networks. In: ICDE (2007)

    Google Scholar 

  18. Zheng, J.X., Balke, W.-T., Guntzer, U.: Efficient distributed skylining for web information systems. In: EBDT (2004)

    Google Scholar 

  19. Wang, W., Lin, X., Yuan, Y., Lu, H.: Stabbing the sky:efficient skyline computation over sliding windows. In: ICDE (2005)

    Google Scholar 

  20. Yuan, Y., Lin, X., Liu, Q., Wang, W., Yu, J.X., Zhang, Q.: Efficient computation of the skyline cube. In: VLDB (2005)

    Google Scholar 

  21. Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: An online algorithm for skyline queries. In: Proc. of the Int’l Conf. in VLDB (2002)

    Google Scholar 

  22. Li, H.J., Tan, Q.Z., Lee, W.C.: Efficient progressive processing of skyline queries in peer-to-peer systems. In: Proc. of the 1st Int’l Conf., In INFOSCALE (2006)

    Google Scholar 

  23. Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Trans. Database Syst. 30(1), 41–82 (2005)

    Article  Google Scholar 

  24. Kung, H.T., Luccio, F., Preparata, F.P.: On finding the maxima of a set of vectors. JACM 22(4) (1975)

    Google Scholar 

  25. Stojmenovic, I., Miyakawa, M.: An optimal paralle lalgorithm for solving the maximal elements problem in the plane. In: Parallel Computing (1988)

    Google Scholar 

  26. Matousek, J.: Computing dominances in e n. Inf. Process. Lett. (1991)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

He, W., Li, C., Chen, H. (2009). Maintaining the Dominant Representatives on Data Streams. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2009. Lecture Notes in Computer Science, vol 5690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03573-9_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03573-9_58

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03572-2

  • Online ISBN: 978-3-642-03573-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics