Abstract
It is well known that traditional skyline query is very likely to return over many but less informative data points in the result, especially when the querying dataset is high-dimensional or anti-correlated. In data stream applications where large amounts of data are continuously generated, this problem becomes much more serious since the full skyline result cannot be obtained efficiently and analyzed easily. To cope with this difficulty, in this paper, we propose a new concept called Combinatorial Dominant relationship to abstract dominant representatives of stream data. Based on this concept, we propose three novel skyline queries, namely basic convex skyline query (BCSQ), dynamic convex skyline query (DCSQ), andreverse convex skyline query (RCSQ), combining the concepts of convex derived from geometry and the traditional skyline for the first time. These queries can adaptively abstract the contour of skyline points without specifying the size of result set in advance and promote information content of the query result. To efficiently process these queries and maintain their results, we design and analyze algorithms by exploiting a memory indexing structure called DCEL which is used to represent and store the arrangement of data in the sliding window. We convert the problems of points in the primal plane into those of lines in dual plane through dual transformation, which helps us avoid expensive full skyline computation and speeds up the candidate set selection. Finally, through extensive experiments with both real and synthetic datasets, we validate the representative capability of CSQs, as well as the performance of our proposed algorithms.
This research is supported by National 863 Hi-Tech R & D Plan of China under Grant No. 2008AA01Z120, NSFC under Grant Nos. 60673138, 60603046, Program for New Century Excellent Talents in University and Union Project with BMEC on industry-study-research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agarwal, P., Erickson, J.: Geometric Range Searching and Its Relatives. In: Advances in Discrete and Computational Geometry, Contemporary Mathematics, vol. 223, pp. 1–56 (1999)
Preparata, F.P., Shamos, M.I.: Computational geometry: An introduction. Springer, Heidelberg (1985)
Agarwal, P.K., Sharir, M.: Arrangements and Their Applications. In: Handbook of Computational Geometry, ch. 2, pp. 49–119. Elsevier, Amsterdam (2000)
De Berg, M., Cheong, O., Van Kreveld, M., Overmars, M.: Computational Geometry: Algorithms and Applications (March 2008)
Dellis, E., Seeger, B.: Efficient Computation of Reverse Skyline Queries. In: VLDB (2007)
Das, G., Gunopulos, D.: Ad hoc Top-k Query Answering for Data Streams. In: VLDB (2007)
Chan, C.Y., Jagadish, H.V., Tan, K.L., Tung, A.K.H., Zhang, Z.: Finding k-dominant skylines in high dimensional space. In: SIGMOD (2006)
Chan, C.Y., Jagadish, H.V., Tan, K.L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Ioannidis, Y., Scholl, M.H., Schmidt, J.W., Matthes, F., Hatzopoulos, M., Böhm, K., Kemper, A., Grust, T., Böhm, C. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 478–495. Springer, Heidelberg (2006)
Ramsak, F., Kossmann, D., Rost, S.: Shooting stars in the sky: An online algorithm for skyline queries. In: VLDB (2002)
Fu, G., Papadias, D., Tao, Y., Seeger, B.: An optimal and progressive algorithm for skyline queries. In: SIGMOD (2003)
Tan, K., et al.: Efficient progressive skyline computation. In: VLDB (2001)
Lin, X., Yuan, Y., Zhang, Q., Zhang, Y.: Selecting stars: The k most representative skyline operator. In: ICDE (2007)
Morse, M., Patel, J., Jagadish, H.V.: Efficient skyline computation over low-cardinality domains. In: VLDB (2007)
Pei, J., Jin, W., Ester, M., Tao, Y.: Catching the best views of skyline: a semantic approach based on decisive subspaces. In: VLDB (2005)
Kossmann, D., Borzsonyi, S., Stocker, K.: The skyline operator. In: ICDE (2001)
Sharifzadeh, M., Shahabi, C.: The spatial skyline queries. In: VLDB Conference (2006)
Anthony, K.H., Tung, L., Wang, X.S., Ooi, B.C.: Efficient skyline query processing on peer-to-peer networks. In: ICDE (2007)
Zheng, J.X., Balke, W.-T., Guntzer, U.: Efficient distributed skylining for web information systems. In: EBDT (2004)
Wang, W., Lin, X., Yuan, Y., Lu, H.: Stabbing the sky:efficient skyline computation over sliding windows. In: ICDE (2005)
Yuan, Y., Lin, X., Liu, Q., Wang, W., Yu, J.X., Zhang, Q.: Efficient computation of the skyline cube. In: VLDB (2005)
Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: An online algorithm for skyline queries. In: Proc. of the Int’l Conf. in VLDB (2002)
Li, H.J., Tan, Q.Z., Lee, W.C.: Efficient progressive processing of skyline queries in peer-to-peer systems. In: Proc. of the 1st Int’l Conf., In INFOSCALE (2006)
Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Trans. Database Syst. 30(1), 41–82 (2005)
Kung, H.T., Luccio, F., Preparata, F.P.: On finding the maxima of a set of vectors. JACMÂ 22(4) (1975)
Stojmenovic, I., Miyakawa, M.: An optimal paralle lalgorithm for solving the maximal elements problem in the plane. In: Parallel Computing (1988)
Matousek, J.: Computing dominances in e n. Inf. Process. Lett. (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
He, W., Li, C., Chen, H. (2009). Maintaining the Dominant Representatives on Data Streams. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2009. Lecture Notes in Computer Science, vol 5690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03573-9_58
Download citation
DOI: https://doi.org/10.1007/978-3-642-03573-9_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03572-2
Online ISBN: 978-3-642-03573-9
eBook Packages: Computer ScienceComputer Science (R0)