Skip to main content

Non-index Based Skyline Analysis on High Dimensional Data with Uncertain Dimensions

  • Conference paper
  • First Online:
Databases and Information Systems (DB&IS 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 838))

Included in the following conference series:

Abstract

The notion of skyline query is to find a set of objects that is not dominated by any other objects. Regrettably, existing works lack on how to conduct skyline queries on high dimensional uncertain data with objects represented as continuous ranges and exact values, which in this paper is referred to as uncertain dimensions. Hence, in this paper we define skyline queries over data with uncertain dimensions and propose an algorithm, SkyQUD, to efficiently answer skyline queries. The SkyQUD algorithm determines skyline objects through three methods that guaranteed the probability of each object being in the final skyline results: exact domination, range domination, and uncertain domination. The algorithm has been validated through extensive experiments employing real and synthetic datasets. Results exhibit our proposed algorithm is efficient and scalable in answering skyline query on high dimensional and large datasets with uncertain dimensions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Atallah, M., Qi, Y.: Computing all skyline probabilities for uncertain data. In: Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium of the Principles of Database Systems (PODS), pp. 279–287 (2009)

    Google Scholar 

  2. Berchtold, S., Keim, D.A., Kriegel, H.P.: The X-tree: an index structure for high-dimensional data. In: Proceedings of the 22nd International Conference on Very Large Data Bases (VLDB), pp. 28–39 (1996)

    Google Scholar 

  3. Böhm, C., Fiedler, F., Oswald, A., Plant, C., Wackersreuther, B.: Probabilistic skyline queries. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 651–660 (2009)

    Google Scholar 

  4. Börzsönyi, S., Kossmann, D., Stocker, K.: The skyline operator. In: Proceedings of the 17th International Conference on Data Engineering (ICDE), pp. 421–430 (2001)

    Google Scholar 

  5. Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: On high dimensional skylines. In: Ioannidis, Y., et al. (eds.) EDBT 2006. LNCS, vol. 3896, pp. 478–495. Springer, Heidelberg (2006). https://doi.org/10.1007/11687238_30

    Chapter  Google Scholar 

  6. Chan, C.-Y., Jagadish, H.V., Tan, K.-L., Tung, A.K.H., Zhang, Z.: Finding k-dominant skylines in high dimensional space. In: Proceedings of International Conference on Management of Data (SIGMOD), pp. 503–514 (2006)

    Google Scholar 

  7. Chomicki, J., Godfrey, P., Gryz, J., Liang, D.: Skyline with presorting. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 717–816 (2003)

    Google Scholar 

  8. Godfrey, P., Shipley, R., Gryz, J.: Maximal vector computation in large data sets. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 229–240 (2005)

    Google Scholar 

  9. Khalefa, M.E., Mokbel, M.F., Levandoski, J.J.: Skyline query processing for uncertain data. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM), pp. 1293–1296 (2010)

    Google Scholar 

  10. Kossmann, D., Ramsak, F., Rost, S.: Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 275–286 (2002)

    Google Scholar 

  11. Li, X., Wang, Y., Li, X., Wang, G.: Skyline query processing on interval uncertain data. In: IEEE 15th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing Workshops, pp. 87–92 (2012)

    Google Scholar 

  12. Mokbel, M.F., Levandoski, J.J.: Toward context and preference-aware location-based services. In: Proceedings of the International Workshop on Data Engineering for Wireless and Mobile Access, pp. 25–35 (2009)

    Google Scholar 

  13. Papadias, D., Tao, Y., Fu, G., Seeger, B.: Progressive skyline computation in database systems. ACM Trans. Database Syst. 30(1), 41–82 (2005)

    Article  Google Scholar 

  14. Pei, J., Jiang, B., Lin, X., Yuan, Y.: Probabilistic skylines on uncertain data. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 15–26 (2007)

    Google Scholar 

  15. Ross, S.M.: Introduction to Probability Models, 8th edn. American Press, San Diego (2003)

    MATH  Google Scholar 

  16. Tan, K.L., Eng, P.K., Ooi, B.C.: Efficient progressive skyline computation. In: Proceedings of International Conference on Very Large Data Bases (VLDB), pp. 301–310 (2001)

    Google Scholar 

  17. Yong, H., Kim, J.-H., Hwang. S.-W.: Skyline ranking for uncertain data with maybe confidence. In: Proceedings of the 2008 IEEE 24th International Conference on Data Engineering Workshop (ICDEW), pp. 572–579 (2008)

    Google Scholar 

Download references

Acknowledgements

This research was supported by Ministry of Science, Technology, and Innovation under the Fundamental Research Grant Scheme (Grant no. 08-01-16-1853FR). All opinions, findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views of the funding agencies. We thank the anonymous reviewers for their comments.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nurul Husna Mohd Saad .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mohd Saad, N.H., Ibrahim, H., Sidi, F., Yaakob, R. (2018). Non-index Based Skyline Analysis on High Dimensional Data with Uncertain Dimensions. In: Lupeikiene, A., Vasilecas, O., Dzemyda, G. (eds) Databases and Information Systems. DB&IS 2018. Communications in Computer and Information Science, vol 838. Springer, Cham. https://doi.org/10.1007/978-3-319-97571-9_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-97571-9_22

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-97570-2

  • Online ISBN: 978-3-319-97571-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics