Abstract
From a multidimensional dataset, a skyline query extracts the data which satisfy the multiple preferences given by the user. The real challenge in skyline computation is to retrieve such data, in the optimum time. When the datasets are huge, the challenge becomes critical. In this paper, we address exactly this issue focusing on the big data. For this, we aim at utilizing the correlations observed in the user queries. These correlations and the results of historical skyline queries, executed on the same dataset, are very much helpful in optimizing the response time of further skyline computation. For the same purpose, we have earlier proposed a novel structure namely Query Profiler (QP). In this paper, we present a technique namely SkyQP to assert the effectiveness of this concept against the big data. We have also presented the time and space analysis of the proposed technique. The experimental results obtained assert the efficacy of the SkyQP technique.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Kulkarni RD, Momin BF (2016) Skyline computation for frequent queries in update intensive environment. J King Saud Univ Comput Inf Sci 28(4):447–456
Borzsonyi S, Kossmann D, Stocker K (2001) The skyline operator. In: Proceedings IEEE international conference on data engineering, pp 421–430
Chomicki J, Godfrey P, Gryz J, Liang D (2003) Skyline with presorting. In: Proceedings IEEE international conference on data engineering, pp 717–719
Godfrey P, Shipley R, Gryz J (2005) Maximal vector computation in large data sets. In: Proceedings IEEE international conference on very large databases, pp 229–240
Bartolini I, Ciaccia P, Patella M (2006) SaLSa: computing the skyline without scanning the whole sky. In: Proceedings IEEE international conference on information and knowledge management, pp 405–411
Kossmann D, Ramsak F, Rost S (2002) Shooting stars in the sky: an online algorithm for skyline queries. In: Proceedings IEEE international conference on very large databases, pp 275–286
Papadias D, Tao Y, Fu G, Seeger B (2005) Progressive skyline computation in database systems. ACM Trans Database Syst 30(1):41–82
Xia T, Zhang D (2005) Refreshing the sky: the compressed skycube with efficient support for frequent updates. In: Proceedings ACM SIGMOD International Conference on Management of Data, pp 493–501
Yuan Y, Lin X, Liu Q, Wang W, Yu JX, Zhang Q (2005) Efficient computation of the skyline cube. In: Proceedings IEEE international conference on very large databases, pp 241–252
Zhang N, Li C, Hassan N, Rajasekaran S, Das G (2014) On skyline groups. IEEE Trans Knowl Data Eng 26(4):942–956
Zheng W, Zou L, Lian X, Hong L, Zhao D (2014) Efficient subgraph skyline search over large graphs. In: Proceedings ACM international conference on conference on information and knowledge management, pp 1529–1538
Lin J, Wei J (2008) Constrained skyline computing over data streams. In: Proceedings IEEE international conference on e-business, engineering, pp 155–161
Wu P, Zhang C, Feng Y, Zhao B, Agrawal D, Abbadi A (2006) Parallelizing skyline queries for scalable distribution. In: Proceedings IEEE international conference on extending database technology, pp 112–130
Wang S, Ooi B, Tung A, Xu L (2007) Efficient skyline query processing on peer-to-peer net-works. In: Proceedings IEEE international conference on data engineering, pp 1126–1135
Chen L, Cui B, Lu H, Xu L, Xu Q (2008) iSky: efficient and progressive skyline computing in a structured P2P network. In: Proceedings IEEE international conference on distributed computing systems, pp 160–167
Wang S, Vu Q, Ooi B, Tung A, Xu L (2009) Skyframe: a framework for skyline query processing in peer-to-peer systems. VLDB J 18(1):345–362
Jensen HC, Lu H, Ooi HB (2006) Skyline queries against mobile lightweight devices in MANETs. In: Proceedings IEEE international conference on data engineering, pp 66–72
Hose K, Lemke C, Sattler K (2006) Processing relaxed skylines in PDMS using distributed data summaries. In: Proceedings IEEE international conference on information and knowledge management, pp 425–434
Hose K, Lemke C, Sattler K, Zinn D (2007) A relaxed but not necessarily constrained way from the top to the sky. In: Proceedings international conference on cooperative information systems, pp 339–407
Zhang B, Zhou S, Guan J (2011) Adapting skyline computation to the MapReduce framework: algorithms and experiments. In: Proceedings international conference on database systems for advanced applications, pp 403–414
Park Y, Min J-K, Shim K (2013) Parallel computation of skyline and reverse skyline queries using MapReduce. J VLDB Endow 6(14):2002–2013
Mullesgaard K, Pederseny JL, Lu H, Zhou Y (2014) Efficient skyline computation in MapReduce. In: Proceedings international conference on extending database technology, pp 37–48
Chen L, Hwang K, Wu J (2012) MapReduce skyline query processing with a new angular partitioning approach. In: Proceedings international conference on parallel and distributed processing symposium, pp 403–414
Bgh K, Aasent I, Maghni M (2013) Efficient GPU-based skyline computation. In: Proceedings international workshop on data management on new hardware, Article no. 5
Bgh K, Chester S, Assent I (2015) Work-efficient parallel skyline computation for the GPU. J Very Large Data Bases Endow 962–973
Choi W, Liu L, Yu B (2012) Multi-criteria decision making with skyline computation. In: Proceedings IEEE international conference on information reuse and integration, pp 316–323
Woods L, Alonso G, Teubner J (2013) Parallel computation of skyline queries. In: Proceedings IEEE international conference on field-programmable custom computing machines, pp 1–8
Woods L, Alonso G, Teubner J (2015) Parallelizing data processing on FPGAs with shifter lists. J ACM Trans Reconfig Technol Syst 8(2)
Bhattacharya A, Teja P, Dutta S (2011) Caching stars in the sky: a semantic caching approach to accelerate skyline queries. In: Proceedings international conference on database and expert systems applications, pp 493–501
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kulkarni, R.D., Momin, B.F. (2019). Skyline Computation for Big Data. In: Mishra, D., Yang, XS., Unal, A. (eds) Data Science and Big Data Analytics. Lecture Notes on Data Engineering and Communications Technologies, vol 16. Springer, Singapore. https://doi.org/10.1007/978-981-10-7641-1_23
Download citation
DOI: https://doi.org/10.1007/978-981-10-7641-1_23
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7640-4
Online ISBN: 978-981-10-7641-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)