Abstract
Recently, many application domains, such as sensor network monitoring and Location-Based Service, raise the issue of uncertain data management. Uncertain objects, a kind of uncertain data, have some uncertain attributes whose values are ranges instead of points. In this paper, we study a new kind of top-k queries, Probabilistic Fuzzy Top-k queries (PF-Topk queries) which can return k results from uncertain objects for fuzzy query conditions. We formally define the problem of PF-Topk query and present a framework for answering this kind of queries. We propose an exact algorithm, Envelope Planes of Membership Function (EPMF) algorithm based on the upper and lower bounding functions, which answers fuzzy top-k queries over uncertain objects in high-dimensional query space efficiently. We also propose an approximate algorithm which improves efficiency while ensuring high precision by setting a proper value of parameter. To reduce the search space, a pruning method is proposed to safely prune some objects before querying. The effectiveness and efficiency of our algorithms is demonstrated by the theoretical analysis and experiments with synthetic and real datasets.
The research was partially supported by NSF of China under Grants Nos. 60873009, 60933001 and 60773220.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Deshpande, A., Guestrin, C., Madden, S.R., Hellerstein, J.M., Hong, W.: Model-driven data acquisition in sensor networks. In: VLDB ’04, pp. 588–599 (2004)
Chen, L., Özsu, M., Oria, V.: Robust and fast similarity search for moving object trajectories. In: SIGMOD ’05, pp. 491–502 (2005)
Christian, B., Alexey, P., Matthias, S.: The gauss-tree: Efficient object identification in databases of probabilistic feature vectors. In: ICDE ’06 (2006)
Cheng, R., Prabhakar, S., Kalashnikov, D.V.: Querying imprecise data in moving object environments. IEEE TKDE 16 (2002)
Lian, X., Chen, L.: Probabilistic ranked queries in uncertain databases. In: EDBT ’08, pp. 511–522 (2008)
Beskales, G., Soliman, M.A., IIyas, I.F.: Efficient search for the top-k probable nearest neighbors in uncertain databases. VLDB Endow 1(1), 326–339 (2008)
Bodenhofer, U., Küng, J., Saminger, S.: Flexible query answering using distance-based fuzzy relations. In: de Swart, H., Orłowska, E., Schmidt, G., Roubens, M. (eds.) TARSKI 2006. LNCS (LNAI), vol. 4342, pp. 207–228. Springer, Heidelberg (2006)
Chen, S.M., Jong, W.T.: Fuzzy query translation for relational database systems. IEEE Transactions on Systems, Man, and Cybernetics, Part B 27(4), 714–721 (1997)
Tao, Y., Xiao, X., Cheng, R.: Range search on multidimensional uncertain data. ACM Trans. Database Syst. 32(3), 15 (2007)
Ishikawa, Y., Iijima, Y., Yu, J.X.: Spatial range querying for gaussian-based imprecise query objects. In: ICDE ’09, pp. 676–687 (2009)
Soliman, M.A., Ilyas, I.F., Chang, K.C.: Top-k query processing in uncertain databases. In: ICDE’ 07, pp. 896–905 (2007)
Yi, K., Li, F., Kollios, G., Srivastava, D.: Efficient processing of top-k queries in uncertain databases with x-relations. IEEE Trans. on Knowl. and Data Eng. 20(12), 1669–1682 (2008)
Hua, M., Pei, J., Zhang, W., Lin, X.: Efficiently answering probabilistic threshold top-k queries on uncertain data. In: ICDE’08, pp. 1403–1405 (2008)
Topologically integrated geographic encoding and referencing (tiger) system, http://www.census.gov/geo/www/tiger/
RANDLIB, http://biostatistics.mdanderson.org/softwaredownload/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, C., Wang, Y., Lin, S., Gu, Y., Qiao, J. (2010). Efficient Fuzzy Top-k Query Processing over Uncertain Objects. In: Bringas, P.G., Hameurlain, A., Quirchmayr, G. (eds) Database and Expert Systems Applications. DEXA 2010. Lecture Notes in Computer Science, vol 6261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15364-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-15364-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15363-1
Online ISBN: 978-3-642-15364-8
eBook Packages: Computer ScienceComputer Science (R0)