Skip to main content

Efficient Fuzzy Top-k Query Processing over Uncertain Objects

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6261))

Abstract

Recently, many application domains, such as sensor network monitoring and Location-Based Service, raise the issue of uncertain data management. Uncertain objects, a kind of uncertain data, have some uncertain attributes whose values are ranges instead of points. In this paper, we study a new kind of top-k queries, Probabilistic Fuzzy Top-k queries (PF-Topk queries) which can return k results from uncertain objects for fuzzy query conditions. We formally define the problem of PF-Topk query and present a framework for answering this kind of queries. We propose an exact algorithm, Envelope Planes of Membership Function (EPMF) algorithm based on the upper and lower bounding functions, which answers fuzzy top-k queries over uncertain objects in high-dimensional query space efficiently. We also propose an approximate algorithm which improves efficiency while ensuring high precision by setting a proper value of parameter. To reduce the search space, a pruning method is proposed to safely prune some objects before querying. The effectiveness and efficiency of our algorithms is demonstrated by the theoretical analysis and experiments with synthetic and real datasets.

The research was partially supported by NSF of China under Grants Nos. 60873009, 60933001 and 60773220.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Deshpande, A., Guestrin, C., Madden, S.R., Hellerstein, J.M., Hong, W.: Model-driven data acquisition in sensor networks. In: VLDB ’04, pp. 588–599 (2004)

    Google Scholar 

  2. Chen, L., Özsu, M., Oria, V.: Robust and fast similarity search for moving object trajectories. In: SIGMOD ’05, pp. 491–502 (2005)

    Google Scholar 

  3. Christian, B., Alexey, P., Matthias, S.: The gauss-tree: Efficient object identification in databases of probabilistic feature vectors. In: ICDE ’06 (2006)

    Google Scholar 

  4. Cheng, R., Prabhakar, S., Kalashnikov, D.V.: Querying imprecise data in moving object environments. IEEE TKDE 16 (2002)

    Google Scholar 

  5. Lian, X., Chen, L.: Probabilistic ranked queries in uncertain databases. In: EDBT ’08, pp. 511–522 (2008)

    Google Scholar 

  6. Beskales, G., Soliman, M.A., IIyas, I.F.: Efficient search for the top-k probable nearest neighbors in uncertain databases. VLDB Endow 1(1), 326–339 (2008)

    Google Scholar 

  7. Bodenhofer, U., Küng, J., Saminger, S.: Flexible query answering using distance-based fuzzy relations. In: de Swart, H., Orłowska, E., Schmidt, G., Roubens, M. (eds.) TARSKI 2006. LNCS (LNAI), vol. 4342, pp. 207–228. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  8. Chen, S.M., Jong, W.T.: Fuzzy query translation for relational database systems. IEEE Transactions on Systems, Man, and Cybernetics, Part B 27(4), 714–721 (1997)

    Article  Google Scholar 

  9. Tao, Y., Xiao, X., Cheng, R.: Range search on multidimensional uncertain data. ACM Trans. Database Syst. 32(3), 15 (2007)

    Article  Google Scholar 

  10. Ishikawa, Y., Iijima, Y., Yu, J.X.: Spatial range querying for gaussian-based imprecise query objects. In: ICDE ’09, pp. 676–687 (2009)

    Google Scholar 

  11. Soliman, M.A., Ilyas, I.F., Chang, K.C.: Top-k query processing in uncertain databases. In: ICDE’ 07, pp. 896–905 (2007)

    Google Scholar 

  12. Yi, K., Li, F., Kollios, G., Srivastava, D.: Efficient processing of top-k queries in uncertain databases with x-relations. IEEE Trans. on Knowl. and Data Eng. 20(12), 1669–1682 (2008)

    Article  Google Scholar 

  13. Hua, M., Pei, J., Zhang, W., Lin, X.: Efficiently answering probabilistic threshold top-k queries on uncertain data. In: ICDE’08, pp. 1403–1405 (2008)

    Google Scholar 

  14. Topologically integrated geographic encoding and referencing (tiger) system, http://www.census.gov/geo/www/tiger/

  15. RANDLIB, http://biostatistics.mdanderson.org/softwaredownload/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, C., Wang, Y., Lin, S., Gu, Y., Qiao, J. (2010). Efficient Fuzzy Top-k Query Processing over Uncertain Objects. In: Bringas, P.G., Hameurlain, A., Quirchmayr, G. (eds) Database and Expert Systems Applications. DEXA 2010. Lecture Notes in Computer Science, vol 6261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15364-8_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15364-8_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15363-1

  • Online ISBN: 978-3-642-15364-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics