Skip to main content

CrowdIQ: A Declarative Crowdsourcing Platform for Improving the Quality of Web Tables

  • Conference paper
  • First Online:
Web and Big Data (APWeb-WAIM 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10367))

Abstract

Web tables provide us with high-quality sources of structured data. However, we could not use those valuable tables directly owing to various problems such as conflict data and missing headers. We present CrowdIQ, a scalable platform that integrates crowdsourcing technology for improving the quality of web tables. We design CrowdIQL, which is a declarative language aiming at helping requesters operate tables more exactly and flexibly. Crowdsourcing task is also optimized in this platform by providing candidate items and minimizing useless data, which help requesters to get higher quality tables with less cost.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amazon Mechanical Turk. http://www.mturk.com

  2. Park, H., Widom, J.: Collecting structured data from the crowd. In: SIGMOD (2014)

    Google Scholar 

  3. Liu, H., Wang, N., Ren, X.: CrowdSR: a crowd enabled system for semantic recovering of web tables. In: Dong, X.L., Yu, X., Li, J., Sun, Y. (eds.) WAIM 2015. LNCS, vol. 9098, pp. 581–583. Springer, Cham (2015). doi:10.1007/978-3-319-21042-1_67

    Chapter  Google Scholar 

  4. Franklin, M.J., Kossmann, D., Kraska, T., et al.: CrowdDB: answering queries with crowdsourcing. In: SIGMOD (2011)

    Google Scholar 

  5. Parameswaran, A., Park, H., et al.: Deco: declarative crowdsouring. In: ACM International Conference on Information and Knowledge Management (2012)

    Google Scholar 

Download references

Acknowledgment

This work is supported by National Natural Science Foundation of China (Grant No. 61370060). We would also like to give our thanks for the support from Microsoft Research Asia.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ning Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Xi, Y., Wang, N., Wu, X., Bao, Y., Zhou, W. (2017). CrowdIQ: A Declarative Crowdsourcing Platform for Improving the Quality of Web Tables. In: Chen, L., Jensen, C., Shahabi, C., Yang, X., Lian, X. (eds) Web and Big Data. APWeb-WAIM 2017. Lecture Notes in Computer Science(), vol 10367. Springer, Cham. https://doi.org/10.1007/978-3-319-63564-4_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63564-4_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63563-7

  • Online ISBN: 978-3-319-63564-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics