Abstract
Web tables provide us with high-quality sources of structured data. However, we could not use those valuable tables directly owing to various problems such as conflict data and missing headers. We present CrowdIQ, a scalable platform that integrates crowdsourcing technology for improving the quality of web tables. We design CrowdIQL, which is a declarative language aiming at helping requesters operate tables more exactly and flexibly. Crowdsourcing task is also optimized in this platform by providing candidate items and minimizing useless data, which help requesters to get higher quality tables with less cost.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Amazon Mechanical Turk. http://www.mturk.com
Park, H., Widom, J.: Collecting structured data from the crowd. In: SIGMOD (2014)
Liu, H., Wang, N., Ren, X.: CrowdSR: a crowd enabled system for semantic recovering of web tables. In: Dong, X.L., Yu, X., Li, J., Sun, Y. (eds.) WAIM 2015. LNCS, vol. 9098, pp. 581–583. Springer, Cham (2015). doi:10.1007/978-3-319-21042-1_67
Franklin, M.J., Kossmann, D., Kraska, T., et al.: CrowdDB: answering queries with crowdsourcing. In: SIGMOD (2011)
Parameswaran, A., Park, H., et al.: Deco: declarative crowdsouring. In: ACM International Conference on Information and Knowledge Management (2012)
Acknowledgment
This work is supported by National Natural Science Foundation of China (Grant No. 61370060). We would also like to give our thanks for the support from Microsoft Research Asia.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Xi, Y., Wang, N., Wu, X., Bao, Y., Zhou, W. (2017). CrowdIQ: A Declarative Crowdsourcing Platform for Improving the Quality of Web Tables. In: Chen, L., Jensen, C., Shahabi, C., Yang, X., Lian, X. (eds) Web and Big Data. APWeb-WAIM 2017. Lecture Notes in Computer Science(), vol 10367. Springer, Cham. https://doi.org/10.1007/978-3-319-63564-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-63564-4_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63563-7
Online ISBN: 978-3-319-63564-4
eBook Packages: Computer ScienceComputer Science (R0)