Skip to main content

A Divide-and-Conquer Approach for Crowdsourced Data Enumeration

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8238))

Abstract

Crowdsourced data enumeration, in which the Web crowd is requested to enumerate data items within a specified range, is important in many Web applications such as hotel reviews. This paper presents a processing method for crowdsourced data enumeration on microtask-based crowdsourcing platforms. A general approach to achieving a high recall in data enumeration is to apply the divide-and-conquer principle. However, how to apply the principle to data enumeration on microtask-based crowdsourcing platforms is not trivial. The proposed method is unique in that the workers join the process of generating smaller tasks in a divide-and-conquer fashion, and the programmer does not need to provide many microtasks in advance. This paper explains the method, provides theoretical results to show the method works well with microtask-based platforms, and explains our experimental results that suggest the proposed method can achieve higher recalls and produces appropriate tasks for microtask-based crowdsourcing.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. ESP Game, http://www.gwap.com/gwap/gamesPreview/espgame/

  2. Franklin, M.J., Kossmann, D., Kraska, T., Ramesh, S., Xin, R.: CrowdDB: answering queries with crowdsourcing. In: SIGMOD 2011, pp. 61–72 (2011)

    Google Scholar 

  3. Jain, S., Parkes, D.C.: A Game-Theoretic Analysis of Games with a Purpose. In: Papadimitriou, C., Zhang, S. (eds.) WINE 2008. LNCS, vol. 5385, pp. 342–350. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Kulkarni, A.P., Can, M., Hartmann, B.: Collaboratively crowdsourcing workflows with turkomatic. In: CSCW 2012, pp. 1003–1012 (2012)

    Google Scholar 

  5. Morishima, A., Shinagawa, N., Mochizuki, S.: The Power of Integrated Abstraction for Data-Centric Human/Machine Computations. In: VLDS 2011, pp. 7–10 (2011)

    Google Scholar 

  6. Morishima, A., Shinagawa, N., Mitsuishi, T., Aoki, H., Fukusumi, S.: CyLog/Crowd4U: A Declarative Platform for Complex Data-centric Crowdsourcing. PVLDB 5(12), 1918–1921 (2012)

    Google Scholar 

  7. Marcus, A., Wu, E., Karger, D.R., Madden, S., Miller, R.C.: Human-powered Sorts and Joins. PVLDB 5(1), 13–24 (2011)

    Google Scholar 

  8. Marcus, A., Wu, E., Madden, S., Miller, R.C.: Crowdsourced Databases: Query Processing with People. In: CIDR 2011, pp. 211–214 (2011)

    Google Scholar 

  9. Parameswaran, A.G., Polyzotis, N.: Answering Queries using Humans, Algorithms and Databases. In: CIDR 2011, pp. 160–166 (2011)

    Google Scholar 

  10. Tripadvisor, http://www.tripadvisor.com/

  11. Trushkowsky, B., Kraska, T., Franklin, M.J., Sarkar, P.: Crowdsourced enumeration queries. In: ICDE 2013, pp. 673–684 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Aoki, H., Morishima, A. (2013). A Divide-and-Conquer Approach for Crowdsourced Data Enumeration. In: Jatowt, A., et al. Social Informatics. SocInfo 2013. Lecture Notes in Computer Science, vol 8238. Springer, Cham. https://doi.org/10.1007/978-3-319-03260-3_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03260-3_6

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03259-7

  • Online ISBN: 978-3-319-03260-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics