skip to main content
10.1145/2213836.2213905acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
tutorial

Mob data sourcing

Published:20 May 2012Publication History

ABSTRACT

Crowdsourcing is an emerging paradigm that harnesses a mass of users to perform various types of tasks. We focus in this tutorial on a particular form of crowdsourcing, namely crowd (or mob) datasourcing whose goal is to obtain, aggregate or process data. We overview crowd datasourcing solutions in various contexts, explain the need for a principled solution, describe advances towards achieving such a solution, and highlight remaining gaps.

References

  1. S. Abiteboul, O. Benjelloun, and T. Milo. The active xml project: an overview. VLDB J., 17(5), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Abiteboul, M. Bienvenu, A. Galland, and E. Antoine. A rule-based language for web data management. In PODS, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. L. Antova, T. Jansen, C. Koch, and D. Olteanu. "Fast and Simple Relational Processing of Uncertain Data". In Proc. ICDE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. O. Benjelloun, A. D. Sarma, C. Hayworth, and J. Widom. An introduction to uldbs and the trio system. IEEE Data Eng. Bull., 29(1):5--16, 2006.Google ScholarGoogle Scholar
  6. D. C. Brabham. Crowdsourcing as a Model for Problem Solving: An Introduction and Cases. Convergence, 14(1):75--90, 2008.Google ScholarGoogle Scholar
  7. P. Buneman, J. Cheney, and S. Vansummeren. On the expressiveness of implicit provenance in query and update languages. ACM Trans. Database Syst., 33(4), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Buneman, S. Khanna, and W. C. Tan. Why and where: A characterization of data provenance. In Proc. of ICDT, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Cheney, S. Chong, N. Foster, M. I. Seltzer, and S. Vansummeren. Provenance: a future history. In Proc. of OOPSLA, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Deutch, C. Koch, and T. Milo. On probabilistic fixpoint and markov chain query languages. In PODS '10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. J. Franklin, D. Kossmann, T. Kraska, S. Ramesh, and R. Xin. Crowddb: answering queries with crowdsourcing. In SIGMOD Conference, pages 61--72, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating information from disagreeing views. In WSDM '10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Gravano, P. G. Ipeirotis, N. Koudas, and D. Srivastava. Text joins in an rdbms for web data integration. In Proceedings of the 12th international conference on World Wide Web, WWW '03, pages 90--101, New York, NY, USA, 2003. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. T. J. Green, G. Karvounarakis, Z. G. Ives, and V. Tannen. Update exchange with mappings and provenance. In Proc. of VLDB, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. J. Green, G. Karvounarakis, and V. Tannen. Provenance semirings. In Proc. of PODS, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Imdb. http://www.imdb.com/.Google ScholarGoogle Scholar
  17. R. Jampani, F. Xu, M. Wu, L. L. Perez, C. Jermaine, and P. J. Haas. Mcdb: a monte carlo approach to managing uncertain data. In SIGMOD '08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. H. Ma, R. Chandrasekar, C. Quirk, and A. Gupta. Improving search engines using human computation games. In CIKM '09. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Madhavan, S. R. Jeffery, S. Cohen, X. (luna Dong, D. Ko, C. Yu, A. Halevy, and G. Inc. Web-scale data integration: You can only afford to pay as you go. In CIDR, 2007.Google ScholarGoogle Scholar
  20. A. Marcus, E. Wu, S. Madden, and R. C. Miller. Crowdsourced databases: Query processing with people. In CIDR, pages 211--214, 2011.Google ScholarGoogle Scholar
  21. A. Marian and M. Wu. Corroborating information from web sources. IEEE Data Eng. Bull., 34(3):11--17, 2011.Google ScholarGoogle Scholar
  22. Amazon's mechanical turk. https://www.mturk.com/.Google ScholarGoogle Scholar
  23. A. Parameswaran, A. D. Sarma, H. G.-M. andNeoklis Polyzotis, and J. Widom. Human-assisted graph search: It's okay to ask questions. In VLDB, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. R. Ramakrishnan and J. D. Ullman. A survey of research on deductive database systems. Journal of Logic Programming, 1993.Google ScholarGoogle Scholar
  25. J. Stoyanovich, S. Davidson, T. Milo, and V. Tannen. Deriving probabilistic databases with inference ensembles. In To appear in Proc. of ICDE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Top coder. http://www.topcoder.com/.Google ScholarGoogle Scholar
  27. Tripadvisor. http://www.tripadvisor.com/.Google ScholarGoogle Scholar
  28. L. von Ahn and L. Dabbish. Designing games with a purpose. Commun. ACM, 51(8):58--67, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Waze. http://www.waze.com/.Google ScholarGoogle Scholar
  30. Wikiepdia. http://www.wikipedia.org/.Google ScholarGoogle Scholar

Index Terms

  1. Mob data sourcing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '12: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
      May 2012
      886 pages
      ISBN:9781450312479
      DOI:10.1145/2213836

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 20 May 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • tutorial

      Acceptance Rates

      SIGMOD '12 Paper Acceptance Rate48of289submissions,17%Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader