skip to main content
10.1145/2872427.2883075acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Just in Time: Controlling Temporal Performance in Crowdsourcing Competitions

Published:11 April 2016Publication History

ABSTRACT

Many modern data analytics applications in areas such as crisis management, stock trading, and healthcare, rely on components capable of nearly real-time processing of streaming data produced at varying rates. In addition to automatic processing methods, many tasks involved in those applications require further human assessment and analysis. However, current crowdsourcing platforms and systems do not support stream processing with variable loads. In this paper, we investigate how incentive mechanisms in competition based crowdsourcing can be employed in such scenarios. More specifically, we explore techniques for stimulating workers to dynamically adapt to both anticipated and sudden changes in data volume and processing demand, and we analyze effects such as data processing throughput, peak-to-average ratios, and saturation effects. To this end, we study a wide range of incentive schemes and utility functions inspired by real world applications. Our large-scale experimental evaluation with more than 900 participants and more than 6200 hours of work spent by crowd workers demonstrates that our competition based mechanisms are capable of adjusting the throughput of online workers and lead to substantial on-demand performance boosts.

References

  1. GamifIR '14: Proceedings of the First International Workshop on Gamification for Information Retrieval, 2014. ACM.Google ScholarGoogle Scholar
  2. M. S. Bernstein, J. Brandt, R. C. Miller, and D. R. Karger. Crowds in two seconds: Enabling realtime crowd-powered interfaces. In Proceedings of the 24th annual ACM symposium on User interface software and technology, pages 33--42. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A. Biem, H. Feng, A. V. Riabov, and D. S. Turaga. Real-time analysis and management of big time-series data. IBM J. Res. Dev., 57(3--4):1:8--1:8, May 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. P. Bigham, C. Jayant, H. Ji, G. Little, A. Miller, R. C. Miller, R. Miller, A. Tatarowicz, B. White, S. White, et al. Vizwiz: nearly real-time answers to visual questions. In Proceedings of the 23nd annual ACM symposium on User interface software and technology, pages 333--342, 2010. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Dahi and S. Tabbane. Sigmoid utility function formulation for handoff reducing access model in cognitive radio. In Communications and Information Technologies (ISCIT), 2013 13th International Symposium on, pages 166--170, 2013. Google ScholarGoogle ScholarCross RefCross Ref
  6. A. Damasceno, R. A. Mini, J. M. Almeida, and H. Marques-Neto. A base station workload-aware dynamic pricing scheme for mobile internet access. In Proceedings of the 10th International Workshop on Mobility in the Evolving Internet Architecture, MobiArch '15, pages 45--50, 2015. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Das Sarma, A. Parameswaran, H. Garcia-Molina, and A. Halevy. Crowd-powered find algorithms. In Data Engineering (ICDE), 2014 IEEE 30th International Conference on, pages 964--975. 2014, IEEE. Google ScholarGoogle ScholarCross RefCross Ref
  8. D. E. Difallah, M. Catasta, G. Demartini, and P. Cudré-Mauroux. Scaling-up the crowd: Micro-task pricing schemes for worker retention and latency improvement. In Second AAAI Conference on Human Computation and Crowdsourcing, 2014.Google ScholarGoogle Scholar
  9. J. Fan, M. Zhang, S. Kok, M. Lu, and B. C. Ooi. Crowdop: Query optimization for declarative crowdsourcing systems. Knowledge and Data Engineering, IEEE Transactions on, 27(8):2078--2092, Aug 2015.Google ScholarGoogle Scholar
  10. S. Faradani, B. Hartmann, and P. G. Ipeirotis. What's the right price? pricing tasks for finishing on time. Human computation, 11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. O. Feyisetan, E. Simperl, M. V. Kleek, and N. Shadbolt. Improving paid microtasks through gamification and adaptive furtherance incentives. In Proceedings of the 24th International Conference on World Wide Web, WWW '15, pages 333--343, 2015. International World Wide Web Conferences Steering Committee. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Gao and A. Parameswaran. Finish them!: Pricing algorithms for human computation. Proceedings of the VLDB Endowment, 7(14):1965--1976, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. He, M. Bron, L. Azzopardi, and A. de Vries. Studying user browsing behavior through gamified search tasks. In Proceedings of the First International Workshop on Gamification for Information Retrieval, GamifIR '14, pages 49--52, 2014. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Imran, C. Castillo, J. Lucas, P. Meier, and S. Vieweg. Aidr: Artificial intelligence for disaster response. In Proceedings of the companion publication of the 23rd international conference on World wide web companion, pages 159--162, 2014. International World Wide Web Conferences Steering Committee. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Imran, C. Castillo, J. Lucas, M. Patrick, and J. Rogstadius. Coordinating human and machine intelligence to classify microblog communications in crises. Proc. of ISCRAM, 2014.Google ScholarGoogle Scholar
  16. M. Imran, I. Lykourentzou, Y. Naudet, and C. Castillo. Engineering crowdsourced stream processing systems. arXiv preprint arXiv:1310.5463, 2013.Google ScholarGoogle Scholar
  17. P. G. Ipeirotis and P. K. Paritosh. Managing crowdsourced human computation: a tutorial. In Proceedings of the 20th international conference companion on World wide web, pages 287--288. ACM, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. E. D. Jensen. Asynchronous decentralized realtime computer systems. In Real Time Computing, pages 347--371. Springer, 1994. Google ScholarGoogle ScholarCross RefCross Ref
  19. E. D. Jensen, C. D. Locke, and H. Tokuda. A time-driven scheduling model for real-time operating systems. In RTSS, volume 85, pages 112--122, 1985.Google ScholarGoogle Scholar
  20. T. Johnson, S. Muthukrishnan, and I. Rozenbaum. Sampling algorithms in a stream operator. In Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, SIGMOD '05, pages 1--12, 2005. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. N. Kumar, A. C. Berg, P. N. Belhumeur, and S. K. Nayar. Attribute and Simile Classifiers for Face Verification. In IEEE International Conference on Computer Vision (ICCV), Oct 2009. Google ScholarGoogle ScholarCross RefCross Ref
  22. P. Li, B. Ravindran, and E. Jensen. Adaptive time-critical resource management using time/utility functions: past, present, and future. In Computer Software and Applications Conference, 2004. COMPSAC 2004. Proceedings of the 28th Annual International, volume 2, pages 12--13 vol.2, Sept 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. W. Mason and D. J. Watts. Financial incentives and the "performance of crowds". SIGKDD Explor. Newsl., 11(2):100--108, May 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. E. Minack, W. Siberski, and W. Nejdl. Incremental diversification for very large sets: A streaming-based approach. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '11, pages 585--594, 2011. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A.-H. Mohsenian-Rad and A. Leon-Garcia. Optimal residential load control with price prediction in real-time electricity pricing environments. Smart Grid, IEEE Transactions on, 1(2):120--133, Sept 2010.Google ScholarGoogle Scholar
  26. J. Noronha, E. Hysen, H. Zhang, and K. Z. Gajos. Platemate: crowdsourcing nutritional analysis from food photographs. In Proceedings of the 24th annual ACM symposium on User interface software and technology, pages 1--12. 2011, ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Perlich, F. Provost, and J. S. Simonoff. Tree induction vs. logistic regression: A learning-curve analysis. The Journal of Machine Learning Research, 4:211--255, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. D. Pothineni, P. Mishra, A. Rasheed, and D. Sundararajan. Incentive design to mould online behavior: A game mechanics perspective. In Proceedings of the First International Workshop on Gamification for Information Retrieval, GamifIR '14, pages 27--32, 2014. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. S. Roche, E. Propeck-Zimmermann, and B. Mericskay. Geoweb and crisis management: issues and perspectives of volunteered geographic information. GeoJournal, 78(1):21--40, 2013. Google ScholarGoogle ScholarCross RefCross Ref
  30. M. Rokicki, S. Chelaru, S. Zerr, and S. Siersdorfer. Competitive game designs for improving the cost effectiveness of crowdsourcing. In Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, CIKM '14, pages 1469--1478, 2014. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. M. Rokicki, S. Zerr, and S. Siersdorfer. Groupsourcing: Team competition designs for crowdsourcing. In Proceedings of the 24th International Conference on World Wide Web, WWW '15, pages 906--915, 2015.International World Wide Web Conferences Steering Committee. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. P. Samadi, A.-H. Mohsenian-Rad, R. Schober, V. Wong, and J. Jatskevich. Optimal real-time pricing algorithm based on utility maximization for smart grid. In Smart Grid Communications (SmartGridComm), 2010 First IEEE International Conference on, pages 415--420, Oct 2010. Google ScholarGoogle ScholarCross RefCross Ref
  33. D. R. Stockwell and A. T. Peterson. Effects of sample size on accuracy of species distribution models. Ecological modelling, 148(1):1--13, 2002. Google ScholarGoogle ScholarCross RefCross Ref
  34. J. Wang and B. Ravindran. Time-utility function-driven switched ethernet: Packet scheduling algorithm, implementation, and feasibility analysis. Parallel and Distributed Systems, IEEE Transactions on, 15(2):119--133, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Y. Wang, J.-G. Kim, and S.-F. Chang. Content-based utility function prediction for real-time mpeg-4 video transcoding. In Image Processing, 2003. ICIP 2003. Proceedings. 2003 International Conference on, volume 1, pages I--189--92 vol.1, Sept 2003. Google ScholarGoogle ScholarCross RefCross Ref
  36. %B. L. Welch.% The significance of the difference between two means when the% population variances are unequal.% Biometrika, 29(3--4):350--362, 1938.Google ScholarGoogle Scholar
  37. M. Xiao, N. B. Shroff, and E. K. Chong. A utility-baed power-control scheme in wireless cellular systems. Networking, IEEE/ACM Transactions on, 11(2):210--221, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. D. Yang, E. A. Rundensteiner, and M. O. Ward. Summarization and matching of density-based clusters in streaming environments. Proc. VLDB Endow., 5(2):121--132, Oct. 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. S. Yousefi, M. P. Moghaddam, and V. J. Majd. Optimal real time pricing in an agent-based retail market using a comprehensive demand response model. Energy, 36(9):5716--5727, 2011. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Just in Time: Controlling Temporal Performance in Crowdsourcing Competitions

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      WWW '16: Proceedings of the 25th International Conference on World Wide Web
      April 2016
      1482 pages
      ISBN:9781450341431

      Copyright © 2016 Copyright is held by the International World Wide Web Conference Committee (IW3C2)

      Publisher

      International World Wide Web Conferences Steering Committee

      Republic and Canton of Geneva, Switzerland

      Publication History

      • Published: 11 April 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      WWW '16 Paper Acceptance Rate115of727submissions,16%Overall Acceptance Rate1,899of8,196submissions,23%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader