Skip to main content

Aggregating Crowd Wisdom with Instance Grouping Methods

  • Conference paper
  • First Online:
  • 2224 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9931))

Abstract

With the blooming of crowdsourcing platforms, utilizing crowd wisdom becomes popular. Label aggregation is one of the key topics in crowdsourcing research. The goal is to infer true labels from multiple labels provided by different users. Most researchers make their efforts in modeling user ability and instance difficulty. However, these methods may suffer from sparsity of labels in practice. In this paper, we consider label aggregation from the view of grouping instances. We assume instances are sampled from latent groups and instances in the same group share the same true label. A probabilistic graphical model named InGroup (Instance Grouping model) is constructed to infer latent group assignments as well as true labels. Further, we combine user ability and group difficulty into InGroup to achieve a better model called InGroup+ (InGroup Plus). The experiments conducted on a real-world dataset show the advantages of instance grouping methods compared with other methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bachrach, Y., Graepel, T., Minka, T., Guiver, J.: How to grade a test without knowing the answers–a bayesian graphical model for adaptive crowdsourcing and aptitude testing. In: Proceedings of the 29th International Conference on Machine Learning (ICML 2012), pp. 1183–1190 (2012)

    Google Scholar 

  2. Galland, A., Abiteboul, S., Marian, A., Senellart, P.: Corroborating information from disagreeing views. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, pp. 131–140. ACM (2010)

    Google Scholar 

  3. Li, Q., Li, Y., Gao, J., Su, L., Zhao, B., Demirbas, M., Fan, W., Han, J.: A confidence-aware approach for truth discovery on long-tail data. Proc. VLDB Endow. 8(4), 425–436 (2014)

    Article  Google Scholar 

  4. Pasternack, J., Roth, D.: Knowing what to believe (when you already know something). In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 877–885. Association for Computational Linguistics (2010)

    Google Scholar 

  5. Qi, G.-J., Aggarwal, C.C., Han, J., Huang, T.: Mining collective intelligence in diverse groups. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1041–1052. International World Wide Web Conferences Steering Committee (2013)

    Google Scholar 

  6. Raykar, V.C., Yu, S., Zhao, L.H., Valadez, G.H., Florin, C., Bogoni, L., Moy, L.: Learning from crowds. J. Mach. Learn. Res. 11, 1297–1322 (2010)

    MathSciNet  Google Scholar 

  7. Venanzi, M., Guiver, J., Kazai, G., Kohli, P., Shokouhi, M.: Community-based bayesian aggregation models for crowdsourcing. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 155–164. International World Wide Web Conferences Steering Committee (2014)

    Google Scholar 

  8. Welinder, P., Branson, S., Mita, T., Wah, C., Schroff, F., Belongie, S., Perona, P.: Caltech-UCSD birds 200

    Google Scholar 

  9. Welinder, P., Branson, S., Perona, P., Belongie, S.J.: The multidimensional wisdom of crowds. In: Advances in Neural Information Processing Systems, pp. 2424–2432 (2010)

    Google Scholar 

  10. Whitehill, J., Wu, T.-F., Bergsma, J., Movellan, J.R., Ruvolo, P.L.: Whose vote should count more: optimal integration of labels from labelers of unknown expertise. In: Advances in Neural Information Processing Systems, pp. 2035–2043 (2009)

    Google Scholar 

  11. Yin, L., Han, J., Yu, Y.: Label aggregation with instance grouping model. In: Proceedings of the 25th International Conference Companion on World Wide Web, pp. 135–136. International World Wide Web Conferences Steering Committee (2016)

    Google Scholar 

  12. Yin, X., Han, J., Yu, P.S.: Truth discovery with multiple conflicting information providers on the web. IEEE Trans. Knowl. Data Eng. 20(6), 796–808 (2008)

    Article  Google Scholar 

  13. Zhi, S., Zhao, B., Tong, W., Gao, J., Yu, D., Ji, H., Han, J.: Modeling truth existence in truth discovery. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1543–1552. ACM (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li’ang Yin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Yin, L., Li, Z., Han, J., Yu, Y. (2016). Aggregating Crowd Wisdom with Instance Grouping Methods. In: Li, F., Shim, K., Zheng, K., Liu, G. (eds) Web Technologies and Applications. APWeb 2016. Lecture Notes in Computer Science(), vol 9931. Springer, Cham. https://doi.org/10.1007/978-3-319-45814-4_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-45814-4_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-45813-7

  • Online ISBN: 978-3-319-45814-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics