Aggregating Crowd Wisdom with Instance Grouping Methods

Yin, Li’ang; Li, Zhengbo; Han, Jianhua; Yu, Yong

doi:10.1007/978-3-319-45814-4_38

Aggregating Crowd Wisdom with Instance Grouping Methods

Li’ang Yin¹⁷,
Zhengbo Li¹⁷,
Jianhua Han¹⁷ &
…
Yong Yu¹⁷

Conference paper
First Online: 17 September 2016

2224 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9931))

Abstract

With the blooming of crowdsourcing platforms, utilizing crowd wisdom becomes popular. Label aggregation is one of the key topics in crowdsourcing research. The goal is to infer true labels from multiple labels provided by different users. Most researchers make their efforts in modeling user ability and instance difficulty. However, these methods may suffer from sparsity of labels in practice. In this paper, we consider label aggregation from the view of grouping instances. We assume instances are sampled from latent groups and instances in the same group share the same true label. A probabilistic graphical model named InGroup (Instance Grouping model) is constructed to infer latent group assignments as well as true labels. Further, we combine user ability and group difficulty into InGroup to achieve a better model called InGroup+ (InGroup Plus). The experiments conducted on a real-world dataset show the advantages of instance grouping methods compared with other methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bachrach, Y., Graepel, T., Minka, T., Guiver, J.: How to grade a test without knowing the answers–a bayesian graphical model for adaptive crowdsourcing and aptitude testing. In: Proceedings of the 29th International Conference on Machine Learning (ICML 2012), pp. 1183–1190 (2012)
Google Scholar
Galland, A., Abiteboul, S., Marian, A., Senellart, P.: Corroborating information from disagreeing views. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, pp. 131–140. ACM (2010)
Google Scholar
Li, Q., Li, Y., Gao, J., Su, L., Zhao, B., Demirbas, M., Fan, W., Han, J.: A confidence-aware approach for truth discovery on long-tail data. Proc. VLDB Endow. 8(4), 425–436 (2014)
Article Google Scholar
Pasternack, J., Roth, D.: Knowing what to believe (when you already know something). In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 877–885. Association for Computational Linguistics (2010)
Google Scholar
Qi, G.-J., Aggarwal, C.C., Han, J., Huang, T.: Mining collective intelligence in diverse groups. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1041–1052. International World Wide Web Conferences Steering Committee (2013)
Google Scholar
Raykar, V.C., Yu, S., Zhao, L.H., Valadez, G.H., Florin, C., Bogoni, L., Moy, L.: Learning from crowds. J. Mach. Learn. Res. 11, 1297–1322 (2010)
MathSciNet Google Scholar
Venanzi, M., Guiver, J., Kazai, G., Kohli, P., Shokouhi, M.: Community-based bayesian aggregation models for crowdsourcing. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 155–164. International World Wide Web Conferences Steering Committee (2014)
Google Scholar
Welinder, P., Branson, S., Mita, T., Wah, C., Schroff, F., Belongie, S., Perona, P.: Caltech-UCSD birds 200
Google Scholar
Welinder, P., Branson, S., Perona, P., Belongie, S.J.: The multidimensional wisdom of crowds. In: Advances in Neural Information Processing Systems, pp. 2424–2432 (2010)
Google Scholar
Whitehill, J., Wu, T.-F., Bergsma, J., Movellan, J.R., Ruvolo, P.L.: Whose vote should count more: optimal integration of labels from labelers of unknown expertise. In: Advances in Neural Information Processing Systems, pp. 2035–2043 (2009)
Google Scholar
Yin, L., Han, J., Yu, Y.: Label aggregation with instance grouping model. In: Proceedings of the 25th International Conference Companion on World Wide Web, pp. 135–136. International World Wide Web Conferences Steering Committee (2016)
Google Scholar
Yin, X., Han, J., Yu, P.S.: Truth discovery with multiple conflicting information providers on the web. IEEE Trans. Knowl. Data Eng. 20(6), 796–808 (2008)
Article Google Scholar
Zhi, S., Zhao, B., Tong, W., Gao, J., Yu, D., Ji, H., Han, J.: Modeling truth existence in truth discovery. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1543–1552. ACM (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Shanghai Jiao Tong University, No. 800 Dongchuan Road, Shanghai, China
Li’ang Yin, Zhengbo Li, Jianhua Han & Yong Yu

Authors

Li’ang Yin
View author publications
You can also search for this author in PubMed Google Scholar
Zhengbo Li
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Han
View author publications
You can also search for this author in PubMed Google Scholar
Yong Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Li’ang Yin .

Editor information

Editors and Affiliations

School of Computing, University of Utah, Salt Lake City, Utah, USA
Feifei Li
School of Electrical Engineering, Seoul National University, Seoul, Korea (Republic of)
Kyuseok Shim
Soochow University , Suzhou, China
Kai Zheng
Soochow University , Suzhou, China
Guanfeng Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yin, L., Li, Z., Han, J., Yu, Y. (2016). Aggregating Crowd Wisdom with Instance Grouping Methods. In: Li, F., Shim, K., Zheng, K., Liu, G. (eds) Web Technologies and Applications. APWeb 2016. Lecture Notes in Computer Science(), vol 9931. Springer, Cham. https://doi.org/10.1007/978-3-319-45814-4_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-45814-4_38
Published: 17 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45813-7
Online ISBN: 978-3-319-45814-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics