Web Engineering with Human-in-the-Loop

Ustalov, Dmitry; Pavlichenko, Nikita; Tseytlin, Boris; Baidakova, Daria; Drutsa, Alexey

doi:10.1007/978-3-031-09917-5_45

Web Engineering with Human-in-the-Loop

Conference paper
First Online: 01 July 2022

1218 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13362))

Abstract

Modern Web applications employ sophisticated Machine Learning models to rank news, posts, products, and other items presented to the users or contributed by them. To keep these models useful, one has to constantly train, evaluate, and monitor these models using freshly annotated data, which can be done using crowdsourcing. In this tutorial we will present a portion of our six-year experience in solving real-world tasks with human-in-the-loop pipelines that combine efforts made by humans and machines. We will introduce data labeling via public crowdsourcing marketplaces and present the critical components of efficient data labeling. Then, we will run a practical session, where participants address a challenging real-world Information Retrieval for e-Commerce task, experiment with selecting settings for the labeling process, and launch their label collection project on real crowds within the tutorial session. We will present useful quality control techniques and provide the attendees with an opportunity to discuss their annotation ideas. Methods and techniques described in this tutorial can be applied to any crowdsourced data and are not bound to any specific crowdsourcing platform.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Daniel, F., Kucherbaev, P., Cappiello, C., Benatallah, B., Allahbakhsh, M.: Quality control in crowdsourcing: a survey of quality attributes, assessment techniques, and assurance actions. ACM Comput. Surv. 51(1), 7:1–7:40 (2018)
Google Scholar
Ustalov, D., Pavlichenko, N., Losev, V., Giliazev, I., Tulin, E.: A general-purpose crowdsourcing computational quality control toolkit for python. In: The Ninth AAAI Conference on Human Computation and Crowdsourcing: Works-in-Progress and Demonstration Track (HCOMP 2021) (2021)
Google Scholar
Zheng, Y., Li, G., Li, Y., Shan, C., Cheng, R.: Truth inference in crowdsourcing: is the problem solved? Proc. VLDB Endowm. 10(5), 541–552 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Toloka, Lucerne, Switzerland
Dmitry Ustalov, Nikita Pavlichenko, Boris Tseytlin, Daria Baidakova & Alexey Drutsa

Authors

Dmitry Ustalov
View author publications
You can also search for this author in PubMed Google Scholar
Nikita Pavlichenko
View author publications
You can also search for this author in PubMed Google Scholar
Boris Tseytlin
View author publications
You can also search for this author in PubMed Google Scholar
Daria Baidakova
View author publications
You can also search for this author in PubMed Google Scholar
Alexey Drutsa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Boris Tseytlin .

Editor information

Editors and Affiliations

Polytechnic University of Bari, Bari, Italy
Tommaso Di Noia
Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea (Republic of)
In-Young Ko
Johannes Kepler University Linz, Linz, Austria
Markus Schedl
Polytechnic University of Bari, Bari, Italy
Carmelo Ardito

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ustalov, D., Pavlichenko, N., Tseytlin, B., Baidakova, D., Drutsa, A. (2022). Web Engineering with Human-in-the-Loop. In: Di Noia, T., Ko, IY., Schedl, M., Ardito, C. (eds) Web Engineering. ICWE 2022. Lecture Notes in Computer Science, vol 13362. Springer, Cham. https://doi.org/10.1007/978-3-031-09917-5_45

Download citation

DOI: https://doi.org/10.1007/978-3-031-09917-5_45
Published: 01 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-09916-8
Online ISBN: 978-3-031-09917-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics