Skip to main content

Semantic Annotation in Collaborative Document Transcription: A Gamified Citizen Science Approach

  • Conference paper
  • First Online:
Culture and Computing (HCII 2023)

Abstract

Human or manual transcription is the task where a person reads a handwritten document and types in a digital environment the text she is reading. The manual transcription task is a long and time-consuming process for one person. However, the intelligence of the transcriber is provided with better results than automatic alternatives. This article introduces a collaborative transcription platform called Transcriptor, in which the community members can upload digitized manuscripts and collaborate in transcribing them, defining different transcription layers which will be represented using semantic web technologies and gamification techniques. The article provides two experimentation that shows that Transcriptor had a good acceptance and novelty.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://fromthepage.com accessed on 8th february 2023.

  2. 2.

    http://scribeproject.github.io, accessed on 8th february 2023.

  3. 3.

    http://zooniverse.org/.

  4. 4.

    https://www.nypl.org/.

  5. 5.

    https://github.com/cientopolis/transcriptor-web accessed on 7th february 2023.

  6. 6.

    https://github.com/cientopolis/transcriptor-backend accessed on 7th february 2023.

References

  1. Bangor, A., Kortum, P., Miller, J.: Determining what individual SUS scores mean: adding an adjective rating scale. J. Usability Stud. 4(3), 114–123 (2009)

    Google Scholar 

  2. Bazillon, T., Estève, Y., Luzzati, D.: Manual vs assisted transcription of prepared and spontaneous speech. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08). European Language Resources Association (ELRA), Marrakech, Morocco (May 2008). http://www.lrec-conf.org/proceedings/lrec2008/pdf/277_paper.pdf

  3. Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Sci. Am. 284(5), 34–43 (2001). JSTOR (2001)

    Google Scholar 

  4. Granollers, T., Lorés, J.: Incorporation of users in the evaluation of usability by cognitive walkthrough. In: Navarro-Prieto, R., Vidal, J. (eds) HCI related papers of Interacción 2004. Springer, Dordrecht (2006). https://doi.org/10.1007/1-4020-4205-1_20

  5. Guha, R.V., Brickley, D., Macbeth, S.: Schema.org: evolution of structured data on the web. Commun. ACM 59(2), 44–51 (2016)

    Google Scholar 

  6. Huitfeldt, C., Sperberg-McQueen, C.: What is transcription? Literary Linguistic Comput. 23, 295–310 (09 2008). https://doi.org/10.1093/llc/fqn013

  7. Kortum, P., Acemyan, C.Z., Oswald, F.L.: Is it time to go positive? assessing the positively worded system usability scale (sus). Hum. Factors 63(6), 987–998 (2021)

    Article  Google Scholar 

  8. Laugwitz, B., Held, T., Schrepp, M.: Construction and evaluation of a user experience questionnaire. In: Holzinger, A. (ed.) USAB 2008. LNCS, vol. 5298, pp. 63–76. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-89350-9_6

    Chapter  Google Scholar 

  9. Laugwitz, B., Schrepp, M., Held, T.: Konstruktion eines fragebogens zur messung der user experience von softwareprodukten. In: Mensch & Computer, pp. 125–134 (2006)

    Google Scholar 

  10. Memon, J., Sami, M., Khan, R.A., Uddin, M.: Handwritten optical character recognition (ocr): a comprehensive systematic literature review (slr). IEEE Access 8, 142642–142668 (2020). https://doi.org/10.1109/ACCESS.2020.3012542

    Article  Google Scholar 

  11. Mori, S., Nishida, H., Yamada, H.: Optical character recognition. John Wiley & Sons, Inc. (1999)

    Google Scholar 

  12. Moyle, M., Tonra, J., Wallace, V.: Manuscript transcription by crowdsourcing: transcribe bentham. Liber Quart. 20, 3 (2010). https://doi.org/10.18352/lq.7999

  13. Ortiz Baco, J., Guzman, A., Palacios, A.A.: Fromthepage collection owner user study report. Enabling and Reusing Multilingual Citizen Contributions in the Archival Record-NEH Grant Documentation (2020)

    Google Scholar 

  14. del Rio, M.G., Hernández, N., De León, R.: Manos en la masa (textual): introducción al editatón colaborativo con fromthepage. In: III Congreso Internacional de la Asociación de Humanidades Digitales (AAHD). La cultura de los datos (2018)

    Google Scholar 

  15. Shadbolt, N., Berners-Lee, T., Hall, W.: The semantic web revisited. IEEE Intell. Syst. 21(3), 96–101 (2006). IEEE (2006)

    Google Scholar 

  16. Simpson, R., Page, K.R., De Roure, D.: Zooniverse: observing the world’s largest citizen science platform. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 1049–1054 (2014)

    Google Scholar 

  17. Vershbow, B.: Scribe: turning text into structured information through the power of the crowd (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Diego Torres .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bracco, C.E., Cuevas, L.E., Torres, D. (2023). Semantic Annotation in Collaborative Document Transcription: A Gamified Citizen Science Approach. In: Rauterberg, M. (eds) Culture and Computing. HCII 2023. Lecture Notes in Computer Science, vol 14035. Springer, Cham. https://doi.org/10.1007/978-3-031-34732-0_38

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-34732-0_38

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-34731-3

  • Online ISBN: 978-3-031-34732-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics