Abstract
In this paper, we present OTTO, a web-based transcription tool which is designed for diplomatic transcription of historical language data. The tool supports fast and accurate typing, by use of user-defined special characters, and, simultaneously, providing a view on the manuscript that is as close to the original as possible. It also allows for the annotation of rich, user-defined header information. Users can log in and operate OTTO from anywhere through a standard web browser.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baker, P.: Junicode, a Unicode/OpenType font for medievalists. Font Software, http://junicode.sourceforge.net
Boersma, P.: Praat, a system for doing phonetics by computer. Glot International 5(9/10), 341–345 (2001), software http://www.fon.hum.uva.nl/praat
Burnard, L., Bauman, S.: Representation of primary sources. In: P5: Guidelines for Electronic Text Encoding and Interchange, ch. 11. TEI Consortium (2007), http://www.tei-c.org/release/doc/tei-p5-doc/html/PH.html
Burnard, L., Bauman, S.: The TEI header. In: P5: Guidelines for Electronic Text Encoding and Interchange, ch. 2. TEI Consortium (2007), http://www.tei-c.org/release/doc/tei-p5-doc/en/html/HD.html
Dipper, S.: POS-tagging of historical language data: First experiments. In: Semantic Approaches in Natural Language Processing. In: Proceedings of the 10th Conference on Natural Language Processing (KONVENS 2010), pp. 117–121 (2010)
Dipper, S., Schnurrenberger, M.: OTTO: A tool for diplomatic transcription of historical texts. In: Proceedings of 4th Language & Technology Conference, Poznan, Poland (2009)
Driscoll, M.J.: Levels of transcription. In: Burnard, L., O’Keeffe, K.O., Unsworth, J. (eds.) Electronic Textual Editing, pp. 254–261. Modern Language Association of America, New York (2006), http://www.tei-c.org/About/Archive_new/ETE/Preview/driscoll.xml
Haugen, O.E. (ed.): MUFI character recommendation. Medieval Unicode Font Initiative, Bergen (2009), version 3.0, http://www.mufi.info
Hellwig, B., Uytvanck, D.V., Hulsbosch, M.: ELAN — linguistic annotator. Manual, Version 3.9.0, Max Planck Institute for Psycholinguistics, Nijmegen (2010), software http://www.lat-mpi.eu/tools/elan
Kytö, M. (ed.): Manual to the Diachronic Part of The Helsinki Corpus of English Texts: Coding Conventions and Lists of Source Texts, 3rd edn. University of Helsinki, Finland (1996)
Manuscriptorium project: The ENRICH project and non-standard characters, character database, http://beta.manuscriptorium.com/ (menu item ‘gaiji bank)
Nowviskie, B., McGann, J.: NINES: a federated model for integrating digital scholarship. White paper by NINES (Networked Infrastructure for Nineteenth-Century Electronic Scholarship) (2005), software http://www.juxtasoftware.org
Rissanen, M., Kytö, M., et al.: The Helsinki Corpus of English Texts. Department of English, University of Helsinki. Compiled by Matti Rissanen (Project leader), Merja Kytö (Project secretary); Leena Kahlas-Tarkka, Matti Kilpiö (Old English); Saara Nevanlinna, Irma Taavitsainen (Middle English); Terttu Nevalainen, Helena Raumolin-Brunberg (Early Modern English) (1991)
Schmidt, T.: Creating and working with spoken language corpora in EXMARaLDA. In: LULCL II: Lesser Used Languages & Computer Linguistics II, pp. 151–164 (2009), software http://www.exmaralda.org
Zentrum für Datenverarbeitung, Universität Tübingen: TUSTEP: Tübinger System von Textverarbeitungs-Programmen. Handbuch und Referenz. Manual, Version 2010, Tübingen University (2000), http://www.tustep.uni-tuebingen.de
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dipper, S., Schnurrenberger, M. (2011). OTTO: A Tool for Diplomatic Transcription of Historical Texts. In: Vetulani, Z. (eds) Human Language Technology. Challenges for Computer Science and Linguistics. LTC 2009. Lecture Notes in Computer Science(), vol 6562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20095-3_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-20095-3_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20094-6
Online ISBN: 978-3-642-20095-3
eBook Packages: Computer ScienceComputer Science (R0)