Skip to main content

Analysis of Expert Manual Annotation of the Russian Spontaneous Monologue: Evidence from Sentence Boundary Detection

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8113))

Abstract

The paper describes the corpus of Russian spontaneous monologues and the results of its expert manual annotation. The corpus is balanced with respect to speakers’ social characteristics and a text genre. The analysis of manual labelling of transcriptions reveals experts’ disagreement in sentence boundary detection. The paper demonstrates that labelled boundaries may have different status. We also show that speakers’ social characteristics (gender and speech usage) and a text genre influence inter-labeller agreement.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vannikov, Y., Abdalyan, I.: Eksperimental’noe issledovanie chleneniya razgovornoj rechi na diskretnye intonacionno-smyslovye edinicy (frazy). In: Sirotinina, O.B., Barannikova, L.I., Serdobintsev, L.Y. (eds.) Russkaya Razgovornaya Rech, Saratov, pp. 40–46 (1973) (in Russian)

    Google Scholar 

  2. Kibrik, A.A.: Est’ li predlozhenie v russkoj rechi? In: Arkhipov, A.V., Zakharov, L.M., Kibrik, A.A., et al. (eds.) Phonetics and Non-phonetics: For the 70th Birthday of Sandro V. Kodzasov, pp. 104–115. Jazyki slavjanskih kul’tur, Moscow (2008) (in Russian)

    Google Scholar 

  3. Chistikov, P., Khomitsevich, O.: Online Automatic Sentence Boundary Detection in a Russian ASR System. In: Potapova, R.K. (ed.) SPECOM 2011. The 14th International Conference “Speech and Computer”, Kazan, Russia, September 27-30, pp. 112–117 (2011)

    Google Scholar 

  4. Skrebnev, Y.M.: Vvedenie v kollokvialistiku. Izdatel’stvo Saratovskogo universiteta, Saratov (1985) (in Russian)

    Google Scholar 

  5. Kibrik, A.A., Podlesskaya, V.I. (eds.): Night Dream Stories: A Corpus Study of Spoken Russian Discourse. Jazyki slavjanskih kul’tur, Moscow (2009) (in Russian)

    Google Scholar 

  6. Nasukawa, T., Punjani, D., Roy, S., Subramaniam, L.V., Takeuchi, H.: Adding Sentence Boundaries to Conversational Speech Transcriptions using Noisily Labelled Examples. In: AND 2007, pp. 71–78 (2007)

    Google Scholar 

  7. Gotoh, Y., Renals, S.: Sentence Boundary Detection in Broadcast Speech Transcripts. In: Proceedings of the International Speech Communication Association (ISCA) Workshop: Automatic Speech Recognition: Challenges for the New Millenium (ASR 2000), Paris, France, September 18-20, pp. 228–235 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Stepikhov, A. (2013). Analysis of Expert Manual Annotation of the Russian Spontaneous Monologue: Evidence from Sentence Boundary Detection. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01931-4_5

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01930-7

  • Online ISBN: 978-3-319-01931-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics