Skip to main content

Orthographic Transcription for Spoken Tunisian Arabic

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2013)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7816))

Abstract

Transcribing spoken Arabic dialects is an important task for building speech corpora. Therefore, it is necessary to follow a definite orthography and a definite annotation to transcribe speech data. In this paper, we present OTTA, Orthographic Transcription for Tunisian Arabic. This convention proposes the use of some rules based on the standard Arabic transcription conventions and we define a set of conventions which preserve the particularities of Tunisian dialect.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Al-Saidat, E., Al-Momani, I.: Future Markers in Modern Standard Arabic and Jordanian Arabic: A Contrastive Study. European Journal of Social Sciences 12(3) (2010)

    Google Scholar 

  2. Diab, M., Habash, N.: Arabic Dialect Processing Tutorial. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, Rochester, pp. 5–6. Association for Computational Linguistics (April 2007)

    Google Scholar 

  3. Alorifi, F.S.: Automatic identification of Arabic dialects using Hidden Markov Models. In mémoire de thèse, Université de Pittsburgh (2008)

    Google Scholar 

  4. Almeman, K., Lee, M.: Towards developing a Multi-dialect Morphological analyzer for Arabic. In: 4th International Conference on Arabic Language Processing, Rabat, Morocco, May 2-3 (2012)

    Google Scholar 

  5. Khalfaoui, A.: A cognitive approach to analyzing demonstratives in Tunisian Arabic. In PhD thesis of university of Minnesota (November 2009)

    Google Scholar 

  6. Graja, M., Jaoua, M., HadrichBelguith, L.: Lexical Study of A Spoken Dialogue Corpus in Tunisian Dialect. In: ACIT 2010: The International Arab Conference on Information Technology, Benghazi - Libya, December 14-16 (2010)

    Google Scholar 

  7. Mejri, S., Said, M., Sfar, I.: Pluringuisme et diglossie en Tunisie. In: Synergies Tunisie, vol. (1), pp. 53–74 (2009)

    Google Scholar 

  8. Tilmatine, M.: Substrat Et Convergences: Le Berbère Et L’arabe Nord-Africain. Estudios de Dialectologia Norteafricana y Andalusl 4, 99–119 (1999)

    Google Scholar 

  9. Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Ji, G., He, F., Henderson, J., Liu, D., Noamany, M., Schone, P., Schwartz, R., Vergyri, D.: Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Missouri, USA, vol. 1, pp. 344–347 (April 2003)

    Google Scholar 

  10. Mejri, S., Baccouche, T.: L’atlas linguistique de Tunisie: repères méthodologiques pour la description du système dialectal. In: Lentin, J., Lonnet, A. (eds.) Mélanges David Cohen, pp. 47–54. Maisonneuve & Larose, Paris (2003)

    Google Scholar 

  11. Quitout, M.: Parlons l’arabe tunisien. In book edited by L’Harmattan (2006)

    Google Scholar 

  12. Ouerhani, B.: Interférence entre le dialectal et le littéral en Tunisie: Le cas de la morphologie verbale. In: Synergies Tunisie, vol. (1), pp. 75–84 (2009)

    Google Scholar 

  13. Maalej, Z.: Passives in modern standard and Tunisian Arabic. Matériaux Arabes et Sudarabiques-Gellas 9, 51–76 (1999)

    Google Scholar 

  14. Bouzemni, A.: Linguistic situation in Tunisia: French and Arabic code switching. In: INTERLINGĂœISTICA, vol. 16(1), pp. 217–223 (2005) ISSN 1134-8941

    Google Scholar 

  15. Zawaydeh, B., Stallard, D., Makhoul, J. (2003), http://ldc.upenn.edu/Catalog/docs/LDC2005S08/BBN-Babylon-transcription-guidelines.pdf

  16. Maamouri, M., Buckwalter, T., Cieri, C.: Dialectal Arabic Telephone Speech Corpus: Principles, Tool Design, and Transcription Conventions. In: NEMLAR International Conference on Arabic Language Resources and Tools, Cairo, September 22-23 (2004)

    Google Scholar 

  17. Habash, N., Diab, M., Rambow, O.: Conventional Orthography for Dialectal Arabic. In: Proceedings of the Language Resources and Evaluation Conference (LREC), Istanbul (2012)

    Google Scholar 

  18. Bertrand, R., Blache, P., Espesser, R., Ferré, G., Meunier, C., Priego-Valverde, B., Rauzy, S.: Le CID - Corpus of Interactional Data - Annotation et Exploitation Multimodale de Parole Conversationnelle. Traitement Automatique des Langues 49(3), 105–134 (2008)

    Google Scholar 

  19. Heeman, P., Allen, J.: Detecting and correcting speech repairs. In: Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, Las Cruces, New Mexico, pp. 295–302 (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zribi, I., Graja, M., Khmekhem, M.E., Jaoua, M., Belguith, L.H. (2013). Orthographic Transcription for Spoken Tunisian Arabic. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37247-6_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37246-9

  • Online ISBN: 978-3-642-37247-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics