Abstract
Transcribing spoken Arabic dialects is an important task for building speech corpora. Therefore, it is necessary to follow a definite orthography and a definite annotation to transcribe speech data. In this paper, we present OTTA, Orthographic Transcription for Tunisian Arabic. This convention proposes the use of some rules based on the standard Arabic transcription conventions and we define a set of conventions which preserve the particularities of Tunisian dialect.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Al-Saidat, E., Al-Momani, I.: Future Markers in Modern Standard Arabic and Jordanian Arabic: A Contrastive Study. European Journal of Social Sciences 12(3) (2010)
Diab, M., Habash, N.: Arabic Dialect Processing Tutorial. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the ACL, Rochester, pp. 5–6. Association for Computational Linguistics (April 2007)
Alorifi, F.S.: Automatic identification of Arabic dialects using Hidden Markov Models. In mémoire de thèse, Université de Pittsburgh (2008)
Almeman, K., Lee, M.: Towards developing a Multi-dialect Morphological analyzer for Arabic. In: 4th International Conference on Arabic Language Processing, Rabat, Morocco, May 2-3 (2012)
Khalfaoui, A.: A cognitive approach to analyzing demonstratives in Tunisian Arabic. In PhD thesis of university of Minnesota (November 2009)
Graja, M., Jaoua, M., HadrichBelguith, L.: Lexical Study of A Spoken Dialogue Corpus in Tunisian Dialect. In: ACIT 2010: The International Arab Conference on Information Technology, Benghazi - Libya, December 14-16 (2010)
Mejri, S., Said, M., Sfar, I.: Pluringuisme et diglossie en Tunisie. In: Synergies Tunisie, vol. (1), pp. 53–74 (2009)
Tilmatine, M.: Substrat Et Convergences: Le Berbère Et L’arabe Nord-Africain. Estudios de Dialectologia Norteafricana y Andalusl 4, 99–119 (1999)
Kirchhoff, K., Bilmes, J., Das, S., Duta, N., Egan, M., Ji, G., He, F., Henderson, J., Liu, D., Noamany, M., Schone, P., Schwartz, R., Vergyri, D.: Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2003), Missouri, USA, vol. 1, pp. 344–347 (April 2003)
Mejri, S., Baccouche, T.: L’atlas linguistique de Tunisie: repères méthodologiques pour la description du système dialectal. In: Lentin, J., Lonnet, A. (eds.) Mélanges David Cohen, pp. 47–54. Maisonneuve & Larose, Paris (2003)
Quitout, M.: Parlons l’arabe tunisien. In book edited by L’Harmattan (2006)
Ouerhani, B.: Interférence entre le dialectal et le littéral en Tunisie: Le cas de la morphologie verbale. In: Synergies Tunisie, vol. (1), pp. 75–84 (2009)
Maalej, Z.: Passives in modern standard and Tunisian Arabic. Matériaux Arabes et Sudarabiques-Gellas 9, 51–76 (1999)
Bouzemni, A.: Linguistic situation in Tunisia: French and Arabic code switching. In: INTERLINGĂœISTICA, vol. 16(1), pp. 217–223 (2005) ISSN 1134-8941
Zawaydeh, B., Stallard, D., Makhoul, J. (2003), http://ldc.upenn.edu/Catalog/docs/LDC2005S08/BBN-Babylon-transcription-guidelines.pdf
Maamouri, M., Buckwalter, T., Cieri, C.: Dialectal Arabic Telephone Speech Corpus: Principles, Tool Design, and Transcription Conventions. In: NEMLAR International Conference on Arabic Language Resources and Tools, Cairo, September 22-23 (2004)
Habash, N., Diab, M., Rambow, O.: Conventional Orthography for Dialectal Arabic. In: Proceedings of the Language Resources and Evaluation Conference (LREC), Istanbul (2012)
Bertrand, R., Blache, P., Espesser, R., Ferré, G., Meunier, C., Priego-Valverde, B., Rauzy, S.: Le CID - Corpus of Interactional Data - Annotation et Exploitation Multimodale de Parole Conversationnelle. Traitement Automatique des Langues 49(3), 105–134 (2008)
Heeman, P., Allen, J.: Detecting and correcting speech repairs. In: Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, Las Cruces, New Mexico, pp. 295–302 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zribi, I., Graja, M., Khmekhem, M.E., Jaoua, M., Belguith, L.H. (2013). Orthographic Transcription for Spoken Tunisian Arabic. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-37247-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37246-9
Online ISBN: 978-3-642-37247-6
eBook Packages: Computer ScienceComputer Science (R0)