A Karaka Dependency Based Dialog Act Tagging for Telugu Using Combination of LMs and HMM

Dowlagar, Suman; Mamidi, Radhika

doi:10.1007/978-3-319-75477-2_48

Suman Dowlagar¹⁴ &
Radhika Mamidi¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9623))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1338 Accesses

Abstract

The main goal of this paper is to perform the dialog act(DA) tagging for Telugu corpus. Annotation of utterances with dialog acts is necessary to recognize the intent of speaker in dialog systems. While English language follows a strict subject–verb–object(SVO) syntax, Telugu is a free word order language. The n-gram DA tagging methods proposed for English language will not work for free word order languages like Telugu. In this paper, we propose a method to perform DA tagging for Telugu corpus using advanced machine learning techniques combined with karaka dependency relation modifiers. In other words, we use syntactic features obtained from karaka dependencies and apply combination of language models(LMs) at utterance level with Hidden Markov Model(HMM) at context level for DA tagging. The use of karaka dependencies for free word order languages like Telugu helps in extracting the modifier-modified relationships between words or word clusters for an utterance. The modifier-modified relationships remain fixed even though the word order in an utterance changes. These extracted modifier-modified relationships appear similar to n-grams. Statistical machine learning methods such as combination of LMs and HMM are applied to predict DA for an utterance in a dialog. The proposed method is compared with several baseline tagging algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The dialogs given here were originally written in Telugu but translated to English for reader’s understanding.

References

Austin, J.L.: How to do Things with Words, vol. 367. Oxford University Press, Cambridge (1975)
Book Google Scholar
Král, P., Cerisara, C.: Automatic dialogue act recognition with syntactic features. Lang. Resour. Eval. 48, 419–441 (2014)
Article Google Scholar
Ivanovic, E.: Dialogue act tagging for instant messaging chat sessions. In: Proceedings of the ACL Student Research Workshop, pp. 79–84. Association for Computational Linguistics (2005)
Google Scholar
Garner, P.N., Browning, S.R., Moore, R.K., Russell, M.J.: A theory of word frequencies and its application to dialogue move recognition. In: ICSLP 1996 Proceedings of Fourth International Conference on Spoken Language, vol. 3, pp. 1880–1883. IEEE (1996)
Google Scholar
Louwerse, M.M., Crossley, S.A.: Dialog act classification using n-gram algorithms. In: FLAIRS Conference, pp. 758–763 (2006)
Google Scholar
Webb, N., Ferguson, M.: Automatic extraction of cue phrases for cross-corpus dialogue act classification. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 1310–1317. Association for Computational Linguistics (2010)
Google Scholar
Král, P., Cerisara, C.: Dialogue act recognition approaches. Comput. Inf. 29, 227–250 (2012)
MATH Google Scholar
Bharati, A., Sangal, R., Sharma, D.M., Bai, L.: AnnCorra: annotating corpora guidelines for POS and chunk annotation for Indian languages. LTRC-TR31 (2006)
Google Scholar
Bharati, A., Chaitanya, V., Sangal, R., Ramakrishnamacharyulu, K.: Natural Language Processing: A Paninian Perspective. Prentice-Hall of India, New Delhi (1995)
Google Scholar
Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian languages. In: IJCNLP, pp. 721–726. Citeseer (2008)
Google Scholar
Mohanan, K.P.: Grammatical relations and clause structure in Malayalam. Ment. Represent. Gramm. Relat. 504, 589 (1982)
Google Scholar
Dowlagar, S., Mamidi, R.: A semi supervised dialog act tagging for Telugu. In: ICON 2015 : 12th International Conference on Natural Language Processing (2015)
Google Scholar
Brants, T., Popat, A.C., Xu, P., Och, F.J., Dean, J.: Large language models in machine translation. In: Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Citeseer (2007)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech & Language Processing. Pearson Education India, Noida (2000)
Google Scholar
Core, M.G., Allen, J.: Coding dialogs with the DAMSL annotation scheme. In: AAAI Fall Symposium on Communicative Action in Humans and Machines, Boston, MA, pp. 28–35 (1997)
Google Scholar
PVS, A., Karthik, G.: Part-of-speech tagging and chunking using conditional random fields and transformation based learning. Shallow Parsing South Asian Lang. 21 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Kohli Center on Intelligent Systems (KCIS), International Institute of Information Technology, Hyderabad, Gachibowli, Hyderabad, 500032, Telangana, India
Suman Dowlagar & Radhika Mamidi

Authors

Suman Dowlagar
View author publications
You can also search for this author in PubMed Google Scholar
Radhika Mamidi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Suman Dowlagar .

Editor information

Editors and Affiliations

CIC, Instituto Politécnico Nacional, Mexico City, Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dowlagar, S., Mamidi, R. (2018). A Karaka Dependency Based Dialog Act Tagging for Telugu Using Combination of LMs and HMM. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2016. Lecture Notes in Computer Science(), vol 9623. Springer, Cham. https://doi.org/10.1007/978-3-319-75477-2_48

Download citation

DOI: https://doi.org/10.1007/978-3-319-75477-2_48
Published: 21 March 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-75476-5
Online ISBN: 978-3-319-75477-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics