ABSTRACT
Generally Telangana dialect is frequently spoken in vocal daily interactions. Official Telugu is the language used in books, newspapers, academic journals, and other types of literature. Telangana only produces a small quantity of literature and written material in documentary series form. Despite numerous attempts, the Telangana language’s range is still confined to vocal forms. We are attempting to build a dataset of Telangana words, that are obtained from various documents, novels, essays, plays, and everyday interactions of native speakers, to mitigate this barrier and enable the electronic profusion of Telangana dialect. The first phase of the work consisted of extracting some research papers relevant to the topic and gaining some more insight into the objective focused. We then moved on to collect words in the Telangana language as a second phase, i.e., making a dataset. Then using other methods such as tokenization we began with the third phase of our project to implement the proposed work where finally conversion of Telangana dialects is translated to English..
- [n.d.]. Languages of India - Wikipedia — en.wikipedia.org. https://en.wikipedia.org/wiki/Languages_of_India. [Accessed 09-Jun-2023].Google Scholar
- [n.d.]. Telugu language - Wikipedia — en.wikipedia.org. https://en.wikipedia.org/wiki/Telugu_language. [Accessed 09-Jun-2023].Google Scholar
- Shimaa Ibrahim, Said Fathalla, Hamed Shariat Yazdi, Jens Lehmann, and Hajira Jabeen. 2019. From monolingual to multilingual ontologies: The role of cross-lingual ontology enrichment. In Semantic Systems. The Power of AI and Knowledge Graphs: 15th International Conference, SEMANTiCS 2019, Karlsruhe, Germany, September 9–12, 2019, Proceedings 15. Springer International Publishing, 215–230.Google ScholarCross Ref
- Taisiya Kostareva, Svetlana Chuprina, and Alexandr Nam. 2016. Using Ontology-Driven Methods to Develop Frameworks for Tackling NLP Problems.. In AIST (Supplement). 102–113.Google Scholar
- Jose Melchor Medina-Quintero, Demian Abrego-Almazán, and Fernando Ortiz-Rodríguez. 2018. Use and usefulness of the information systems measurement. a quality approach at the mexican northeastern region. Cuadernos de Administración 31, 56 (2018), 7–30.Google Scholar
- Diego Moussallem and Ricardo Choren. 2015. Using ontology-based context in the portuguese-english translation of homographs in textual dialogues. arXiv preprint arXiv:1510.01886 (2015).Google Scholar
- Fernando Ortiz-Rodriguez, Jose Melchor Medina-Quintero, Sanju Tiwari, and Vicente Villanueva. 2022. EGODO ontology: sharing, retrieving, and exchanging legal documentation across e-government. In Futuristic Trends for Sustainable Development and Sustainable Ecosystems. IGI Global, 261–276.Google Scholar
- Fernando Ortiz-Rodriguez, Sanju Tiwari, Ronak Panchal, Jose Melchor Medina-Quintero, and Ruben Barrera. 2022. MEXIN: multidialectal ontology supporting NLP approach to improve government electronic communication with the Mexican Ethnic Groups. In DG. O 2022: The 23rd Annual International Conference on Digital Government Research. 461–463.Google ScholarDigital Library
- AC Schalley. [n.d.]. Ontologies and ontological methods in linguistics. Lang. Linguist. Compass 13 (11), e12356 (2019).Google ScholarCross Ref
- Hashwanth Sutharapu, Akshit Duggal, Sanju Tiwari, Nisha Chaurasia, and Fernando Ortiz-Rodriguez. 2022. Dialect Translation of English Language to Telangana. Proceedings http://ceur-ws. org ISSN 1613 (2022), 0073.Google Scholar
- Sanju Tiwari, Onur Dogan, MA Jabbar, Shishir Kumar Shandilya, Fernando Ortiz-Rodriguez, Sailesh Bajpai, and Sourav Banerjee. 2022. Applications of machine learning approaches to combat COVID-19: a survey. Lessons from COVID-19 (2022), 263–287.Google ScholarCross Ref
Recommendations
A Malay Dialect Translation and Synthesis System: Proposal and Preliminary System
IALP '12: Proceedings of the 2012 International Conference on Asian Language ProcessingMalay is a language from the Austronesian family. Malay is the official language in Malaysia, Indonesia, Singapore, and Brunei. However, Malay spoken in different countries, and even within a country itself might vary in terms of pronunciation and ...
A Basic Language Resource Kit Implementation for the IgboNLP Project
Igbo, an African language with around 32 million speakers worldwide, is one of the many languages having few or none of the language processing resources needed for advanced language technology applications. In this article, we describe the approach ...
Orthographic and morphological processing for English---Arabic statistical machine translation
Much of the work on statistical machine translation (SMT) from morphologically rich languages has shown that morphological tokenization and orthographic normalization help improve SMT quality because of the sparsity reduction they contribute. In this ...
Comments