poster

Dialect Translation of English Language to Telangana: Mexin Project

Authors:
Hashwanth Sutharapu

Ambedkar National Institute of Technology, India

Ambedkar National Institute of Technology, India

0009-0008-1159-9924
View Profile

,
Akshit Duggal

Ambedkar National Institute of Technology, India

Ambedkar National Institute of Technology, India

0000-0002-1602-9989
View Profile

,
Sanju Tiwari

UAT, India

UAT, India

0000-0001-7197-0766
View Profile

,
Edlira Vakaj

Birmingham City University, United Kingdom

Birmingham City University, United Kingdom

0000-0002-0712-0959
View Profile

,
Fernando Ortiz-Rodriguez

Universidad Autonoma de Tamaulipas, Mexico

Universidad Autonoma de Tamaulipas, Mexico

0000-0003-2084-3462
View Profile

,
Ruben Barrera-Hernandez

Universidad Autonoma de Tamaulipas, Mexico

Universidad Autonoma de Tamaulipas, Mexico

0000-0003-3628-022X
View Profile

DGO '23: Proceedings of the 24th Annual International Conference on Digital Government ResearchJuly 2023Pages 671–673https://doi.org/10.1145/3598469.3598553

Published:11 July 2023Publication History

DGO '23: Proceedings of the 24th Annual International Conference on Digital Government Research

Pages 671–673

ABSTRACT

Generally Telangana dialect is frequently spoken in vocal daily interactions. Official Telugu is the language used in books, newspapers, academic journals, and other types of literature. Telangana only produces a small quantity of literature and written material in documentary series form. Despite numerous attempts, the Telangana language’s range is still confined to vocal forms. We are attempting to build a dataset of Telangana words, that are obtained from various documents, novels, essays, plays, and everyday interactions of native speakers, to mitigate this barrier and enable the electronic profusion of Telangana dialect. The first phase of the work consisted of extracting some research papers relevant to the topic and gaining some more insight into the objective focused. We then moved on to collect words in the Telangana language as a second phase, i.e., making a dataset. Then using other methods such as tokenization we began with the third phase of our project to implement the proposed work where finally conversion of Telangana dialects is translated to English..

References

[n.d.]. Languages of India - Wikipedia — en.wikipedia.org. https://en.wikipedia.org/wiki/Languages_of_India. [Accessed 09-Jun-2023].Google Scholar
[n.d.]. Telugu language - Wikipedia — en.wikipedia.org. https://en.wikipedia.org/wiki/Telugu_language. [Accessed 09-Jun-2023].Google Scholar
Shimaa Ibrahim, Said Fathalla, Hamed Shariat Yazdi, Jens Lehmann, and Hajira Jabeen. 2019. From monolingual to multilingual ontologies: The role of cross-lingual ontology enrichment. In Semantic Systems. The Power of AI and Knowledge Graphs: 15th International Conference, SEMANTiCS 2019, Karlsruhe, Germany, September 9–12, 2019, Proceedings 15. Springer International Publishing, 215–230.Google ScholarCross Ref
Taisiya Kostareva, Svetlana Chuprina, and Alexandr Nam. 2016. Using Ontology-Driven Methods to Develop Frameworks for Tackling NLP Problems.. In AIST (Supplement). 102–113.Google Scholar
Jose Melchor Medina-Quintero, Demian Abrego-Almazán, and Fernando Ortiz-Rodríguez. 2018. Use and usefulness of the information systems measurement. a quality approach at the mexican northeastern region. Cuadernos de Administración 31, 56 (2018), 7–30.Google Scholar
Diego Moussallem and Ricardo Choren. 2015. Using ontology-based context in the portuguese-english translation of homographs in textual dialogues. arXiv preprint arXiv:1510.01886 (2015).Google Scholar
Fernando Ortiz-Rodriguez, Jose Melchor Medina-Quintero, Sanju Tiwari, and Vicente Villanueva. 2022. EGODO ontology: sharing, retrieving, and exchanging legal documentation across e-government. In Futuristic Trends for Sustainable Development and Sustainable Ecosystems. IGI Global, 261–276.Google Scholar
Fernando Ortiz-Rodriguez, Sanju Tiwari, Ronak Panchal, Jose Melchor Medina-Quintero, and Ruben Barrera. 2022. MEXIN: multidialectal ontology supporting NLP approach to improve government electronic communication with the Mexican Ethnic Groups. In DG. O 2022: The 23rd Annual International Conference on Digital Government Research. 461–463.Google ScholarDigital Library
AC Schalley. [n.d.]. Ontologies and ontological methods in linguistics. Lang. Linguist. Compass 13 (11), e12356 (2019).Google ScholarCross Ref
Hashwanth Sutharapu, Akshit Duggal, Sanju Tiwari, Nisha Chaurasia, and Fernando Ortiz-Rodriguez. 2022. Dialect Translation of English Language to Telangana. Proceedings http://ceur-ws. org ISSN 1613 (2022), 0073.Google Scholar
Sanju Tiwari, Onur Dogan, MA Jabbar, Shishir Kumar Shandilya, Fernando Ortiz-Rodriguez, Sailesh Bajpai, and Sourav Banerjee. 2022. Applications of machine learning approaches to combat COVID-19: a survey. Lessons from COVID-19 (2022), 263–287.Google ScholarCross Ref

Recommendations

A Malay Dialect Translation and Synthesis System: Proposal and Preliminary System
IALP '12: Proceedings of the 2012 International Conference on Asian Language Processing

Malay is a language from the Austronesian family. Malay is the official language in Malaysia, Indonesia, Singapore, and Brunei. However, Malay spoken in different countries, and even within a country itself might vary in terms of pronunciation and ...
Read More
A Basic Language Resource Kit Implementation for the IgboNLP Project

Igbo, an African language with around 32 million speakers worldwide, is one of the many languages having few or none of the language processing resources needed for advanced language technology applications. In this article, we describe the approach ...
Read More
Orthographic and morphological processing for English---Arabic statistical machine translation

Much of the work on statistical machine translation (SMT) from morphologically rich languages has shown that morphological tokenization and orthographic normalization help improve SMT quality because of the sparsity reduction they contribute. In this ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
DGO '23: Proceedings of the 24th Annual International Conference on Digital Government Research
July 2023
711 pages
ISBN:9798400708374
DOI:10.1145/3598469
Editors:
David Duenas Cid,
Nadzeya Sabatini,
Loni Hagen,
Hsin-chung Liao
Copyright © 2023 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 July 2023
Check for updates
Author Tags
Dialect
NLP
Tokenization
Translation
Qualifiers
- poster
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate150of271submissions,55%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 23
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Dialect Translation of English Language to Telangana: Mexin Project

DGO '23: Proceedings of the 24th Annual International Conference on Digital Government Research

ABSTRACT

References

Cited By

Recommendations

A Malay Dialect Translation and Synthesis System: Proposal and Preliminary System

A Basic Language Resource Kit Implementation for the IgboNLP Project

Orthographic and morphological processing for English---Arabic statistical machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Dialect Translation of English Language to Telangana: Mexin Project

DGO '23: Proceedings of the 24th Annual International Conference on Digital Government Research

ABSTRACT

References

Cited By

Recommendations

A Malay Dialect Translation and Synthesis System: Proposal and Preliminary System

A Basic Language Resource Kit Implementation for the IgboNLP Project

Orthographic and morphological processing for English---Arabic statistical machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media