skip to main content
10.1145/3460112.3471946acmconferencesArticle/Chapter ViewAbstractPublication PagescompassConference Proceedingsconference-collections
research-article

Early Results from Automating Voice-based Question-Answering Services Among Low-income Populations in India

Published: 23 September 2021 Publication History

Abstract

Question-answering systems where users can ask questions based on emergent needs which are then answered by experts or peers, have emerged as an important information seeking modality on digital platforms. Automating this process has been an active area of research since many years, to identify relevant answers from pre-existing question-answer databases. We report on the feasibility of running automated question-answering systems in the context of rural and less-literate users in India, accessed through IVR (Interactive Voice Response) systems. We use commercial speech recognition APIs to convert audio questions asked by users into their equivalent transcripts in real time, in Hindi, and use deep-learning based architectures to retrieve corresponding candidate answers which are instantly played to the users. We report several insights from an earlier phase of running question-answering programmes through a manual operation, to how it was transitioned to an automated setup, and document the user experiences during this journey.

References

[1]
Mira Johri Aaditeshwar Seth, Aarushi Gupta. 2021. Delivery of Social Protection Entitlements in India. https://drive.google.com/file/d/1hrxF2UP3qZ8Ouo2IdfrTCZwGHLhcFjGX/view
[2]
Orlanda Ruthven Aaditeshwar Seth, Sultan Ahmed. 2020. #NotStatusQuo A campaign to fix the broken social protection systems in India. https://drive.google.com/file/d/1q2TtBZanO_PhZuLfve9qVQRHav4Wuj5V/view
[3]
Alan Akbik, Tanja Bergmann, Duncan Blythe, Kashif Rasul, Stefan Schweter, and Roland Vollgraf. 2019. FLAIR: An easy-to-use framework for state-of-the-art NLP. In NAACL 2019, 2019 Annual Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations). 54–59.
[4]
Gaurav Arora. 2020. iNLTK: Natural Language Toolkit for Indic Languages. arxiv:2009.12534 [cs.CL]
[5]
Pranav Bhagat, Sachin Kumar Prajapati, and Aaditeshwar Seth. 2020. Initial Lessons from Building an IVR-based Automated Question-Answering System. In Proceedings of the 2020 International Conference on Information and Communication Technologies and Development. 1–5.
[6]
Dipanjan Chakraborty, Mohd Sultan Ahmad, and Aaditeshwar Seth. 2017. Findings from a civil society mediated and technology assisted grievance redressal model in rural India. In Proceedings of the Ninth International Conference on Information and Communication Technologies and Development. 1–12.
[7]
Dipanjan Chakraborty, Akshay Gupta, and Aaditeshwar Seth. 2019. Experiences from a mobile-based behaviour change campaign on maternal and child nutrition in rural India. In Proceedings of the Tenth International Conference on Information and Communication Technologies and Development. 1–11.
[8]
Dipanjan Chakraborty and Aaditeshwar Seth. 2015. Building citizen engagement into the implementation of welfare schemes in rural India. In Proceedings of the Seventh International Conference on Information and Communication Technologies and Development. 1–10.
[9]
Jeanne E Daniel, Willie Brink, Ryan Eloff, and Charles Copley. 2019. Towards automating healthcare question answering in a noisy multilingual low-resource setting. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 948–953.
[10]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186. https://doi.org/10.18653/v1/N19-1423
[11]
Google. 2021. Speech To Text. https://cloud.google.com/speech-to-text
[12]
Mohit Jain, Pratyush Kumar, Ishita Bhansali, Q Vera Liao, Khai Truong, and Shwetak Patel. 2018. FarmChat: a conversational agent to answer farmer queries. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2, 4 (2018), 1–22.
[13]
Divyanshu Kakwani, Anoop Kunchukuttan, Satish Golla, Gokul N.C., Avik Bhattacharyya, Mitesh M. Khapra, and Pratyush Kumar. 2020. IndicNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages. In Findings of EMNLP.
[14]
Meghana Marathe, Jacki O’Neill, Paromita Pain, and William Thies. 2015. Revisiting CGNet Swara and its impact in rural India. In Proceedings of the Seventh International Conference on Information and Communication Technologies and Development. 1–10.
[15]
Aparna Moitra, Vishnupriya Das, Gram Vaani, Archna Kumar, and Aaditeshwar Seth. 2016. Design lessons from creating a mobile-based community media platform in Rural India. In Proceedings of the Eighth International Conference on Information and Communication Technologies and Development. 1–11.
[16]
Preeti Mudliar, Jonathan Donner, and William Thies. 2012. Emergent practices around CGNet Swara, voice forum for citizen journalism in rural India. In Proceedings of the Fifth International Conference on Information and Communication Technologies and Development. 159–168.
[17]
Ankur Pandey, Inshita Mutreja, Saru Brar, and Pushpendra Singh. 2020. Exploring Automated Q&A Support System for Maternal and Child Health in Rural India. In Proceedings of the 3rd ACM SIGCAS Conference on Computing and Sustainable Societies. 349–350.
[18]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, 2019. Pytorch: An imperative style, high-performance deep learning library. arXiv preprint arXiv:1912.01703(2019).
[19]
Neil Patel, Deepti Chittamuru, Anupam Jain, Paresh Dave, and Tapan S Parikh. 2010. Avaaj otalo: a field study of an interactive voice forum for small farmers in rural india. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 733–742.
[20]
Agha Ali Raza, Mansoor Pervaiz, Christina Milo, Samia Razaq, Guy Alster, Jahanzeb Sherwani, Umar Saif, and Roni Rosenfeld. 2012. Viral entertainment as a vehicle for disseminating speech-based services to low-literate users. In Proceedings of the Fifth International Conference on Information and Communication Technologies and Development. 350–359.
[21]
Orlanda Ruthven. 2018. Labour Reform is Fine But Who Holds Employers to Account When Government Fails?The Wire (2018). https://thewire.in/labour/rights-at-work-who-holds-employers-to-account-when-the-government-fails
[22]
Huda Sarfraz, Sarmad Hussain, Riffat Bokhari, Agha Ali Raza, Inam Ullah, Zahid Sarfraz, Sophia Pervez, Asad Mustafa, Iqra Javed, and Rahila Parveen. 2010. Large vocabulary continuous speech recognition for Urdu. In Proceedings of the 8th International Conference on Frontiers of Information Technology. 1–5.
[23]
A Seth, A Gupta, A Moitra, D Kumar, D Chakraborty, L Enoch, O Ruthven, P Panjal, RA Siddiqi, R Singh, 2020. Reflections from Practical Experiences of Managing Participatory Media Platforms for Development. In Proceedings of the 2020 International Conference on Information and Communication Technologies and Development. 1–15.
[24]
Aditya Vashistha and William Thies. 2012. {IVR} Junction: Building Scalable and Distributed Voice Forums in the Developing World. In 6th USENIX/ACM Workshop on Networked Systems for Developing Regions ({NSDR} 12).
[25]
Wikipedia contributors. 2021. Jaccard index — Wikipedia, The Free Encyclopedia. https://en.wikipedia.org/w/index.php?title=Jaccard_index&oldid=1009813550 [Online; accessed 6-April-2021].
[26]
Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, Rémi Louf, Morgan Funtowicz, Joe Davison, Sam Shleifer, Patrick von Platen, Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu, Teven Le Scao, Sylvain Gugger, Mariama Drame, Quentin Lhoest, and Alexander M. Rush. 2020. Transformers: State-of-the-Art Natural Language Processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Association for Computational Linguistics, Online, 38–45. https://www.aclweb.org/anthology/2020.emnlp-demos.6
[27]
Deepika Yadav, Mayank Gupta, Malolan Chetlur, and Pushpendra Singh. 2018. Automatic annotation of voice forum content for rural users and evaluation of relevance. In Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies. 1–11.

Cited By

View all
  • (2024)Experiences from Running a Participatory Media Platform for Women and Led by Women in Rural North IndiaProceedings of the ACM on Human-Computer Interaction10.1145/36536858:CSCW1(1-23)Online publication date: 26-Apr-2024
  • (2024)A Design Vocabulary for Scaffolding Group Interaction Archetypes through Synchronous TelephonyProceedings of the ACM on Human-Computer Interaction10.1145/36372898:CSCW1(1-22)Online publication date: 26-Apr-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
COMPASS '21: Proceedings of the 4th ACM SIGCAS Conference on Computing and Sustainable Societies
June 2021
462 pages
ISBN:9781450384537
DOI:10.1145/3460112
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 September 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. FAQ retrieval
  2. Interactive Voice Response systems
  3. natural language processing
  4. question-answering
  5. speech recognition

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

COMPASS '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 25 of 50 submissions, 50%

Upcoming Conference

COMPASS '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)1
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Experiences from Running a Participatory Media Platform for Women and Led by Women in Rural North IndiaProceedings of the ACM on Human-Computer Interaction10.1145/36536858:CSCW1(1-23)Online publication date: 26-Apr-2024
  • (2024)A Design Vocabulary for Scaffolding Group Interaction Archetypes through Synchronous TelephonyProceedings of the ACM on Human-Computer Interaction10.1145/36372898:CSCW1(1-22)Online publication date: 26-Apr-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media