Abstract
Conversational agents are used to communicating with humans in a friendly manner. To achieve the highest level of performance, agents need to respond assertively and fastly. Transformer architectures are shown to produce excellent performances on recent tasks; however, for tasks involving conversational agents, they may have a lower speed performance. The main goal of this study is to evaluate and propose a HoPE (Healthcare Obstetric in PrEgnancy) model that is tailored to pregnancy data. We carried out a dataset extraction and construction process based on collections of health documents related to breastfeeding, childcare, pregnant care, nutrition, risks, vaccines, exams, and physical exercises. We evaluated two pre-trained models in the Portuguese language for the conversational agent architecture proposal and chose the one with the best performance to compose the HoPE architecture. The BERTimbau model, which has been trained on data augmentation strategies, proves to be able to retrieve information quickly and most accurately than others. For the fine-tuning process, we achieved a Spearman correlation of 95.55 on BERTimbau augmented with a few pairs (1.500 pairs). The HoPE model architecture achieved an F1-Score of 0.89, outperforming other combinations tested in this study. We will evaluate this approach for clinical studies in future studies.
















Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Alambo A, Padhee S, Banerjee T, Thirunarayan K (2021) COVID-19 and mental health/substance use disorders on reddit: a longitudinal study. In: International conference on pattern recognition, pp 20–27. Springer
Alfeo AL, Cimino MG, Vaglini G (2021) Technological troubleshooting based on sentence embedding with deep transformers. J Intell Manuf 32:1–12
ALMarwi H, Ghurab M, Al-Baltah I (2020) A hybrid semantic query expansion approach for Arabic information retrieval. J Big Data 7(1):1–19
Alomari A, Idris N, Sabri AQM, Alsmadi I (2021) Deep reinforcement and transfer learning for abstractive text summarization: a review. Comput Speech Lang 71:101276
Altinok D (2018) An ontology-based dialogue management system for banking and finance dialogue systems. arXiv:1804.04838
Amith M, Anna Z, Cunningham R, Rebecca L, Savas L, Laura S, Yong C, Yang G, Julie B, Roberts K et al (2019) Early usability assessment of a conversational agent for HPV vaccination. Stud Health Technol Inform 257:17
Avila CVS, Calixto AB, Rolim TV, Franco W, Venceslau AD, Vidal VM, Pequeno VM, De Moura FF (2019) MediBot: an ontology based chatbot for Portuguese speakers drug’s users
Bakouan M, Kone T, Kamagate BH, Oumtanaga S, Babri M (2018) A chatbot for automatic processing of learner concerns in an online learning platform. Int J Adv Comput Sci Appl 9(5):168–176
Barbosa A, Godoy A (2021) Augmenting customer support with an NLP-based receptionist. arXiv:2112.01959
Beltagy I, Lo K, Cohan A (2019) SciBERT: a pretrained language model for scientific text. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP), pp 3615–3620. Association for Computational Linguistics, Hong Kong, China. https://doi.org/10.18653/v1/D19-1371. https://aclanthology.org/D19-1371
Benesty J, Chen J, Huang Y, Cohen I (2009) Pearson correlation coefficient. In: Noise reduction in speech processing, pp 1–4. Springer
Bickmore TW, Pfeifer LM, Byron D, Forsythe S, Henault LE, Jack BW, Silliman R, Paasche-Orlow MK (2010) Usability of conversational agents by patients with inadequate health literacy: evidence from two clinical trials. J Health Commun 15(S2):197–210
Bickmore TW, Pfeifer LM, Jack BW (2009) Taking the time to care: empowering low health literacy hospital patients with virtual nurse agents. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 1265–1274. ACM
Bickmore TW, Utami D, Matsuyama R, Paasche-Orlow MK (2016) Improving access to online health information with conversational agents: a randomized controlled experiment. J Medical Internet Res 18(1):e1
Bjelke M, Martinsson AK, Lendahls L, Oscarsson M (2016) Using the Internet as a source of information during pregnancy-a descriptive cross-sectional study in Sweden. Midwifery 40:187–191
Boonstra L (2021) Getting started with dialogflow essentials. In: The definitive guide to conversational AI with dialogflow and google cloud, pp 29–57. Springer
Boudjellal N, Zhang H, Khan A, Ahmad A, Naseem R, Shang J, Dai L (2021) ABioNER: a BERT-based model for Arabic biomedical named-entity recognition. Complexity
Cabezudo MAS, Inácio M, Rodrigues AC, Casanova E, de Sousa RF (2019) NILC at ASSIN 2: exploring multilingual approaches. In: ASSIN@ STIL, pp 49–58
Carlsson F, Gogoulou E, Ylipää E, Gyllensten AC, Sahlgren M (2021) Semantic re-tuning with contrastive tension. In: International conference on learning representations
Chang WC, Yu HF, Zhong K, Yang Y, Dhillon I (2019) Taming pretrained transformers for extreme multi-label text classification. arXiv:1905.02331
Choi H, Kim J, Joe S, Gwon Y (2021) Evaluation of BERT and ALBERT sentence embedding performance on downstream NLP tasks. In: 2020 25th International conference on pattern recognition (ICPR), pp 5482–5487. IEEE
Consortium WWW, et al (2014) Rdf 1.1 concepts and abstract syntax
Criss S, Baidal JAW, Goldman RE, Perkins M, Cunningham C, Taveras EM (2015) The role of health information sources in decision-making among Hispanic mothers during their children’s first 1000 days of life. Matern Child Health J 19(11):2536–2543
Croux C, Dehon C (2010) Influence functions of the Spearman and Kendall correlation measures. Statistical Methods & Applications 19(4):497–515
Dai Z, Wang X, Ni P, Li Y, Li G, Bai X (2019) Named entity recognition using BERT BiLSTM CRF for Chinese electronic health records. In: 2019 12th international congress on image and signal processing, biomedical engineering and informatics (cisp-bmei), pp 1–5. IEEE
Deng X, Liu Q, Deng Y, Mahadevan S (2016) An improved method to construct basic probability assignment based on the confusion matrix for classification problem. Information Sciences 340:250–261
Devlin J, Chang MW, Lee K, Toutanova K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 conference of the north american chapter of the association for computational linguistics: human language technologies, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics, Minneapolis, Minnesota. https://doi.org/10.18653/v1/N19-1423. https://aclanthology.org/N19-1423
Emygdio JL, Almeida MB (2019) Representações formais do conhecimento aplicadas à interoperabilidade semântica de terminologias clínicas. Múltiplos Olhares em Ciência da Informação 9(2)
Engelmann D, Damasio J, Krausburg T, Borges O, Colissi M, Panisson AR, Bordini RH (2021) Dial4jaca–a communication interface between multi-agent systems and chatbots. In: International conference on practical applications of agents and multi-agent systems, pp 77–88. Springer
Esteva A, Kale A, Paulus R, Hashimoto K, Yin W, Radev D, Socher R (2021) COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization. NPJ Digital Medicine 4(1):1–9
Farinelli: ONTONEO (2018). http://bioportal.bioontology.org/ontologies/ONTONEO
Floridi L, Chiriatti M (2020) GPT-3: its nature, scope, limits, and consequences. Minds and Machines 30(4):681–694
Fushiki T (2011) Estimation of prediction error by using k-fold cross-validation. Statistics and Computing 21(2):137–146
Ganhotra J, Roitman H, Cohen D, Mills N, Gunasekara C, Mass Y, Joshi S, Lastras L, Konopnicki D (2020) Conversational document prediction to assist customer care agents. arXiv:2010.02305
Goeuriot L, Jones GJ, Kelly L, Müller H, Zobel J (2016) Medical information retrieval: introduction to the special issue. Information Retrieval Journal 19(1–2):1–5
Greche L, Jazouli M, Es-Sbai N, Majda A, Zarghili A (2017) Comparison between Euclidean and Manhattan distance measure for facial expressions classification. In: 2017 International conference on wireless technologies, embedded and intelligent systems (WITS), pp 1–4. IEEE
Guo J, Fan Y, Pang L, Yang L, Ai Q, Zamani H, Wu C, Croft WB, Cheng X (2020) A deep look into neural ranking models for information retrieval. Information Processing & Management 57(6):102067
Han X, Eisenstein J (2019) Unsupervised domain adaptation of contextualized embeddings for sequence labeling. arXiv:1904.02817
Henderson M, Al-Rfou R, Strope B, Sung YH, Lukács L, Guo R, Kumar S, Miklos B, Kurzweil R (2017) Efficient natural language response suggestion for smart reply. arXiv:1705.00652
Hersh W, Hersh W (2020) Information retrieval: a biomedical and health perspective. Springer
Huertas-García Á, Huertas-Tato J, Martín A, Camacho D (2021) Countering misinformation through semantic-aware multilingual models. In: International conference on intelligent data engineering and automated learning, pp 312–323. Springer
Humeau S, Shuster K, Lachaux MA, Weston J (2019) Poly-encoders: transformer architectures and pre-training strategies for fast and accurate multi-sentence scoring. arXiv:1905.01969
Inamdar VA, Shivanand R (2019) Development of college enquiry chatbot using snatchbot. DEVELOPMENT 6(07):1615–1618
Jack B, Bickmore T, Hempstead M, Yinusa-Nyahkoon L, Sadikova E, Mitchell S, Gardiner P, Adigun F, Penti B, Schulman D et al (2015) Reducing preconception risks among African American women with conversational agent technology. The Journal of the American Board of Family Medicine 28(4):441–451
Júnior VODS, Branco JAC, De Oliveira MA, Da Silva TLC, Cruz LA, Magalhaes RP (2021) A natural language understanding model COVID-19 based for chatbots. In: 2021 IEEE 21st International conference on bioinformatics and bioengineering (BIBE), pp 1–7. IEEE
Kadri Y, Nie JY (2006) Effective stemming for Arabic information retrieval. In: Proceedings of the challenge of arabic for NLP/MT conference, Londres, Royaume-Uni, pp 68–74
Kankaria RV, Agrawal A, Barot H, Godbole A (2021) RAAH.ai: an interactive chatbot for stress relief using deep learning and natural language processing. In: 2021 12th International conference on computing communication and networking technologies (ICCCNT), pp 1–7. IEEE
Kasilingam DL (2020) Understanding the attitude and intention to use smartphone chatbots for shopping. Technology in Society 62:101280
Khattab O, Zaharia M (2020) ColBERT: efficient and effective passage search via contextualized late interaction over BERT. In: Proceedings of the 43rd International ACM SIGIR conference on research and development in information retrieval, pp 39–48
Kilmer S, Marshall C, Senger S (2020) Dot product chains. arXiv:2006.11467
Kim T, Yoo KM, Lee SG (2021) Self-guided contrastive learning for BERT sentence representations. arXiv:2106.07345
Lagan BM, Sinclair M, George Kernohan W (2010) Internet use in pregnancy informs women’s decision making: a web-based survey. Birth 37(2):106–115
Larkey LS, Ballesteros L, Connell ME (2002) Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis. In: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp 275–282
Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, Kang J (2020) BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4):1234–1240
Li B, Zhou H, He J, Wang M, Yang Y, Li L (2020) On the sentence embeddings from pre-trained language models. arXiv:2011.05864
Li D, Zeng W (2018) Distance measure of Pythagorean fuzzy sets. Int J Intell Syst 33(2):348–361
Liu X, Wang Y, Ji J, Cheng H, Zhu X, Awa E, He P, Chen W, Poon H, Cao G, et al (2020) The microsoft toolkit of multi-task deep neural networks for natural language understanding. arXiv:2002.07972
Lv Y, Zhai C (2011) When documents are very long, BM25 fails! In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 1103–1104
Malizia A, Onorati T, Diaz P, Aedo I, Astorga-Paliza F (2010) SEMA4A: an ontology for emergency notification systems accessibility. Expert Systems with Applications 37(4):3380–3391
Maroengsit W, Piyakulpinyo T, Phonyiam K, Pongnumkul S, Chaovalit P, Theeramunkong T (2019) A survey on evaluation methods for chatbots. In: Proceedings of the 2019 7th International conference on information and education technology, pp 111–119
Mellado-Silva R, Faúndez-Ugalde A, Lobos MB (2020) Learning tax regulations through rules-based chatbots using decision trees: a case study at the time of COVID-19. In: 2020 39th International conference of the chilean computer science society (SCCC), pp 1–8. IEEE
Mikolov T, Yih WT, Zweig G (2013) Linguistic regularities in continuous space word representations. In: Proceedings of the 2013 conference of the north american chapter of the association for computational linguistics: Human language technologies, pp 746–751
Montenegro JLZ, da Costa CA, da Rosa Righi R (2019) Survey of conversational agents in health. Expert Syst Appl 129:56–67
Moyer CA, Compton SD, Kaselitz E, Muzik M (2020) Pregnancy-related anxiety during COVID-19: a nationwide survey of 2740 pregnant women. Arch Women’s Mental Health 23(6):757–765
Mukherjee S, Liu X, Zheng G, Hosseini S, Cheng H, Yang G, Meek C, Awadallah AH, Gao J (2021) CLUES: few-shot learning evaluation in natural language understanding. arXiv:2111.02570
Nazir A, Khan MY, Ahmed T, Jami SI, Wasi S (2019) A novel approach for ontology-driven information retrieving chatbot for fashion brands. Int J Adv Comput Sci Appl IJACSA 10(9):546–552
Nguyen T, Rosenberg M, Song X, Gao J, Tiwary S, Majumder R, Deng L (2016) MS MARCO: a human generated machine reading comprehension dataset. In: CoCo@ NIPS
Nouri SS, Rudd RE (2015) Health literacy in the oral exchange: an important element of patient-provider communication. Patient Education and Counseling 98(5):565–571
Noy NF, McGuinness DL, et al (2001) Ontology development 101: a guide to creating your first ontology
Oliveira LE, Gebeluca CP, Silva AM, Moro CM, Hasan SA, Farri O (2017) A statistics and UMLS-based tool for assisted semantic annotation of Brazilian clinical documents. In: 2017 IEEE International conference on bioinformatics and biomedicine (BIBM), pp 1072–1078. IEEE
Padaki R, Dai Z, Callan J (2020) Rethinking query expansion for BERT reranking. In: European conference on information retrieval, pp 297–304. Springer
Palanica A, Flaschner P, Thommandram A, Li M, Fossat Y (2019) Physicians’ perceptions of chatbots in health care: cross-sectional web-based survey. Journal of medical Internet research 21(4):e12887
Patel V, Garrison P, de Jesus Mari J, Minas H, Prince M, Saxena S (2008) The Lancet’s series on global mental health: 1 year on. The Lancet 372(9646):1354–1357
Pereira I, Sousa J, Costa PB, Barbosa SD, Colcher S (2020) SucupiraBot: an interactive question-answering system for the Sucupira platform. In: Proceedings of the brazilian symposium on multimedia and the web, pp 277–280
Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L (2018) Deep contextualized word representations. arXiv:1802.05365
Podgorny I, Khaburzaniya Y, Geisler J (2019) Conversational agents and community question answering. In: CHI 2019 Workshops, Glasgow, United Kingdom
Qian Y, Liang J, Dang C (2009) Knowledge structure, knowledge granulation and knowledge distance in a knowledge base. Int J Approximate Reason 50(1):174–188
Quamar A, Özcan F, Miller D, Moore RJ, Niehus R, Kreulen J (2020) Conversational BI: an ontology-driven conversation system for business intelligence applications. Proceedings of the VLDB Endowment 13(12):3369–3381
Ragab M, Abdel Aal AM, Jifri AO, Omran NF (2021) Enhancement of predicting students performance model using ensemble approaches and educational data mining techniques. Wireless Communications and Mobile Computing 2021
Raji P, Surendran S (2016) RDF approach on social network analysis. In: 2016 International conference on research advances in integrated navigation systems (RAINS), pp 1–4. IEEE
Rajosoa M, Hantach R, Abbes SB, Calvez P (2019) Hybrid question answering system based on natural language processing and SPARQL query
Rajpurkar P, Zhang J, Lopyrev K, Liang P (2016) Squad: 100,000+ questions for machine comprehension of text
Reimers N, Gurevych I (2019) Sentence-BERT: sentence embeddings using siamese BERT-networks. arXiv:1908.10084
Rodrigues R, Couto P, Rodrigues I (2019) IPR: the semantic textual similarity and recognizing textual entailment systems. In: ASSIN@ STIL, pp 39–48
Rodrigues RC, Rodrigues J, de Castro, PVQ, da Silva NFF, Soares A (2020) Portuguese language models and word embeddings: evaluating on semantic similarity tasks. In: International conference on computational processing of the portuguese language, pp 239–248. Springer
Rogers A, Kovaleva O, Rumshisky A (2021) A primer in BERTology: what we know about how BERT works. Transactions of the Association for Computational Linguistics 8:842–866
Rubinstein A (2018) Hardness of approximate nearest neighbor search. In: Proceedings of the 50th annual ACM SIGACT symposium on theory of computing, pp 1260–1268
Rychalska B, Glabska H, Wroblewska A (2018) Multi-intent hierarchical natural language understanding for chatbots. In: 2018 Fifth international conference on social networks analysis, management and security (SNAMS), pp 256–259. IEEE
Safder I, Hassan SU (2019) Bibliometric-enhanced information retrieval: a novel deep feature engineering approach for algorithm searching from full-text publications. Scientometrics 119(1):257–277
Samimi P, Ravana SD (2014) Creation of reliable relevance judgments in information retrieval systems evaluation experimentation through crowdsourcing: a review. Sci World J 2014
Sankhavara J (2020) Feature weighting in finding feedback documents for query expansion in biomedical document retrieval. SN Computer Science 1(2):1–7
Sayakhot P, Carolan-Olah M (2016) Internet use by pregnant women seeking pregnancy-related information: a systematic review. BMC Pregnancy and Childbirth 16(1):1–10
Senese MA, Rizzo G, Dragoni M, Morisio M (2020) MTSI-BERT: a session-aware knowledge-based conversational agent. In: Proceedings of The 12th language resources and evaluation conference, pp 717–725
Sheth A, Yip HY, Iyengar A, Tepper P (2019) Cognitive services and intelligent chatbots: current perspectives and special issue introduction. IEEE Internet Computing 23(2):6–12
Singh I, Scarton C, Bontcheva K (2021) Multistage BiCross encoder: team GATE entry for MLIA multilingual semantic search task 2. arXiv:2101.03013
Singh S, Mahmood A (2021) The NLP cookbook: modern recipes for transformer based deep learning architectures. IEEE Access 9:68675–68702
Souza F, Nogueira R, Lotufo R (2020) BERTimbau: pretrained BERT models for Brazilian Portuguese. In: Brazilian conference on intelligent systems, pp 403–417. Springer
de Souza JVA, Oliveira LESE, Gumiel YB, Carvalho DR, Moro CMC (2020) Exploiting siamese neural networks on short text similarity tasks for multiple domains and languages. In: International conference on computational processing of the portuguese language, pp 357–367. Springer
Tanana MJ, Soma CS, Srikumar V, Atkins DC, Imel ZE (2019) Development and evaluation of ClientBot: patient-like conversational agent to train basic counseling skills. J Med Int Res 21(7):e12529
Teixeira, M.S., Maran, V., Dragoni, M (2021) The interplay of a conversational ontology and AI planning for health dialogue management. In: Proceedings of the 36th annual ACM symposium on applied computing, pp 611–619
Thakur N, Reimers N, Daxenberger J, Gurevych I (2020) Augmented SBERT: data augmentation method for improving bi-encoders for pairwise sentence scoring tasks. arXiv:2010.08240
Traylor CS, Johnson J, Kimmel MC, Manuck TA (2020) Effects of psychological stress on adverse pregnancy outcomes and non-pharmacologic approaches for reduction: an expert review. Am J Obstet Gynecol MFM, 100229
Trivedi S, Gildersleeve R, Franco S, Kanter AS, Chaudhry A (2020) Evaluation of a concept mapping task using named entity recognition and normalization in unstructured clinical text. J Healthc Inform Res 4(4):395–410
Vakili Tahami A, Ghajar K, Shakery A (2020) Distilling knowledge for fast retrieval-based chat-bots. In: Proceedings of the 43rd International ACM SIGIR conference on research and development in information retrieval, pp 2081–2084
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. arXiv:1706.03762
Wagner Filho JA, Wilkens R, Idiart M, Villavicencio A (2018) The brWaC corpus: a new open resource for Brazilian Portuguese. In: Proceedings of the eleventh international conference on language resources and evaluation (LREC 2018)
Wang A, Singh A, Michael J, Hill F, Levy O, Bowman SR (2019) GLUE: a multi-task benchmark and analysis platform for natural language understanding. In: International conference on learning representations. https://openreview.net/forum?id=rJ4km2R5t7
Wang K, Yang B, Xu G, He X (2019) Medical question retrieval based on siamese neural network and transfer learning method. In: International conference on database systems for advanced applications, pp 49–64. Springer
Wang Y, Liu F, Verspoor K, Baldwin T (2020) Evaluating the utility of model configurations and data augmentation on clinical semantic textual similarity. In: Proceedings of the 19th SIGBioMed workshop on biomedical language processing, pp 105–111
Wang Y, Rastegar-Mojarad M, Elayavilli RK, Liu S, Liu H (2016) An ensemble model of clinical information extraction and information retrieval for clinical decision support. In: TREC
Whissell JS, Clarke CL (2011) Improving document clustering using Okapi BM25 feature weighting. Information Retrieval 14(5):466–487
Wu Z, Liang J, Zhang Z, Lei J (2021) Exploration of text matching methods in Chinese disease q&a systems: a method using ensemble based on BERT and boosted tree models. J Biomed Inform 115:103683
Xie Y, Yang W, Tan L, Xiong K, Yuan NJ, Huai B, Li M, Lin J (2020) Distant supervision for multi-stage fine-tuning in retrieval-based question answering. Proceed Web Conference 2020:2934–2940
Yang W, Xie Y, Lin A, Li X, Tan L, Xiong K, Li M, Lin J (2019) End-to-end open-domain question answering with BERTserini. arXiv:1902.01718
Yang W, Xie Y, Tan L, Xiong K, Li M, Lin J (2019) Data augmentation for BERT fine-tuning in open-domain question answering. arXiv:1904.06652
Yin X, Zhang W, Zhu W, Liu S, Yao T (2020) Improving sentence representations via component focusing. Applied Sciences 10(3):958
Yoo S, Jeong O (2020) An intelligent chatbot utilizing BERT model and knowledge graph. J Soc e-Business Stud 24(3)
Yoon S, Kang WY, Jeon S, Lee S, Han C, Park J, Kim ES (2020) Image-to-image retrieval by learning similarity between scene graphs. arXiv:2012.14700
Zhang J, et al (2021) S-SimCSE: sampled sub-networks for contrastive learning of sentence embedding. arXiv:2111.11750
Zhang Z, Bickmore TW, Paasche-Orlow MK (2017) Perceived organizational affiliation and its effects on patient trust: role modeling with embodied conversational agents. Patient Educ Couns 100(9):1730–1737
Acknowledgements
The authors would like to thank the Brazilian National Council for Scientific and Technological Development - CNPq (Grant Numbers 303640/2017-0 and 405354/2016-9) for supporting this work.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Montenegro, J.L.Z., da Costa, C.A. The HoPE Model Architecture: a Novel Approach to Pregnancy Information Retrieval Based on Conversational Agents. J Healthc Inform Res 6, 253–294 (2022). https://doi.org/10.1007/s41666-022-00115-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41666-022-00115-0