Boosting QA Performance Through SA-Net and AA-Net with the Read+Verify Framework

Tang, Liang; Qi, Qianqian; Shang, Qinghua; Cai, Yuguang; Liu, Jiamou; Witbrock, Michael; lv, Kaokao

doi:10.1007/978-981-99-8696-5_6

Liang Tang¹⁰,
Qianqian Qi¹¹,
Qinghua Shang¹⁰,
Yuguang Cai¹⁰,
Jiamou Liu¹¹,
Michael Witbrock¹¹ &
…
Kaokao lv¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1943))

Included in the following conference series:

Australasian Conference on Data Science and Machine Learning

492 Accesses

Abstract

This paper proposes an ensemble model for the Stanford Question Answering Dataset (SQuAD) with the aim of improving performance compared to baseline models such as Albert, and Electra. The proposed ensemble model incorporates Sentence Attention (SA-Net) and Answer Attention (AA-Net) components, which leverage attention mechanisms to emphasize important information in sentences and answers, respectively. Additionally, the model adopts a read+verify architecture. In the Read stage, the model’s focus is on accurately predicting answer text, while in the Verify stage, it emphasizes the ability to determine the presence or absence of an answer, providing a probability for the existence of an answer. To enhance the training data, techniques for data augmentation are utilized, including Synonyms Replacement and Random Insertion. The experiment results demonstrate significant improvements on the Albert and Electra baseline models, highlighting the effectiveness of the proposed ensemble model for SQuAD.

L. Tang and Q. Qi—These authors contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

BertHANK: hierarchical attention networks with enhanced knowledge and pre-trained model for answer selection

Article 29 June 2022

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

AMQAN: Adaptive Multi-Attention Question-Answer Networks for Answer Selection

References

Aniol, A., Pietron, M., Duda, J.: Ensemble approach for natural language question answering problem. In: 2019 Seventh International Symposium on Computing and Networking Workshops (CANDARW), pp. 180–183. IEEE (2019)
Google Scholar
Clark, K., Luong, M.T., Le, Q.V., Manning, C.D.: Electra: pre-training text encoders as discriminators rather than generators. arXiv preprint arXiv:2003.10555 (2020)
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., Soricut, R.: ALBERT: a lite BERT for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019)
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Rajpurkar, P., Jia, R., Liang, P.: Know what you don’t know: unanswerable questions for squad. arXiv preprint arXiv:1806.03822 (2018)
Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: Squad: 100,000+ questions for machine comprehension of text. arXiv preprint arXiv:1606.05250 (2016)
Wang, A., Singh, A., Michael, J., Hill, F., Levy, O., Bowman, S.R.: GLUE: a multi-task benchmark and analysis platform for natural language understanding. arXiv preprint arXiv:1804.07461 (2018)
Wei, J., Zou, K.: EDA: easy data augmentation techniques for boosting performance on text classification tasks. arXiv preprint arXiv:1901.11196 (2019)
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R.R., Le, Q.V.: XLNet: generalized autoregressive pretraining for language understanding. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Google Scholar
Zhang, Z., Wu, Y., Zhou, J., Duan, S., Zhao, H., Wang, R.: SG-Net: syntax-guided machine reading comprehension. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 9636–9643 (2020)
Google Scholar
Zhang, Z., Yang, J., Zhao, H.: Retrospective reader for machine reading comprehension. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 14506–14514 (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

Qi An Xin Technology, Beijing, China
Liang Tang, Qinghua Shang, Yuguang Cai & Kaokao lv
The University of Auckland, Auckland, New Zealand
Qianqian Qi, Jiamou Liu & Michael Witbrock

Authors

Liang Tang
View author publications
You can also search for this author in PubMed Google Scholar
Qianqian Qi
View author publications
You can also search for this author in PubMed Google Scholar
Qinghua Shang
View author publications
You can also search for this author in PubMed Google Scholar
Yuguang Cai
View author publications
You can also search for this author in PubMed Google Scholar
Jiamou Liu
View author publications
You can also search for this author in PubMed Google Scholar
Michael Witbrock
View author publications
You can also search for this author in PubMed Google Scholar
Kaokao lv
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qinghua Shang .

Editor information

Editors and Affiliations

The University of Auckland, Auckland, New Zealand
Diana Benavides-Prado
The University of Melbourne, Carlton, VIC, Australia
Sarah Erfani
Shenzhen University, Shenzhen, China
Philippe Fournier-Viger
RMIT University, Melbourne, VIC, Australia
Yee Ling Boo
The University of Auckland, Auckland, New Zealand
Yun Sing Koh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tang, L. et al. (2024). Boosting QA Performance Through SA-Net and AA-Net with the Read+Verify Framework. In: Benavides-Prado, D., Erfani, S., Fournier-Viger, P., Boo, Y.L., Koh, Y.S. (eds) Data Science and Machine Learning. AusDM 2023. Communications in Computer and Information Science, vol 1943. Springer, Singapore. https://doi.org/10.1007/978-981-99-8696-5_6

Download citation

DOI: https://doi.org/10.1007/978-981-99-8696-5_6
Published: 05 December 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8695-8
Online ISBN: 978-981-99-8696-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Boosting QA Performance Through SA-Net and AA-Net with the Read+Verify Framework

Abstract

Access this chapter

Similar content being viewed by others

BertHANK: hierarchical attention networks with enhanced knowledge and pre-trained model for answer selection

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

AMQAN: Adaptive Multi-Attention Question-Answer Networks for Answer Selection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Boosting QA Performance Through SA-Net and AA-Net with the Read+Verify Framework

Abstract

Access this chapter

Similar content being viewed by others

BertHANK: hierarchical attention networks with enhanced knowledge and pre-trained model for answer selection

BERTDAN: Question-Answer Dual Attention Fusion Networks with Pre-trained Models for Answer Selection

AMQAN: Adaptive Multi-Attention Question-Answer Networks for Answer Selection

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation