Improving BERT with Focal Loss for Paragraph Segmentation of Novels

Iikura, Riku; Okada, Makoto; Mori, Naoki

doi:10.1007/978-3-030-53036-5_3

Riku Iikura²⁰,
Makoto Okada²⁰ &
Naoki Mori²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1237))

Included in the following conference series:

International Symposium on Distributed Computing and Artificial Intelligence

858 Accesses
9 Citations

Abstract

In this study, we address the problem of paragraph segmentation from the perspective of understanding the content of a novel. Estimating the paragraph of a text can be considered a binary classification problem regarding whether two given sentences belong to the same paragraph. When the number of paragraphs is small relative to the number of sentences, it is necessary to consider the imbalance in the number of data. We applied the bidirectional encoder representations from transformer (BERT), which has shown high accuracy in various natural language processing tasks, to paragraph segmentation. We improved the performance of the model using the focal loss as the loss function of the classifier. As a result, the effectiveness of the proposed model was confirmed on multiple datasets with different ratios of data in each class.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A More Effective Sentence-Wise Text Segmentation Approach Using BERT

Multi-task Learning for Newspaper Image Segmentation and Baseline Detection Using Attention-Based U-Net Architecture

On Text Tiling for Documents: A Neural-Network Approach

Notes

1.
https://www.gutenberg.org/.

References

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Lin, T.-Y., Goyal, P., Girshick, R.B., He, K., Dollár, P.: Focal loss for dense object detection. CoRR, abs/1708.02002 (2017)
Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Philip Kegelmeyer, W.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16(1), 321–357 (2002)
Article Google Scholar
Glavaš, G., Nanni, F., Ponzetto, S.P.: Unsupervised text segmentation using semantic relatedness graphs. In: Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics, Berlin, Germany, pp. 125–130. Association for Computational Linguistics, August 2016
Google Scholar
Koshorek, O., Cohen, A., Mor, N., Rotman, M., Berant, J.: Text segmentation as a supervised learning task. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), New Orleans, Louisiana, pp. 469–473. Association for Computational Linguistics, June 2018
Google Scholar
Badjatiya, P., Kurisinkel, L.J., Gupta, M., Varma, V.: Attention-based neural text segmentation. CoRR, abs/1808.09935 (2018)
Google Scholar
Loper, E., Bird, S., Klein, E.: Natural Language Processing with Python. O’Reilly Media Inc., Sebastopol (2009)
MATH Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (eds.) Advances in Neural Information Processing Systems 30, pp. 5998–6008. Curran Associates, Inc. (2017)
Google Scholar
Wang, A., Singh, A., Michael, J., Hill, F., Levy, O., Bowman, S.: GLUE: a multi-task benchmark and analysis platform for natural language understanding. In: Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, Brussels, Belgium, pp. 353–355. Association for Computational Linguistics, November 2018
Google Scholar
Rajpurkar, P., Jia, R., Liang, P.: Know what you don’t know: unanswerable questions for SQuAD. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Melbourne, Australia, pp. 784–789. Association for Computational Linguistics, July 2018
Google Scholar
Madabushi, H.T., Kochkina, E., Castelle, M.: Cost-sensitive BERT for generalisable sentence classification on imbalanced data. In: Proceedings of the Second Workshop on Natural Language Processing for Internet Freedom: Censorship, Disinformation, and Propaganda, Hong Kong, China, pp. 125–134. Association for Computational Linguistics, November 2019
Google Scholar
van der Maaten, L.J.P., Hinton, G.E.: Visualizing high-dimensional data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008)
MATH Google Scholar

Download references

Acknowledgement

This work was supported by JSPS KAKENHI Grant, Grant-in-Aid for Scientific Research(B), 19H04184.

Author information

Authors and Affiliations

Osaka Prefecture University, 1-1 Gakuen-cho, Naka-ku, Sakai, Osaka, Japan
Riku Iikura, Makoto Okada & Naoki Mori

Authors

Riku Iikura
View author publications
You can also search for this author in PubMed Google Scholar
Makoto Okada
View author publications
You can also search for this author in PubMed Google Scholar
Naoki Mori
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Riku Iikura .

Editor information

Editors and Affiliations

Business School, Sichuan University, Chengdu, China
Yucheng Dong
Andalusian Research Institute on Data Science and Computational Intelligence (DaSCI), University of Granada, Granada, Spain
Enrique Herrera-Viedma
Dept. of System Design, Osaka Institute of Technology, Osaka, Japan
Kenji Matsui
Hiroshima University, Osaka, Japan
Shigeru Omatsu
GRASIA Research Group, Facultad de Informática, Universidad Complutense de Madrid, Madrid, Spain
Alfonso González Briones
IoT European Digital Innovation Hub, Bioinformatics Intelligent Systems and Educational Technology Research Group, Department of Computer Science, Faculty of Science, University of Salamanca, Salamanca, Spain
Sara Rodríguez González

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Iikura, R., Okada, M., Mori, N. (2021). Improving BERT with Focal Loss for Paragraph Segmentation of Novels. In: Dong, Y., Herrera-Viedma, E., Matsui, K., Omatsu, S., González Briones, A., Rodríguez González, S. (eds) Distributed Computing and Artificial Intelligence, 17th International Conference. DCAI 2020. Advances in Intelligent Systems and Computing, vol 1237. Springer, Cham. https://doi.org/10.1007/978-3-030-53036-5_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-53036-5_3
Published: 07 August 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-53035-8
Online ISBN: 978-3-030-53036-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Improving BERT with Focal Loss for Paragraph Segmentation of Novels

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A More Effective Sentence-Wise Text Segmentation Approach Using BERT

Multi-task Learning for Newspaper Image Segmentation and Baseline Detection Using Attention-Based U-Net Architecture

On Text Tiling for Documents: A Neural-Network Approach

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Improving BERT with Focal Loss for Paragraph Segmentation of Novels

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A More Effective Sentence-Wise Text Segmentation Approach Using BERT

Multi-task Learning for Newspaper Image Segmentation and Baseline Detection Using Attention-Based U-Net Architecture

On Text Tiling for Documents: A Neural-Network Approach

Notes

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation