Before and After COVID-19 Outbreak Using Variance Representation Comparative Analysis of Newspaper Articles on the Travel Hotel Industry

Yang, Yeqing; Asahi, Yumi

doi:10.1007/978-3-031-60114-9_12

Yeqing Yang²⁶ &
Yumi Asahi²⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14690))

Included in the following conference series:

International Conference on Human-Computer Interaction

400 Accesses

Abstract

This study explores the impact of COVID-19 on Japan’s travel industry by analyzing differences before and after the pandemic through articles from the Nihon Keizai Shimbun. It employs text mining techniques like Latent Semantic Analysis (LSA) and BERT (a natural language model) to process and categorize information from newspaper titles. The analysis involves extracting nouns using MeCab, creating a frequency matrix, decomposing it with NMF for clustering, and setting topics. BERT is used for text classification, focusing on token attention weights and variance representation. The data includes articles from Nikkei Morning News pre- and post-COVID-19, specifically tagged with “Travel & Hotel,” totaling 792 articles. Analysis revealed ten topics such as vaccines, business structures, and financial results. Hierarchical clustering grouped these topics across eight clusters. Findings indicate a shift in topics post-COVID-19 towards financial impacts and business activities, highlighting tokens related to company activities and keywords associated with the pandemic. Future work aims at improving classification accuracy and leveraging data insights.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Using Word2Vec-LDA-Cosine Similarity for Discovering News Dissemination Pattern to Support Government–Citizen Engagement

An Analysis of the news coverage of Fintech in Africa: a natural language processing approach

Article 08 January 2025

Agenda-Setting for COVID-19: A Study of Large-Scale Economic News Coverage Using Natural Language Processing

Article Open access 06 October 2022

References

Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. CoRR, abs/1810.04805 (2018)
Google Scholar
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Xing, E.P., Jebara, T. (eds.) Proceedings of the 31st International Conference on Machine Learning, Proceedings of Machine Learning Research, vol. 32 , pp. 1188–1196, Beijing, China, June 2014. PMLR (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Management, Department of Management, Tokyo University of Science, 1-11-2, Fujimi, Chiyoda-Ku, 102-0071, Tokyo, Japan
Yeqing Yang & Yumi Asahi

Authors

Yeqing Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yumi Asahi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yeqing Yang .

Editor information

Editors and Affiliations

Tokyo City University, Tokyo, Japan
Hirohiko Mori
Tokyo University of Science, Tokyo, Japan
Yumi Asahi

Appendices

Appendix 1: Data Code List

Code related to title data analysis

https://colab.research.google.com/drive/14kVhJuJSRHV0m7zCKCuPuOM0DkPcgKHQ
2019 Nikkei morning edition article data

https://tus.app.box.com/file/998437276219?s=e7hx3pyypf06ekyjjignhpc8b725sdd1
2022 Nikkei morning edition article data

https://tus.app.box.com/file/947471044518?s=bs8c424lfrckgm6o5k8bbh8vuf81axru
Weighted word list - positive

https://tus.box.com/s/53hksp6tk3o1bjnzbwfkgvxywij7ido6
Weighted word list-negetive

https://tus.box.com/s/stsfk4khv7sa7ihvarnwudeudtraeyjb

Appendix 2: Attention Weight Visualization Example

In English, it is easy because only the alphabet and each word is separated, but in Japanese, kanji, hiragana, katakana, emoji and other words are connected, so it is difficult to judge where to cut. So, since the data used this time is all in Japanese, the analysis results are also attached in Japanese as is. Thank you for your understanding.

Example sentences after the coronavirus pandemic and with high $P_{AC}$

Example sentences before the coronavirus pandemic and with high $P_{AC}$

Example sentences before the coronavirus pandemic and with high $P_{BC}$

Example sentences after the coronavirus pandemic and with high $P_{BC}$

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, Y., Asahi, Y. (2024). Before and After COVID-19 Outbreak Using Variance Representation Comparative Analysis of Newspaper Articles on the Travel Hotel Industry. In: Mori, H., Asahi, Y. (eds) Human Interface and the Management of Information. HCII 2024. Lecture Notes in Computer Science, vol 14690. Springer, Cham. https://doi.org/10.1007/978-3-031-60114-9_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-60114-9_12
Published: 01 June 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-60113-2
Online ISBN: 978-3-031-60114-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Before and After COVID-19 Outbreak Using Variance Representation Comparative Analysis of Newspaper Articles on the Travel Hotel Industry