Research on Constructing Technology of Implicit Hierarchical Topic Network Based on FP-Growth

Yu, Wentao; Yi, Mianzhu; Li, Zhufeng

doi:10.1007/978-3-030-24274-9_23

Wentao Yu¹⁷,
Mianzhu Yi¹⁷ &
Zhufeng Li¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 11632))

Included in the following conference series:

International Conference on Artificial Intelligence and Security

Abstract

Topic extraction for books is of great significance in the development of intelligent reading systems, question answering systems and other applications. Compared with the theme of microblog and science and technology literature, the topic of book has the characteristics of multi-themes, hierarchization, networking, and information sharing. Therefore, the topic extraction of books must be more complicated and difficult. This article is based on solving the problems such as quick positioning of the relevant contents of the answer, cross-topic retrieval, and other issues in the intelligent reading system. Based on the topic trees extracted from the novel text chapters using the TF-IDF algorithm, the FP-GROWTH algorithm is used to mine the topic words. The association relationship, in turn, analyzes the hidden relationship between topics, and proposes and constructs an implicit hierarchical subject network (IHTN) of the novel text. The experimental results show that this method can completely extract the thematic network of novel texts, effectively extract the chapter relationships, significantly reduce the answer retrieval time in the question answering system, and improve the accuracy of the answers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Research on Hot Topic Discovery Technology of Micro-blog Based on Biterm Topic Model

Dynamically constructing semantic topic hierarchy through formal concept analysis

Article 18 August 2022

Extraction of Complementary Topics Based on Phrase Importance and Co-occurrence in Technical Blogs

References

Xue, X., Gao, J., et al.: Research on topic extraction algorithm based on MapReduce parallel LDA model. J. FuZhou Univ. (Nat. Sci. Ed.) 44(5), 644–648 (2016)
Google Scholar
Hu, J., Chen, G.: Mining and evolution of content topic based on dynamic LDA. Libr. Inf. Serv. 58(2), 138–142 (2014)
Google Scholar
Van Eck, N.J., Waltman, L.: Citation-based clustering of publications using CitNetExplorer and VOSviewer. In: Gläser, J., Scharnhorst, A., Glänzel, W. (eds.) Same Data – Different Results? Towards a Comparative Approach to the Identification of Thematic Structures in Science. Special Issue of Scientometrics (2017). https://doi.org/10.1007/s11192-017-2300-7
Article Google Scholar
Velden, T., Boyack, K.W., Gläser, J., Koopman, R., Scharnhorst, A., Wang, S.: Comparison of topic extraction approaches and their results. In: Gläser, J., Scharnhorst, A., Glänzel, W. (eds.) Same Data—Different Results? Towards a Comparative Approach to the Identification of Thematic Structures in Science. Special issue of Scientometrics (2017)
Google Scholar
Havemann, F., Gläser, J., Heinz, M.: Memetic search for overlapping topics based on a local evaluation of link communities. In: Gläser, J., Scharnhorst, A. Glänzel, W. (eds.) Same Data – Different Results? Towards a Comparative Approach to the Identification of Thematic Structures in Science. Special Issue of Scientometrics (2017). https://doi.org/10.1007/s11192-017-2302-5
Article Google Scholar
Koopman, R., Wang, S.: Mutual information based labelling and comparing clusters. In: Gläser, J., Scharnhorst, A. Glänzel, W. (eds.) Same Data Different Results? Towards a Comparative Approach to the Identification of Thematic Structures in Science. Special Issue of Scientometrics (2017b). https://doi.org/10.1007/s1192-017-2305-x
Jing, C.L.Z., et al.: Application of hierarchical topic model on technological evolution analysis. Libr. Inf. Serv. 61(5), 103–108 (2017)
Google Scholar
Wu, X.J., Zheng, F., Xu, M.-X.: Topic forest based dialog management model. ACTA Autom. Sin. 29(2), 275–283 (2003)
Google Scholar
Erra, U., Senatore, S., Minnella, F., Caggianese, G.: Approximate TF-IDF based on topic extraction from massive message stream using the GPU. Inf. Sci. 292, 143–161 (2015)
Article Google Scholar
Haddi, E., Liu, X., Shi, Y.: The role of text pre-processing in sentiment analysis. Procedia Comput. Sci. 17, 26–32 (2013)
Article Google Scholar
Trstenjak, B., Mikac, S., Donko, D.: KNN with TF-IDF based framework for text categorization. Procedia Eng. 69, 1356–1364 (2014)
Article Google Scholar
Gimpel, K., et al.: Part-of-speech tagging for Twitter: annotation, features, and experiments. Carnegie-Mellon Univ Pittsburgh Pa School of Computer Science (2010)
Google Scholar
Rill, S., Reinel, D., Scheidt, J., Zicari, R.V.: PoliTwi: early detection of emerging political topics on Twitter and the impact on concept-level sentiment analysis. Knowl.-Based Syst. 69, 24–33 (2014)
Article Google Scholar
Xiong, Z., Shen, Q., Wang, Y., Zhu, C.: Paragraph vector representation based on word to vector and CNN learning. CMC: Comput. Mater. Continua 055(2), 213–227 (2018)
Google Scholar
Wang, M., Wang, J., Guo, L., Harn, L.: Inverted XML access control model based on ontology semantic dependency. CMC: Comput. Mater. Continua 55(3), 465–482 (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

Zhengzhou Information Science and Technology Institute, Zhengzhou, 450001, China
Wentao Yu, Mianzhu Yi & Zhufeng Li

Authors

Wentao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Mianzhu Yi
View author publications
You can also search for this author in PubMed Google Scholar
Zhufeng Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhufeng Li .

Editor information

Editors and Affiliations

Nanjing University of Information Science and Technology, Nanjing, China
Xingming Sun
Nanjing University of Information Science and Technology, Nanjing, China
Zhaoqing Pan
Purdue University, West Lafayette, IN, USA
Elisa Bertino

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, W., Yi, M., Li, Z. (2019). Research on Constructing Technology of Implicit Hierarchical Topic Network Based on FP-Growth. In: Sun, X., Pan, Z., Bertino, E. (eds) Artificial Intelligence and Security. ICAIS 2019. Lecture Notes in Computer Science(), vol 11632. Springer, Cham. https://doi.org/10.1007/978-3-030-24274-9_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-24274-9_23
Published: 11 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24273-2
Online ISBN: 978-3-030-24274-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Research on Constructing Technology of Implicit Hierarchical Topic Network Based on FP-Growth

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Research on Hot Topic Discovery Technology of Micro-blog Based on Biterm Topic Model

Dynamically constructing semantic topic hierarchy through formal concept analysis

Extraction of Complementary Topics Based on Phrase Importance and Co-occurrence in Technical Blogs

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Research on Constructing Technology of Implicit Hierarchical Topic Network Based on FP-Growth

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Research on Hot Topic Discovery Technology of Micro-blog Based on Biterm Topic Model

Dynamically constructing semantic topic hierarchy through formal concept analysis

Extraction of Complementary Topics Based on Phrase Importance and Co-occurrence in Technical Blogs

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation