Personal research idea recommendation using research trends and a hierarchical topic model

Wang, Hei-Chia; Hsu, Tzu-Ting; Sari, Yunita

doi:10.1007/s11192-019-03258-x

Personal research idea recommendation using research trends and a hierarchical topic model

Published: 03 October 2019

Volume 121, pages 1385–1406, (2019)
Cite this article

Scientometrics Aims and scope Submit manuscript

931 Accesses
6 Citations
Explore all metrics

Abstract

In the era of rapid technological advance, it is an important task for all researchers to keep up with trends when performing research. How to efficiently find suitable research topics while the number of papers is increasing rapidly is worthwhile to explore. To solve such problems, some researchers attempted to find research ideas by topic detection and tracking methods. However, these methods do not consider the users’ background knowledge and preferences, and they express a topic with general keywords, which does not effectively help researchers to develop new research ideas. Existing studies support that the title expresses the research idea the best. This study adapts this concept to propose an automatic title generation method that combines personalized recommendation methods and topic trend analysis methods to achieve this task. First, it uses hierarchical latent tree analysis to find the users’ interests for a topic structure and its representative keywords hidden in the existing research. Second, the interesting topic trends, popularity and user preferences in a hybrid recommendation method are considered. Finally, a natural language generation algorithm that is suitable for the titles of academic papers converts the original recommended-keywords into fluent title sentences that are designed for the users. Experiments have found that adding Google Trend indicators and personal factors can improve the performance of topic recommendations. The automatic title generation method using template-based and statistical information methods leads to excellent performances in both grammatical correctness and semantic expression. Moreover, for the users, the title is indeed more inspirational than the simple keywords for users to develop new research ideas.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Path-Based Academic Paper Recommendation

Learning from Titles to Recommend Keywords for Academic Papers

A scientific paper recommendation method using the time decay heterogeneous graph

Article 23 January 2024

References

Allan, J., Papka, R., & Lavrenko, V. (1998). On-line new event detection and tracking. Paper presented at the proceedings of the 21st annual international ACM SIGIR conference on research and development in information retrieval, Melbourne, Australia.
Blei, D. M., Griffiths, T. L., & Jordan, M. I. (2010). The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies. Journal of the ACM (JACM),57(2), 7.
Article MathSciNet Google Scholar
Blei, D. M., & Lafferty, J. D. (2006). Dynamic topic models. Paper presented at the proceedings of the 23rd international conference on machine learning, Pittsburgh, Pennsylvania, USA.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research,3(Jan), 993–1022.
MATH Google Scholar
Boon, S. (2017). 21st Century science overload. Retrieved from http://blog.cdnsciencepub.com/21st-century-science-overload/. Accessed 7 Jan 2017.
Chen, P., Zhang, N. L., Liu, T., Poon, L. K., Chen, Z., & Khawar, F. (2017). Latent tree models for hierarchical topic detection. Artificial Intelligence,250, 105–124.
Article MathSciNet Google Scholar
Deemter, K. V., Theune, M., & Krahmer, E. (2005). Real versus template-based natural language generation: A false opposition? Computational Linguistics,31(1), 15–24.
Article Google Scholar
Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., & Harshman, R. (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science,41(6), 391.
Article Google Scholar
Hartley, J. (2005). To attract or to inform: What are titles for? Journal of Technical Writing and Communication,35(2), 203–213.
Article Google Scholar
Hofmann, T. (1999). Probabilistic latent semantic analysis. Paper presented at the proceedings of the fifteenth conference on uncertainty in artificial intelligence.
Howald, B., Kondadadi, R., & Schilder, F. (2013). Domain adaptable semantic clustering in statistical NLG. Paper presented at the proceedings of the 10th international conference on computational semantics (IWCS 2013)—Long papers.
Jamali, H. R., & Nikzad, M. (2011). Article title type and its relation with the number of downloads and citations. Scientometrics,88(2), 653–661.
Article Google Scholar
Jinha, A. E. (2010). Article 50 million: An estimate of the number of scholarly articles in existence. Learned Publishing,23(3), 258–263.
Article Google Scholar
Lau, J. H., Baldwin, T., & Newman, D. (2013). On collocations and topic models. ACM Transactions on Speech and Language Processing (TSLP),10(3), 10.
Google Scholar
Lopez, C., Prince, V., & Roche, M. (2011). Automatic titling of articles using position and statistical information. Paper presented at the proceedings of the international conference recent advances in natural language processing 2011.
Lopez, C., Prince, V., & Roche, M. (2014). How can catchy titles be generated without loss of informativeness? Expert Systems with Applications,41(4), 1051–1062.
Article Google Scholar
Lü, L., Medo, M., Yeung, C. H., Zhang, Y.-C., Zhang, Z.-K., & Zhou, T. (2012). Recommender systems. Physics Reports,519(1), 1–49.
Article Google Scholar
Lu, Y., Mei, Q., & Zhai, C. (2011). Investigating task performance of probabilistic topic models: An empirical study of PLSA and LDA. Information Retrieval,14(2), 178–203.
Article Google Scholar
Luong, M.-T., Pham, H., & Manning, C. D. (2015). Effective approaches to attention-based neural machine translation. Paper presented at the proceedings of the 2015 conference on empirical methods in natural language processing.
Mairesse, F., Gašić, M., Jurčíček, F., Keizer, S., Thomson, B., Yu, K., & Young, S. (2010). Phrase-based statistical language generation using graphical models and active learning. Paper presented at the proceedings of the 48th annual meeting of the association for computational linguistics.
Ogawa, T., & Kajikawa, Y. (2017). Generating novel research ideas using computational intelligence: A case study involving fuel cells and ammonia synthesis. Technological Forecasting and Social Change,120, 41–47.
Article Google Scholar
Paisley, J., Wang, C., Blei, D. M., & Jordan, M. I. (2015). Nested hierarchical Dirichlet processes. IEEE Transactions on Pattern Analysis and Machine Intelligence,37(2), 256–270.
Article Google Scholar
Perera, R., & Nand, P. (2017). Recent advances in natural language generation: A survey and classification of the empirical literature. Computing and Informatics,36(1), 1–32.
Article MathSciNet Google Scholar
Reiter, E., & Dale, R. (2000). Building natural language generation systems. Cambridge: Cambridge University Press.
Book Google Scholar
Salakhutdinov, R., & Mnih, A. (2008). Probabilistic matrix factorization. Paper presented at the proceedings of advances in neural information processing systems 20 (NIPS 07) (pp. 1257–1264). ACM Press.
Sasaki, A. (2017). Search engine statistics 2017. Retrieved from https://www.airsassociation.org/airs-articles/search-engine-statistics-2017. Accessed on 5 May 2017.
Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics,6(2), 461–464.
Article MathSciNet Google Scholar
Stent, A., Marge, M., & Singhai, M. (2005). Evaluating evaluation methods for generation in the presence of variation. Paper presented at the international conference on intelligent text processing and computational linguistics.
Sun, L., & Yin, Y. (2017). Discovering themes and trends in transportation research using topic modeling. Transportation Research Part C: Emerging Technologies,77, 49–66.
Article Google Scholar
Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Paper presented at the advances in neural information processing systems 27.
Wang, H., Wang, N., & Yeung, D. Y. (2015) Collaborative deep learning for recommender systems. Paper presented at the proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, Sydney, NSW, Australia (pp. 1235–1244).
Wang, H., Xingjian, S., & Yeung, D. Y. (2016) Collaborative recurrent autoencoder: Recommend while learning to fill in the blanks. Paper presented at the proceedings of the 30th annual conference on neural information processing systems, Barcelona, Spain (pp. 415–423).
Zhang, Y., Zhang, G., Chen, H., Porter, A. L., Zhu, D., & Lu, J. (2016). Topic analysis and forecasting for science, technology and innovation: Methodology with a case study focusing on big data research. Technological Forecasting and Social Change,105, 179–191.
Article Google Scholar
Zheng, H.-T., Kang, B.-Y., & Kim, H.-G. (2009). Exploiting noun phrases and semantic relationships for text document clustering. Information Sciences,179(13), 2249–2262.
Article Google Scholar

Download references

Acknowledgements

The research is based on work supported by Taiwan Ministry of Science and Technology under Grant No. MOST 107-2410-H-006 040-MY3 and MOST 108-2511-H-006-009.

Author information

Authors and Affiliations

Institute of Information Management, National Cheng Kung University, Tainan, 701, Taiwan
Hei-Chia Wang & Tzu-Ting Hsu
Department of Computer Sciences and Electronics, Faculty of Mathematics and Natural Sciences, Universitas Gadjah Mada, Yogyakarta, Indonesia
Yunita Sari

Authors

Hei-Chia Wang
View author publications
You can also search for this author inPubMed Google Scholar
Tzu-Ting Hsu
View author publications
You can also search for this author inPubMed Google Scholar
Yunita Sari
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Hei-Chia Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, HC., Hsu, TT. & Sari, Y. Personal research idea recommendation using research trends and a hierarchical topic model. Scientometrics 121, 1385–1406 (2019). https://doi.org/10.1007/s11192-019-03258-x

Download citation

Received: 17 January 2019
Published: 03 October 2019
Issue Date: December 2019
DOI: https://doi.org/10.1007/s11192-019-03258-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Personal research idea recommendation using research trends and a hierarchical topic model

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Path-Based Academic Paper Recommendation

Learning from Titles to Recommend Keywords for Academic Papers

A scientific paper recommendation method using the time decay heterogeneous graph

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now