Skip to main content

User Story-Based Automatic Keyword Extraction Using Algorithms and Analysis

  • Conference paper
  • First Online:
Intelligent Data Engineering and Analytics (FICTA 2023)

Abstract

Writing effective requirement specification documents in the face of dynamic user needs is a significant challenge in modern software development. Keyword extraction algorithms can help to retrieve relevant information from functional requirements provided by users. The process of identifying a concise set of words that capture the essence of user stories without losing important information is accomplished through automatic keyword extraction. This paper presents a comparative study of four popular algorithms for keyword extraction: Rapid Automatic Keyword Extraction (RAKE), Yet Another Keyword Extraction (YAKE), TextRank, and KeyBERT. The algorithms are analyzed based on their ability to extract keywords and provide corresponding scores. Additionally, N-gram analysis is performed using the extracted keywords. The study concludes that the RAKE algorithm exhibits better performance in extracting relevant keywords from user stories compared to the other algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhang, C., Wang, H., Liu, Y., Wu, D., Liao, Y., Wang, B.: Automatic keyword extraction from documents using conditional random fields. J. Comput. Inf. Syst. 3 (2008)

    Google Scholar 

  2. Zhao, H., Zeng, Q.: Micro-blog keyword extraction method based on graph model and semantic space. J. Multimed. 8(5), 611–617 (2013)

    Article  Google Scholar 

  3. Beliga, S.: Keyword Extraction: A Review of Methods and Approaches. University of Rijeka, Department of Informatics, Rijeka (2014)

    Google Scholar 

  4. Beliga, S., Meštrović, A., Martinčić-Ipšić, S.: An overview of graph-based keyword extraction methods and approaches. J. Inf. Organ. Sci. 39(1), 1–20 (2015)

    Google Scholar 

  5. Carretero-Campos, C., Bernaola-Galván, P., Coronado, A.V., Carpena, P.: Improving statistical keyword detection in short texts: Entropic and clustering approaches. Phys. A 392(6), 1481–1492 (2013)

    Article  Google Scholar 

  6. Yang, Z., Lei, J., Fan, K., Lai, Y.: Keyword extraction by entropy difference between the intrinsic and extrinsic mode. Phys. A 392(19), 4523–4531 (2013)

    Article  Google Scholar 

  7. Ventura, J., Silva, J.: Mining concepts from texts. Procedia Comput. Sci. 9, 27–36 (2020)

    Article  Google Scholar 

  8. Hong, B., Zhen, D.: An extended keyword extraction method. Phys. Procedia 24, 1120–1127 (2012)

    Google Scholar 

  9. Miah, M.B.A, Awang, S., Azad, M.S.: Region-based distance analysis of keyphrases: a new unsupervised method for extracting keyphrase feature from articles. In: 2021 International Conference on Software Engineering & Computer Systems and 4th Internal Conference on Computational Science and Information Management (ICSECS-ICOCSIM). pp 124–129, IEEE (2021)

    Google Scholar 

  10. Hulth, A.: Improved Automatic Keyword Extraction Given More Linguistic Knowledge

    Google Scholar 

  11. Huang, Z., Xie, Z.: A patent keywords extraction method using TextRank model with prior public knowledge. Compledx Intell. Syst. 8, 1–12 (2022). 10/1007/s40747-021-00343-8

    Google Scholar 

  12. Merrouni, Z.A., Frikh, B., Ouhbi, B.: Automatic keyphrase extraction: a survey and trends. J. Intell. Inf. Syst. 54(2), 391–424 (2020)

    Google Scholar 

  13. Dutta, A.: A novel extension for automatic keyword extraction. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 6(5), May 2016

    Google Scholar 

  14. Sarwar, T.B., Noor, N.M., Saef Ullah Miah, M.: Evaluating keyphrase extraction algorithms for finding similar news articles using lexical similarity calculation and semantic relatedness measurement by word embedding. Peer J. Comput. Sci. 8, e1024 (2022) 107717/peerj-cs.1024

    Google Scholar 

  15. Kian, H.H., Zahedi, M.: Improving precision in automatic keyword extraction using attention attractive strings. Arab. J. Sci. Eng.

    Google Scholar 

  16. Hasan, M., Sanyal, F., Chaki, D., Ali, H.: An empirical study of important keywords extraction techniques from documents. 978-1-5090-4264-7/17/$31.00 ©2017 IEE

    Google Scholar 

  17. Thushara, M.G., Krishnapriya, M.S., Nair, S.S.: A model for auto-tagging of research papers based on keyphrase extraction methods. In: 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE, Udupi, India (2017)

    Google Scholar 

  18. Jayaram, K., Sangeeta, K.: A review: information extraction techniques from research papers. In: IEEE International Conference on Innovative Mechanisms for Industry Applications, ICIMIA 2017—Proceedings, pp. 56–59 (2017)

    Google Scholar 

  19. Thushara, M.G., Dominic, N.: A template based checking and automated tagging algorithm for project documents. In: Second International Conference on Computing Paradigms (International Journal of Control Theory and Applications), vol. 9, no. 10. pp. 4537–4544 (2016)

    Google Scholar 

  20. Thushara, M.G., Mounika, T., Mangamuru, R.: A comparative study on different keyword extraction algorithms. In: Proceedings of the third international conference on computing methodologies and communication (ICCMC 2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to C. Arunkumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jaagruthi, A., Varshitha, M., Vinaya, K.S., Gupta, V.N., Arunkumar, C., Sabarish, B.A. (2023). User Story-Based Automatic Keyword Extraction Using Algorithms and Analysis. In: Bhateja, V., Carroll, F., Tavares, J.M.R.S., Sengar, S.S., Peer, P. (eds) Intelligent Data Engineering and Analytics. FICTA 2023. Smart Innovation, Systems and Technologies, vol 371. Springer, Singapore. https://doi.org/10.1007/978-981-99-6706-3_30

Download citation

Publish with us

Policies and ethics