Skip to main content

Ontological Text Mining of Software Documents

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4592))

Abstract

Documents written in natural languages constitute a major part of the software engineering lifecycle artifacts. Especially during software maintenance or reverse engineering, semantic information conveyed in these documents can provide important knowledge for the software engineer. In this paper, we present a text mining system capable of populating a software ontology with information detected in documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Sommerville, I.: Software Engineering, 6th edn. Addison-Wesley, Reading (2000)

    Google Scholar 

  2. Seacord, R., Plakosh, D., Lewis, G.: Modernizing Legacy Systems: Software Technologies, Engineering Processes, and Business Practices. SEI Series in SE. Addison-Wesley, Reading (2003)

    Google Scholar 

  3. Jin, D., Cordy, J.: Ontology-Based Software Analysis and Reengineering Tool Integration: The OASIS Service-Sharing Methodology. In: 21st IEEE International Conference on Software Maintenance (ICSM) (2005)

    Google Scholar 

  4. Antoniol, G., Canfora, G., Casazza, G., Lucia, A.D.: Information retrieval models for recovering traceability links between code and documentation. In: Proc. of IEEE Intl. Conf. on Software Maintenance, San Jose, CA, USA (2000)

    Google Scholar 

  5. IEEE: IEEE Standard for Software Maintenance. IEEE 1219 (1998)

    Google Scholar 

  6. Riva, C.: Reverse Architecting: An Industrial Experience Report. In: 7th IEEE Working Conference on Reverse Engineering (WCRE), pp. 42–52 (2000)

    Google Scholar 

  7. Storey, M.A., Sim, S.E., Wong, K.: A Collaborative Demonstration of Reverse Engineering tools. ACM SIGAPP Applied Computing Review 10(1), 18–25 (2002)

    Article  Google Scholar 

  8. Welty, C.: Augmenting Abstract Syntax Trees for Program Understanding. In: Proc. of Int. Conf. on Automated Software Engineering, pp. 126–133. IEEE Computer Society Press, Los Alamitos (1997)

    Chapter  Google Scholar 

  9. Lethbridge, T.C., Nicholas, A.: Architecture of a Source Code Exploration Tool: A Soft-ware Engineering Case Study. Technical Report TR-97-07, Department of Computer Science, University of Ottawa (1997)

    Google Scholar 

  10. Meng, W., Rilling, J., Zhang, Y., Witte, R., Charland, P.: An Ontological Software Comprehension Process Model. In: 3rd Int. Workshop on Metamodels, Schemas, Grammars, and Ontologies for Reverse Engineering (ATEM, Genoa, Italy (October 1st 2006), pp. 28–35 (2006)

    Google Scholar 

  11. Lindvall, M., Sandahl, K.: How well do experienced software developers predict software change? Journal of Systems and Software 43(1), 19–27 (1998)

    Article  Google Scholar 

  12. Johnson-Laird, P.N.: Mental Models: Towards a Cognitive Science of Language, Inference and Consciousness. Harvard University, Cambridge, MA (1983)

    Google Scholar 

  13. Rilling, J., Witte, R., Zhang, Y.: Automatic Traceability Recovery: An Ontological Approach. In: International Symposium on Grand Challenges in Traceability (GCT 2007), Lexington, Kentucky, USA (March 22–23, 2007)

    Google Scholar 

  14. Haarslev, V., Möller, R.: RACER: System Description. In: Goré, R.P., Leitsch, A., Nipkow, T. (eds.) IJCAR 2001. LNCS (LNAI), vol. 2083, pp. 701–705. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  15. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A framework and graphical development environment for robust NLP tools and applications. In: Proc. of the 40th Anniversary Meeting of the ACL (2002)

    Google Scholar 

  16. Witte, R., Bergler, S.: Fuzzy Coreference Resolution for Summarization. In: Proceedings of 2003 International Symposium on Reference Resolution and Its Applications to Question Answering and Summarization (ARQAS), Venice, Italy, Università Ca’ Foscari (June 23–24 2003), pp. 43–50 http://rene-witte.net

  17. Gaizauskas, R., Hepple, M., Saggion, H., Greenwood, M.A., Humphreys, K.: SUPPLE: A practical parser for natural language engineering applications. In: Proc. of the 9th Intl. Workshop on Parsing Technologies (IWPT2005), Vancouver (2005)

    Google Scholar 

  18. Witte, R., Kappler, T., Baker, C.J.O.: Ontology Design for Biomedical Text Mining. In: Semantic Web: Revolutionizing Knowledge Discovery in the Life Sciences, pp. 281–313. Springer, Heidelberg (2006)

    Google Scholar 

  19. Mencl, V.: Deriving behavior specifications from textual use cases. In: Proceedings of Workshop on Intelligent Technologies for Software Engineering, Linz, Austria, Oesterreichische Computer Gesellschaft, pp. 331–341 (2004)

    Google Scholar 

  20. Ilieva, M., Ormandjieva, O.: Automatic transition of natural language software requirements specification into formal presentation. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 392–397. Springer, Heidelberg (2005)

    Google Scholar 

  21. Kof, L.: Natural language processing: Mature enough for requirements documents analysis? In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 91–102. Springer, Heidelberg (2005)

    Google Scholar 

  22. Marcus, A., Maletic, J.I.: Recovering Documentation-to-Source-Code Traceability Links using Latent Semantic Indexing. In: Proc. of 25th Intl. Conf. on Software Engineering (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zoubida Kedad Nadira Lammari Elisabeth Métais Farid Meziane Yacine Rezgui

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Witte, R., Li, Q., Zhang, Y., Rilling, J. (2007). Ontological Text Mining of Software Documents. In: Kedad, Z., Lammari, N., Métais, E., Meziane, F., Rezgui, Y. (eds) Natural Language Processing and Information Systems. NLDB 2007. Lecture Notes in Computer Science, vol 4592. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73351-5_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73351-5_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73350-8

  • Online ISBN: 978-3-540-73351-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics