Skip to main content
Log in

Automated ontology generation from a plain text using statistical and NLP techniques

  • Original Article
  • Published:
International Journal of System Assurance Engineering and Management Aims and scope Submit manuscript

Abstract

Major portion of web pages contains the natural language text and understanding of natural language text from the web pages is a major challenge for machines. Due to this lacking search engines are not able to provide relevant information to the users. This problem is tackled by natural language processing techniques and the development of ontologies from natural language text. With the help of such ontologies search of information can increases manifold. Much research in the field of text processing and automatic ontology building from text has been done to address these challenges. The proposed method in this paper is another effort to build automatic ontology from domain specific text. In this method we first extract concepts from a given domain specific text. We have used a Stanford parser to parse the text and the dictionary of basic concepts is created manually containing all the domain specific concepts and their relationships by recognizing laxico-syntactic patterns in the text corpus. Once concepts and relations among concepts as well as properties of concepts are identified, the extracted information can be represented in the form of graph and OWL.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • ADen L (2004) Natural language understanding. Pearson Publication, New York

    Google Scholar 

  • Agirre E, Ansa O, Hovy E, Martinez D (2000) Enriching very large ontologies using the WWW. In: Proceedings of the ontology learning workshop, ECAl, Berlin, Germany

  • Allen JF (1993) Natural language, knowledge representation and logical form. In: Bates M, Weischedel R (eds) Challenges in natural language processing. Cambridge University Press, Cambridge

    Google Scholar 

  • Bedini I, Nguyen B (2007) Automatic ontology generation: state of the art. In: PRiSM Laboratory Technical Report. University of Versailles

  • Cho M, Kim H, Kirn P (2006) A new method for ontology merging based on concept using wordnet. In: Advanced communication technology, 2006. ICACT 2006. The 8th international conference, vol 3, p 157311576

  • Chomsky N (2002) Theory of syntactic structures. ISBN-13: 978-1614278047

  • Dorr BJ, Jordan PW, Benoit HW (1998) A survey of current paradigms in machine translation. Technical Report: LL\MP-TR-027, UMIACS-TR-98-72, CS-TR-3961, University of Maryland, College Park, December 1998

  • Fromkin V et al (2000) Linguistics: an introduction to linguistic theory, Chapter 3.2 (Constituent Order, Case Marking and Thematic Roles). Blackwell Publishing, Hoboken

    Google Scholar 

  • Gruber TR (1995) Toward principles for the design of ontologies used for knowledge sharing. Int J Hum Comput Stud 43:907–928

    Article  Google Scholar 

  • http://ww.w3.org/2003/08/owlfaq

  • Jurafsky D, Martin J (2000) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition upper saddle river. Prentice Hall, New Jersey

    Google Scholar 

  • Kietz J, Maedche A, Volz R (2000) A method for semi-automatic ontology acquisition from a corporate intranet. In: Proceedings of EKA W-2000 workshop ontologies and text, Juan-Les-Pins, France

  • Kong H, Hwang M, Kim P (2006) Design of the automatic ontology building system about the specific domain knowledge. In: Advanced communication technology, 2006. ICACT 2006. The 8th international conference, vol 2

  • MiUer GA (1995) WORDNET: a lexical database for English. Commun ACM 1:39–41

    Google Scholar 

  • Moldovan DI, Girju R (2000) Domain-specific knowledge acquisition and classification using wordnet. In: Proceedings of the thirteenth international florida artificial intelligence research society conference, pp 224/228. AAAI Press, p 81

  • Noy NF, McGuinness DL. Ontology Development 101: a Guide to creating your first Ontology. http://protege.stanford.edu/publications/ontology_development/ontology101.pdf

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naresh Kumar.

Ethics declarations

Conflict of interest

None.

Appendix

Appendix

See Tables 1, 2, 3, 4, 5 and 6.

Table 4 Rules for generating nested concepts
Table 5 Dictionary of valid concepts for computer domain
Table 6 Dictionary of valid properties for computer domain

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kumar, N., Kumar, M. & Singh, M. Automated ontology generation from a plain text using statistical and NLP techniques. Int J Syst Assur Eng Manag 7 (Suppl 1), 282–293 (2016). https://doi.org/10.1007/s13198-015-0403-1

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13198-015-0403-1

Keywords

Navigation