Abstract
This paper presents a novel methodology for topic ontology learning from text documents. The proposed methodology, named OntoTermExtraction (Term Extraction for Ontology learning), is based on OntoGen, a semi-automated tool for topic ontology construction, upgraded by using an advanced terminology extraction tool in an iterative, semi-automated ontology construction process. This process consists of (a) document clustering to find the nodes in the topic ontology, (b) term extraction from document clusters, (c) populating the term vocabulary and keyword extraction, and (d) choosing the concept names by comparing the best-ranked terms with the extracted keywords. The approach was successfully used for generating the ontology of topics in Inductive Logic Programming, learned semi-automatically from papers indexed in the ILPnet2 publications database.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Fortuna, B., Mladenić, D., Grobelnik, M.: Semi-automatic construction of topic ontologies. In: Ackermann, M., et al. (eds.) EWMF 2005 and KDO 2005. LNCS (LNAI), vol. 4289, pp. 121–131. Springer, Heidelberg (2006)
Fortuna, B., Grobelnik, M., Mladenić, D.: Semi-automatic data-driven ontology construction system. In: Proceedings of the 9th International Multi-conference Information Society, Ljubljana, Slovenia, pp. 223–226 (2006)
The Protégé project (2000), http://protege.stanford.edu
ILPNet2 publications database, http://www.cs.bris.ac.uk/~ILPnet2/
Sabo, S., Grčar, M., Fabjan, D.A., Ljubič, P., Lavrač, N.: Exploratory analysis of the ILPnet2 social network. In: Proceedings of the 10th International Multi-conference Information Society, Ljubljana, Slovenia, pp. 223–227 (2007)
Grobelnik, M., Mladenić, D.: Simple classification into large topic ontology of web documents. In: Proceedings of the 27th International Conference Information Technology Interfaces, Dubrovnik, Croatia, pp. 188–193 (2005)
The TermExtractor tool, http://lcl2.uniroma1.it/termextractor
Sclano, F., Velardi, P.: TermExtractor: A Web application to learn the common terminology of interest groups and research communities. In: Proceedings of the 9th Conference on Terminology and Artificial Intelligence, Sophia Antipolis, France (2007)
Mladenić, D., Grobelnik, M.: Evaluation of semi-automatic ontology generation in real-world setting. In: Proceedings of the 29th International Conference Information Technology Interfaces, Dubrovnik, Croatia, pp. 547–551 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fortuna, B., Lavrač, N., Velardi, P. (2008). Advancing Topic Ontology Learning through Term Extraction. In: Ho, TB., Zhou, ZH. (eds) PRICAI 2008: Trends in Artificial Intelligence. PRICAI 2008. Lecture Notes in Computer Science(), vol 5351. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89197-0_57
Download citation
DOI: https://doi.org/10.1007/978-3-540-89197-0_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89196-3
Online ISBN: 978-3-540-89197-0
eBook Packages: Computer ScienceComputer Science (R0)