ABSTRACT
Software artifacts, such as requirements, design, source code, documentation, and safety-related artifacts are typically expressed using domain-specific terminology. Automated tools which attempt to analyze software artifacts in order to perform tasks such as trace retrieval and maintenance, domain analysis, program comprehension, or to service natural language queries, need to understand the vocabulary and concepts of the domain in order to achieve acceptable levels of accuracy. Domain concepts can be captured and stored as an ontology. Unfortunately, constructing ontologies is extremely time-consuming and has proven hard to automate. This dissertation proposes a novel approach for semi-automated ontology building that leverages user-defined trace links to identify candidate domain facts. It uses a variety of web-mining, Natural Language Processing, and machine learning techniques to filter and rank the candidate facts, and to assist the user in building a domain-specific ontology. The benefits of the constructed ontology are described and evaluated within the context of automated trace link creation.
- A. Carlson, J. Betteridge, B. Kisiel, B. Settles, E. R. Hruschka, and T. M. Mitchell. Toward an architecture for never-ending language learning. In In AAAI, 2010. Google ScholarDigital Library
- R. Gacitua and P. Sawyer. Ensemble methods for ontology learning - an empirical experiment to evaluate combinations of concept acquisition techniques. In 7th IEEE/ACIS Conf. on Computer and Information Science, ICIS 2008, USA, pages 328--333, 2008. Google ScholarDigital Library
- R. Gacitua, P. Sawyer, and P. Rayson. A flexible framework to experiment with ontology learning techniques. Knowl.-Based Syst., 21(3):192--199, 2008. Google ScholarDigital Library
- T. R. Gruber. A translation approach to portable ontology specifications. Knowledge acquisition, 5(2):199--220, 1993. Google ScholarDigital Library
- J. Guo, J. Cleland-Huang, and B. Berenbach. Foundations for an expert system in domain-specific traceability. In Requirements Engineering Conference (RE), 2013 21st IEEE International, pages 42--51. IEEE, 2013.Google ScholarCross Ref
- J. Guo, N. Monaikul, C. Plepel, and J. Cleland-Huang. Towards an intelligent domain-specific traceability solution. In Proceedings of the 29th ACM/IEEE international conference on Automated software engineering, pages 755--766. ACM, 2014. Google ScholarDigital Library
- J. H. Hayes, A. Dekhtyar, and S. K. Sundaram. Advancing candidate link generation for requirements tracing: The study of methods. IEEE Trans. Softw. Eng., 32(1):4--19, 2006. Google ScholarDigital Library
- M. A. Hearst. Automatic acquisition of hyponyms from large text corpora. In Proceedings of the 14th conference on Computational linguistics-Volume 2, pages 539--545. Association for Computational Linguistics, 1992. Google ScholarDigital Library
- I. Jureta, J. Mylopoulos, and S. Faulkner. A core ontology for requirements. Applied Ontology, 4(3-4):169--244, 2009. Google ScholarDigital Library
- G. A. Miller. Wordnet: a lexical database for english. Communications of the ACM, 38(11):39--41, 1995. Google ScholarDigital Library
- D. Movshovitz-Attias and W. W. Cohen. Grounded discovery of coordinate term relationships between software entities, 2015.Google Scholar
- K. Robles, A. Fraga, J. Morato, and J. Lloréns. Towards an ontology-based retrieval of UML class diagrams. Information & Software Technology, 54(1):72--86, 2012. Google ScholarDigital Library
- M. Ruiz-Casado, E. Alfonseca, and P. Castells. Automatising the learning of lexical patterns: An application to the enrichment of wordnet by extracting semantic relationships from wikipedia. Data Knowl. Eng., 61(3):484--499, 2007. Google ScholarDigital Library
- F. M. Suchanek, G. Kasneci, and G. Weikum. Yago: a core of semantic knowledge. In Proc. 16th international conference on World Wide Web, pages 697--706. ACM, 2007. Google ScholarDigital Library
- B. Verlaine, Y. Dubois, I. Jureta, and S. Faulkner. Towards conceptual foundations for service-oriented requirements engineering: bridging requirements and services ontologies. IET Software, 6(2):85--102, 2012.Google ScholarCross Ref
- C. Wang, A. Kalyanpur, J. Fan, B. K. Boguraev, and D. Gondek. Relation extraction and scoring in deepqa. IBM Journal of Research and Development, 56(3.4):9--1, 2012. Google ScholarDigital Library
- W. Wu et al. Probase: A probabilistic taxonomy for text understanding. In 2012 ACM SIGMOD Int'l Conf on Management of Data, SIGMOD '12, pages 481--492, New York, NY, USA, 2012. ACM. Google ScholarDigital Library
Index Terms
- Ontology learning and its application in software-intensive projects
Recommendations
Arabic ontology learning using deep learning
WI '17: Proceedings of the International Conference on Web IntelligenceOntology, the backbone of Semantic Web, is defined as the formal specification of conceptual hierarchy with relationships between concepts. Ontology Learning (OL) is a process to create an ontology from text automatically or semi-automatically. OL is an ...
Ontology learning: revisted
The term "ontology" comes from the field of philosophy that is concerned with the study of being or existence. In general computer science defines ontology as an "explicit specification of a conceptualization," which is, "the objects, concepts, and ...
Formal semantics-preserving translation from fuzzy ER model to fuzzy OWL DL ontology
Ontology is an important part of the W3C standards for the Semantic Web, and how to quickly and cheaply construct Web ontologies has become a key technology to enable the Semantic Web. However, information imprecision and uncertainty exist in many real-...
Comments