Abstract
This paper brings a contribution focused on collaborative engineering projects where knowledge plays a key role in the process. Collaboration is the arena, engineering projects are the target, knowledge is the currency used to provide harmony into the arena since it can potentially support innovation and, hence, a successful collaboration. The building and construction domain is challenged with significant problems for exchanging, sharing and integrating information between actors. For example, semantic gaps or lack of meaning definition at the conceptual and technical level, are problems fundamentally created through the employment of representations to map the ‘world’ into models in an endeavour to anticipate different actors’ views, vocabulary, and objectives. One of the primary research challenges addressed in this work is the process of formalization and representation of document content, where most existing approaches are limited in their capability and only take into account the explicit, word-based information in the document. The research described in this paper explores how traditional knowledge representations can be enriched by incorporation of implicit information derived from the complex relationships (the Semantic Associations) modelled by domain ontologies combined with the information presented in documents, thereby providing a baseline for facilitating knowledge interpretation and sharing between humans and machines. The paper introduces a novel conceptual framework for representation of knowledge sources, where each knowledge source is semantically represented (within its domain of use) by a Semantic Vector. This work contributes to the enrichment of Semantic Vectors, using the classical vector space model approach extended with ontological support, employing ontology concepts and their relations in the enrichment process. The test bed for the assessment of the approach is the Building and Construction industry, using an appropriate B&C domain Ontology. Preliminary results were collected using a clustering algorithm for document classification, which indicates that the proposed approach does improve the precision and recall of classifications. Future work and open issues are also discussed.
Similar content being viewed by others
Notes
It contains a list of stop words that is used by Rapidminer tool
References
Braines, D., Kalfoglou, Y., Smart, P., Shadbolt, N., & Bao, J. (2008). A data-intensive lightweight semantic wrapper approach to aid information integration. 4th International Workshop on Contexts and Ontologies (C &O 2008). Patras.
Braines, D, Jones, G, Smart, P., Bao, J & Huynh, T. D. (2009). GIDS: Global Interlinked Data Store. 3rd Annual Conference of the International Technology Alliance (ACITA’09). Hyattsville: International Technology Alliance.
BuildingSmart. IFD Library for BuildingSmart. (2012). http://www.ifd-library.org/index.php?title=Home_Page. Accessed September 3, 2012.
Castells, P., Fernandez, M., & Vallet, D. (2007). An Adaptation of the vector-space model for ontology-based information retrieval. IEEE Transactions on Knowledge and Data Engineering, 19(2), 261–272.
Chen, C.-L., Tseng, F., & Liang, T. (2010). An integration of WordNet and fuzzy association rule mining for multi-label document clustering. Data & Knowledge Engineering, 69, 1208–1226.
Costa, R., Figueiras, P., Paiva, L., Jardim-Gonçalves, R., & Lima C. (2012) Capturing knowledge representations using semantic relationships. The Sixth International Conference on Advances in Semantic Processing. Barcelona, Spain: IARIA.
Dandala, B., Mihalcea, R., & Razvan, B. (2013). Word sense disambiguation using Wikipedia. Theory and Applications of Natural Language Processing, by Iryna Gurevych and Jungi Kim (pp. 241–262). Berlin: Springer.
Dascal, M. (1989). Artificial intelligence and philosophy: The knowledge of representation. Systems Research, 6, 39–52.
Dascal, M. (1992). Why does language matter to artificial intelligence? Minds and Machines, 2, 145–174.
Drineas, P., Frieze, A., Kannan, R., Vempala, S., & Vinay, V. (2004). Clustering large graphs via the singular value decomposition. Machine Learning, 56, 9–33.
Dumais, S., Platt, J., Heckerman, D., & Sahami, M. (1998). Inductive learning algorithms and representations for text categorization. international conference on Information and knowledge management. Washington: ACM, 148–155.
El-Diraby, T., & Celson, L. (2005). Domain taxonomy for construction concepts: Toward a formal ontology for construction knowledge. Journal of Computing in Civil Engineering, 19(4), 394–406.
El-Diraby, T. (2012). Epistemology of construction informatics. Journal of Construction Engineering and Management, 138, 53–65.
Figueiras, P., Costa, R., Paiva, L., Jardim-Gonçalves, R., & Lima, C. (2012). Information retrieval in collaborative engineering projects: A vector space model approach. Knowledge Engineering and Ontology Development Conference. (2012). Barcelona (pp. 233–238). Spain: INSTICC.
Firestone, J., & McElroy, M. (2003). Mark key issues in the new knowledge management. Burlington: Butterworth-Heinemann.
Floridi, L. (2004). Open problems in the philosophy of information. Metaphilosophy, 35, 554–582.
Grilo, A., & Jardim-Goncalves, R. (2010). Value proposition on interoperability of BIM and collaborative working environments. Automation in Construction, 522–530.
Gruber, T. (1993). Toward principles for the design of ontologies used for knowledge sharing. International Journal of Human-Computer Studies, 907–928.
IEEE. (1990) Standard computer dictionary - a compilation of IEEE standard computer glossaries. The Institute of Electrical and Electronics Engineers.
IRB (1986) Fraunhofer. ICONDA Bibliographic.
ISO12006-3. , (2006). Building construction—organization of information about construction works: Part 3: Framework for object-oriented information. International Organization for Standardization: Switzerland.
Kalfoglou, Y., Smart, P., Braines, D., & Shadbolt, N. (2008). POAF: Portable ontology aligned fragments. International Workshop on Ontologies: Reasoning and Modularity (WORM 2008). Tenerife.
Li, S. (2009). A semantic vector retrieval model for desktop documents. Journal of Software Engineering and Applications, 2(1), 55–59.
Lima, C., Silva, C., Duc, C., Zarli, A. (2006). A Framework to Support Interoperability among Semantic Resources. In: Interoperability of Enterprise Software and Applications, by Dimitri Konstantas, Jean-Paul Bourrières, Michel Léonard and Nacer Boudjlida, (pp. 87–98). Springer: London.
Lima, C., & El-Diraby, T. (2005). Ontology-based optimisation of knowledge management in e-Construction. ITcon, 10, 305–327.
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. Berkeley: University of California Press.
Meilă, M. (2006). The uniqueness of a good optimum for K-means (pp. 625–632). International conference on Machine learning. Pittsburgh : ACM.
Nagarajan, M., Sheth, A., Aguilera, M., Keeton, K., Merchant, A., & Uysal, M. (2007). Altering Document Term Vectors for Classification: Ontologies as Expectations of Co-occurrence. 16th international conference on World Wide Web. Alberta: ACM, 1225–1226.
Nonaka, I., & Hirotaka, T. (1995). The knowledge-creating company: How japanese companies create the dynamics of innovation. New York: Oxford University Press.
Noy, N, & Deborah, Mc G. (2002). Ontology Development 101: A Guide to Creating Your First Ontology. Technical Report, Stanford : Knowledge Systems Laboratory.
Noy, N. F., & Hafner, C. (1997). The State of the Art in Ontology Design. AI Magazine, 53–74.
OCCS Development Committee Secretariat (2013). OmniClass - A Strategy for Classifying the Built Environment. http://www.omniclass.org/. Accessed September 3, 2012.
Paiva, L., Costa, R., Figueiras, P., & Lima, C. (2013). Discovering semantic relations from unstructured data for ontology enrichment: Association rules based approach. 8th Iberian Conference on Information Systems and Technologies. Lisbon: IEEE.
RapidMiner (2012). Rapid-I GmBH.
Rezgui, Y. (2006). Ontology-centered knowledge management using information retrieval techniques. Journal of Computing in Civil Engineering, 20(4), 261–270.
Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), 613–620.
Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24, 513–523.
Sarraipa, S., João, J., Jardim-Goncalves, R., & Monteiro, A. (2008). MENTOR-A Methodology for Enterprise Reference Ontology Development. Intelligent Systems, 2008. IS ’08. 4th International IEEE Conference.
Sarraipa, J., Jardim-Gonçalves, R., & Steiger-Garção, A. (2010). MENTOR: An enabler for interoperable intelligent systems. International Journal General Systems, 39(5), 57–573.
Stanford Center for Biomedical Informatics Research (2013). Stanford’s Protégé Home Page. 2013. http://protege.stanford.edu/. Accessed Spetember 3, 2012.
Subramanya, A., & Bilmes, J. (2008). Soft-supervised learning for text classification. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Honolulu, Hawaii: Association for Computational Linguistics, 1090–1099.
Uschold, M., & Jasper, R. (1999). A framework for understanding and classifying ontology applications. IJCAI-99 Workshop on Ontologies and Problem-Solving Methods. Stockholm: CEUR Publications.
W3C. (2012). OWL Web Ontology Language Reference. http://www.w3.org/TR/owl2-overview/. Accessed September 2012, 3.
Wimmer, H., & Zhou, L. (2013). Word Sense Disambiguation for Ontology Learning. 19th Americas Conference on Information Systems. Chicago.
Xia, T., & Du Y. (2011). Improve VSM text classification by title vector based document representation method. The 6th International Conference on Computer Science & Education. Singapore: IEEE.
Zhang, J. (2010). A social semantic web system for coordinating communication in the architecture, engineering and construction industry. Toronto: Univeristy of Toronto.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Costa, R., Lima, C., Sarraipa, J. et al. Facilitating knowledge sharing and reuse in building and construction domain: an ontology-based approach. J Intell Manuf 27, 263–282 (2016). https://doi.org/10.1007/s10845-013-0856-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10845-013-0856-5