skip to main content
10.1145/3148011.3154473acmconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
short-paper

Mixed-Initiative Feature Engineering Using Knowledge Graphs

Published: 04 December 2017 Publication History

Abstract

This paper proposes a mixed-initiative feature engineering approach using explicit knowledge captured in a knowledge graph complemented by a novel interactive visualization method. Using the explicitly captured relations and dependencies between concepts and their properties, feature engineering is enabled in a semi-automatic way. Furthermore, the results (and decisions) obtained throughout the process can be utilized for refining the features and the knowledge graph. Analytical requirements can then be conveniently captured for feature engineering -- enabling integrated semantics-driven data analysis and machine learning.

References

[1]
Taqwa Ahmed Alhaj, Maheyzah Md Siraj, Anazida Zainal, Huwaida Tagelsir Elshoush, and Fatin Elhaj. 2016. Feature Selection Using Information Gain for Improved Structural-Based Alert Correlation. PloS One 11, 11 (2016), e0166017.
[2]
Martin Atzmueller, Joachim Baumeister, Achim Hemsing, Ernst-Jürgen Richter, and Frank Puppe. 2005. Subgroup Mining for Interactive Knowledge Refinement. In Proc. AIME (LNAI 3581). Springer Verlag, Heidelberg, Germany, 453--462.
[3]
Martin Atzmueller, Naveed Hayat, Andreas Schmidt, and Benjamin Klöpper. 2017. Explanation-Aware Feature Selection using Symbolic Time Series Abstraction: Approaches and Experiences in a Petro-Chemical Production Context. In Proc. IEEE INDIN. IEEE Press, Boston, MA, USA.
[4]
Martin Atzmueller, Benjamin Kloepper, Hassan Al Mawla, Benjamin Jäschke, Martin Hollender, Markus Graube, David Arnu, Andreas Schmidt, Sebastian Heinze, Lukas Schorer, Andreas Kroll, Gerd Stumme, and Leon Urbas. 2016. Big Data Analytics for Proactive Industrial Decision Support. atp edition 58, 9 (2016).
[5]
Martin Atzmueller and Frank Puppe. 2008. A Case-Based Approach for Characterization and Analysis of Subgroup Patterns. Appl. Intell. 28, 3 (2008), 210--221.
[6]
Martin Atzmueller and Thomas Roth-Berghofer. 2010. The Mining and Analysis Continuum of Explaining Uncovered. In Proc. AI-2010. London, UK.
[7]
Christian Bizer, Jens Lehmann, Georgi Kobilarov, Sören Auer, Christian Becker, Richard Cyganiak, and Sebastian Hellmann. 2009. DBpedia - A Crystallization Point for the Web of Data. Web Semantics 7, 3 (2009), 154--165.
[8]
Brigitte Boden, Roman Haag, and Thomas Seidl. 2013. Detecting and Exploring Clusters in Attributed Graphs: A Plugin for the Gephi Platform. In Proc. ACM CIKM. ACM, New York, NY, USA, 2505--2508.
[9]
Duen Horng Chau, Leman Akoglu, Jilles Vreeken, Hanghang Tong, and Christos Faloutsos. 2012. TourViz: Interactive Visualization of Connection Pathways in Large Graphs. In Proc. ACM KDD. ACM, New York, NY, USA, 1516--1519.
[10]
Dejing Dou, Hao Wang, and Haishan Liu. 2015. Semantic Data Mining: A Survey of Ontology-based Approaches. In IEEE ICSC. IEEE, 244--251.
[11]
Santo Fortunato. 2010. Community Detection in Graphs. Physics Reports 486, 3-5 (2010), 75--174.
[12]
Ivan Herman, Guy Melançon, and M Scott Marshall. 2000. Graph Visualization and Navigation in Information Visualization: A Survey. IEEE Transactions on Visualization and Computer Graphics 6, 1 (2000), 24--43.
[13]
Johannes Hoffart, Fabian M Suchanek, Klaus Berberich, and Gerhard Weikum. 2013. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia. Artificial Intelligence 194 (2013), 28--61.
[14]
Ahsan Morshed, Ritaban Dutta, and Jagannath Aryal. 2013. Recommending Environmental Knowledge as Linked Open Data Cloud using Semantic Machine Learning. In Proc. IEEE ICDEW. IEEE, 27--28.
[15]
Heiko Paulheim. 2017. Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods. Semantic web 8, 3 (2017), 489--508.
[16]
Heiko Paulheim and Johannes Fümkranz. 2012. Unsupervised Generation of Data Mining Features from Linked Open Data. In Proc. WIMS. ACM, 31.
[17]
Petar Ristoski and Heiko Paulheim. 2016. Semantic Web in Data Mining and Knowledge Discovery: A Comprehensive Survey. Web Semantics 36 (2016), 1--22.
[18]
Ilaria Tiddi, Mathieu d'Aquin, and Enrico Motta. 2015. An Ontology Design Pattern to Define Explanations. In Proc. K-Cap. ACM, New York, NY, USA.
[19]
Anze Vavpetic, Vid Podpecan, and Nada Lavrac. 2014. Semantic Subgroup Explanations. Journal of Intelligent Information Systems 42, 2 (2014), 233--254.
[20]
Xander Wilcke, Peter Bloem, and Victor de Boer. 2017. The Knowledge Graph as the Default Data Model for Learning on Heterogeneous Knowledge. Data Science Preprint (2017), 1--19.

Cited By

View all
  • (2024)AutoCL: AutoML for Concept LearningExplainable Artificial Intelligence10.1007/978-3-031-63787-2_7(117-136)Online publication date: 10-Jul-2024
  • (2023)Advanced Analytics on Complex Industrial DataData Science for Entrepreneurship10.1007/978-3-031-19554-9_9(177-203)Online publication date: 24-Mar-2023
  • (2021)Semantic Data Mining in Ubiquitous Sensing: A SurveySensors10.3390/s2113432221:13(4322)Online publication date: 24-Jun-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
K-CAP '17: Proceedings of the 9th Knowledge Capture Conference
December 2017
271 pages
ISBN:9781450355537
DOI:10.1145/3148011
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 December 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. feature engineering
  2. knowledge graph
  3. machine learning

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

K-CAP 2017
Sponsor:
K-CAP 2017: Knowledge Capture Conference
December 4 - 6, 2017
TX, Austin, USA

Acceptance Rates

Overall Acceptance Rate 55 of 198 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)AutoCL: AutoML for Concept LearningExplainable Artificial Intelligence10.1007/978-3-031-63787-2_7(117-136)Online publication date: 10-Jul-2024
  • (2023)Advanced Analytics on Complex Industrial DataData Science for Entrepreneurship10.1007/978-3-031-19554-9_9(177-203)Online publication date: 24-Mar-2023
  • (2021)Semantic Data Mining in Ubiquitous Sensing: A SurveySensors10.3390/s2113432221:13(4322)Online publication date: 24-Jun-2021
  • (2021)Interpretable Machine Learning: A brief survey from the predictive maintenance perspective2021 26th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA )10.1109/ETFA45728.2021.9613467(01-08)Online publication date: 7-Sep-2021
  • (2019)Onto Model-based Anomalous Link Pattern Mining on Feature-Rich Social Interaction NetworksCompanion Proceedings of The 2019 World Wide Web Conference10.1145/3308560.3316707(1047-1050)Online publication date: 13-May-2019
  • (2019)Neural Feature Search: A Neural Architecture for Automated Feature Engineering2019 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM.2019.00017(71-80)Online publication date: Nov-2019
  • (2018)Explicative human activity recognition using adaptive association rule-based classification2018 IEEE International Conference on Future IoT Technologies (Future IoT)10.1109/FIOT.2018.8325603(1-6)Online publication date: Jan-2018
  • (2018)Knowledge-Based Mining of Exceptional Patterns in Logistics Data: Approaches and Experiences in an Industry 4.0 ContextFoundations of Intelligent Systems10.1007/978-3-030-01851-1_7(67-77)Online publication date: 7-Oct-2018
  • (2018)Declarative Aspects in Explicative Data Mining for Computational SensemakingDeclarative Programming and Knowledge Management10.1007/978-3-030-00801-7_7(97-114)Online publication date: 27-Sep-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media