skip to main content
10.1145/3387940.3392161acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

DockerKG: A Knowledge Graph of Docker Artifacts

Published: 25 September 2020 Publication History

Abstract

Docker helps developers reuse software artifacts by providing a lightweight solution to the problem of operating system virtualization. A Docker image contains very rich and useful knowledge of software engineering, including the source of software packages, the correlations among software packages, the installation methods of software packages and the information on operating systems. To effectively obtain this knowledge, this paper proposes an approach to constructing a knowledge graph of Docker artifacts, named DockerKG, by analyzing a large number of Dockerfiles in Docker Hub, which contains more than 3.08 million Docker repositories (up to February 2020). Currently, DockerKG contains the domain knowledge extracted from approximately 200 thousand Dockerfiles in Docker Hub. Besides, it contains the information on Docker repositories and their semantic tags. In future work, DockerKG can be used for Docker image recommendations and online Q&A service providing software engineering domain knowledge.

References

[1]
Sören Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak, and Zachary Ives. 2007. Dbpedia: A nucleus for a web of open data. In The semantic web. Springer, 722--735.
[2]
Christian Bizer, Jens Lehmann, Georgi Kobilarov, Sören Auer, Christian Becker, Richard Cyganiak, and Sebastian Hellmann. 2009. DBpedia-A crystallization point for the Web of Data. Journal of web semantics 7, 3 (2009), 154--165.
[3]
Wei Chen, Jia-Hong Zhou, Jia-Xin Zhu, Guo-Quan Wu, and Jun Wei. 2019. Semi-Supervised Learning Based Tag Recommendation for Docker Repositories. Journal of Computer Science and Technology 34, 5 (2019), 957--971.
[4]
Jürgen Cito, Gerald Schermann, John Erik Wittern, Philipp Leitner, Sali Zumberi, and Harald C Gall. 2017. An empirical analysis of the docker container ecosystem on github. In 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR). IEEE, 323--333.
[5]
Zhendong Dong, Qiang Dong, and Changling Hao. 2010. Hownet and its computation of meaning. In Proceedings of the 23rd international conference on Computational Linguistics: Demonstrations. Association for Computational Linguistics, 53--56.
[6]
Foyzul Hassan, Rodney Rodriguez, and Xiaoyin Wang. 2018. RUDSEA: recommending updates of Dockerfiles via software environment analysis. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 796--801.
[7]
Waldemar Hummer, Florian Rosenberg, Fábio Oliveira, and Tamar Eilam. 2013. Testing idempotence for infrastructure as code. In ACM/IFIP/USENIX International Conference on Distributed Systems Platforms and Open Distributed Processing. Springer, 368--388.
[8]
Da Huo, Jaroslaw Nabrzyski, and Charles Vardeman. 2015. Smart Container: an Ontology Towards Conceptualizing Docker. In International Semantic Web Conference (Posters & Demos).
[9]
Michael Hüttermann. 2012. DevOps for developers. Apress.
[10]
Jens Lehmann, Robert Isele, Max Jakob, Anja Jentzsch, Dimitris Kontokostas, Pablo N Mendes, Sebastian Hellmann, Mohamed Morsey, Patrick Van Kleef, Sören Auer, et al. 2015. DBpedia-a large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web 6, 2 (2015), 167--195.
[11]
Ze-Qi Lin, Bing Xie, Yan-Zhen Zou, Jun-Feng Zhao, Xuan-Dong Li, Jun Wei, Hai-Long Sun, and Gang Yin. 2017. Intelligent development environment and software knowledge graph. Journal of Computer Science and Technology 32, 2 (2017), 242--249.
[12]
Farzaneh Mahdisoltani, Joanna Biega, and Fabian M Suchanek. 2013. Yago3: A knowledge base from multilingual wikipedias.
[13]
Maximiliano Osorio, Carlos Buil Aranda, and Hernán Vargas. 2018. DockerPedia: a Knowledge Graph of Docker Images. In International Semantic Web Conference (P&D/Industry/BlueSky).
[14]
M Ross Quillan. 1966. Semantic memory. Technical Report. BOLT BERANEK AND NEWMAN INC CAMBRIDGE MA.
[15]
Thomas Rebele, Fabian Suchanek, Johannes Hoffart, Joanna Biega, Erdal Kuzey, and Gerhard Weikum. 2016. YAGO: A multilingual knowledge base from wikipedia, wordnet, and geonames. In International Semantic Web Conference. Springer, 177--185.
[16]
Mojtaba Shahin, Muhammad Ali Babar, and Liming Zhu. 2017. Continuous integration, delivery and deployment: a systematic review on approaches, tools, challenges and practices. IEEE Access 5 (2017), 3909--3943.
[17]
Johannes Wettinger, Vasilios Andrikopoulos, and Frank Leymann. 2015. Automated capturing and systematic usage of devops knowledge for cloud applications. In 2015 IEEE International Conference on Cloud Engineering. IEEE, 60--65.
[18]
Tianyin Xu and Darko Marinov. 2018. Mining container image repositories for software configuration and beyond. In Proceedings of the 40th International Conference on Software Engineering: New Ideas and Emerging Results. 49--52.
[19]
Yang Zhang, Huaimin Wang, and Vladimir Filkov. 2019. A clustering-based approach for mining dockerfile evolutionary trajectories. Science China Information Sciences 62, 1 (2019), 19101.
[20]
Yang Zhang, Gang Yin, Tao Wang, Yue Yu, and Huaimin Wang. 2018. An insight into the impact of dockerfile evolutionary trajectories on quality and latency. In 2018 IEEE 42nd Annual Computer Software and Applications Conference (COMP-SAC), Vol. 1. IEEE, 138--143.

Cited By

View all
  • (2024)DocSecKG: A Systematic Approach for Building Knowledge Graph to Understand the Relationship Between Docker Image and VulnerabilityAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5618-6_33(392-404)Online publication date: 1-Aug-2024
  • (2022)A Semantic Web-Enabled Approach for Dependency ManagementInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402250049832:09(1307-1343)Online publication date: 29-Sep-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICSEW'20: Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops
June 2020
831 pages
ISBN:9781450379632
DOI:10.1145/3387940
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 September 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Docker
  2. Dockerfile
  3. knowledge graph
  4. software package

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICSE '20
Sponsor:
ICSE '20: 42nd International Conference on Software Engineering
June 27 - July 19, 2020
Seoul, Republic of Korea

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)1
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)DocSecKG: A Systematic Approach for Building Knowledge Graph to Understand the Relationship Between Docker Image and VulnerabilityAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5618-6_33(392-404)Online publication date: 1-Aug-2024
  • (2022)A Semantic Web-Enabled Approach for Dependency ManagementInternational Journal of Software Engineering and Knowledge Engineering10.1142/S021819402250049832:09(1307-1343)Online publication date: 29-Sep-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media