Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter Oldenbourg February 11, 2020

(Deep) FAIR mathematics

  • Katja Berčič

    Dr. Katja Berčič is a postdoc in the research group for Knowledge Representation/Processing (Computer Science) at FAU Erlangen-Nürnberg. While working on her PhD in mathematics (combinatorics) she became interested in how to improve the way mathematicians work with data. She followed this interest during a postdoc at the National Autonomous University of Mexico. Her research interests lie in the areas of knowledge representation and management, particularly for mathematics.

    EMAIL logo
    , Michael Kohlhase

    Prof. Dr. Michael Kohlhase is professor for Knowledge Representation/Processing (Computer Science) at FAU Erlangen-Nürnberg and adjunct associate professor for Computer Science at Carnegie Mellon University. His research interests include knowledge representation for STEM (science, technology, engineering, mathematics), inference-based techniques for natural language processing, computer-supported education, and user assistance. He pursues these (interrelated) topics focusing on the aspects of modular foundations (usually logical methods) and large-scale structures in document corpora. He has pursued these interests during extended visits to Carnegie Mellon University, SRI International, and the Universities of Amsterdam, Edinburgh, and Auckland.

    ORCID logo
    and Florian Rabe

    PD Dr. Florian Rabe is a senior researcher at the Universities of Erlangen-Nürnberg. His research interests include logics, type systems, and programming languages for computer science and mathematics as well as knowledge representation, scalable implementation, and system interoperability for them. He is the creator and main author of the MMT language and system.

Abstract

In this article, we analyze the state of research data in mathematics. We find that while the mathematical community embraces the notion of open data, the FAIR principles are not yet sufficiently realized. Indeed, we claim that the case of mathematical data is special, since the objects of interest are abstract (all properties can be known) and complex (they have a rich inner structure that must be represented). We present a novel classification of mathematical data and derive an extended set of FAIR requirements, which accomodate the special needs of math datasets. We summarize these as deep FAIR. Finally, we show a prototypical system infrastructure, which can realize deep FAIRness for one category (tabular data) of mathematical datasets.

ACM CCS:

Award Identifier / Grant number: RA-18723-1 OAF

Award Identifier / Grant number: 676541

Funding statement: The authors were supported by DFG grant RA-18723-1 OAF and European Commission grant Horizon 2020 ERI 676541 OpenDreamKit. Most of the implementation of the current MathDataHub prototype is due to Tom Wiesing.

About the authors

Katja Berčič

Dr. Katja Berčič is a postdoc in the research group for Knowledge Representation/Processing (Computer Science) at FAU Erlangen-Nürnberg. While working on her PhD in mathematics (combinatorics) she became interested in how to improve the way mathematicians work with data. She followed this interest during a postdoc at the National Autonomous University of Mexico. Her research interests lie in the areas of knowledge representation and management, particularly for mathematics.

Michael Kohlhase

Prof. Dr. Michael Kohlhase is professor for Knowledge Representation/Processing (Computer Science) at FAU Erlangen-Nürnberg and adjunct associate professor for Computer Science at Carnegie Mellon University. His research interests include knowledge representation for STEM (science, technology, engineering, mathematics), inference-based techniques for natural language processing, computer-supported education, and user assistance. He pursues these (interrelated) topics focusing on the aspects of modular foundations (usually logical methods) and large-scale structures in document corpora. He has pursued these interests during extended visits to Carnegie Mellon University, SRI International, and the Universities of Amsterdam, Edinburgh, and Auckland.

Florian Rabe

PD Dr. Florian Rabe is a senior researcher at the Universities of Erlangen-Nürnberg. His research interests include logics, type systems, and programming languages for computer science and mathematics as well as knowledge representation, scalable implementation, and system interoperability for them. He is the creator and main author of the MMT language and system.

References

1. Gunnar Brinkmann et al. “House of Graphs: A database of interesting graphs”. Discrete Appl. Math. 161.1-2 (2013), pp. 311–314. ISSN: 0166-218X. DOI: 10.1016/j.dam.2012.07.018.Search in Google Scholar

2. Hans Ulrich Besche, Bettina Eick, and E.A. O’Brien. “A Millennium Project: Constructing small groups”. Intern. J. Alg. and Comput. 12.05 (2002), pp. 623–644.10.1142/S0218196702001115Search in Google Scholar

3. Katja Bercic. Math Databases wiki. URL: https://github.com/MathHublnfo/Documentation/wiki/Math-Databases (visited on 01/15/2019).Search in Google Scholar

4. Jan De Beule et al. GAP package Digraphs. URL: https://www.gap-system.org/Packages/digraphs.html (visited on 01/25/2019).Search in Google Scholar

5. Katja Bercic, Michael Kohlhase, and Florian Rabe. “Towards a Unified Mathematical Data Infrastructure: Database and Interface Generation”. In: Intelligent Computer Mathematics (CICM 2019). Ed. by Cezary Kaliszyck et al. LNAI 11617. Springer, 2019, pp. 28-43. DOI: 10.1007/978-3-030-23250-4.Search in Google Scholar

6. Sara C. Billey and Bridget E. Tenner. “Fingerprint databases for theorems”. Notices Amer. Math. Soc. 60.8 (2013), pp. 1034–1039. ISSN: 0002-9920. DOI: 10.1090/noti1029.Search in Google Scholar

7. John Cremona. “The L-functions and modular forms database project”. Foundations of Computational Mathematics 16.6 (2016), pp. 1541–1553. ISSN: 1615-3383. DOI: 10.1007/s10208-016-9306-z.Search in Google Scholar

8. DataMathHub - Census of Small, Connected, Cubic, Vertex-Transitive Graphs. URL: https://data.mathhub.info/collection/cvt/ (visited on 12/14/2019).Search in Google Scholar

9. vt_schema.mmt. URL: https://gl.mathhub.info/ODK/mbgen/blob/master/source/cvt_schema.mmt (visited on 12/14/2019).Search in Google Scholar

10. Digraphs file format incompatibility. URL: https://github.com/gap-packages/Digraphs/issues/158 (visited on 01/25/2019).Search in Google Scholar

11. Datasets on MathHub.info. URL: https://data.mathhub.info (visited on 09/24/2019).Search in Google Scholar

12. Dataset Search. URL: https://datasetsearch.research.google.com/ (visited on 01/25/2020).Search in Google Scholar

13. European Commission Expert Group on FAIR Data. Turning FAIR into reality. 2018.Search in Google Scholar

14. GDML. URL: https://en.wikipedia.org/wiki/Global_Digital_Mathematics_Library (visited on 01/28/2019).Search in Google Scholar

15. Michael Kohlhase et al. “Knowledge-Based Interoperability for Mathematical Software Systems”. In: MACIS 2017: Seventh International Conference on Mathematical Aspects of Computer and Information Sciences. Ed. by Johannes Blomer, Temur Kutsia, and Dimitris Simos. LNCS 10693. Springer-Verlag, 2017, pp. 195-210. URL: https://github.com/OpenDreamKit/OpenDreamKit/blob/master/WP6/MACIS17-interop/crc.pdf.10.1007/978-3-319-72453-9_14Search in Google Scholar

16. The L-functions and Modular Forms Database. URL: http://www.lmfdb.org (visited on 02/01/2016).Search in Google Scholar

17. Brendan McKay. Graph formats. URL: http://users.cecs.anu.edu.au/~bdm/data/formats.html (visited on 01/25/2019).Search in Google Scholar

18. MitM: The Math-in-the-Middle Ontology. URL: https://mathhub.info/library/group?id=MitM (visited on 12/05/2019).Search in Google Scholar

19. Ron Ausbrooks et al. Mathematical Markup Language (MathML) Version 3.0. Ed. by David Carlisle, Patrick Ion, and Robert Miner, 2010. URL: http://www.w3.org/TR/MathML3.Search in Google Scholar

20. MMT - Language and System for the Uniform Representation of Knowledge. Project web site. URL: https://uniformal.github.io/ (visited on 01/15/2019).Search in Google Scholar

21. Modelica and the Modelica Association. URL: https://modelica.org/ (visited on 01/22/2019).Search in Google Scholar

22. MathWebSearch. URL: https://github.com/MathWebSearch (visited on 01/05/2020).Search in Google Scholar

23. Developing a 21st Century Global Library for Mathematics Research. Tech. rep. National Research Council, 2014. DOI: 10.17226/18619.10.17226/18619Search in Google Scholar

24. The OBO Foundry. URL: http://www.obofoundry.org/ (visited on 12/13/2019).Search in Google Scholar

25. The On-Line Encyclopedia of Integer Sequences. URL: http://oeis.org (visited on 05/28/2017).Search in Google Scholar

26. Primoz Potocnik, Pablo Spiga, and Gabriel Verret. “Cubic vertex-transitive graphs on up to 1280 vertices”. J. Symbolic Comput. 50 (2013), pp. 465–477. ISSN: 0747-7171. DOI: 10.1016/j.jsc.2012.09.002.Search in Google Scholar

27. The Systems Biology Markup Language. URL: http://sbml.org (visited on 03/17/2017).Search in Google Scholar

28. Wikidata:Introduction. URL: https://wikidata.org/wiki/Wikidata:Introduction (visited on 01/25/2015).Search in Google Scholar

29. Mark D. Wilkinson et al. “The FAIR Guiding Principles for scientific data management and stewardship”. Scientific Data 3 (2016), 160018. DOI: 10.1038/sdata.2016.18.Search in Google Scholar PubMed PubMed Central

30. Tom Wiesing, Michael Kohlhase, and Florian Rabe. “Virtual Theories - A Uniform Interface to Mathematical Knowledge Bases”. In: MACIS 2017: Seventh International Conference on Mathematical Aspects of Computer and Information Sciences. Ed. by Johannes Blomer, Temur Kutsia, and Dimitris Simos. LNCS 10693. Springer-Verlag, 2017, pp. 243-257. URL: https://github.com/OpenDreamKit/OpenDreamKit/blob/master/WP6/MACIS17-vt/crc.pdf.10.1007/978-3-319-72453-9_17Search in Google Scholar

31. Mark Ziemann, Yotam Eren, and Assam El-Osta. “Gene name errors are widespread in the scientific literature”. Genome Biology 17 (2016), 177. DOI: 10.1186/s13059-016-1044-7.Search in Google Scholar PubMed PubMed Central

Received: 2019-07-31
Revised: 2019-12-15
Accepted: 2020-01-25
Published Online: 2020-02-11
Published in Print: 2020-02-25

© 2020 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 25.4.2024 from https://www.degruyter.com/document/doi/10.1515/itit-2019-0028/html
Scroll to top button