Towards measuring the semantic capacity of a physical medium demonstrated with elementary cellular automata

doi:10.1016/j.biosystems.2017.11.007

Biosystems

Volume 164, February 2018, Pages 177-185

https://doi.org/10.1016/j.biosystems.2017.11.007 Get rights and content

Abstract

The organic code concept and its operationalization by molecular codes have been introduced to study the semiotic nature of living systems. This contribution develops further the idea that the semantic capacity of a physical medium can be measured by assessing its ability to implement a code as a contingent mapping. For demonstration and evaluation, the approach is applied to a formal medium: elementary cellular automata (ECA). The semantic capacity is measured by counting the number of ways codes can be implemented. Additionally, a link to information theory is established by taking multivariate mutual information for quantifying contingency. It is shown how ECAs differ in their semantic capacities, how this is related to various ECA classifications, and how this depends on how a meaning is defined. Interestingly, if the meaning should persist for a certain while, the highest semantic capacity is found in CAs with apparently simple behavior, i.e., the fixed-point and two-cycle class. Synergy as a predictor for a CA's ability to implement codes can only be used if context implementing codes are common. For large context spaces with sparse coding contexts synergy is a weak predictor. Concluding, the approach presented here can distinguish CA-like systems with respect to their ability to implement contingent mappings. Applying this to physical systems appears straight forward and might lead to a novel physical property indicating how suitable a physical medium is to implement a semiotic system.

Introduction

Understanding the origin of life and its further evolution should include understanding how the first organic semantic information systems appeared (Küppers, 2015, Walker et al., 2017) and how the processing of semantic information evolved over time (Hernández and Jagus, 2016, Barbieri, 2015). The contingency between physical information carriers has been identified as a key property of semantic information (Monod, 1971, Barbieri, 2008). Examples are the mapping of an external signaling molecule to a second messenger inside a cell, or the mapping from triplets to amino acids in the genetic code, which are examples of organic codes (Barbieri, 2015). The development of such arbitrary symbol-meaning mappings may have added great flexibility in an organism's control system and increased its evolvability.

Following this line of thought, a formal method to asses the capacity of a chemical reaction network to process semantic information has been recently suggested (Görlich and Dittrich, 2013) and applied (Görlich et al., 2014, Neu et al., 2017). The basic idea is to measure semantic capacity as the number of molecular codes a network can implement. According to Görlich and Dittrich (2013), a molecular code is a contingent mapping between molecular species, that is, a mapping that cannot be inferred from knowing the network and the species alone. So far, the algorithms for finding all molecular codes of a given reaction network consider only the network's structure and do not require any kinetic information. A computational analysis of some chemical systems has pointed to a large spectrum of semantic capacities (Görlich and Dittrich, 2013, Neu et al., 2017). Basically no semantic capacity was found in a model of the atmosphere chemistry of Mars and combustion chemistries, whereas bio-chemical systems posses very high semantic capacities. Consequently, the hypothesis has been derived that life over the course of evolution is gaining access to (chemical) systems with increasing semantic capacity, that is, with an increasing ability to implement contingent mappings.

This paper develops further the idea that the semantic capacity of a physical medium can be measured by assessing its ability to implement a contingent mapping, called a code. For that, contingency (also called arbitrariness) is understood as a property of a mapping with respect to a medium. The medium can be physical (e.g., the combustion chemistry of hydrogen) or formal, e.g., a particular metabolic reaction network model or a cellular automaton as studied here. Furthermore, it is required that the medium can be configured in different ways, allowing to implement the mapping using particularly configured instances of the medium. So, it can be assumed that the mapping maps elements of the medium (called signs) to other elements of the medium (called meanings). Such mapping is called contingent, if an alternative mapping on the same domain of signs to the same codomain of meanings can be implemented by the same medium using a different configuration (called alternative context). Note that those mappings must be non-trivial, e.g., not constant, which is necessary for useful information processing.

For demonstration and evaluation, the approach is applied to elementary cellular automata (ECA) (Wolfram, 1984), which can represent certain physical media (Wolfram, 2002). Despite their simplicity, ECAs display a wide spectrum of different behaviors (Martinez, 2013), including computational universality (Cook, 2004). Because there are only 256 different ECAs, each denoted by a number in a standard way (Wolfram, 1984, Wikipedia, 2017), the space of all ECAs can be easily explored computationally.

Furthermore, in this work a link to information theory (Shannon, 1948) is established by taking multivariate mutual information (MMI) (McGill, 1954) for quantifying contingency. A negative value of MMI applied to three random variables representing sign, meaning, and context, respectively, is taken as as an indication of contingency.

The following results show how ECAs differ in their semantic capacities and that there is not a trivial relation to a CA's behavioral class (Martinez, 2013), shown for the classification according to Oliveira et al. (2001). Furthermore it is shown that the semantic capacity depends on how a meaning is defined. Interestingly, if the meaning should persist for a certain while, the highest semantic capacity is found in CAs with apparently simple behavior, i.e., the fixed-point and two-cycle class.

The choice of a medium, how signs, meanings and contexts are defined, how the meaning is used to implement mappings, all those decisions influence strongly the measures of semantic capacity, as is also shown in the subsequent study of ECAs. However the study also suggests a certain robustness of classifying the semantic capacity. Furthermore, it is possible to understand how a particular choice influences the results, which might lead to a concept of relative semantic capacity, i.e., relating the semantic capacity to how signs and meanings are actually used. This will add a pragmatic level to the theory, which needs to be carefully investigated in the future.

Section snippets

Elementary cellular automaton (ECA)

An elementary cellular automaton (Wolfram, 1984) is a deterministic, homogeneous, one-dimensional, binary cellular automaton with a neighborhood size of two, i.e., the new state c_i(t + 1) ∈ {0, 1} of a cell i at time t + 1 depends on its own previous state c_i(t) and the previous state of its two immediate neighbors i − 1 and i + 1 yielding the update scheme: c_i(t + 1) = f(c_i−1(t), c_i(t), c_i+1(t)). There are 256 different local state transition functions f : {0, 1}³ → {0, 1} and thus 256

A general approach to assess the semantic capacity of a medium

This paper follows the idea to measure the semantic capacity of a medium by assessing its ability to implement contingent mappings. A medium with a high semantic capacity can easily implement a (non-trivial) mapping f : S → M and a (non-trivial) alternative mapping f′ : S → M on the same domain (called signs) and codomain (called meanings). In order to show this, a number of assumptions are necessary and decisions have to be made that are described in the following procedure. The procedure is

Results

The result section shows how ECAs differ in their semantic capacity and how the semantic capacity is related to the behavioral class and to information theoretic properties (synergy) of the CA.

Discussion

The ability to implement a contingent mapping (code) has been found in all behavioral classes of elementary cellular automata as defined by Oliveira et al. (2001). There is even a null-class ECA, i.e. an ECA with a single fixed point as the only global attractor, able to implement a code. All two-cycle, complex, and chaotic ECAs can implement codes, if the meaning needs to persists only for a short amount of time (d = 1, Fig. 6 left).

The actual measurement depends on what is considered to be a

Conclusion

This paper indicates how a mediums capacity to implement contingent mappings, called codes, can objectively be measured; suggesting that we can also distinguish physical media by measuring their ability to implement contingent mappings. Ultimately this ability becomes a physical property. And since this ability is also suggested to be an important (or even essential) component of semantic information processing and meaningful sign-mediated communication, this work contributes also to a physical

References (24)

S. Wolfram
Universality and complexity in cellular automata
Phys. D: Nonlinear Phenom.
(1984)
M. Barbieri
Biosemiotics: a new understanding of life
Naturwissenschaften
(2008)
M. Barbieri
Code Biology
(2015)
N. Bertschinger et al.
Quantifying unique information
Entropy
(2014)
J. Čejková et al.
Chemotaxis and chemokinesis of living and non-living objects
M. Cook
Universality in elementary cellular automata
Complex Syst.
(2004)
D. Görlich et al.
Molecular codes in biological and chemical reaction networks
PLoS ONE
(2013)
D. Görlich et al.
Molecular codes in the human inner-kinetochore model: relating CENPS to function
Biosemiotics
(2014)
M. Harder et al.
Bivariate measure of redundant information
Phys. Rev. E
(2013)

B.O. Küppers

The nucleation of semantic information in prebiotic matter

J.T. Lizier et al.

Towards a synergy-based approach to measuring information modification

2013 IEEE Symposium on Artificial Life (ALIFE)

(2013)

Cited by (3)

The semantic theory of language
2020, BioSystems
Citation Excerpt :
The genetic code, on the other hand, was followed by many other organic codes in the first three thousand million years of the history of life, when our planet was exclusively inhabited by microorganisms (Barbieri, 2003). Among them, the sequence codes (Trifonov 1989, 1996, 1999), the histone code (Strahl and Allis, 2000; Turner, 2000, 2007; Kühn and Hofmeyr, 2014), the splicing codes (Barbieri, 2003; Fu, 2004; Wang and Cooper, 2007), the signal transduction codes (Barbieri, 2003), the compartment codes (Barbieri, 2003), the tubulin code (Verhey and Gaertig, 2007; Janke, 2014), the ubiquitin code (Komander and Rape, 2012), the molecular codes (De Beule et al., 2011; Görlich et al., 2011; Görlich and Dittrich, 2013; Dittrich, 2018) and the lamin code (Maraldi, 2018). With the origin of animals, about 600 million years ago, a second type of codes appeared on Earth, codes that are referred to as neural codes because they are rules between neural states.
Traditional linguistics was based on the idea that language is an activity that links sounds and meaning, an idea that has been referred to as ‘the code view of language’ because codes are the most familiar processes that generate meaning. Ever since the work of Noam Chomsky, however, this view has been increasingly replaced by ‘the syntax view of language’, the idea that children learn a language because they have an innate mechanism that allows them to grasp the syntax of whatever language they grow up with. This innate mechanism has been given various names – first Universal Grammar, then Language Acquisition Device (LAD), and finally Faculty of Language – but despite decades of research attempts there still is no evidence that such a device actually exists. At the same time, it has become increasingly clear that codes are not the sole processes that generate meaning. Another such process is the ability of higher animals to interpret what goes on in the world, and interpretation is different from coding because it is not based on fixed rules but on a process that Charles Peirce called abduction. This allows us to generalize the code view of language into the semantic view of language, a theory which maintains that language is primarily a semantic activity that gives meaning to sounds either by codes or by processes of interpretation. This view, furthermore, gives us a new theoretical framework for studying the origin of language without resorting to any deus ex machina device. In this framework the origin of language is compared with the origin of life and the origin of mind, because those mega transitions generated the three great families of codes that we find in Nature – the organic codes, the neural codes and the cultural codes – and it is possible that a comparative study allows us to catch a glimpse of the mechanisms that gave origin to language.
A general model on the origin of biological codes
2019, BioSystems
Citation Excerpt :
The genetic code was the first of a long succession of organic codes that have appeared in the history of life. Among them, the sequence codes (Trifonov, 1989, 1996, 1999), the sugar code (Gabius, 2000, 2009), the signal transduction codes (Barbieri, 2003), the splicing codes (Barbieri, 2003; Fu, 2004; Buratti et al., 2006; Wang and Cooper, 2007), the compartment codes (Barbieri, 2003), the tubulin code (Verhey and Gaertig, 2007; Janke, 2014), the nuclear signalling code (Maraldi, 2008), the ubiquitin code (Komander and Rape, 2012), the molecular codes (De Beule et al., 2011; Görlich et al., 2011; Görlich and Dittrich, 2013; Dittrich, 2018) and the lamin code (Maraldi, 2018). The evolution of life took place exclusively in single cells for about three billion years, but eventually some eukaryotes gave origin to multicellular creatures and new organic codes came into being.
For a long time it has been assumed that the rules of the genetic code were determined by chemistry – either by stereochemical affinities or by metabolic reactions – but the experimental evidence has revealed a totally different reality; it has been shown that any codon can be associated to any amino acid, and this means that there is no deterministic link between them. The genetic code, in other words, is based on arbitrary, or conventional, rules and this raises a formidable problem: how can arbitrary rules exist in Nature? We know that such rules exist in culture, but there is an abyssal difference between biology and culture, because the cultural codes are short-lived, whereas the biological codes are the most conserved entities in evolution. Biological codes, in other words are fundamentally different from cultural codes and we do need a model that makes us understand how they came into being.
In this paper it is shown that the origin of biological codes takes place in five phases (beginning, evolution, optimization, major transition and conservation) and this suggests a general model for their development. According to this model, a biological code evolves in a system as a means of solving a local problem, but then it becomes the tool of a much larger change in macroevolution. This is the great potential of the biological codes: their ability to bring into existence absolute novelties that change the whole course of the history of life. Different major transitions were based on different codes, but we can also recognize some common features in all of them. This indicates that coding is a universal mechanism that Nature has employed many times in the course of evolution to solve a wide variety of different problems.
Simple Utterances but Complex Understanding? Meta-studying the Fuzzy Mismatch between Animal Semantic Capacities in Varied Contexts
2022, Biosemiotics

View full text

Towards measuring the semantic capacity of a physical medium demonstrated with elementary cellular automata

Abstract

Introduction

Section snippets

Elementary cellular automaton (ECA)

A general approach to assess the semantic capacity of a medium

Results

Discussion

Conclusion

Phys. D: Nonlinear Phenom.

Biosemiotics: a new understanding of life

Naturwissenschaften

Code Biology

Quantifying unique information

Entropy

Chemotaxis and chemokinesis of living and non-living objects

Universality in elementary cellular automata

Complex Syst.

Molecular codes in biological and chemical reaction networks

PLoS ONE

Molecular codes in the human inner-kinetochore model: relating CENPS to function

Biosemiotics

Bivariate measure of redundant information

Phys. Rev. E

The nucleation of semantic information in prebiotic matter

Towards a synergy-based approach to measuring information modification

2013 IEEE Symposium on Artificial Life (ALIFE)